Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

Check Now, Can You See It?: Exploring Voice and Video-Capable Language Models for Identifying and Spatially Locating Items of Interest for Blind and Low-Vision Travelers
Aziz N Zeidieh, JooYoung Seo · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This experience report documents the lived experiences of two blind travelers — Aziz (28, blind in left eye, 20/2200 in right) and JooYoung (35, blind in right eye, limited vision in left) — as they adapted commercially available voice and video-capable language models (VVLMs)…
artificial intelligence · navigation · blindness and visual impairment · multimodal AI · large language models
Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions
Meng Chen, Akhil Iyer, Amy Pavel · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper addresses a critical safety problem in AI-powered visual access technology: multimodal large language models (MLLMs) like GPT-4o, Gemini, and Claude produce fluent, confident image descriptions that can contain fabricated content, misinterpretations, and omissions…
blindness · low vision · image descriptions · multimodal AI · large language models
Temp access: Reflecting on multimodal GAI as an accessibility technology for temporary disability
Kate S. Glazko · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper presents an autoethnographic account of using multimodal generative AI (GAI) tools as accessibility technology during a period of temporary disability. The author, an accessibility researcher, experienced an illness that simultaneously impacted verbal communication,…
generative AI · temporary disability · assistive technology · autoethnography · multimodal AI
DescribePro: Collaborative Audio Description with Human-AI Interaction
Maryam S Cheema, Sina Elahimanesh, Samuel Martin, Pooyan Fazli, Hasti Seifi · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper presents DescribePro, a web-based platform that combines human expertise with AI capabilities to create and refine audio descriptions (AD) for video content. The system addresses the fundamental tension in AD production: human-crafted descriptions are high quality but…
audio description · video accessibility · human-AI collaboration · authoring tools · blind and low vision
AccessMenu: Enhancing Usability of Online Restaurant Menus for Screen Reader Users
Nithiya Venkatraman, Akshay Kolgar Nayak, Suyog Dahal, Yash Prakash, Hae-Na Lee, Vikas Ashok · 2025 · Proceedings of the 22nd International Web for All Conference (W4A)
This paper addresses the significant accessibility barriers that blind and visually impaired (BVI) screen reader users face when trying to access online restaurant menus, which are typically presented as images or PDFs. The research proceeds in two phases. First, an interview…
screen readers · blind users · visual document understanding · LLM accessibility · multimodal AI

5 results.

Tag

Search results

Check Now, Can You See It?: Exploring Voice and Video-Capable Language Models for Identifying and Spatially Locating Items of Interest for Blind and Low-Vision Travelers

Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions

Temp access: Reflecting on multimodal GAI as an accessibility technology for temporary disability

DescribePro: Collaborative Audio Description with Human-AI Interaction

AccessMenu: Enhancing Usability of Online Restaurant Menus for Screen Reader Users