← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • Check Now, Can You See It?: Exploring Voice and Video-Capable Language Models for Identifying and Spatially Locating Items of Interest for Blind and Low-Vision Travelers

    Aziz N Zeidieh, JooYoung Seo · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This experience report documents the lived experiences of two blind travelers — Aziz (28, blind in left eye, 20/2200 in right) and JooYoung (35, blind in right eye, limited vision in left) — as they adapted commercially available voice and video-capable language models (VVLMs)…

    artificial intelligence · navigation · blindness and visual impairment · multimodal AI · large language models

  • Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions

    Meng Chen, Akhil Iyer, Amy Pavel · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper addresses a critical safety problem in AI-powered visual access technology: multimodal large language models (MLLMs) like GPT-4o, Gemini, and Claude produce fluent, confident image descriptions that can contain fabricated content, misinterpretations, and omissions…

    blindness · low vision · image descriptions · multimodal AI · large language models

  • Temp access: Reflecting on multimodal GAI as an accessibility technology for temporary disability

    Kate S. Glazko · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper presents an autoethnographic account of using multimodal generative AI (GAI) tools as accessibility technology during a period of temporary disability. The author, an accessibility researcher, experienced an illness that simultaneously impacted verbal communication,…

    generative AI · temporary disability · assistive technology · autoethnography · multimodal AI

  • DescribePro: Collaborative Audio Description with Human-AI Interaction

    Maryam S Cheema, Sina Elahimanesh, Samuel Martin, Pooyan Fazli, Hasti Seifi · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper presents DescribePro, a web-based platform that combines human expertise with AI capabilities to create and refine audio descriptions (AD) for video content. The system addresses the fundamental tension in AD production: human-crafted descriptions are high quality but…

    audio description · video accessibility · human-AI collaboration · authoring tools · blind and low vision

  • AccessMenu: Enhancing Usability of Online Restaurant Menus for Screen Reader Users

    Nithiya Venkatraman, Akshay Kolgar Nayak, Suyog Dahal, Yash Prakash, Hae-Na Lee, Vikas Ashok · 2025 · Proceedings of the 22nd International Web for All Conference (W4A)

    This paper addresses the significant accessibility barriers that blind and visually impaired (BVI) screen reader users face when trying to access online restaurant menus, which are typically presented as images or PDFs. The research proceeds in two phases. First, an interview…

    screen readers · blind users · visual document understanding · LLM accessibility · multimodal AI

5 results.