← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • How Multimodal Large Language Models Support Access to Visual Information: A Diary Study With Blind and Low Vision People

    Ricardo E. Gonzalez Penuela, Crescentia Jung, Sharon Lin, Ruiying Hu, Shiri Azenkot · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)

    This CHI 2026 paper reports a two-week diary study with 20 Blind and Low Vision (BLV) participants (ages 19–75, 11 female/9 male, 13 blind/7 low vision) investigating how multimodal large language models (MLLMs) support real-world access to visual information. The authors built…

    AI · accessibility · multimodal large language models · MLLM · visual question answering

  • ViDscribe: Multimodal AI for Customizing Audio Description and Question Answering in Online Videos

    Maryam S Cheema, Sina Elahimanesh, Pooyan Fazli, Hasti Seifi · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)

    Cheema and colleagues (Arizona State University and Saarland University) present ViDscribe, a web platform that layers AI-generated audio description (AD) and conversational visual question answering (VQA) on top of arbitrary YouTube videos for blind and low vision (BLV)…

    video accessibility · audio description · blind and low vision · multimodal large language models · visual question answering

  • Sonic Stage: Automatically Generating an Interactive Spatial Soundscape to Facilitate Dialogue Video Comprehension for Blind and Low Vision Viewers

    Shuchang Xu, Xiaofu Jin, Gaurav Jain, Wenshuo Zhang, Huamin Qu, Brian A. Smith, Yukang Yan · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)

    Xu and colleagues (HKUST, Columbia, Aalto, Rochester) tackle a well-known but largely unsolved problem in video accessibility: standard audio description (AD) is constrained not to overlap with dialogue, so dialogue-heavy scenes in films and TV - where characters' actions,…

    video accessibility · audio description · blind and low vision · spatial audio · sound design

3 results.