← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • ADCanvas: Accessible and Conversational Audio Description Authoring for Blind and Low Vision Creators

    Franklin Mingzhe Li, Michael Xieyang Liu, Cynthia L Bennett, Shaun K. Kane · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)

    Li and colleagues tackle a rarely examined corner of accessibility: the fact that the tools used to produce Audio Description (AD) are themselves largely inaccessible to the blind and low-vision (BLV) creators who are often its most skilled practitioners. Professional AD…

    audio description · blind and low vision · conversational agent · multimodal LLM · visual question answering

  • Co-Designing Multimodal Systems for Accessible Asynchronous Dance Instruction

    Ujjaini Das, Shreya Kappala, Meng Chen, Mina Huh, Amy Pavel · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)

    This paper investigates how to design multimodal systems that make asynchronous dance instruction accessible to blind and low vision (BLV) learners. While online exercise videos have proliferated, particularly since COVID-19, dance tutorials rely heavily on visual demonstrations…

    blind and low vision · audio description · haptics · multimodal instruction · co-design

  • ViDscribe: Multimodal AI for Customizing Audio Description and Question Answering in Online Videos

    Maryam S Cheema, Sina Elahimanesh, Pooyan Fazli, Hasti Seifi · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)

    Cheema and colleagues (Arizona State University and Saarland University) present ViDscribe, a web platform that layers AI-generated audio description (AD) and conversational visual question answering (VQA) on top of arbitrary YouTube videos for blind and low vision (BLV)…

    video accessibility · audio description · blind and low vision · multimodal large language models · visual question answering

  • Sonic Stage: Automatically Generating an Interactive Spatial Soundscape to Facilitate Dialogue Video Comprehension for Blind and Low Vision Viewers

    Shuchang Xu, Xiaofu Jin, Gaurav Jain, Wenshuo Zhang, Huamin Qu, Brian A. Smith, Yukang Yan · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)

    Xu and colleagues (HKUST, Columbia, Aalto, Rochester) tackle a well-known but largely unsolved problem in video accessibility: standard audio description (AD) is constrained not to overlap with dialogue, so dialogue-heavy scenes in films and TV - where characters' actions,…

    video accessibility · audio description · blind and low vision · spatial audio · sound design

  • Enhancing Accessibility in Webtoons: Investigating Audio Effect Placement Strategies for Visually Impaired Users

    Heewon Lee, Juwon Cheong, Minsung Kim, Jia Kim, Hyunjung Kim · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This extended abstract investigates how the timing of audio effect (AE) placement—before, during (overlapping), or after narration—affects the user experience of audio-described webtoons for visually impaired users. Webtoons are Korean-originated vertical-scrolling comics…

    blindness · low vision · audio description · webtoons · digital comics

  • DescribePro: Collaborative Audio Description with Human-AI Interaction

    Maryam S Cheema, Sina Elahimanesh, Samuel Martin, Pooyan Fazli, Hasti Seifi · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper presents DescribePro, a web-based platform that combines human expertise with AI capabilities to create and refine audio descriptions (AD) for video content. The system addresses the fundamental tension in AD production: human-crafted descriptions are high quality but…

    audio description · video accessibility · human-AI collaboration · authoring tools · blind and low vision

  • Barriers to Employment: The Deaf Multimedia Authoring Tax

    Christian Vogler, Abraham Glasser, Raja Kushalnagar, Matthew Seita, Mariana Arroyo Chavez, Keith Delk, Paige DeVries, Molly Feanny, Bernard Thompson, James Waller · 2025 · Proceedings of the 22nd International Web for All Conference (W4A)

    This paper from Gallaudet University describes through firsthand experience the enormous additional burden — termed the "deaf multimedia authoring tax" — that deaf and hard of hearing (DHH) people face when creating accessible multimedia content for the workplace. Written by a…

    deaf and hard of hearing · sign language · content creation · workplace accessibility · captioning

  • Towards Accessible Musical Performances in Virtual Reality: Designing a Conceptual Framework for Omnidirectional Audio Descriptions

    Khang Dang, Grace Burke, Hamdi Korreshi, Sooyeon Lee · 2024 · Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '24)

    This paper develops a conceptual framework for omnidirectional audio description (AD) designed to make musical performances in virtual reality accessible to blind and low-vision (BLV) users. Traditional AD — a monaural narration track describing visual elements — was developed…

    audio description · virtual reality · blind and low vision · spatial audio · musical performances

  • Audio Description Customization

    Rosiana Natalie, Ruei-Che Chang, Smitha Sheshadri, Anhong Guo, Kotaro Hara · 2024 · Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2024)

    This paper investigates how audio descriptions (AD) for video content can be customized to meet the diverse preferences of blind and low-vision (BLV) users. Traditional ADs are fixed narratives created by sighted describers, offering no ability for users to adjust what…

    audio description · blind and low vision · customization · video accessibility · assistive technology

  • Direct or Immersive? Comparing Smartphone-based Museum Guide Systems for Blind Visitors

    Xiyue Wang, Seita Kayukawa, Hironobu Takagi, Giorgia Masoero, Chieko Asakawa · 2024 · Proceedings of the 21st International Web for All Conference (W4A)

    This paper presents the first direct comparison of two smartphone-based museum guide paradigms for blind visitors: a "direct" system using turn-by-turn navigation with VoiceOver-controlled audio descriptions, and an "immersive" system using spatialized sound navigation with…

    museum accessibility · blindness · indoor navigation · spatialized audio · screen readers

  • Making Accessible Movies Easily: An Intelligent Tool for Authoring and Integrating Audio Descriptions to Movies

    Ming Shen, Gang Huang, Yuxuan Wu, Shuyi Song, Sheng Zhou, Liangcheng Li, Zhi Yu, Wei Wang, Jiajun Bu · 2024 · Proceedings of the 21st International Web for All Conference (W4A)

    This paper introduces EasyAD, an intelligent tool that automates the process of authoring and integrating audio descriptions (AD) into movies for blind and visually impaired (BVI) users. The traditional AD production workflow is highly labor-intensive, requiring authors to…

    audio description · blind and low vision · media accessibility · multimodal AI · speech synthesis

  • Translating Color: Sonification as a Method of Sensory Substitution within the Museum

    Silvia Dini, Luca Andrea Ludovico, Sergio Mascetti, Maria Joaquina Valero Gisbert · 2023 · Proceedings of the 20th International Web for All Conference (W4A)

    This extended abstract proposes using sonification — the technique of translating data into sound — to make the chromatic elements of contemporary artworks accessible to people with visual impairments or blindness (VIB). The research addresses a fundamental challenge in museum…

    sonification · museum accessibility · visual impairment · sensory substitution · art accessibility

  • Exploring Community-Driven Descriptions for Making Livestreams Accessible

    Daniel Killough, Amy Pavel · 2023 · ASSETS '23: Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper investigates the feasibility of using livestream community members — sighted viewers who are domain experts in the content being streamed — to provide real-time descriptions that make livestreams accessible to viewers with visual impairments. Livestreams present…

    audio description · livestreaming · blind and low vision · crowdsourcing · video accessibility

  • A Gallery In My Hand: A Multi-Exhibition Investigation of Accessible and Inclusive Gallery Experiences for Blind and Low Vision Visitors

    Matthew Butler, Erica J. Tandori, Vince Dziekan, Kirsten Ellis, Jenna Hall, Leona M. Holloway, Ruth G. Nagassa, Kim Marriott · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '23)

    This paper presents findings from a longitudinal collaboration between researchers and the Bendigo Art Gallery, a major Australian regional gallery, to develop accessible and inclusive experiences for blind and low-vision (BLV) visitors across two flagship exhibitions: Mary…

    museum accessibility · blind and low vision · tactile graphics · 3D printing · inclusive design

  • Beyond Audio Description: Exploring 360° Video Accessibility with Blind and Low Vision Users Through Collaborative Creation

    Lucy Jiang, Mahika Phutane, Shiri Azenkot · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2023)

    This paper investigates how to make 360-degree videos accessible to blind and low vision (BLV) users while preserving their immersive nature — a challenge that goes well beyond simply adding traditional audio description (AD). The researchers conducted a two-part study with 14…

    audio description · 360 video · video accessibility · blind and low vision · co-design

  • The Potential of a Visual Dialogue Agent In a Tandem Automated Audio Description System for Videos

    Abigale Stangl, Shasta Ihorn, Yue-Ting Siu, Aditya Bodi, Mar Castanon, Lothar D Narins, Ilmi Yoon · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2023)

    This paper presents and evaluates a tandem AI-based audio description (AD) system for videos that combines two complementary tools: NarrationBot, which delivers automated minimum viable descriptions (MVD) of video content, and InfoBot, a visual dialogue agent that allows users…

    audio description · blind and low vision · visual question answering · visual dialogue · AI

  • Machine Generation of Audio Description for Blind and Visually Impaired People

    Virgínia P. Campos, Tiago M. U. de Araújo, Guido L. de Souza Filho, Luiz M. G. Gonçalves · 2023 · ACM Transactions on Accessible Computing

    This paper presents an extension to CineAD, a system for automatically generating audio descriptions (AD) for videos. The authors address a critical accessibility gap: most videos, films, and cultural programming lack audio descriptions, leaving blind and visually impaired (BVI)…

    audio description · blind and visually impaired · computer vision · machine learning · video accessibility

  • AccessComics: An Accessible Digital Comic Book Reader for People with Visual Impairments

    Yunjung Lee, Hwayeon Joh, Suhyeon Yoo, Uran Oh · 2021 · Proceedings of the 18th International Web for All Conference (W4A)

    This paper from Ewha Womans University in Seoul presents AccessComics, a web-based accessible digital comic book reader for people with visual impairments (PVI). Comics are a popular medium available on many digital platforms, yet almost none support screen readers, leaving…

    visual impairment · blindness · low vision · content accessibility · screen readers

  • Slidecho: Flexible Non-Visual Exploration of Presentation Videos

    Yi-Hao Peng, Jeffrey P Bigham, Amy Pavel · 2021 · Proceedings of the 23rd International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '21)

    This paper presents Slidecho, a system that makes recorded presentation videos accessible to blind and visually impaired learners by automatically extracting slide content and synchronizing it with the presenter's speech. The core problem is that most presentation videos —…

    video accessibility · blind and low vision · audio description · presentations · screen reader

  • The Efficacy of Collaborative Authoring of Video Scene Descriptions

    Rosiana Natalie, Jolene Loh, Huei Suen Tan, Joshua Tseng, Ian Luke Yi-Ren Chan, Ebrima H Jarjue, Hernisa Kacorri, Kotaro Hara · 2021 · ASSETS '21: The 23rd International ACM SIGACCESS Conference on Computers and Accessibility

    The vast majority of online video content remains inaccessible to people with visual impairments because it lacks audio descriptions — verbal commentaries that depict visual information in scenes. Professional audio description services cost US$12 to US$75 per video minute and…

    audio description · video accessibility · visual impairment · crowdsourcing · collaborative authoring

  • Say It All: Feedback for Improving Non-Visual Presentation Accessibility

    Yi-Hao Peng, JiWoong Jang, Jeffrey P. Bigham, Amy Pavel · 2021 · Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI '21)

    This paper addresses the widespread problem of inaccessible slide-based presentations for blind and visually impaired audiences. When presenters fail to verbally describe the visual content on their slides — text, images, diagrams, graphs, and videos — audience members who…

    presentation accessibility · slides · audio description · blind and low vision · real-time feedback

  • Game Changer: Accessible Audio and Tactile Guidance for Board and Card Games

    Gabriella M. Johnson, Shaun K. Kane · 2020 · Proceedings of the 17th International Web for All Conference (W4A)

    This paper presents Game Changer, an augmented workspace system that makes board and card games accessible to blind and visually impaired (BVI) players through a combination of audio descriptions and tactile modifications. The system uses an overhead webcam to track ArUco…

    game accessibility · blind and low vision · board games · tangible interaction · audio description

  • Making GIFs Accessible

    Cole Gleason, Amy Pavel, Himalini Gururaj, Kris Kitani, Jeffrey Bigham · 2020 · Proceedings of the 22nd International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2020)

    This Carnegie Mellon University study examines the accessibility of GIFs on social media, a visual medium that has become central to online conversation but remains largely inaccessible to people with vision impairments. The researchers conducted a multi-part investigation:…

    GIF accessibility · alternative text · audio description · blind · low vision

  • Design Guidelines for an Interactive 3D Model as a Supporting Tool for Exploring a Cultural Site by Visually Impaired and Sighted People

    Barbara Leporini, Valentina Rossetti, Francesco Furfari, Susanna Pelagatti, Andrea Quarta · 2020 · ACM Transactions on Accessible Computing

    This paper presents a methodology for creating low-cost interactive 3D printed models combined with audio descriptions to enable both visually impaired and sighted people to explore cultural heritage sites autonomously. The prototype reproduces Piazza dei Miracoli in Pisa, Italy…

    3D printing · tactile graphics · cultural heritage · museum accessibility · audio description

  • Rescribe: Authoring and Automatically Editing Audio Descriptions

    Amy Pavel, Gabriel Reyes, Jeffrey P. Bigham · 2020 · Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology (UIST '20)

    This paper introduces Rescribe, a tool that helps authors create and refine audio descriptions for videos. Audio descriptions make video content accessible to blind and visually impaired viewers by narrating important visual information during gaps in the existing audio track. A…

    audio description · video accessibility · blind and low vision · NLP · sentence compression