Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

ADCanvas: Accessible and Conversational Audio Description Authoring for Blind and Low Vision Creators
Franklin Mingzhe Li, Michael Xieyang Liu, Cynthia L Bennett, Shaun K. Kane · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)
Li and colleagues tackle a rarely examined corner of accessibility: the fact that the tools used to produce Audio Description (AD) are themselves largely inaccessible to the blind and low-vision (BLV) creators who are often its most skilled practitioners. Professional AD…
audio description · blind and low vision · conversational agent · multimodal LLM · visual question answering
Co-Designing Multimodal Systems for Accessible Asynchronous Dance Instruction
Ujjaini Das, Shreya Kappala, Meng Chen, Mina Huh, Amy Pavel · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)
This paper investigates how to design multimodal systems that make asynchronous dance instruction accessible to blind and low vision (BLV) learners. While online exercise videos have proliferated, particularly since COVID-19, dance tutorials rely heavily on visual demonstrations…
blind and low vision · audio description · haptics · multimodal instruction · co-design
ViDscribe: Multimodal AI for Customizing Audio Description and Question Answering in Online Videos
Maryam S Cheema, Sina Elahimanesh, Pooyan Fazli, Hasti Seifi · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)
Cheema and colleagues (Arizona State University and Saarland University) present ViDscribe, a web platform that layers AI-generated audio description (AD) and conversational visual question answering (VQA) on top of arbitrary YouTube videos for blind and low vision (BLV)…
video accessibility · audio description · blind and low vision · multimodal large language models · visual question answering
Sonic Stage: Automatically Generating an Interactive Spatial Soundscape to Facilitate Dialogue Video Comprehension for Blind and Low Vision Viewers
Shuchang Xu, Xiaofu Jin, Gaurav Jain, Wenshuo Zhang, Huamin Qu, Brian A. Smith, Yukang Yan · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)
Xu and colleagues (HKUST, Columbia, Aalto, Rochester) tackle a well-known but largely unsolved problem in video accessibility: standard audio description (AD) is constrained not to overlap with dialogue, so dialogue-heavy scenes in films and TV - where characters' actions,…
video accessibility · audio description · blind and low vision · spatial audio · sound design
Enhancing Accessibility in Webtoons: Investigating Audio Effect Placement Strategies for Visually Impaired Users
Heewon Lee, Juwon Cheong, Minsung Kim, Jia Kim, Hyunjung Kim · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This extended abstract investigates how the timing of audio effect (AE) placement—before, during (overlapping), or after narration—affects the user experience of audio-described webtoons for visually impaired users. Webtoons are Korean-originated vertical-scrolling comics…
blindness · low vision · audio description · webtoons · digital comics
DescribePro: Collaborative Audio Description with Human-AI Interaction
Maryam S Cheema, Sina Elahimanesh, Samuel Martin, Pooyan Fazli, Hasti Seifi · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper presents DescribePro, a web-based platform that combines human expertise with AI capabilities to create and refine audio descriptions (AD) for video content. The system addresses the fundamental tension in AD production: human-crafted descriptions are high quality but…
audio description · video accessibility · human-AI collaboration · authoring tools · blind and low vision
Barriers to Employment: The Deaf Multimedia Authoring Tax
Christian Vogler, Abraham Glasser, Raja Kushalnagar, Matthew Seita, Mariana Arroyo Chavez, Keith Delk, Paige DeVries, Molly Feanny, Bernard Thompson, James Waller · 2025 · Proceedings of the 22nd International Web for All Conference (W4A)
This paper from Gallaudet University describes through firsthand experience the enormous additional burden — termed the "deaf multimedia authoring tax" — that deaf and hard of hearing (DHH) people face when creating accessible multimedia content for the workplace. Written by a…
deaf and hard of hearing · sign language · content creation · workplace accessibility · captioning
Towards Accessible Musical Performances in Virtual Reality: Designing a Conceptual Framework for Omnidirectional Audio Descriptions
Khang Dang, Grace Burke, Hamdi Korreshi, Sooyeon Lee · 2024 · Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '24)
This paper develops a conceptual framework for omnidirectional audio description (AD) designed to make musical performances in virtual reality accessible to blind and low-vision (BLV) users. Traditional AD — a monaural narration track describing visual elements — was developed…
audio description · virtual reality · blind and low vision · spatial audio · musical performances
Audio Description Customization
Rosiana Natalie, Ruei-Che Chang, Smitha Sheshadri, Anhong Guo, Kotaro Hara · 2024 · Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2024)
This paper investigates how audio descriptions (AD) for video content can be customized to meet the diverse preferences of blind and low-vision (BLV) users. Traditional ADs are fixed narratives created by sighted describers, offering no ability for users to adjust what…
audio description · blind and low vision · customization · video accessibility · assistive technology
Direct or Immersive? Comparing Smartphone-based Museum Guide Systems for Blind Visitors
Xiyue Wang, Seita Kayukawa, Hironobu Takagi, Giorgia Masoero, Chieko Asakawa · 2024 · Proceedings of the 21st International Web for All Conference (W4A)
This paper presents the first direct comparison of two smartphone-based museum guide paradigms for blind visitors: a "direct" system using turn-by-turn navigation with VoiceOver-controlled audio descriptions, and an "immersive" system using spatialized sound navigation with…
museum accessibility · blindness · indoor navigation · spatialized audio · screen readers
Making Accessible Movies Easily: An Intelligent Tool for Authoring and Integrating Audio Descriptions to Movies
Ming Shen, Gang Huang, Yuxuan Wu, Shuyi Song, Sheng Zhou, Liangcheng Li, Zhi Yu, Wei Wang, Jiajun Bu · 2024 · Proceedings of the 21st International Web for All Conference (W4A)
This paper introduces EasyAD, an intelligent tool that automates the process of authoring and integrating audio descriptions (AD) into movies for blind and visually impaired (BVI) users. The traditional AD production workflow is highly labor-intensive, requiring authors to…
audio description · blind and low vision · media accessibility · multimodal AI · speech synthesis
Translating Color: Sonification as a Method of Sensory Substitution within the Museum
Silvia Dini, Luca Andrea Ludovico, Sergio Mascetti, Maria Joaquina Valero Gisbert · 2023 · Proceedings of the 20th International Web for All Conference (W4A)
This extended abstract proposes using sonification — the technique of translating data into sound — to make the chromatic elements of contemporary artworks accessible to people with visual impairments or blindness (VIB). The research addresses a fundamental challenge in museum…
sonification · museum accessibility · visual impairment · sensory substitution · art accessibility
Exploring Community-Driven Descriptions for Making Livestreams Accessible
Daniel Killough, Amy Pavel · 2023 · ASSETS '23: Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility
This paper investigates the feasibility of using livestream community members — sighted viewers who are domain experts in the content being streamed — to provide real-time descriptions that make livestreams accessible to viewers with visual impairments. Livestreams present…
audio description · livestreaming · blind and low vision · crowdsourcing · video accessibility
A Gallery In My Hand: A Multi-Exhibition Investigation of Accessible and Inclusive Gallery Experiences for Blind and Low Vision Visitors
Matthew Butler, Erica J. Tandori, Vince Dziekan, Kirsten Ellis, Jenna Hall, Leona M. Holloway, Ruth G. Nagassa, Kim Marriott · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '23)
This paper presents findings from a longitudinal collaboration between researchers and the Bendigo Art Gallery, a major Australian regional gallery, to develop accessible and inclusive experiences for blind and low-vision (BLV) visitors across two flagship exhibitions: Mary…
museum accessibility · blind and low vision · tactile graphics · 3D printing · inclusive design
Beyond Audio Description: Exploring 360° Video Accessibility with Blind and Low Vision Users Through Collaborative Creation
Lucy Jiang, Mahika Phutane, Shiri Azenkot · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2023)
This paper investigates how to make 360-degree videos accessible to blind and low vision (BLV) users while preserving their immersive nature — a challenge that goes well beyond simply adding traditional audio description (AD). The researchers conducted a two-part study with 14…
audio description · 360 video · video accessibility · blind and low vision · co-design
The Potential of a Visual Dialogue Agent In a Tandem Automated Audio Description System for Videos
Abigale Stangl, Shasta Ihorn, Yue-Ting Siu, Aditya Bodi, Mar Castanon, Lothar D Narins, Ilmi Yoon · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2023)
This paper presents and evaluates a tandem AI-based audio description (AD) system for videos that combines two complementary tools: NarrationBot, which delivers automated minimum viable descriptions (MVD) of video content, and InfoBot, a visual dialogue agent that allows users…
audio description · blind and low vision · visual question answering · visual dialogue · AI
Machine Generation of Audio Description for Blind and Visually Impaired People
Virgínia P. Campos, Tiago M. U. de Araújo, Guido L. de Souza Filho, Luiz M. G. Gonçalves · 2023 · ACM Transactions on Accessible Computing
This paper presents an extension to CineAD, a system for automatically generating audio descriptions (AD) for videos. The authors address a critical accessibility gap: most videos, films, and cultural programming lack audio descriptions, leaving blind and visually impaired (BVI)…
audio description · blind and visually impaired · computer vision · machine learning · video accessibility
AccessComics: An Accessible Digital Comic Book Reader for People with Visual Impairments
Yunjung Lee, Hwayeon Joh, Suhyeon Yoo, Uran Oh · 2021 · Proceedings of the 18th International Web for All Conference (W4A)
This paper from Ewha Womans University in Seoul presents AccessComics, a web-based accessible digital comic book reader for people with visual impairments (PVI). Comics are a popular medium available on many digital platforms, yet almost none support screen readers, leaving…
visual impairment · blindness · low vision · content accessibility · screen readers
Slidecho: Flexible Non-Visual Exploration of Presentation Videos
Yi-Hao Peng, Jeffrey P Bigham, Amy Pavel · 2021 · Proceedings of the 23rd International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '21)
This paper presents Slidecho, a system that makes recorded presentation videos accessible to blind and visually impaired learners by automatically extracting slide content and synchronizing it with the presenter's speech. The core problem is that most presentation videos —…
video accessibility · blind and low vision · audio description · presentations · screen reader
The Efficacy of Collaborative Authoring of Video Scene Descriptions
Rosiana Natalie, Jolene Loh, Huei Suen Tan, Joshua Tseng, Ian Luke Yi-Ren Chan, Ebrima H Jarjue, Hernisa Kacorri, Kotaro Hara · 2021 · ASSETS '21: The 23rd International ACM SIGACCESS Conference on Computers and Accessibility
The vast majority of online video content remains inaccessible to people with visual impairments because it lacks audio descriptions — verbal commentaries that depict visual information in scenes. Professional audio description services cost US$12 to US$75 per video minute and…
audio description · video accessibility · visual impairment · crowdsourcing · collaborative authoring
Say It All: Feedback for Improving Non-Visual Presentation Accessibility
Yi-Hao Peng, JiWoong Jang, Jeffrey P. Bigham, Amy Pavel · 2021 · Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI '21)
This paper addresses the widespread problem of inaccessible slide-based presentations for blind and visually impaired audiences. When presenters fail to verbally describe the visual content on their slides — text, images, diagrams, graphs, and videos — audience members who…
presentation accessibility · slides · audio description · blind and low vision · real-time feedback
Game Changer: Accessible Audio and Tactile Guidance for Board and Card Games
Gabriella M. Johnson, Shaun K. Kane · 2020 · Proceedings of the 17th International Web for All Conference (W4A)
This paper presents Game Changer, an augmented workspace system that makes board and card games accessible to blind and visually impaired (BVI) players through a combination of audio descriptions and tactile modifications. The system uses an overhead webcam to track ArUco…
game accessibility · blind and low vision · board games · tangible interaction · audio description
Making GIFs Accessible
Cole Gleason, Amy Pavel, Himalini Gururaj, Kris Kitani, Jeffrey Bigham · 2020 · Proceedings of the 22nd International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2020)
This Carnegie Mellon University study examines the accessibility of GIFs on social media, a visual medium that has become central to online conversation but remains largely inaccessible to people with vision impairments. The researchers conducted a multi-part investigation:…
GIF accessibility · alternative text · audio description · blind · low vision
Design Guidelines for an Interactive 3D Model as a Supporting Tool for Exploring a Cultural Site by Visually Impaired and Sighted People
Barbara Leporini, Valentina Rossetti, Francesco Furfari, Susanna Pelagatti, Andrea Quarta · 2020 · ACM Transactions on Accessible Computing
This paper presents a methodology for creating low-cost interactive 3D printed models combined with audio descriptions to enable both visually impaired and sighted people to explore cultural heritage sites autonomously. The prototype reproduces Piazza dei Miracoli in Pisa, Italy…
3D printing · tactile graphics · cultural heritage · museum accessibility · audio description
Rescribe: Authoring and Automatically Editing Audio Descriptions
Amy Pavel, Gabriel Reyes, Jeffrey P. Bigham · 2020 · Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology (UIST '20)
This paper introduces Rescribe, a tool that helps authors create and refine audio descriptions for videos. Audio descriptions make video content accessible to blind and visually impaired viewers by narrating important visual information during gaps in the existing audio track. A…
audio description · video accessibility · blind and low vision · NLP · sentence compression

Reviews

Year

Tag

Search results

ADCanvas: Accessible and Conversational Audio Description Authoring for Blind and Low Vision Creators

Co-Designing Multimodal Systems for Accessible Asynchronous Dance Instruction

ViDscribe: Multimodal AI for Customizing Audio Description and Question Answering in Online Videos

Sonic Stage: Automatically Generating an Interactive Spatial Soundscape to Facilitate Dialogue Video Comprehension for Blind and Low Vision Viewers

Enhancing Accessibility in Webtoons: Investigating Audio Effect Placement Strategies for Visually Impaired Users

DescribePro: Collaborative Audio Description with Human-AI Interaction

Barriers to Employment: The Deaf Multimedia Authoring Tax

Towards Accessible Musical Performances in Virtual Reality: Designing a Conceptual Framework for Omnidirectional Audio Descriptions

Audio Description Customization

Direct or Immersive? Comparing Smartphone-based Museum Guide Systems for Blind Visitors

Making Accessible Movies Easily: An Intelligent Tool for Authoring and Integrating Audio Descriptions to Movies

Translating Color: Sonification as a Method of Sensory Substitution within the Museum

Exploring Community-Driven Descriptions for Making Livestreams Accessible

A Gallery In My Hand: A Multi-Exhibition Investigation of Accessible and Inclusive Gallery Experiences for Blind and Low Vision Visitors

Beyond Audio Description: Exploring 360° Video Accessibility with Blind and Low Vision Users Through Collaborative Creation

The Potential of a Visual Dialogue Agent In a Tandem Automated Audio Description System for Videos

Machine Generation of Audio Description for Blind and Visually Impaired People

AccessComics: An Accessible Digital Comic Book Reader for People with Visual Impairments

Slidecho: Flexible Non-Visual Exploration of Presentation Videos

The Efficacy of Collaborative Authoring of Video Scene Descriptions

Say It All: Feedback for Improving Non-Visual Presentation Accessibility

Game Changer: Accessible Audio and Tactile Guidance for Board and Card Games

Making GIFs Accessible

Design Guidelines for an Interactive 3D Model as a Supporting Tool for Exploring a Cultural Site by Visually Impaired and Sighted People

Rescribe: Authoring and Automatically Editing Audio Descriptions