Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

NeuroBridge: Using Generative AI to Bridge Cross-neurotype Communication Differences through Neurotypical Perspective-taking
Rukhshan Haroon, Kyle Wigdor, Katie Yang, Nicole Toumanios, Eileen T Crehan, Fahad Dogar · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper presents NeuroBridge, an LLM-powered interactive platform designed to help neurotypical individuals better understand autistic communication styles and reflect on their own role in cross-neurotype communication breakdowns. The system is grounded in the double empathy…
autism · neurodiversity · large language models · cross-neurotype communication · perspective-taking
Benchmarking PDF Accessibility Evaluation: A Dataset and Framework for Assessing Automated and LLM-Based Approaches for Accessibility Testing
Anukriti Kumar, Tanushree Padath, Lucy Lu Wang · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper addresses a critical gap in PDF accessibility evaluation by introducing the first expert-validated benchmark dataset and standardized evaluation framework for assessing how well different tools and approaches can evaluate PDF accessibility. Despite PDFs being the…
PDF accessibility · automated testing · large language models · WCAG · PDF/UA
AccessGuru: Leveraging LLMs to Detect and Correct Web Accessibility Violations in HTML Code
Nadeen Fathallah, Daniel Hernández, Steffen Staab · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper introduces AccessGuru, a novel method that combines traditional automated accessibility testing tools with large language models (LLMs) to both detect and correct web accessibility violations in HTML code. The work addresses a persistent gap in accessibility tooling:…
automated testing · web accessibility · large language models · HTML remediation · prompt engineering
Check Now, Can You See It?: Exploring Voice and Video-Capable Language Models for Identifying and Spatially Locating Items of Interest for Blind and Low-Vision Travelers
Aziz N Zeidieh, JooYoung Seo · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This experience report documents the lived experiences of two blind travelers — Aziz (28, blind in left eye, 20/2200 in right) and JooYoung (35, blind in right eye, limited vision in left) — as they adapted commercially available voice and video-capable language models (VVLMs)…
artificial intelligence · navigation · blindness and visual impairment · multimodal AI · large language models
CapTune: Adapting Non-Speech Captions With Anchored Generative Models
Jeremy Zhengqi Huang, Caluã De Lacerda Pataca, Saelyne Yang Wu, Dhruv Jain · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
CapTune is a system that enables customization of non-speech captions—descriptions of environmental sounds, music, and other audio cues—for Deaf and Hard of Hearing (DHH) viewers. Current captioning practices follow a one-size-fits-all model based on standardized guidelines like…
closed captioning · non-speech information · caption customization · deaf and hard of hearing · generative AI
Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions
Meng Chen, Akhil Iyer, Amy Pavel · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper addresses a critical safety problem in AI-powered visual access technology: multimodal large language models (MLLMs) like GPT-4o, Gemini, and Claude produce fluent, confident image descriptions that can contain fabricated content, misinterpretations, and omissions…
blindness · low vision · image descriptions · multimodal AI · large language models
DescribePro: Collaborative Audio Description with Human-AI Interaction
Maryam S Cheema, Sina Elahimanesh, Samuel Martin, Pooyan Fazli, Hasti Seifi · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper presents DescribePro, a web-based platform that combines human expertise with AI capabilities to create and refine audio descriptions (AD) for video content. The system addresses the fundamental tension in AD production: human-crafted descriptions are high quality but…
audio description · video accessibility · human-AI collaboration · authoring tools · blind and low vision
CARTGPT: Real-Time Correction of CART Captions Using Large Language Models
Liang-Yuan Wu, Andrea Kleiver, Dhruv Jain · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper introduces CARTGPT, a real-time system that enhances Communication Access Realtime Translation (CART) captions by combining human-generated CART transcripts with automatic speech recognition (ASR) output and using GPT-4 to detect and correct transcription errors. CART…
deaf and hard of hearing · real-time captioning · CART · large language models · automatic speech recognition
Understanding Human-AI Misalignment in LLM-Based Job-Seeking Support for Neurodivergent Users
Kaely Hall, Marcus Ma, Xinyue Zhang, Vedant Das Swain, Jennifer G Kim · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper examines how misalignments manifest between neurodivergent job-seekers and a GPT-4-powered career support chatbot deployed by Mentra, a neuroinclusive employment platform with over 46,000 neurodivergent users. The researchers analysed 348 real-world chat logs from 271…
neurodivergence · large language models · employment · AI alignment · autism
Examining Age-Bias and Stereotypes of Aging in LLMs
Sherwin Dewan, Ismail Shaikh, Connie Shaw, Abhilash Sahoo, Akshita Jha, Alisha Pradhan · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper investigates how large language models encode and reproduce age-related stereotypes about older adults. Using prompts from the Bias Benchmarking Questionnaire (BBQ), a well-established fairness dataset, the researchers administered 1,648 age-bias prompts to ChatGPT…
ageism · AI bias · large language models · older adults · stereotypes
Making Lecture Videos Accessible for Students who are Blind or have Low Vision through AI-Assisted Navigation and Visual Question Answering
Katharina Anderer, Karin Müller, Lukas Strobel, Matthias Wölfel, Jan Niehues, Kathrin Gerling · 2025 · Proceedings of the 27th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2025)
This paper presents the design and evaluation of LectureAssistant, an AI-powered prototype that makes lecture videos more accessible for students who are blind or have low vision. The research follows a three-part human-centred design process. First, need-finding interviews with…
blind and low vision · lecture accessibility · higher education · large language models · vision-language models
When LLM-Generated Code Perpetuates User Interface Accessibility Barriers, How Can We Break the Cycle?
Alexandra-Elena Gurita, Radu-Daniel Vatavu · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)
This paper evaluates the ability of large language models (LLMs) to generate accessible web user interfaces, comparing ChatGPT (GPT-4-turbo) and Claude (3.5 Haiku) across two prompting strategies: accessibility-agnostic prompts ("Design the homepage of a banking app") and…
large language models · WCAG compliance · automated accessibility · prompt engineering · code generation
LLMs for Accessibility in Mobile Apps: Detection and Repair
Wajdi Aljedaani, Ahmed Aljohani, Marcelo M. Eler, Abdulrahman Habib, Hyunsook Do · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)
This study evaluates the capacity of three large language models—GPT-4o, Gemini 1.0 Pro, and Llama 3—to detect, classify, and remediate accessibility violations in Android mobile applications. While prior LLM accessibility research has focused primarily on web applications, this…
mobile accessibility · large language models · Android accessibility · automated accessibility testing · accessibility remediation
QuickQue: Enabling Quick Access to Information in User Reviews for Screen Reader Users
Mohan Sunkara, Akshay Kolgar Nayak, Sandeep Kalari, Yash Prakash, Sampath Jayarathna, Hae-Na Lee, Vikas Ashok · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)
This paper presents QuickCue, a Google Chrome browser extension that helps blind screen reader users efficiently access online customer reviews by using LLM-powered aspect and sentiment classification to organize and summarize review content. The current experience of reading…
screen readers · blind users · online reviews · large language models · browser extension
Morae: Proactively Pausing UI Agents for User Choices
Yi-Hao Peng, Dingzeyu Li, Jeffrey P. Bigham, Amy Pavel · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)
This paper introduces Morae, a UI agent that proactively pauses during automated task execution to involve blind and low-vision (BLV) users in critical decisions, rather than completing tasks end-to-end without user input. The work is motivated by a field study with four BLV…
UI agents · blind and low vision · large language models · human-agent interaction · user agency
StepWrite: Adaptive Planning for Speech-Driven Text Generation
Hamza El Alaoui, Atieh Taheri, Yi-Hao Peng, Jeffrey P. Bigham · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)
This paper introduces StepWrite, an LLM-powered voice-based writing system that enables structured, hands-free and eyes-free composition of longer-form texts. While speech-to-text tools handle short dictation well, composing structured emails or detailed responses requires…
voice interface · speech-to-text · hands-free interaction · eyes-free interaction · large language models
CodeA11y: Making AI Coding Assistants Useful for Accessible Web Development
Peya Mowar, Yi-Hao Peng, Jason Wu, Aaron Steinfeld, Jeffrey P. Bigham · 2025 · Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25)
This paper addresses a persistent problem: despite decades of accessibility standards and tools, ~96% of web pages contain accessibility violations. The authors argue that AI coding assistants like GitHub Copilot represent an untapped opportunity because developers already use…
web accessibility · AI coding assistants · developer tools · WCAG · automated testing
Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors
Michelle S. Lam, Fred Hohman, Dominik Moritz, Jeffrey P. Bigham, Kenneth Holstein, Mary Beth Kery · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)
This paper introduces "policy maps," an approach to AI policy design for large language models inspired by physical mapmaking. The core insight is that comprehensive policy coverage over an unbounded space of LLM inputs and outputs is impossible — just as no map can capture…
AI safety · AI policy · large language models · AI ethics · model evaluation
NoTeeline: Supporting Real-Time, Personalized Notetaking with LLM-Enhanced Micronotes
Faria Huq, Abdus Samee, David Chuan-En Lin, Alice Xiaodi Tang, Jeffrey P. Bigham · 2025 · Proceedings of the 30th International Conference on Intelligent User Interfaces (IUI '25)
This paper introduces NoTeeline, an interactive notetaking tool that uses LLMs to expand user-written "micronotes" — brief shorthand jottings like "plastic pol. ->" or "RNNs are unrolled l to r or opp" — into full-fledged notes that maintain the user's personal writing style.…
large language models · writing assistance · personalization · notetaking · cognitive load

19 results.

Tag

Search results