← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • NeuroBridge: Using Generative AI to Bridge Cross-neurotype Communication Differences through Neurotypical Perspective-taking

    Rukhshan Haroon, Kyle Wigdor, Katie Yang, Nicole Toumanios, Eileen T Crehan, Fahad Dogar · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper presents NeuroBridge, an LLM-powered interactive platform designed to help neurotypical individuals better understand autistic communication styles and reflect on their own role in cross-neurotype communication breakdowns. The system is grounded in the double empathy…

    autism · neurodiversity · large language models · cross-neurotype communication · perspective-taking

  • Benchmarking PDF Accessibility Evaluation: A Dataset and Framework for Assessing Automated and LLM-Based Approaches for Accessibility Testing

    Anukriti Kumar, Tanushree Padath, Lucy Lu Wang · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper addresses a critical gap in PDF accessibility evaluation by introducing the first expert-validated benchmark dataset and standardized evaluation framework for assessing how well different tools and approaches can evaluate PDF accessibility. Despite PDFs being the…

    PDF accessibility · automated testing · large language models · WCAG · PDF/UA

  • AccessGuru: Leveraging LLMs to Detect and Correct Web Accessibility Violations in HTML Code

    Nadeen Fathallah, Daniel Hernández, Steffen Staab · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper introduces AccessGuru, a novel method that combines traditional automated accessibility testing tools with large language models (LLMs) to both detect and correct web accessibility violations in HTML code. The work addresses a persistent gap in accessibility tooling:…

    automated testing · web accessibility · large language models · HTML remediation · prompt engineering

  • Check Now, Can You See It?: Exploring Voice and Video-Capable Language Models for Identifying and Spatially Locating Items of Interest for Blind and Low-Vision Travelers

    Aziz N Zeidieh, JooYoung Seo · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This experience report documents the lived experiences of two blind travelers — Aziz (28, blind in left eye, 20/2200 in right) and JooYoung (35, blind in right eye, limited vision in left) — as they adapted commercially available voice and video-capable language models (VVLMs)…

    artificial intelligence · navigation · blindness and visual impairment · multimodal AI · large language models

  • CapTune: Adapting Non-Speech Captions With Anchored Generative Models

    Jeremy Zhengqi Huang, Caluã De Lacerda Pataca, Saelyne Yang Wu, Dhruv Jain · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    CapTune is a system that enables customization of non-speech captions—descriptions of environmental sounds, music, and other audio cues—for Deaf and Hard of Hearing (DHH) viewers. Current captioning practices follow a one-size-fits-all model based on standardized guidelines like…

    closed captioning · non-speech information · caption customization · deaf and hard of hearing · generative AI

  • Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions

    Meng Chen, Akhil Iyer, Amy Pavel · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper addresses a critical safety problem in AI-powered visual access technology: multimodal large language models (MLLMs) like GPT-4o, Gemini, and Claude produce fluent, confident image descriptions that can contain fabricated content, misinterpretations, and omissions…

    blindness · low vision · image descriptions · multimodal AI · large language models

  • DescribePro: Collaborative Audio Description with Human-AI Interaction

    Maryam S Cheema, Sina Elahimanesh, Samuel Martin, Pooyan Fazli, Hasti Seifi · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper presents DescribePro, a web-based platform that combines human expertise with AI capabilities to create and refine audio descriptions (AD) for video content. The system addresses the fundamental tension in AD production: human-crafted descriptions are high quality but…

    audio description · video accessibility · human-AI collaboration · authoring tools · blind and low vision

  • CARTGPT: Real-Time Correction of CART Captions Using Large Language Models

    Liang-Yuan Wu, Andrea Kleiver, Dhruv Jain · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper introduces CARTGPT, a real-time system that enhances Communication Access Realtime Translation (CART) captions by combining human-generated CART transcripts with automatic speech recognition (ASR) output and using GPT-4 to detect and correct transcription errors. CART…

    deaf and hard of hearing · real-time captioning · CART · large language models · automatic speech recognition

  • Understanding Human-AI Misalignment in LLM-Based Job-Seeking Support for Neurodivergent Users

    Kaely Hall, Marcus Ma, Xinyue Zhang, Vedant Das Swain, Jennifer G Kim · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper examines how misalignments manifest between neurodivergent job-seekers and a GPT-4-powered career support chatbot deployed by Mentra, a neuroinclusive employment platform with over 46,000 neurodivergent users. The researchers analysed 348 real-world chat logs from 271…

    neurodivergence · large language models · employment · AI alignment · autism

  • Examining Age-Bias and Stereotypes of Aging in LLMs

    Sherwin Dewan, Ismail Shaikh, Connie Shaw, Abhilash Sahoo, Akshita Jha, Alisha Pradhan · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper investigates how large language models encode and reproduce age-related stereotypes about older adults. Using prompts from the Bias Benchmarking Questionnaire (BBQ), a well-established fairness dataset, the researchers administered 1,648 age-bias prompts to ChatGPT…

    ageism · AI bias · large language models · older adults · stereotypes

  • Making Lecture Videos Accessible for Students who are Blind or have Low Vision through AI-Assisted Navigation and Visual Question Answering

    Katharina Anderer, Karin Müller, Lukas Strobel, Matthias Wölfel, Jan Niehues, Kathrin Gerling · 2025 · Proceedings of the 27th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2025)

    This paper presents the design and evaluation of LectureAssistant, an AI-powered prototype that makes lecture videos more accessible for students who are blind or have low vision. The research follows a three-part human-centred design process. First, need-finding interviews with…

    blind and low vision · lecture accessibility · higher education · large language models · vision-language models

  • When LLM-Generated Code Perpetuates User Interface Accessibility Barriers, How Can We Break the Cycle?

    Alexandra-Elena Gurita, Radu-Daniel Vatavu · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)

    This paper evaluates the ability of large language models (LLMs) to generate accessible web user interfaces, comparing ChatGPT (GPT-4-turbo) and Claude (3.5 Haiku) across two prompting strategies: accessibility-agnostic prompts ("Design the homepage of a banking app") and…

    large language models · WCAG compliance · automated accessibility · prompt engineering · code generation

  • LLMs for Accessibility in Mobile Apps: Detection and Repair

    Wajdi Aljedaani, Ahmed Aljohani, Marcelo M. Eler, Abdulrahman Habib, Hyunsook Do · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)

    This study evaluates the capacity of three large language models—GPT-4o, Gemini 1.0 Pro, and Llama 3—to detect, classify, and remediate accessibility violations in Android mobile applications. While prior LLM accessibility research has focused primarily on web applications, this…

    mobile accessibility · large language models · Android accessibility · automated accessibility testing · accessibility remediation

  • QuickQue: Enabling Quick Access to Information in User Reviews for Screen Reader Users

    Mohan Sunkara, Akshay Kolgar Nayak, Sandeep Kalari, Yash Prakash, Sampath Jayarathna, Hae-Na Lee, Vikas Ashok · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)

    This paper presents QuickCue, a Google Chrome browser extension that helps blind screen reader users efficiently access online customer reviews by using LLM-powered aspect and sentiment classification to organize and summarize review content. The current experience of reading…

    screen readers · blind users · online reviews · large language models · browser extension

  • Morae: Proactively Pausing UI Agents for User Choices

    Yi-Hao Peng, Dingzeyu Li, Jeffrey P. Bigham, Amy Pavel · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)

    This paper introduces Morae, a UI agent that proactively pauses during automated task execution to involve blind and low-vision (BLV) users in critical decisions, rather than completing tasks end-to-end without user input. The work is motivated by a field study with four BLV…

    UI agents · blind and low vision · large language models · human-agent interaction · user agency

  • StepWrite: Adaptive Planning for Speech-Driven Text Generation

    Hamza El Alaoui, Atieh Taheri, Yi-Hao Peng, Jeffrey P. Bigham · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)

    This paper introduces StepWrite, an LLM-powered voice-based writing system that enables structured, hands-free and eyes-free composition of longer-form texts. While speech-to-text tools handle short dictation well, composing structured emails or detailed responses requires…

    voice interface · speech-to-text · hands-free interaction · eyes-free interaction · large language models

  • CodeA11y: Making AI Coding Assistants Useful for Accessible Web Development

    Peya Mowar, Yi-Hao Peng, Jason Wu, Aaron Steinfeld, Jeffrey P. Bigham · 2025 · Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25)

    This paper addresses a persistent problem: despite decades of accessibility standards and tools, ~96% of web pages contain accessibility violations. The authors argue that AI coding assistants like GitHub Copilot represent an untapped opportunity because developers already use…

    web accessibility · AI coding assistants · developer tools · WCAG · automated testing

  • Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors

    Michelle S. Lam, Fred Hohman, Dominik Moritz, Jeffrey P. Bigham, Kenneth Holstein, Mary Beth Kery · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)

    This paper introduces "policy maps," an approach to AI policy design for large language models inspired by physical mapmaking. The core insight is that comprehensive policy coverage over an unbounded space of LLM inputs and outputs is impossible — just as no map can capture…

    AI safety · AI policy · large language models · AI ethics · model evaluation

  • NoTeeline: Supporting Real-Time, Personalized Notetaking with LLM-Enhanced Micronotes

    Faria Huq, Abdus Samee, David Chuan-En Lin, Alice Xiaodi Tang, Jeffrey P. Bigham · 2025 · Proceedings of the 30th International Conference on Intelligent User Interfaces (IUI '25)

    This paper introduces NoTeeline, an interactive notetaking tool that uses LLMs to expand user-written "micronotes" — brief shorthand jottings like "plastic pol. ->" or "RNNs are unrolled l to r or opp" — into full-fledged notes that maintain the user's personal writing style.…

    large language models · writing assistance · personalization · notetaking · cognitive load

19 results.