Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

Multi-Perspective Visual Contrastive Decoding for Reliable Assistance
Bocheng Pan, Hailong Shi, Xingyu Gao · 2026 · ACM Transactions on Internet of Things
This technical paper presents MPVCD (Multi-Perspective Visual Contrastive Decoding), a framework designed to address the reliability of AI-generated visual descriptions for people who are blind or have low vision (BLV). The core problem it tackles: when BLV users photograph…
blindness and low vision · multimodal AI · image captioning · visual hallucination · assistive technology
"It's trained by non-disabled people": Evaluating How Image Quality Affects Product Captioning with Vision-Language Models
Kapil Garg, Xinru Tang, Jimin Heo, Dwayne R. Morgan, Darren Gergle, Erik B. Sudderth, Anne Marie Piper · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)
Garg and colleagues investigate how well Vision-Language Models (VLMs) caption product images taken by blind and low-vision (BLV) people — a high-stakes everyday task that increasingly depends on tools like Be My AI, Microsoft Seeing AI, and general-purpose assistants such as…
blind and low vision · vision-language models · image captioning · product identification · hallucinations
VisualAid: Enhancing Accessibility for Visually Impaired Users Through AI
Wajdi Aljedaani, Sijo Rejigeorge, Priya Jha, Srija Yadavalli, Manikanta Kothakota, Marcelo M. Eler, Abdulrahman Habib · 2025 · Proceedings of the 22nd International Web for All Conference (W4A)
This technical note presents VisualAid, an AI-powered Android application designed to help visually impaired users understand and navigate their physical surroundings. The app integrates multiple AI technologies into a single mobile interface: YOLO11x for real-time object…
visual impairment · object detection · image captioning · OCR · voice interaction
Going Beyond One-Size-Fits-All Image Descriptions to Satisfy the Information Wants of People Who are Blind or Have Low Vision
Abigale Stangl, Nitin Verma, Kenneth R. Fleischmann, Meredith Ringel Morris, Danna Gurari · 2021 · ASSETS '21: The 23rd International ACM SIGACCESS Conference on Computers and Accessibility
Current image description practices typically produce a single, one-size-fits-all description for each image, yet the same image can appear across vastly different contexts — news websites, e-commerce platforms, social media feeds, travel sites, and personal photo libraries —…
image description · alternative text · blind · low vision · context-aware
Accessify: An ML Powered Application to Provide Accessible Images on Web Sites
Shivam Singh, Anurag Bhandari, Nishith Pathak · 2018 · Proceedings of the 15th International Web for All Conference (W4A 2018)
This demonstration paper presents Accessify, a browser plugin that uses machine learning to automatically generate alternative text descriptions for all images on a website, injecting them into the page’s DOM so screen readers can access them. The system addresses the persistent…
alternative text · image accessibility · machine learning · browser extension · computer vision

5 results.

Reviews

Year

Tag

Search results

Multi-Perspective Visual Contrastive Decoding for Reliable Assistance

"It's trained by non-disabled people": Evaluating How Image Quality Affects Product Captioning with Vision-Language Models

VisualAid: Enhancing Accessibility for Visually Impaired Users Through AI

Going Beyond One-Size-Fits-All Image Descriptions to Satisfy the Information Wants of People Who are Blind or Have Low Vision

Accessify: An ML Powered Application to Provide Accessible Images on Web Sites