← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • Multi-Perspective Visual Contrastive Decoding for Reliable Assistance

    Bocheng Pan, Hailong Shi, Xingyu Gao · 2026 · ACM Transactions on Internet of Things

    This technical paper presents MPVCD (Multi-Perspective Visual Contrastive Decoding), a framework designed to address the reliability of AI-generated visual descriptions for people who are blind or have low vision (BLV). The core problem it tackles: when BLV users photograph…

    blindness and low vision · multimodal AI · image captioning · visual hallucination · assistive technology

  • "It's trained by non-disabled people": Evaluating How Image Quality Affects Product Captioning with Vision-Language Models

    Kapil Garg, Xinru Tang, Jimin Heo, Dwayne R. Morgan, Darren Gergle, Erik B. Sudderth, Anne Marie Piper · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)

    Garg and colleagues investigate how well Vision-Language Models (VLMs) caption product images taken by blind and low-vision (BLV) people — a high-stakes everyday task that increasingly depends on tools like Be My AI, Microsoft Seeing AI, and general-purpose assistants such as…

    blind and low vision · vision-language models · image captioning · product identification · hallucinations

  • VisualAid: Enhancing Accessibility for Visually Impaired Users Through AI

    Wajdi Aljedaani, Sijo Rejigeorge, Priya Jha, Srija Yadavalli, Manikanta Kothakota, Marcelo M. Eler, Abdulrahman Habib · 2025 · Proceedings of the 22nd International Web for All Conference (W4A)

    This technical note presents VisualAid, an AI-powered Android application designed to help visually impaired users understand and navigate their physical surroundings. The app integrates multiple AI technologies into a single mobile interface: YOLO11x for real-time object…

    visual impairment · object detection · image captioning · OCR · voice interaction

  • Going Beyond One-Size-Fits-All Image Descriptions to Satisfy the Information Wants of People Who are Blind or Have Low Vision

    Abigale Stangl, Nitin Verma, Kenneth R. Fleischmann, Meredith Ringel Morris, Danna Gurari · 2021 · ASSETS '21: The 23rd International ACM SIGACCESS Conference on Computers and Accessibility

    Current image description practices typically produce a single, one-size-fits-all description for each image, yet the same image can appear across vastly different contexts — news websites, e-commerce platforms, social media feeds, travel sites, and personal photo libraries —…

    image description · alternative text · blind · low vision · context-aware

  • Accessify: An ML Powered Application to Provide Accessible Images on Web Sites

    Shivam Singh, Anurag Bhandari, Nishith Pathak · 2018 · Proceedings of the 15th International Web for All Conference (W4A 2018)

    This demonstration paper presents Accessify, a browser plugin that uses machine learning to automatically generate alternative text descriptions for all images on a website, injecting them into the page’s DOM so screen readers can access them. The system addresses the persistent…

    alternative text · image accessibility · machine learning · browser extension · computer vision

5 results.