← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • Multi-Perspective Visual Contrastive Decoding for Reliable Assistance

    Bocheng Pan, Hailong Shi, Xingyu Gao · 2026 · ACM Transactions on Internet of Things

    This technical paper presents MPVCD (Multi-Perspective Visual Contrastive Decoding), a framework designed to address the reliability of AI-generated visual descriptions for people who are blind or have low vision (BLV). The core problem it tackles: when BLV users photograph…

    blindness and low vision · multimodal AI · image captioning · visual hallucination · assistive technology

  • Expanding Perspectives to Improve Access to Visual Archives through Multimodal Image Enrichment

    Karina Rodriguez Echavarria, Myrsini Samaroudi · 2026 · ACM Journal on Computing and Cultural Heritage

    This paper addresses a pervasive challenge in the Galleries, Libraries, Archives and Museums (GLAM) sector: large-scale visual collections that have been digitised but remain undiscoverable because they lack descriptive metadata. The authors, from the University of Brighton,…

    cultural heritage · metadata enrichment · AI image classification · FAIR principles · information discovery

  • SceneScout: Towards AI-Driven Access to Street Level Imagery for Blind Users

    Gaurav Jain, Leah Findlater, Cole Gleason · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)

    Jain, Findlater and Gleason present SceneScout, a prototype web interface that uses a multimodal large language model (GPT-4o) to make street level imagery — the panoramic pedestrian-height photography behind Apple Maps Look Around and Google Street View — directly usable by…

    accessibility · navigation · screen readers · AI · multimodal AI

  • From Struggle to Success: Context-Aware Guidance for Screen Reader Users in Computer Use

    Nan Chen, Jing Lu, Zilong Wang, Luna K. Qiu, Siming Chen, Yuqing Yang · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)

    Chen, Lu, Wang, Qiu, Chen and Yang present AskEase, an NVDA add-on that delivers on-demand, step-by-step, screen-reader-friendly guidance for blind and low-vision computer users tackling unfamiliar desktop software. The work responds to a persistent problem: mainstream tutorials…

    accessibility · screen readers · AI · LLM · assistive technology

  • Mnemonic Tracing: Using Eye Gaze to Search for Visual Memories

    Wazeer Zulfikar, Yasith Samaradivakara, Paul Pu Liang, Pattie Maes · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA ’26)

    Mnemonic Tracing is a non-verbal image-retrieval interaction in which a user, wearing eye-tracking glasses, deliberately retraces the contents of a remembered image with their gaze on a blank surface. The paper builds on gaze-reinstatement research, which shows that when people…

    eye tracking · gaze interaction · gaze reinstatement · episodic memory · image retrieval

5 results.