Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

Making Memes Accessible
Cole Gleason, Amy Pavel, Xingyu Liu, Patrick Carrington, Lydia B. Chilton, Jeffrey P. Bigham · 2019 · Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2019)
This paper presents two semi-automatic methods for making internet image macro memes accessible to people with vision impairments: rich alternative text descriptions and audio macro memes. Memes are a pervasive form of online communication, yet they are almost entirely…
alternative text · blind · low vision · social media · image accessibility
Effect of Automatic Sign Recognition Performance on the Usability of Video-Based Search Interfaces for Sign Language Dictionaries
Oliver Alonzo, Abraham Glasser, Matt Huenerfauth · 2019 · Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2019)
This paper investigates how the performance of automatic sign recognition technology affects user satisfaction when searching for unfamiliar words in an ASL-to-English dictionary. Looking up an unknown sign in ASL is fundamentally harder than looking up an unknown written word:…
sign language · ASL · sign language recognition · dictionary · Deaf and hard of hearing
CaBot: Designing and Evaluating an Autonomous Navigation Robot for Blind People
João Guerreiro, Daisuke Sato, Saki Asakawa, Huixu Dong, Kris M. Kitani, Chieko Asakawa · 2019 · Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2019)
This paper presents CaBot (Carry-on roBot), an autonomous suitcase-shaped navigation robot designed to guide blind people to destinations in unfamiliar indoor environments while avoiding obstacles. Unlike smartphone navigation apps that provide turn-by-turn instructions but…
navigation robot · blind · orientation and mobility · haptic feedback · LiDAR
StateLens: A Reverse Engineering Solution for Making Existing Dynamic Touchscreens Accessible
Anhong Guo, Junhan Kong, Michael Rivera, Frank F. Xu, Jeffrey P. Bigham · 2019 · ACM Symposium on User Interface Software and Technology
This paper presents StateLens, a three-part system that makes existing dynamic touchscreen interfaces accessible to blind users without requiring any modification to the touchscreen hardware or software. Blind people routinely encounter inaccessible touchscreens on coffee…
touchscreen accessibility · blindness · computer vision · crowdsourcing · reverse engineering
Hacking Blind Navigation
João Guerreiro, Hernisa Kacorri, Jeffrey P. Bigham, Edward Cutrell, Daisuke Sato, Dragan Ahmetovic, Chieko Asakawa · 2019 · CHI Conference on Human Factors in Computing Systems Extended Abstracts
This workshop summary describes a one-day CHI 2019 workshop designed to bring together researchers from multiple disciplines working on assistive navigation technologies for blind people. The organizers — experts from Carnegie Mellon University, University of Maryland, Microsoft…
blind navigation · orientation and mobility · assistive technology · visual impairment · computer vision
BBeep: A Sonic Collision Avoidance System for Blind Travellers and Nearby Pedestrians
Seita Kayukawa, Keita Higuchi, João Guerreiro, Shigeo Morishima, Yoichi Sato, Kris Kitani, Chieko Asakawa · 2019 · Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19)
This paper presents BBeep, an assistive suitcase-shaped system designed to help blind travellers navigate crowded public spaces such as international airports, train stations, and shopping malls. The authors argue that existing blind-navigation research has focused largely on…
blind navigation · obstacle avoidance · collision avoidance · assistive technology · wearable technology
A TensorFlow-based Assistive Technology System for Users with Visual Impairments
Davide Mulfari · 2018 · Proceedings of the 15th International Web for All Conference (W4A 2018)
This extended abstract presents a wearable computer vision system that uses deep learning to classify objects in a blind user’s surroundings and provide audio descriptions via text-to-speech. The system addresses a limitation of smartphone-based object recognition apps: people…
computer vision · deep learning · blind · visual impairment · wearable technology
Accessify: An ML Powered Application to Provide Accessible Images on Web Sites
Shivam Singh, Anurag Bhandari, Nishith Pathak · 2018 · Proceedings of the 15th International Web for All Conference (W4A 2018)
This demonstration paper presents Accessify, a browser plugin that uses machine learning to automatically generate alternative text descriptions for all images on a website, injecting them into the page’s DOM so screen readers can access them. The system addresses the persistent…
alternative text · image accessibility · machine learning · browser extension · computer vision
Multi-view Mouth Renderization for Assisting Lip-reading
Andrea Britto Mattos, Dario Augusto Borges Oliveira · 2018 · Proceedings of the 15th International Web for All Conference (W4A)
This paper presents an assistive tool that uses Generative Adversarial Networks (GANs) to enhance video for people who rely on lip-reading. The core problem is that lip-readers generally prefer a frontal view of a speaker's face, but in real-world video the speaker may be…
lip-reading · hearing impairment · Deaf and hard of hearing · deep learning · generative adversarial networks
Investigating Cursor-based Interactions to Support Non-Visual Exploration in the Real World
Anhong Guo, Saige McVea, Xu Wang, Patrick Clary, Ken Goldman, Yang Li, Yu Zhong, Jeffrey P. Bigham · 2018 · Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018)
This paper from Google and Carnegie Mellon University defines and compares three cursor-based interaction techniques designed to help blind and low vision people attend to specific items within complex real-world visual scenes. While computer vision systems like Seeing AI and…
blindness · low vision · computer vision · non-visual exploration · mobile accessibility
BrowseWithMe: An Online Clothes Shopping Assistant for People with Visual Impairments
Abigale J. Stangl, Esha Kothari, Suyog D. Jain, Tom Yeh, Kristen Grauman, Danna Gurari · 2018 · Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018)
This paper from the University of Colorado Boulder and University of Texas at Austin addresses the inaccessibility of online clothes shopping for people with visual impairments through both empirical investigation and a prototype AI-powered assistant called BrowseWithMe. The…
blindness · low vision · web accessibility · online shopping · computer vision
A Demo of Talkit++: Interacting with 3D Printed Models Using an iOS Device
Lei Shi, Zhuohao Zhang, Shiri Azenkot · 2018 · Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018)
This Cornell Tech demo presents Talkit++, an iOS application that adds interactive multimedia labels to 3D printed tactile models for visually impaired students. Teachers of the visually impaired (TVIs) increasingly use 3D printers to create tactile learning materials, but these…
blindness · low vision · 3D printing · tactile accessibility · computer vision
Exploring the Performance of Facial Expression Recognition Technologies on Deaf Adults and Their Children
Irene Rogan Shaffer · 2018 · Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018)
This Boston University student research paper investigates how commercial facial expression recognition services perform on Deaf ASL signers and Children of Deaf Adults (CODAs) compared to hearing non-signers. The study is motivated by a critical problem: in ASL and other sign…
deaf and hard of hearing · sign language · facial expression recognition · emotion recognition · AI fairness
Crowd-AI Camera Sensing in the Real World
Anhong Guo, Anuraag Jain, Shomiron Ghose, Gierad Laput, Chris Harrison, Jeffrey P. Bigham · 2018 · Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
This paper presents Zensors++, a hybrid crowd-AI camera sensing system that allows users to point a networked camera at a scene, define a natural language question about it (such as "Is the coffee machine in use?" or "How many people are in the room?"), and receive continuous,…
crowdsourcing · computer vision · human computation · machine learning · smart environments
Mind Your Crossings: Mining GIS Imagery for Crosswalk Localization
Dragan Ahmetovic, Roberto Manduchi, James M. Coughlan, Sergio Mascetti · 2017 · ACM Transactions on Accessible Computing (TACCESS)
This paper presents a computer vision system for automatically detecting and localizing zebra crosswalks in urban environments by mining existing geospatial image databases, specifically Google satellite imagery and Google Street View. The motivation is that blind pedestrians…
visual impairment · orientation and mobility · computer vision · navigation · crowdsourcing
Making Real-World Interfaces Accessible Through Crowdsourcing, Computer Vision, and Fabrication
Anhong Guo, Jeffrey P. Bigham · 2017 · Proceedings of the 14th International Web for All Conference (W4A)
This demonstration paper presents two complementary systems designed to make inaccessible physical interfaces usable by blind people: VizLens and Facade. The research addresses a fundamental and long-standing accessibility problem — the vast majority of physical interfaces in…
physical accessibility · computer vision · crowdsourcing · 3D printing · assistive technology
Real-Time Depth-Camera Based Hand Tracking for ASL Recognition
Brandon Taylor, Anind Dey, Daniel Siewiorek, Asim Smailagic · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)
This demonstration paper validates the use of a publicly available real-time hand tracking algorithm (Sphere-Mesh) for recognizing American Sign Language (ASL) handshapes using a depth camera. Sign Language Recognition (SLR) has long been a motivating goal for high-precision…
sign language recognition · hand tracking · computer vision · depth camera · machine learning
Speed-Accuracy Tradeoffs for Detecting Sign Language Content in Video Sharing Sites
Frank M. Shipman, Satyakiran Duggina, Caio D.D. Monteiro, Ricardo Gutierrez-Osuna · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)
This paper addresses the problem of automatically detecting sign language content in videos on sharing platforms like YouTube and Vimeo. For many deaf and hard-of-hearing people, sign language is their primary communication medium, and they rely on online video content to stay…
sign language · computer vision · video classification · information retrieval · deaf and hard of hearing
Identifying Visual Cues to Improve Independent Indoor Navigation for Blind Individuals
Cameron Tyler Cassidy · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)
This student research competition paper proposes a novel approach to indoor navigation for blind individuals that focuses on providing environmental awareness and a "sense of space" rather than traditional point-to-point directions. The author argues that most existing…
indoor navigation · blindness · computer vision · eye tracking · wayfinding
"Is Someone There? Do They Have a Gun": How Visual Information about Others Can Improve Personal Safety Management for Blind Individuals
Stacy M. Branham, Ali Abdolrahmani, William Easley, Morgan Scheuerman, Erick Ronquillo, Amy Hurst · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '17)
This paper investigates an underexplored dimension of accessibility for people who are blind: the need for visual information about other people to manage personal physical safety. While decades of assistive technology research have focused on helping blind individuals navigate…
blindness · personal safety · facial recognition · assistive technology · navigation
Imagining Artificial Intelligence Applications with People with Visual Disabilities using Tactile Ideation
Cecily Morrison, Edward Cutrell, Anupama Dhareshwar, Kevin Doherty, Anja Thieme, Alex Taylor · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '17)
This paper presents a novel tactile ideation methodology for working with people with visual disabilities to imagine future AI applications, and reports on cross-cultural workshops conducted in the UK (6 participants) and India (8 participants). Rather than asking users about…
artificial intelligence · blindness · visual impairment · participatory design · design methodology
"Hands On" Visual Recognition for Visually Impaired Users
Joan Sosa-García, Francesca Odone · 2017 · ACM Transactions on Accessible Computing
This paper presents a collaborative visual recognition system designed to help blind or visually impaired (BVI) users identify specific product instances — distinguishing between brands, models, or types of objects that feel similar when handled. While BVI individuals can often…
visual impairment · object recognition · computer vision · assistive technology · wearable technology
Improving the Accessibility of Mobile OCR Apps Via Interactive Modalities
Michael Cutter, Roberto Manduchi · 2017 · ACM Transactions on Accessible Computing
This paper addresses a critical usability challenge in mobile OCR for blind users: while OCR technology itself works well, it requires properly framed images at adequate resolution—something difficult to achieve without visual feedback. Blind users commonly hold the camera too…
OCR · mobile accessibility · blindness · camera guidance · computer vision
People with Visual Impairment Training Personal Object Recognizers: Feasibility and Challenges
Hernisa Kacorri, Kris M. Kitani, Jeffrey P. Bigham, Chieko Asakawa · 2017 · CHI Conference on Human Factors in Computing Systems
This paper explores whether people with visual impairments can train their own personalized object recognition systems using a smartphone camera and a small number of example photos. The authors address a fundamental limitation of existing object recognition tools for blind…
object recognition · computer vision · blindness · transfer learning · personalization
Computer vision-based analysis of web page structure for assistive interfaces
Michael Cormier · 2016 · Proceedings of the 13th International Web for All Conference (W4A)
This doctoral consortium paper proposes a novel approach to understanding web page structure by analyzing rendered page images using computer vision, rather than relying on the DOM tree or source code as most existing web page segmentation methods do. The author argues that the…
computer vision · web page segmentation · screen reader accessibility · cognitive accessibility · machine learning

Reviews

Year

Tag

Search results

Making Memes Accessible

Effect of Automatic Sign Recognition Performance on the Usability of Video-Based Search Interfaces for Sign Language Dictionaries

CaBot: Designing and Evaluating an Autonomous Navigation Robot for Blind People

StateLens: A Reverse Engineering Solution for Making Existing Dynamic Touchscreens Accessible

Hacking Blind Navigation

BBeep: A Sonic Collision Avoidance System for Blind Travellers and Nearby Pedestrians

A TensorFlow-based Assistive Technology System for Users with Visual Impairments

Accessify: An ML Powered Application to Provide Accessible Images on Web Sites

Multi-view Mouth Renderization for Assisting Lip-reading

Investigating Cursor-based Interactions to Support Non-Visual Exploration in the Real World

BrowseWithMe: An Online Clothes Shopping Assistant for People with Visual Impairments

A Demo of Talkit++: Interacting with 3D Printed Models Using an iOS Device

Exploring the Performance of Facial Expression Recognition Technologies on Deaf Adults and Their Children

Crowd-AI Camera Sensing in the Real World

Mind Your Crossings: Mining GIS Imagery for Crosswalk Localization

Making Real-World Interfaces Accessible Through Crowdsourcing, Computer Vision, and Fabrication

Real-Time Depth-Camera Based Hand Tracking for ASL Recognition

Speed-Accuracy Tradeoffs for Detecting Sign Language Content in Video Sharing Sites

Identifying Visual Cues to Improve Independent Indoor Navigation for Blind Individuals

"Is Someone There? Do They Have a Gun": How Visual Information about Others Can Improve Personal Safety Management for Blind Individuals

Imagining Artificial Intelligence Applications with People with Visual Disabilities using Tactile Ideation

"Hands On" Visual Recognition for Visually Impaired Users

Improving the Accessibility of Mobile OCR Apps Via Interactive Modalities

People with Visual Impairment Training Personal Object Recognizers: Feasibility and Challenges

Computer vision-based analysis of web page structure for assistive interfaces