Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

Towards More Robust Speech Interactions for Deaf and Hard of Hearing Users
Raymond Fok, Harmanpreet Kaur, Skanda Palani, Martez E. Mott, Walter S. Lasecki · 2018 · Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018)
This University of Michigan study addresses a largely overlooked accessibility gap: while much research has focused on providing deaf users access to spoken output (via captioning or sign language), almost no work has addressed improving deaf users' ability to provide speech…
deaf and hard of hearing · automatic speech recognition · deaf speech · crowdsourcing · speech intelligibility
Usability Evaluation of Captions for People Who Are Deaf or Hard of Hearing
Sushant Kafle, Matt Huenerfauth · 2018 · SIGACCESS Accessibility and Computing Newsletter (Issue 122)
This is a SIGACCESS Newsletter article summarizing a line of research by Kafle and Huenerfauth on building a caption-quality evaluation metric that actually reflects the experience of Deaf and Hard-of-Hearing (DHH) readers — rather than simply counting speech-recognition errors.…
automatic speech recognition · captioning · captions · caption quality · accessibility metrics
Methods for Evaluation of Imperfect Captioning Tools by Deaf or Hard-of-Hearing Users at Different Reading Literacy Levels
Larwan Berke, Sushant Kafle, Matt Huenerfauth · 2018 · Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18)
This CHI 2018 paper (awarded an Honourable Mention) is the originating methodological study behind the group’s later Alonzo et al. work on Automatic Text Simplification evaluation. It asks: when Deaf and Hard-of-Hearing (DHH) participants evaluate imperfect captions produced by…
captioning · deaf and hard of hearing · automatic speech recognition · research methodology · literacy
Personal Perspectives on Using Automatic Speech Recognition to Facilitate Communication between Deaf Students and Hearing Customers
James R. Mallory, Michael Stinson, Lisa Elliot, Donna Easton · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)
This experience report examines the use of Automatic Speech Recognition (ASR) via the WhatsApp smartphone app to facilitate communication between deaf and hard-of-hearing (D/HH) students and hearing business customers in real workplace settings. The study took place at the…
automatic speech recognition · deaf and hard of hearing · workplace accessibility · deaf education · speech recognition
Deaf, Hard of Hearing, and Hearing Perspectives on Using Automatic Speech Recognition in Conversation
Abraham Glasser, Kesavan Kushalnagar, Raja Kushalnagar · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)
This experience report describes the real-world accessibility challenges encountered by five participants — two deaf, one hard of hearing, and two hearing — including the authors, when using the top seven most popular ASR applications (DEAFCOM, Dragon Dictation, Siri, Virtual…
automatic speech recognition · deaf and hard of hearing · speech recognition · communication accessibility · voice interface
Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing
Sushant Kafle, Matt Huenerfauth · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '17)
This paper addresses a fundamental problem in automatic captioning for Deaf and Hard of Hearing (DHH) users: the standard metric used to evaluate automatic speech recognition (ASR) systems — Word Error Rate (WER) — poorly predicts how usable the resulting captions actually are…
captioning · automatic speech recognition · deaf and hard of hearing · evaluation methods · natural language processing
Deaf and Hard-of-Hearing Perspectives on Imperfect Automatic Speech Recognition for Captioning One-on-One Meetings
Larwan Berke, Christopher Caulfield, Matt Huenerfauth · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '17)
This paper investigates whether and how to display word-level confidence information from Automatic Speech Recognition (ASR) systems in real-time captions for Deaf and Hard-of-Hearing (DHH) users during one-on-one meetings with hearing people. ASR engines assign confidence…
deaf accessibility · automatic speech recognition · captioning · communication · user research
Leveraging Complementary Contributions of Different Workers for Efficient Crowdsourcing of Video Captions
Yun Huang, Yifeng Huang, Na Xue, Jeffrey P. Bigham · 2017 · CHI Conference on Human Factors in Computing Systems
This paper presents BandCaption, a crowdsourcing system that combines automatic speech recognition (ASR) with input from diverse crowd workers to efficiently correct video captions. The key insight is that different groups of people — hearing-impaired users, second-language…
captioning · crowdsourcing · video accessibility · speech recognition · deaf and hard of hearing
The Effects of Automatic Speech Recognition Quality on Human Transcription Latency
Yashesh Gaur, Walter S. Lasecki, Florian Metze, Jeffrey P. Bigham · 2016 · Proceedings of the 13th International Web for All Conference (W4A)
This paper from Carnegie Mellon University and the University of Michigan empirically investigates when automatic speech recognition (ASR) output helps or hinders human transcriptionists producing captions for deaf and hard of hearing people. Manual transcription remains…
speech recognition · captioning · deaf and hard of hearing · crowdsourcing · human computation
Improving Real-Time Captioning Experiences for Deaf and Hard of Hearing Students
Saba Kawas, George Karalis, Tzu Wen, Richard E. Ladner · 2016 · Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '16)
This paper takes a holistic, qualitative approach to understanding deaf and hard of hearing (DHH) university students' experiences with real-time captioning in mainstream classrooms, examining both human-based captioning (CART — Communication Access Realtime Translation) and…
deaf and hard of hearing · real-time captioning · CART · automatic speech recognition · education
The Effects of Automatic Speech Recognition Quality on Human Transcription Latency
Yashesh Gaur · 2015 · ASSETS '15: Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility
This paper investigates a practical question for accessibility: when does providing automatic speech recognition (ASR) output help human captionists work faster, and when does it slow them down? Converting speech to text is fundamental for making audio content accessible to deaf…
deaf · hard of hearing · automatic speech recognition · ASR · captioning
Evaluating Alternatives for Better Deaf Accessibility to Selected Web-Based Multimedia
Brent N. Shiver, Rosalee J. Wolfe · 2015 · ASSETS '15: Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility
This research addresses the accessibility gap created by the proliferation of uncaptioned video content online—a particular problem for deaf adults who use American Sign Language as their primary language and view English as a second language. While television captioning has…
deaf accessibility · captions · automatic speech recognition · ASR · multimedia accessibility
Perspectives on Speech and Language Interaction for Daily Assistive Technology: Introduction to Part 1 of the Special Issue
Heidi Christensen, Frank Rudzicz, François Portet, Jan Alexandersson · 2015 · ACM Transactions on Accessible Computing (TACCESS)
This editorial introduces the first part of a TACCESS special issue on speech and language interaction for daily assistive technology, emerging from the 2013 SLPAT (Speech and Language Processing for Assistive Technologies) workshop. The editors frame speech and natural language…
speech recognition · disordered speech · dysarthria · speech intelligibility · assistive technology
Automatic Assessment of Speech Capability Loss in Disordered Speech
Thomas Pellegrini, Lionel Fontan, Julie Mauclair, Jérôme Farinas, Charlotte Alazard-Guiu, Marina Robert, Peggy Gatignol · 2015 · ACM Transactions on Accessible Computing
This paper investigates whether the Goodness of Pronunciation (GOP) algorithm, originally developed for computer-assisted language learning to detect non-native speaker mispronunciations, can be repurposed to assess speech capability loss in people with speech disorders. The…
disordered speech · speech assessment · automatic speech recognition · facial palsy · pronunciation assessment
Introducing Game Elements in Crowdsourced Video Captioning by Non-Experts
Hernisa Kacorri, Kaoru Shinkawa, Shin Saito · 2014 · Proceedings of the 11th Web for All Conference (W4A)
This paper from CUNY Graduate Center and IBM Research Tokyo presents a gamified crowdsourcing platform for video captioning that combines ASR output with non-expert human transcription to improve caption accuracy without monetary rewards. The system builds on the Collaborative…
captioning · crowdsourcing · deaf and hard of hearing · gamification · automatic speech recognition
Enhancing Learning Accessibility through Fully Automatic Captioning
Maria Federico, Marco Furini · 2012 · Proceedings of the International Cross-Disciplinary Conference on Web Accessibility (W4A)
This paper proposes an architecture for automatically generating synchronized captions for video lectures using off-the-shelf automatic speech recognition (ASR) software, aimed at making educational content accessible to hearing impaired students, dyslexic students, ESL (English…
captioning · speech recognition · education accessibility · deaf and hard of hearing · automatic speech recognition
Online Quality Control for Real-Time Crowd Captioning
Walter S. Lasecki, Jeffrey P. Bigham · 2012 · Proceedings of the 14th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2012)
This paper addresses quality control in Legion:Scribe, a system that provides real-time captioning by having multiple non-expert crowd workers simultaneously type what they hear, then automatically merging their partial transcriptions into a single caption stream. Real-time…
real-time captioning · crowdsourcing · deaf and hard of hearing · automatic speech recognition · human computation
Crowdsourcing Correction of Speech Recognition Captioning Errors
M. Wald · 2011 · Proceedings of the International Cross-Disciplinary Conference on Web Accessibility (W4A)
This paper describes tools built around Synote, an award-winning web-based application from the University of Southampton, that enable crowdsourced correction of automatic speech recognition (ASR) captioning errors to make video content accessible at scale. The author frames the…
captioning · speech recognition · crowdsourcing · deaf and hard of hearing · video accessibility
Web Educational Services for All: The APEINTA Project
Ana Iglesias, Lourdes Moreno, Belén Ruiz, José Luis Pajares, Javier Jiménez, Juan Francisco López, Pablo Revuelta, Julián Hernández · 2011 · Proceedings of the International Cross-Disciplinary Conference on Web Accessibility (W4A)
This paper presents APEINTA, a Spanish educational project from Carlos III University of Madrid and the Spanish Centre of Captioning and Audio Description (CESyA), aimed at providing inclusive education through three cloud-based web services. Started in 2008, the project was…
education accessibility · captioning · text-to-speech · deaf and hard of hearing · speech disability
Enhancing Accessibility through Correction of Speech Recognition Errors
John-Mark Bell · 2007 · SIGACCESS Accessibility and Computing
This paper investigates methods for automatically correcting errors in speech recognition-generated captions of university lectures, aiming to improve accessibility for hearing-impaired students. The author notes that while ASR-based captioning can make lectures accessible by…
automatic speech recognition · captioning · deaf and hard of hearing · higher education · natural language processing
Comparing speaker-dependent and speaker-adaptive acoustic models for recognizing dysarthric speech
Frank Rudzicz · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '07)
This short ASSETS 2007 poster from Frank Rudzicz at the University of Toronto compares two strategies for building automatic speech recognition (ASR) acoustic models that work for people with dysarthria — a set of motor speech disorders that produces speech with high intra- and…
dysarthria · automatic speech recognition · acoustic model · speaker adaptation · hidden Markov model

Reviews

Year

Tag

Search results

Towards More Robust Speech Interactions for Deaf and Hard of Hearing Users

Usability Evaluation of Captions for People Who Are Deaf or Hard of Hearing

Methods for Evaluation of Imperfect Captioning Tools by Deaf or Hard-of-Hearing Users at Different Reading Literacy Levels

Personal Perspectives on Using Automatic Speech Recognition to Facilitate Communication between Deaf Students and Hearing Customers

Deaf, Hard of Hearing, and Hearing Perspectives on Using Automatic Speech Recognition in Conversation

Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing

Deaf and Hard-of-Hearing Perspectives on Imperfect Automatic Speech Recognition for Captioning One-on-One Meetings

Leveraging Complementary Contributions of Different Workers for Efficient Crowdsourcing of Video Captions

The Effects of Automatic Speech Recognition Quality on Human Transcription Latency

Improving Real-Time Captioning Experiences for Deaf and Hard of Hearing Students

The Effects of Automatic Speech Recognition Quality on Human Transcription Latency

Evaluating Alternatives for Better Deaf Accessibility to Selected Web-Based Multimedia

Perspectives on Speech and Language Interaction for Daily Assistive Technology: Introduction to Part 1 of the Special Issue

Automatic Assessment of Speech Capability Loss in Disordered Speech

Introducing Game Elements in Crowdsourced Video Captioning by Non-Experts

Enhancing Learning Accessibility through Fully Automatic Captioning

Online Quality Control for Real-Time Crowd Captioning

Crowdsourcing Correction of Speech Recognition Captioning Errors

Web Educational Services for All: The APEINTA Project

Enhancing Accessibility through Correction of Speech Recognition Errors

Comparing speaker-dependent and speaker-adaptive acoustic models for recognizing dysarthric speech