Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

Capti-Speak: A Speech-Enabled Accessible Web Interface
Vikas Ashok · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS)
This paper presents Capti-Speak, a speech-augmented screen reader interface for the web that allows blind users to issue voice commands alongside traditional keyboard shortcuts. Built on top of the Capti web browsing application (which provides a JAWS-like screen reader…
screen readers · speech recognition · voice interface · web accessibility · blindness
Improving Programming Interfaces for People with Limited Mobility Using Voice Recognition
Xiomara Figueroa Fontánez, Patricia Ordóñez · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS)
This paper describes an effort to make programming more accessible to people with motor impairments by integrating voice recognition into an Integrated Development Environment (IDE). The work is motivated by the specific case of a computer scientist with spinal muscular atrophy…
programming accessibility · voice interface · speech recognition · motor disability · spinal muscular atrophy
How Voice Augmentation Supports Elderly Web Users
Daisuke Sato, Masatomo Kobayashi, Hironobu Takagi, Chieko Asakawa, Jiro Tanaka · 2011 · The Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)
This paper from IBM Research Tokyo investigates how voice-based augmentation, originally developed for blind screen reader users, can be adapted to support older adults using web applications. The research addresses two key barriers that prevent elderly users from engaging with…
aging · web accessibility · voice interface · cognitive accessibility · auditory interface
The Spoken Web Application Framework: User Generated Content and Service Creation through Low-End Mobiles
Arun Kumar, Sheetal K. Agarwal, Priyanka Manwani · 2010 · Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A)
This paper from IBM Research India presents the Spoken Web Application Framework (SWAF), a platform that expands the definition of "Web" itself to include voice-based hyperlinked content accessible through ordinary telephone calls. At the time of publication, only 22% of the…
digital divide · voice interface · developing regions · digital inclusion · mobile accessibility
(Voice) website creation and access using phones
Arun Kumar, Sheetal K. Agarwal, Priyanka Manwani, Ketki Dhanesha · 2010 · Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A)
This challenge paper from IBM Research India presents Spoken Web, a platform that enables people to create and access "voice websites" (VoiceSites) entirely through phone calls using speech and DTMF (touch-tone) input. The system takes a fundamentally different approach to web…
voice interface · developing countries · digital divide · low literacy · user-generated content
EPG: Speech Access to Program Guides for People with Disabilities
Michael Johnston, Amanda J. Stent · 2010 · Proceedings of the 12th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2010)
This demo paper from AT&T Labs Research presents an Electronic Program Guide (EPG) prototype that uses speech input and text-to-speech output to make television listing navigation accessible to people with visual disabilities or limited hand mobility. The authors identify that…
speech recognition · voice interface · television accessibility · visual impairment · motor impairment
TeleWeb: accessible service for web browsing via phone
Yevgen Borodin, Glenn Dausch, I. V. Ramakrishnan · 2009 · Proceedings of the 2009 International Cross-Disciplinary Conference on Web Accessibililty (W4A)
This paper presents TeleWeb, a telephony service that enables web browsing via any standard telephone using speech and keypad input. Built on top of the HearSay non-visual web browser engine from Stony Brook University, TeleWeb allows users to call a phone number and then search…
screen readers · voice interface · visual impairment · blindness · telephony
Being Old Doesn't Mean Acting Old: How Older Users Interact with Spoken Dialog Systems
Maria Wolters, Kallirroi Georgila, Johanna D. Moore, Sarah E. MacPherson · 2009 · ACM Transactions on Accessible Computing
This study challenges the common practice of designing voice interfaces based on assumed age-related characteristics. Using a bottom-up approach rather than top-down age comparisons, the researchers analyzed 447 appointment scheduling dialogs between 50 users (26 older, aged…
aging · voice interface · spoken dialog systems · cognitive aging · user diversity
Automation of Repetitive Web Browsing Tasks with Voice-Enabled Macros
Yevgen Borodin · 2008 · Proceedings of the 10th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '08)
This paper proposes an approach for automating repetitive web browsing tasks through personalized macros with a speech-enabled interface, implemented within the HearSay non-visual web browser at Stony Brook University. The core problem is that non-visual aural web browsing…
screen reader · web accessibility · non-visual browsing · web macro · voice interface
Humming Control Interface for Hand-held Devices
Sook Young Won, Dong-In Lee, Julius Smith · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '07)
This paper from Stanford University presents a control-by-humming interface that allows hands-free operation of portable devices such as cell phones and music players through subvocal humming detected by a Bluetooth-connected insertion earphone/microphone. The system converts…
alternative input · hands-free control · subvocal input · pitch detection · motor impairment
Demo of VJ-Voicebot: Control of Robotic Arm with the Vocal Joystick
Brandi House, Jon Malkin, Jeff Bilmes · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '07)
This demonstration paper from the University of Washington presents VJ-Voicebot, a system that allows individuals with motor disabilities to continuously control a 5 degrees-of-freedom robotic arm using non-verbal vocal sounds. The system builds on the Vocal Joystick (VJ)…
assistive robotics · voice interface · motor impairment · continuous voice control · robotic arm
VoiceDraw: a hands-free voice-driven drawing application for people with motor impairments
Susumu Harada, Jacob O. Wobbrock, James A. Landay · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '07)
Harada, Wobbrock, and Landay's Assets '07 paper introduces VoiceDraw, a hands-free digital painting application for people with severe motor impairments that uses *non-speech* vocalisations — continuously-held vowel sounds and short consonant clicks — rather than discrete speech…
motor accessibility · voice interface · non-speech vocalisation · speech recognition · vocal joystick
Comparing speaker-dependent and speaker-adaptive acoustic models for recognizing dysarthric speech
Frank Rudzicz · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '07)
This short ASSETS 2007 poster from Frank Rudzicz at the University of Toronto compares two strategies for building automatic speech recognition (ASR) acoustic models that work for people with dysarthria — a set of motor speech disorders that produces speech with high intra- and…
dysarthria · automatic speech recognition · acoustic model · speaker adaptation · hidden Markov model
The Migratory Cursor: Accurate Speech-Based Cursor Movement by Moving Multiple Ghost Cursors Using Non-Verbal Vocalizations
Yoshiyuki Mihara, Etsuya Shibayama, Shin Takahashi · 2005 · Proceedings of the 7th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '05)
This paper presents the migratory cursor, a novel voice-controlled cursor movement interface that combines two complementary techniques to achieve both speed and accuracy. The fundamental challenge with speech-based cursor control is that existing approaches are either fast but…
cursor control · voice interface · speech technology · motor accessibility · alternative input
Speech-Based Cursor Control
Azfar S. Karimullah, Andrew Sears · 2002 · Proceedings of the Fifth International ACM Conference on Assistive Technologies (Assets '02)
This paper from UMBC investigates the effectiveness of speech-based cursor control for navigating graphical user interfaces, comparing a standard cursor with a predictive cursor designed to compensate for speech recognition delays. While speech recognition is well-studied for…
speech recognition · cursor control · motor disability · pointing devices · assistive technology
Voice over Workplace (VoWP): Voice Navigation in a Complex Business GUI
Frankie James, Jeff Roelands · 2002 · Proceedings of the Fifth International ACM Conference on Assistive Technologies (Assets 02)
This paper explores the design of voice navigation interfaces for complex business GUIs, specifically SAP Workplace, to support physically disabled users who cannot use a mouse or keyboard. The authors conducted two user studies examining the fundamental trade-off in voice…
voice interface · speech recognition · physical disability · GUI accessibility · navigation
The Intelligent Voice-Interactive Interface
Christopher Schmandt, Eric A. Hulteen · 1982 · Proceedings of the 1982 Conference on Human Factors in Computing Systems (CHI '82)
Schmandt and Hulteen describe the 'Put That There' system built at MIT's Architecture Machine Group (the precursor to the Media Lab), one of the earliest working implementations of a conversational, multimodal human–computer interface. Seated in a chair ten feet from a…
speech recognition · voice interface · multimodal interaction · gesture recognition · historical

Reviews

Year

Tag

Search results

Capti-Speak: A Speech-Enabled Accessible Web Interface

Improving Programming Interfaces for People with Limited Mobility Using Voice Recognition

How Voice Augmentation Supports Elderly Web Users

The Spoken Web Application Framework: User Generated Content and Service Creation through Low-End Mobiles

(Voice) website creation and access using phones

EPG: Speech Access to Program Guides for People with Disabilities

TeleWeb: accessible service for web browsing via phone

Being Old Doesn't Mean Acting Old: How Older Users Interact with Spoken Dialog Systems

Automation of Repetitive Web Browsing Tasks with Voice-Enabled Macros

Humming Control Interface for Hand-held Devices

Demo of VJ-Voicebot: Control of Robotic Arm with the Vocal Joystick

VoiceDraw: a hands-free voice-driven drawing application for people with motor impairments

Comparing speaker-dependent and speaker-adaptive acoustic models for recognizing dysarthric speech

The Migratory Cursor: Accurate Speech-Based Cursor Movement by Moving Multiple Ghost Cursors Using Non-Verbal Vocalizations

Speech-Based Cursor Control

Voice over Workplace (VoWP): Voice Navigation in a Complex Business GUI

The Intelligent Voice-Interactive Interface