← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • Capti-Speak: A Speech-Enabled Accessible Web Interface

    Vikas Ashok · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS)

    This paper presents Capti-Speak, a speech-augmented screen reader interface for the web that allows blind users to issue voice commands alongside traditional keyboard shortcuts. Built on top of the Capti web browsing application (which provides a JAWS-like screen reader…

    screen readers · speech recognition · voice interface · web accessibility · blindness

  • Improving Programming Interfaces for People with Limited Mobility Using Voice Recognition

    Xiomara Figueroa Fontánez, Patricia Ordóñez · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS)

    This paper describes an effort to make programming more accessible to people with motor impairments by integrating voice recognition into an Integrated Development Environment (IDE). The work is motivated by the specific case of a computer scientist with spinal muscular atrophy…

    programming accessibility · voice interface · speech recognition · motor disability · spinal muscular atrophy

  • How Voice Augmentation Supports Elderly Web Users

    Daisuke Sato, Masatomo Kobayashi, Hironobu Takagi, Chieko Asakawa, Jiro Tanaka · 2011 · The Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)

    This paper from IBM Research Tokyo investigates how voice-based augmentation, originally developed for blind screen reader users, can be adapted to support older adults using web applications. The research addresses two key barriers that prevent elderly users from engaging with…

    aging · web accessibility · voice interface · cognitive accessibility · auditory interface

  • The Spoken Web Application Framework: User Generated Content and Service Creation through Low-End Mobiles

    Arun Kumar, Sheetal K. Agarwal, Priyanka Manwani · 2010 · Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A)

    This paper from IBM Research India presents the Spoken Web Application Framework (SWAF), a platform that expands the definition of "Web" itself to include voice-based hyperlinked content accessible through ordinary telephone calls. At the time of publication, only 22% of the…

    digital divide · voice interface · developing regions · digital inclusion · mobile accessibility

  • (Voice) website creation and access using phones

    Arun Kumar, Sheetal K. Agarwal, Priyanka Manwani, Ketki Dhanesha · 2010 · Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A)

    This challenge paper from IBM Research India presents Spoken Web, a platform that enables people to create and access "voice websites" (VoiceSites) entirely through phone calls using speech and DTMF (touch-tone) input. The system takes a fundamentally different approach to web…

    voice interface · developing countries · digital divide · low literacy · user-generated content

  • EPG: Speech Access to Program Guides for People with Disabilities

    Michael Johnston, Amanda J. Stent · 2010 · Proceedings of the 12th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2010)

    This demo paper from AT&T Labs Research presents an Electronic Program Guide (EPG) prototype that uses speech input and text-to-speech output to make television listing navigation accessible to people with visual disabilities or limited hand mobility. The authors identify that…

    speech recognition · voice interface · television accessibility · visual impairment · motor impairment

  • TeleWeb: accessible service for web browsing via phone

    Yevgen Borodin, Glenn Dausch, I. V. Ramakrishnan · 2009 · Proceedings of the 2009 International Cross-Disciplinary Conference on Web Accessibililty (W4A)

    This paper presents TeleWeb, a telephony service that enables web browsing via any standard telephone using speech and keypad input. Built on top of the HearSay non-visual web browser engine from Stony Brook University, TeleWeb allows users to call a phone number and then search…

    screen readers · voice interface · visual impairment · blindness · telephony

  • Being Old Doesn't Mean Acting Old: How Older Users Interact with Spoken Dialog Systems

    Maria Wolters, Kallirroi Georgila, Johanna D. Moore, Sarah E. MacPherson · 2009 · ACM Transactions on Accessible Computing

    This study challenges the common practice of designing voice interfaces based on assumed age-related characteristics. Using a bottom-up approach rather than top-down age comparisons, the researchers analyzed 447 appointment scheduling dialogs between 50 users (26 older, aged…

    aging · voice interface · spoken dialog systems · cognitive aging · user diversity

  • Automation of Repetitive Web Browsing Tasks with Voice-Enabled Macros

    Yevgen Borodin · 2008 · Proceedings of the 10th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '08)

    This paper proposes an approach for automating repetitive web browsing tasks through personalized macros with a speech-enabled interface, implemented within the HearSay non-visual web browser at Stony Brook University. The core problem is that non-visual aural web browsing…

    screen reader · web accessibility · non-visual browsing · web macro · voice interface

  • Humming Control Interface for Hand-held Devices

    Sook Young Won, Dong-In Lee, Julius Smith · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '07)

    This paper from Stanford University presents a control-by-humming interface that allows hands-free operation of portable devices such as cell phones and music players through subvocal humming detected by a Bluetooth-connected insertion earphone/microphone. The system converts…

    alternative input · hands-free control · subvocal input · pitch detection · motor impairment

  • Demo of VJ-Voicebot: Control of Robotic Arm with the Vocal Joystick

    Brandi House, Jon Malkin, Jeff Bilmes · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '07)

    This demonstration paper from the University of Washington presents VJ-Voicebot, a system that allows individuals with motor disabilities to continuously control a 5 degrees-of-freedom robotic arm using non-verbal vocal sounds. The system builds on the Vocal Joystick (VJ)…

    assistive robotics · voice interface · motor impairment · continuous voice control · robotic arm

  • VoiceDraw: a hands-free voice-driven drawing application for people with motor impairments

    Susumu Harada, Jacob O. Wobbrock, James A. Landay · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '07)

    Harada, Wobbrock, and Landay's Assets '07 paper introduces VoiceDraw, a hands-free digital painting application for people with severe motor impairments that uses *non-speech* vocalisations — continuously-held vowel sounds and short consonant clicks — rather than discrete speech…

    motor accessibility · voice interface · non-speech vocalisation · speech recognition · vocal joystick

  • Comparing speaker-dependent and speaker-adaptive acoustic models for recognizing dysarthric speech

    Frank Rudzicz · 2007 · Proceedings of the 9th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '07)

    This short ASSETS 2007 poster from Frank Rudzicz at the University of Toronto compares two strategies for building automatic speech recognition (ASR) acoustic models that work for people with dysarthria — a set of motor speech disorders that produces speech with high intra- and…

    dysarthria · automatic speech recognition · acoustic model · speaker adaptation · hidden Markov model

  • The Migratory Cursor: Accurate Speech-Based Cursor Movement by Moving Multiple Ghost Cursors Using Non-Verbal Vocalizations

    Yoshiyuki Mihara, Etsuya Shibayama, Shin Takahashi · 2005 · Proceedings of the 7th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '05)

    This paper presents the migratory cursor, a novel voice-controlled cursor movement interface that combines two complementary techniques to achieve both speed and accuracy. The fundamental challenge with speech-based cursor control is that existing approaches are either fast but…

    cursor control · voice interface · speech technology · motor accessibility · alternative input

  • Speech-Based Cursor Control

    Azfar S. Karimullah, Andrew Sears · 2002 · Proceedings of the Fifth International ACM Conference on Assistive Technologies (Assets '02)

    This paper from UMBC investigates the effectiveness of speech-based cursor control for navigating graphical user interfaces, comparing a standard cursor with a predictive cursor designed to compensate for speech recognition delays. While speech recognition is well-studied for…

    speech recognition · cursor control · motor disability · pointing devices · assistive technology

  • Voice over Workplace (VoWP): Voice Navigation in a Complex Business GUI

    Frankie James, Jeff Roelands · 2002 · Proceedings of the Fifth International ACM Conference on Assistive Technologies (Assets 02)

    This paper explores the design of voice navigation interfaces for complex business GUIs, specifically SAP Workplace, to support physically disabled users who cannot use a mouse or keyboard. The authors conducted two user studies examining the fundamental trade-off in voice…

    voice interface · speech recognition · physical disability · GUI accessibility · navigation

  • The Intelligent Voice-Interactive Interface

    Christopher Schmandt, Eric A. Hulteen · 1982 · Proceedings of the 1982 Conference on Human Factors in Computing Systems (CHI '82)

    Schmandt and Hulteen describe the 'Put That There' system built at MIT's Architecture Machine Group (the precursor to the Media Lab), one of the earliest working implementations of a conversational, multimodal human–computer interface. Seated in a chair ten feet from a…

    speech recognition · voice interface · multimodal interaction · gesture recognition · historical