SUITEKeys: A Speech Understanding Interface for the Motor-Control Challenged

Bill Manaris, Alan Harkreader · 1998 · Proceedings of the Third International ACM Conference on Assistive Technologies (Assets '98) · doi:10.1145/274497.274517

Summary

This paper from the University of Southwestern Louisiana presents SUITEKeys, a continuous speech understanding interface that provides complete computer access to users with motor-control impairments by modelling all interaction at the physical keyboard and mouse level. Unlike application-specific speech interfaces that only work within particular programs, SUITEKeys operates at the operating system level, translating spoken commands into virtual keyboard and mouse events that work with any application. The system architecture combines symbolic, statistical, and connectionist components: a Feature Extractor creates parametric representations of speech, a Phoneme Probability Estimator uses a neural network to classify sounds, a Lexical Analyzer matches against a vocabulary, and a Code Generator produces the corresponding hardware events. The interface is formally modelled across five levels — conceptual, semantic, syntactic, lexical, and acoustic — providing a rigorous framework for the speech-to-action translation. Users speak natural language commands like "press alpha" for keystrokes, "move mouse left" for cursor movement, and can use either alphabetic or military phonetic pronunciations to disambiguate similar-sounding letters. The system was developed using a user-centred approach with iterative feedback from motor-control challenged users throughout the design process, built on the SUITE framework and CSLU Toolkit for speech processing and NALIGE for natural language understanding.

Key findings

A pilot study with three university students who had upper-body motor-control impairments evaluated SUITEKeys against each subject's preferred input method. Subjects performed a document creation task — typing a short paragraph into a text editor and saving it — using both their preferred method and the SUITEKeys prototype. With minimal training (less than ten minutes), subjects achieved a mean completion rate of 99.3%, a mean typing rate of 1.44 characters per second, and a mean error rate of 3.3%. One subject who could not directly manipulate the keyboard or mouse verbally directed an assistant, another used a mouthstick with StickyKeys, and the third used a trackball. While the results were not statistically significant due to the small sample size, they suggested that speech is a viable and effective alternative input modality for motor-control challenged users. Notably, one subject's performance was lower because she initially tried to dictate text using discrete speech rather than the continuous command-based approach SUITEKeys uses, highlighting the importance of matching the interaction model to user expectations. The subjects had considerable training on their preferred modalities, suggesting that with equivalent training on SUITEKeys, speech input performance could improve further.

Relevance

This 1998 paper addresses a fundamental accessibility challenge that remains relevant today: providing full computer access to people who cannot use standard keyboards and mice. The key architectural decision — modelling speech input at the physical device level rather than the application level — meant SUITEKeys could work with any software without special integration, a principle that modern voice control systems like Dragon NaturallySpeaking, Windows Voice Access, and macOS Voice Control have adopted. The formal multi-level model of the interface (conceptual through acoustic) provides a useful framework for thinking about the design of any speech-based assistive system. For practitioners, the study highlights important design considerations: the need for disambiguation strategies for similar-sounding inputs, the value of supporting both novice-friendly and expert-efficient command vocabularies, and the critical role of user-centred design in assistive technology development. The finding that users could achieve near-complete task accuracy with minimal training supports the ongoing investment in voice-based computer access as a primary assistive technology strategy for motor impairments.

Tags: speech recognition · motor disabilities · voice input · alternative input · natural language interface · keyboard emulation · mouse emulation · assistive technology