Glossary

Terms used in accessibility research and practice. Each entry has a definition, common aliases, and category tags.

Filter

Search results

Talking-Head Video(also: Talking Head): A common educational video format in which a presenter speaks directly to the camera, typically filling the frame, with no or few accompanying visuals. For d/Deaf and Hard-of-Hearing learners, talking-head videos are often low in useful visual content - the speaker's face must…
Text-to-Sound(also: Text-to-Audio, TTA, Sound Generation from Text): A class of generative AI models that synthesize non-speech audio - sound effects, ambient environments, foley, or short music clips - from a natural-language description such as 'a door creaking shut' or 'cloth ruffling as a coat is removed'. Distinct from text-to-speech, which…
Text-to-Video(also: T2V, Text-to-Video Generation): A class of generative AI models that produces short video clips from natural-language prompts (and sometimes reference images). Examples at the time of writing include Runway Gen, OpenAI Sora, Google Veo, and Pika. For accessibility, text-to-video raises both opportunities —…
Tracked Captions(also: Speaker-following captions, Dynamic captions): Captions that move dynamically within the video frame to stay near the current speaker's face or mouth, rather than remaining anchored at a fixed position (typically the bottom of the video). Tracked captions reduce the visual effort required for Deaf and Hard-of-Hearing viewers…

4 results.