Glossary

Terms used in accessibility research and practice. Each entry has a definition, common aliases, and category tags.

Filter

Search results

Text-to-Audio(also: Text-to-Audio Generation, TTA): A class of generative AI models that synthesise non-speech sound (environmental sounds, sound effects, music stems) from a text prompt - for example producing the sound of 'leaves rustling in wind' or 'church bells ringing'. Distinct from text-to-speech, which produces spoken…
Text-to-Image Model(also: T2I Model, T2I, Text-to-Image Generator): A generative AI system that produces images from natural-language prompts. Prominent examples include DALL-E, Stable Diffusion, and Midjourney. In accessibility contexts, text-to-image models have been shown to replicate and amplify disability stereotypes — for example,…
Text-to-Sound(also: Text-to-Audio, TTA, Sound Generation from Text): A class of generative AI models that synthesize non-speech audio - sound effects, ambient environments, foley, or short music clips - from a natural-language description such as 'a door creaking shut' or 'cloth ruffling as a coat is removed'. Distinct from text-to-speech, which…
Text-to-Video(also: T2V, Text-to-Video Generation): A class of generative AI models that produces short video clips from natural-language prompts (and sometimes reference images). Examples at the time of writing include Runway Gen, OpenAI Sora, Google Veo, and Pika. For accessibility, text-to-video raises both opportunities —…

4 results.