← All terms

Scene Description

Also known as: SD, Visual Description

A textual description of the visual elements in a video scene — including objects, people, settings, actions, and visual cues — that can be converted into audio through text-to-speech technology. Scene descriptions serve as the basis for audio descriptions, making video content accessible to people with visual impairments. Quality scene descriptions should be descriptive (conveying visual properties), objective (avoiding speculation), succinct (fitting within natural pauses in dialogue), sufficient (covering important visual information), referable (avoiding vague pronouns like "this" or "that"), and clear (introducing characters and objects before describing their actions). Professional scene description is costly and time-consuming, prompting research into collaborative and crowdsourced authoring approaches.

Category: accessibility · media accessibility

Related: Audio Description · Video Accessibility · Text-to-Speech · Alt Text

Sources