← All terms

Voice Activity Detection

Also known as: VAD, Speech Detection

A signal processing technique that automatically determines whether a segment of audio contains human speech or not. In accessibility applications, voice activity detection is used in audio description timing systems to identify non-speech segments where descriptions can be inserted without interrupting dialogue. VAD models analyze audio features to classify each segment as speech or non-speech, enabling automated workflows that were previously done manually by listening through entire audio tracks.

Category: technology · speech processing

Related: Silent Gap Detection · AD Timing · Speech Recognition

Sources