Voice Activity Detection
Also known as: VAD, Speech Detection
A signal processing technique that automatically determines whether a segment of audio contains human speech or not. In accessibility applications, voice activity detection is used in audio description timing systems to identify non-speech segments where descriptions can be inserted without interrupting dialogue. VAD models analyze audio features to classify each segment as speech or non-speech, enabling automated workflows that were previously done manually by listening through entire audio tracks.
Category: technology · speech processing
Related: Silent Gap Detection · AD Timing · Speech Recognition