Glossary

Terms used in accessibility research and practice. Each entry has a definition, common aliases, and category tags.

Filter

Search results

Wav2Vec(also: Wav2Vec2, Wav2Vec 2.0): A family of self-supervised speech representation models from Meta AI that learn rich acoustic embeddings directly from raw waveform audio without requiring transcribed training data. Wav2Vec 2.0, introduced in 2020, became a backbone for low-resource automatic speech…
Whisper(also: OpenAI Whisper, Whisper ASR): An open-source automatic speech recognition (ASR) model released by OpenAI in 2022, trained on 680,000 hours of multilingual and multitask supervised audio data. Whisper supports transcription in dozens of languages, translation into English, language identification, and…

2 results.