Describing online videos with text-to-speech narration
Masatomo Kobayashi, Tohru Nagano, Kentarou Fukuda, Hironobu Takagi · 2010 · Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A)
This paper from IBM Research Tokyo presents a technology platform that uses text-to-speech (TTS) synthesis to add audio descriptions (AD) to online videos at minimal cost. The system addresses the two main barriers that prevent most online video creators from providing audio…
audio description · text-to-speech · video accessibility · speech synthesis · external metadata