DocumentCode
672395
Title
Lightly supervised automatic subtitling of weather forecasts
Author
Driesen, Johan ; Renals, Steve
Author_Institution
Center for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
fYear
2013
fDate
8-12 Dec. 2013
Firstpage
452
Lastpage
457
Abstract
Since subtitling television content is a costly process, there are large potential advantages to automating it, using automatic speech recognition (ASR). However, training the necessary acoustic models can be a challenge, since the available training data usually lacks verbatim orthographic transcriptions. If there are approximate transcriptions, this problem can be overcome using light supervision methods. In this paper, we perform speech recognition on broadcasts of Weatherview, BBC´s daily weather report, as a first step towards automatic subtitling. For training, we use a large set of past broadcasts, using their manually created subtitles as approximate transcriptions. We discuss and and compare two different light supervision methods, applying them to this data. The best training set finally obtained with these methods is used to create a hybrid deep neural network-based recognition system, which yields high recognition accuracies on three separate Weatherview evaluation sets.
Keywords
learning (artificial intelligence); neural nets; speech recognition; weather forecasting; ASR; Weatherview; approximate transcriptions; automatic speech recognition; daily weather report; hybrid deep neural network-based recognition system; light supervision methods; supervised automatic subtitling; weather forecasts; Acoustics; Speech; Speech recognition; Training; Training data; Weather forecasting; Acoustic Model Training; Light supervision; Segmentation; Subtitling; Transcription;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location
Olomouc
Type
conf
DOI
10.1109/ASRU.2013.6707772
Filename
6707772
Link To Document