DocumentCode :
1749694
Title :
Investigating lightly supervised acoustic model training
Author :
Lamel, Lori ; Gauvain, Jean-Luc ; Adda, Gilles
Author_Institution :
LIMSI, CNRS, Orsay, France
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
477
Abstract :
The last decade has witnessed substantial progress in speech recognition technology, with todays state-of-the-art systems being able to transcribe broadcast audio data with a word error of about 20%. However, acoustic model development for the recognizers requires large corpora of manually transcribed training data. Obtaining such data is both time-consuming and expensive, requiring trained human annotators with substantial amounts of supervision. We describe some experiments using different levels of supervision for acoustic model training in order to reduce the system development cost. The experiments have been carried out using the DARPA TDT-2 corpus (also used in the SDR99 and SDR00 evaluations). Our experiments demonstrate that light supervision is sufficient for acoustic model development, drastically reducing the development cost
Keywords :
hidden Markov models; learning (artificial intelligence); speech recognition; DARPA TDT-2 corpus; broadcast audio data; lightly supervised acoustic model training; speech recognition technology; supervision; Broadcast technology; Costs; Hidden Markov models; Humans; Natural languages; Radio broadcasting; Speech recognition; TV broadcasting; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940871
Filename :
940871
Link To Document :
بازگشت