DocumentCode
3244348
Title
Transcribing Mandarin broadcast news
Author
Chen, Langzhou ; Lame, Lori ; Gauvain, Jean-Luc
Author_Institution
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
fYear
2003
fDate
30 Nov.-3 Dec. 2003
Firstpage
99
Lastpage
104
Abstract
The paper describes improvements to the LIMSI broadcast news transcription system for the Mandarin language in preparation for the DARPA/NIST Rich Transcription 2003 (RT´03) evaluation. The transcription system has been substantially updated to deal with the varied acoustic and linguistic characteristics of the RT´03 test conditions. The major improvements come from the use of lightly supervised acoustic model training in order to benefit from unannotated audio data, the use of source specific language models, and MDI adaptation to tune the language models for sources with limited amounts of training data. The character error rate on the development data has been reduced from 34.5% with the baseline system to 22.6% with the evaluation system.
Keywords
acoustics; error statistics; learning (artificial intelligence); linguistics; natural languages; speech recognition; DARPA/NIST Rich Transcription 2003; MDI adaptation; Mandarin language; acoustic characteristics; acoustic model training; broadcast news transcription system; character error rate; linguistic characteristics; source specific language models; unannotated audio data; Acoustic testing; Adaptation model; Broadcasting; Error analysis; NIST; Natural languages; Speech analysis; Speech recognition; System testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN
0-7803-7980-2
Type
conf
DOI
10.1109/ASRU.2003.1318411
Filename
1318411
Link To Document