Title :
Transcribing Mandarin broadcast news
Author :
Chen, Langzhou ; Lame, Lori ; Gauvain, Jean-Luc
Author_Institution :
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
fDate :
30 Nov.-3 Dec. 2003
Abstract :
The paper describes improvements to the LIMSI broadcast news transcription system for the Mandarin language in preparation for the DARPA/NIST Rich Transcription 2003 (RT´03) evaluation. The transcription system has been substantially updated to deal with the varied acoustic and linguistic characteristics of the RT´03 test conditions. The major improvements come from the use of lightly supervised acoustic model training in order to benefit from unannotated audio data, the use of source specific language models, and MDI adaptation to tune the language models for sources with limited amounts of training data. The character error rate on the development data has been reduced from 34.5% with the baseline system to 22.6% with the evaluation system.
Keywords :
acoustics; error statistics; learning (artificial intelligence); linguistics; natural languages; speech recognition; DARPA/NIST Rich Transcription 2003; MDI adaptation; Mandarin language; acoustic characteristics; acoustic model training; broadcast news transcription system; character error rate; linguistic characteristics; source specific language models; unannotated audio data; Acoustic testing; Adaptation model; Broadcasting; Error analysis; NIST; Natural languages; Speech analysis; Speech recognition; System testing; Training data;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318411