DocumentCode :
394233
Title :
Unsupervised language model adaptation for broadcast news
Author :
Chen, Lungzhou ; Gauvain, Jean-Luc ; Lamel, Lori ; Adda, Gilles
Author_Institution :
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Unsupervised language model adaptation for speech recognition is challenging, particularly for complicated tasks such the transcription of broadcast news (BN) data. This paper presents an unsupervised adaptation method for language modeling based on information retrieval techniques. The method is designed for the broadcast news transcription task where the topics of the audio data cannot be predicted in advance. Experiments are carried out using the LIMSI American English BN transcription system and the NIST 1999 BN evaluation sets. The unsupervised adaptation method reduces the perplexity by 7% relative to the baseline LM and yields a 2% relative improvement for a 10xRT system.
Keywords :
information retrieval; natural languages; speech recognition; LIMSI American English BN transcription system; broadcast news; information retrieval techniques; language modeling; perplexity; speech recognition; unsupervised language model adaptation; Adaptation model; Broadcasting; Data mining; Decoding; Design methodology; Natural languages; Speech recognition; TV; Text recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198757
Filename :
1198757
Link To Document :
بازگشت