Title :
Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition
Author :
Lei, Xin ; Wang, Wen ; Stolcke, Andreas
Author_Institution :
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA
Abstract :
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and conversation speech recognition. The lexicon expansion includes the generation of pronunciation variants for frequent words and vocabulary augmentation with new words and phrases derived from the training data. To learn multiple pronunciations, we first generate all possible pronunciation candidates for a word from its character pronunciation network. The top pronunciation variants are then selected from forced alignment statistics. To augment the acoustic vocabulary, we propose an efficient algorithm that derives new words based on N-gram statistics. Experiments show that a dictionary expanded in this manner yields significant improvements on a Mandarin broadcast speech recognition task.
Keywords :
broadcasting; speech recognition; statistics; Mandarin broadcast news; N-gram statistics; acoustic vocabulary; character pronunciation network; conversation speech recognition; data-driven lexicon expansion; vocabulary augmentation; Automatic speech recognition; Broadcast technology; Broadcasting; Character generation; Dictionaries; Natural languages; Speech recognition; Statistics; Training data; Vocabulary; Mandarin speech recognition; Pronunciation learning; vocabulary expansion;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960587