Title :
Word Topical Mixture Models for Dynamic Language Model Adaptation
Author :
Hsuan-Sheng Chin ; Chen, Bing
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Taipei Univ., Taiwan
Abstract :
This paper considers dynamic language model adaptation for Mandarin broadcast news recognition. A word topical mixture model (TMM) is proposed to explore the co-occurrence relationship between words, as well as the long-span latent topical information, for language model adaptation. The search history is modeled as a composite word TMM model for predicting the decoded word. The underlying characteristics and different kinds of model structures were extensively investigated, while the performance of word TMM was analyzed and verified by comparison with the conventional probabilistic latent semantic analysis-based language model (PLSALM) and trigger-based language model (TBLM) adaptation approaches. The large vocabulary continuous speech recognition (LVCSR) experiments were conducted on the Mandarin broadcast news collected in Taiwan. Very promising results in perplexity as well as character error rate reductions were initially obtained.
Keywords :
decoding; natural language processing; speech coding; speech recognition; Mandarin broadcast news recognition; decoded word; dynamic language model adaptation; large vocabulary continuous speech recognition; long-span latent topical information; word topical mixture models; Adaptation model; Broadcasting; Decoding; Error analysis; History; Natural languages; Performance analysis; Predictive models; Speech recognition; Vocabulary; language model adaptation; probabilistic latent semantic analysis; speech recognition; trigger-based language model; word topical mixture model;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367190