DocumentCode :
2788430
Title :
Latent topic modeling of word vicinity information for speech recognition
Author :
Chen, Kuan-Yu ; Chiu, Hsuan-Sheng ; Chen, Berlin
Author_Institution :
Nat. Taiwan Normal Univ., Taipei, Taiwan
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5394
Lastpage :
5397
Abstract :
Topic language models, mostly revolving around the discovery of “word-document” co-occurrence dependence, have attracted significant attention and shown good performance in a wide variety of speech recognition tasks over the years. In this paper, a new topic language model, named word vicinity model (WVM), is proposed to explore the co-occurrence relationship between words, as well as the long-span latent topical information for language model adaptation. A search history is modeled as a composite WVM model for predicting a decoded word. The underlying characteristics and different kinds of model structures are extensively investigated, while the performance of WVM is thoroughly analyzed and verified by comparison with a few existing topic language models. Moreover, we also present a new modeling approach to our recently proposed word topic model (WTM), and design an efficient way to simultaneously extract “word-document” and “word-word” co-occurrence characteristics through the sharing of the same set of latent topics. Experiments on broadcast news transcription seem to demonstrate the utility of the presented models.
Keywords :
natural language processing; speech recognition; latent topic modeling; long span latent topical information; speech recognition; topic language models; word vicinity information; word vicinity model; Adaptation model; Broadcasting; Decoding; Frequency; History; Linear discriminant analysis; Natural languages; Predictive models; Speech analysis; Speech recognition; broadcast news transcription; speech recognition; topic language model; word vicinity model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494942
Filename :
5494942
Link To Document :
بازگشت