DocumentCode :
1696183
Title :
Comparison of a bigram PLSA and a novel context-based PLSA language model for speech recognition
Author :
Haidar, Md Akmal ; O´Shaughnessy, D.
Author_Institution :
INRS-EMT, Montreal, QC, Canada
fYear :
2013
Firstpage :
8440
Lastpage :
8444
Abstract :
We propose a novel context-based probabilistic latent semantic analysis (PLSA) language model for speech recognition. In this model, the topic is conditioned on the immediate history context and the document in the original PLSA model. This allows computing all the possible bigram probabilities of the seen history context using the model. It properly computes the topic probability of an unseen document for each history context present in the document. We compare our approach with a recently proposed unsmoothed bigram PLSA model where only the seen bigram probabilities are calculated, which causes computing the incorrect topic probability for the present history context of the unseen document. The proposed model requires a significantly less amount of computation time and memory space requirements than the unsmoothed bigram PLSA model. We carried out experiments on a continuous speech recognition (CSR) task using theWall Street Journal (WSJ) corpus. The proposed approach shows significant reduction in both perplexity and word error rate (WER) measurements over the other approach.
Keywords :
error statistics; probability; speech recognition; CSR; WER; Wall Street Journal corpus; bigram probabilities; context-based PLSA language model; continuous speech recognition; history context; probabilistic latent semantic analysis; statistical language model; unsmoothed bigram PLSA model; word error rate; Adaptation models; Computational modeling; Context; Context modeling; History; Mathematical model; Training; Topic models; bigram PLSA models; speech recognition; statistical language model; word co-occurrence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639312
Filename :
6639312
Link To Document :
بازگشت