DocumentCode :
2700487
Title :
Correlated Latent Semantic Model for Unsupervised LM Adaptation
Author :
Yik-Cheung Tam ; Schultz, Tanja
Author_Institution :
Inst. of Language Technol., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
We propose a latent Dirichlet-tree allocation (LDTA) model - a correlated latent semantic model - for unsupervised language model adaptation. The LDTA model extends the latent Dirichlet allocation (LDA) model by replacing a Dirichlet prior with a Dirichlet-tree prior over the topic proportions. Latent topics under the same subtree are expected to be more correlated than topics under different subtrees. The LDTA model falls back to the LDA model using a depth-one Dirichlet-tree, and the model fits to the variational Bayes inference framework employed in the LDA model. Empirical results show that the LDTA model has a faster training convergence than the LDA model with the same initial flat model. Experimental results show that LDTA-adapted LM performed better than LDA-adapted LM on the Mandarin RT04-eval set when the models were trained using a small text corpus, while both models had the same recognition performance when the models were trained using a big text corpus. We observed 0.4% absolute CER reduction after LM adaptation using LSA marginals.
Keywords :
Bayes methods; natural language processing; trees (mathematics); Mandarin RT04-eval set; correlated latent semantic model; latent Dirichlet-tree allocation; subtree; unsupervised LM adaptation; unsupervised language model adaptation; variational Bayes inference framework; Adaptation model; Automatic speech recognition; Bayesian methods; Convergence; Gaussian distribution; Humans; Linear discriminant analysis; Machine learning; Sampling methods; Text recognition; Dirichlet-Tree; LSA; correlated topics; unsupervised LM adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367158
Filename :
4218032
Link To Document :
بازگشت