DocumentCode
3426550
Title
Hierarchical linear discounting class N-gram language models: A multilevel class hierarchy approach
Author
Zitouni, Imed ; Zhou, Qiru
Author_Institution
TJ. Watson Res. Center, IBM, Yorktown Heights, NY
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4917
Lastpage
4920
Abstract
We introduce in this paper a hierarchical linear discounting class n-gram language modeling technique that has the advantage of combining several language models, trained at different nodes in a class hierarchy. The approach hierarchically clusters the word vocabulary into a word-tree. The closer a tree node is to the leaves, the more specific the corresponding word class is. The tree is used to balance generalization ability and word specificity when estimating the likelihood of an n-gram event. Experiments are conducted on Wall Street Journal corpus using a vocabulary of 20,000 words. Results show a reduction on the test perplexity over the standard n-gram approaches by 10%. We also report considerable improvement in the accuracy of the speech recognition task.
Keywords
speech recognition; hierarchical linear discounting class n-gram language models; multilevel class hierarchy approach; speech recognition task; word tree; Frequency; Interpolation; Smoothing methods; Speech recognition; Testing; Vocabulary; Class Hierarchy; Language Modeling; Linear Distortion; n-gram;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518760
Filename
4518760
Link To Document