Title :
Word Activation Forces-Based Language Modeling and Smoothing
Author :
Min Qin ; Gang Liu ; Baoxiang Li ; Yueming Lu
Author_Institution :
Sch. of Inf. & Commun. Eng., BUPT, Beijing, China
Abstract :
N-gram language models are useful for modeling the local dependencies of word occurrences but not for capturing global word dependencies. When the window size n is limited, the n-gram is weak in terms of capturing long distance dependencies. Long-distance Dependency information has long been proven useful in language model. However, the improved performance of long-distance LMs over conventional n-gram models generally comes at the cost of increased decoding complexity and model size. Word Activation Forces has been proven a simple and human-comparable accurate measure to identify word closest associates. In this paper, Word Activation Forces-Based language model is proposed to capture the long distance dependency between words, but which is as fast for decoding as a conventional word n-gram. As shown by experiments on broadcast news, the proposed language modeling and smoothing can significantly reduce the perplexity of language models and word error rate with moderate computational cost.
Keywords :
computational complexity; language translation; natural language processing; probability; smoothing methods; speech recognition; statistical analysis; N-gram language models; automatic speech recognition; computational cost; decoding complexity; human-comparable accurate measure; long-distance dependency information; model size; perplexity reduction; window size; word activation forces-based language modeling; word activation forces-based language smoothing; word closest associate identification; word error rate; word occurrence local dependencies; Computational modeling; Error analysis; History; Interpolation; Semantics; Smoothing methods; Training; Word Activation Forces; language model; long-distance dependency;
Conference_Titel :
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2013 5th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-0-7695-5011-4
DOI :
10.1109/IHMSC.2013.140