DocumentCode :
353700
Title :
A unified context-free grammar and n-gram model for spoken language processing
Author :
Wang, Ye-Yi ; Mahajan, Milind ; Huang, Xuedong
Author_Institution :
Speech Technol. Group, Microsoft Corp., Redmond, WA, USA
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1639
Abstract :
While context-free grammars (CFGs) remain as one of the most important formalisms for interpreting natural language, word n-gram models are surprisingly powerful for domain-independent applications. We propose to unify these two formalisms for both speech recognition and spoken language understanding (SLU). With portability as the major problem, we incorporated domain-specific CFGs into a domain-independent n-gram model that can improve the generalizability of the CFG and the specificity of the n-gram. In our experiments, the unified model can significantly reduce the test set perplexity from 378 to 90 in comparison with a domain-independent word trigram. The unified model converges well when domain-specific data becomes available. The perplexity can be further reduced from 90 to 65 with a limited amount of domain-specific data. While we have demonstrated excellent portability, the full potential of our approach lies in its unified recognition and understanding that we are investigating
Keywords :
context-free grammars; natural languages; nomograms; speech processing; speech recognition; convergence; domain-independent applications; domain-independent n-gram model; domain-independent word trigram; domain-specific context-free grammars; generalizability; natural language interpretation; portability; specificity; speech recognition; spoken language processing; spoken language understanding; test set perplexity; unified model; word n-gram models; Context modeling; Decoding; Equations; Natural languages; Predictive models; Signal generators; Speech processing; Speech recognition; Testing; Unified modeling language;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.862062
Filename :
862062
Link To Document :
بازگشت