Title :
Embedded machine learning systems for robust spoken language parsing
Author :
Wu, Wei-Lin ; Duan, Jian-Yong ; Lu, Ru-Zhan ; Gao, Feng ; Chen, Yu-Quan
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
In processing ill-formed spontaneous spoken utterance, many state-of-the-art robust parsers achieve robustness by allowing skipping of words and rule symbols. The parser´s ability to skip words and rule symbols, however, results in a much bigger search space and greatly increases the parse ambiguity. Previous approaches resolved these issues through manually labeling the types of rule symbols, or by utilizing heuristic scores or statistical probabilities. However, these approaches have certain drawbacks. This paper proposes to exploit embedded machine learning techniques to help with pruning and disambiguation in robust parsers. An embedded machine learning system is integrated with the heuristic score and the strategy of basing the types of rule symbols upon their correspondence to the domain model. This integration can considerably relieve the reliance of robust parser development on linguistic expert handcrafting. Our experiments show that this integration offers stronger capability in ambiguity resolution, thereby enabling the robust parser to achieve better parsing accuracy.
Keywords :
embedded systems; grammars; learning (artificial intelligence); natural languages; disambiguation method; embedded machine learning system; linguistic expert handcrafting; pruning method; robust spoken language parsing; spontaneous spoken utterance; Computer science; Labeling; Learning systems; Machine learning; Natural languages; Probability; Robustness; Speech; Statistical analysis; Training data;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598729