Title :
Parsing Chinese Text Based on Semantic Class
Author :
Ding, Hua-Fu ; Zhao, Tie-jun ; Li, Sheng
Author_Institution :
Harbin Inst. of Technol., Harbin
Abstract :
This paper proposes a novel Chinese syntactic parsing model based on semantic class, which is a variant of normal lexicalized statistical model. It attempts to make use of the syntactic and semantic similarity between Chinese words and then produces a more knowledgeable estimate of the probability of grammar rules. A simple but effective unsupervised method is designed to determine the proper semantic class of given words. Semantic class is used to improve the performance of parsing model. We evaluate our methods on the widely used Penn Chinese Treebank. Experimental results show that it outperforms a famous lexicalized model significantly on appropriate semantic class levels.
Keywords :
grammars; natural language processing; text analysis; unsupervised learning; Chinese text parsing; Penn Chinese Treebank; effective unsupervised method; grammar rules; normal lexicalized statistical model; semantic class; semantic similarity; syntactic similarity; Context modeling; Cybernetics; Design methodology; Information processing; Laboratories; Machine learning; Natural language processing; Natural languages; Probability; Speech processing; Chinese information processing; Parsing; Semantic class;
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
DOI :
10.1109/ICMLC.2007.4370731