Title :
Parsing Chinese with head-driven model
Author :
Cao, Hai-Long ; Zhao, Tie-jun ; Yang, Mu-yun ; Li, Sheng
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., China
Abstract :
Great progress has been made in parsing the Wall Street Journal portion of the Penn Treebank. Now parsing languages other than English is an intensive research area. Head-driven model is one of the best English parsing models. It has been successfully applied to Czech but failed to outperform a base-line model in parsing German. This paper attempts to parse Chinese with head-driven model. Promising experimental results demonstrate that head-driven model works well for Chinese. We propose a hybrid parsing strategy, which combines head-driven model with a Chinese base phrases parsing model. The combined model not only improves the performance but also makes the parser space and time efficient. We evaluate our method in PARSEVAL measures, and the combined model performances are at 79.88% precision, 81.97% recall.
Keywords :
grammars; natural language interfaces; natural languages; statistical analysis; Chinese base phrase parsing; head-driven model; hybrid parsing strategy; language parsing; Context modeling; Decoding; Dictionaries; Length measurement; Machine learning; Speech; Testing;
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
DOI :
10.1109/ICMLC.2004.1382246