DocumentCode
2909761
Title
Improving Chinese Dependency Parsing with Self-Disambiguating Patterns
Author
Qiu, Likun ; Wu, Lei ; Zhao, Kai ; Hu, Changjian ; Kong, Lingpeng
Author_Institution
Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
fYear
2011
fDate
15-17 Nov. 2011
Firstpage
7
Lastpage
10
Abstract
To solve the data sparseness problem in dependency parsing, most previous studies used features constructed from large-scale auto-parsed data. Unlike previous work, we propose a new approach to improve dependency parsing with context-free dependency triples (CDT) extracted by using self-disambiguating patterns (SDP). The use of SDP makes it possible to avoid the dependency on a baseline parser and explore the influence of different types of substructures one by one. Additionally, taking the available CDTs as seeds, a label propagation process is used to tag a large number of unlabeled word pairs as CDTs. Experiments show that, when CDT features are integrated into a maximum spanning tree (MST) dependency parser, the new parser improves significantly over the baseline MST parser. Comparative results also show that CDTs with dependency relation labels perform much better than CDT without dependency relation label.
Keywords
context-free grammars; natural language processing; trees (mathematics); CDT; MST; SDP; context-free dependency triples; data sparseness problem; dependency parsing; label propagation process; maximum spanning tree; self-disambiguating patterns; Context; Data mining; Earth Observing System; Feature extraction; Syntactics; Tagging; Training; Dependency parsing; raw corpus; self-disambiguating pattern;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location
Penang
Print_ISBN
978-1-4577-1733-8
Type
conf
DOI
10.1109/IALP.2011.36
Filename
6121457
Link To Document