• DocumentCode
    2909761
  • Title

    Improving Chinese Dependency Parsing with Self-Disambiguating Patterns

  • Author

    Qiu, Likun ; Wu, Lei ; Zhao, Kai ; Hu, Changjian ; Kong, Lingpeng

  • Author_Institution
    Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
  • fYear
    2011
  • fDate
    15-17 Nov. 2011
  • Firstpage
    7
  • Lastpage
    10
  • Abstract
    To solve the data sparseness problem in dependency parsing, most previous studies used features constructed from large-scale auto-parsed data. Unlike previous work, we propose a new approach to improve dependency parsing with context-free dependency triples (CDT) extracted by using self-disambiguating patterns (SDP). The use of SDP makes it possible to avoid the dependency on a baseline parser and explore the influence of different types of substructures one by one. Additionally, taking the available CDTs as seeds, a label propagation process is used to tag a large number of unlabeled word pairs as CDTs. Experiments show that, when CDT features are integrated into a maximum spanning tree (MST) dependency parser, the new parser improves significantly over the baseline MST parser. Comparative results also show that CDTs with dependency relation labels perform much better than CDT without dependency relation label.
  • Keywords
    context-free grammars; natural language processing; trees (mathematics); CDT; MST; SDP; context-free dependency triples; data sparseness problem; dependency parsing; label propagation process; maximum spanning tree; self-disambiguating patterns; Context; Data mining; Earth Observing System; Feature extraction; Syntactics; Tagging; Training; Dependency parsing; raw corpus; self-disambiguating pattern;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2011 International Conference on
  • Conference_Location
    Penang
  • Print_ISBN
    978-1-4577-1733-8
  • Type

    conf

  • DOI
    10.1109/IALP.2011.36
  • Filename
    6121457