• DocumentCode
    1796925
  • Title

    Improved parsing with taxonomy of conjunctions

  • Author

    Dongchen Li ; Xiantao Zhang ; Xihong Wu

  • Author_Institution
    Key Lab. of Machine Perception & Intell., Peking Univ., Beijing, China
  • fYear
    2014
  • fDate
    9-13 July 2014
  • Firstpage
    47
  • Lastpage
    51
  • Abstract
    Incorporating knowledge for training a parser has been shown to remedy the weaknesses of probabilistic context-free grammar. Previous parsing systems have exploited content words semantic resource and word-formation knowledge. However, they are limited in that they do not take into account conjunction category refinement, which stands out to be helpful in predicting the syntactic structure and syntactic label in Chinese. We define a conjunction taxonomy representing intrinsic syntactic constraints, and show that refined categories in the taxonomy for conjunctions contribute to improved parsing performance. The taxonomy is used to supervise the splitting of these refined tags, and the automatic hierarchical state-split approach is employ to compensate the limitation in the scope and refinement degree of the taxonomy. The experiments are carried out on Penn Chinese Treebank, which show that our method can improve parsing performance significantly.
  • Keywords
    context-free grammars; Penn Chinese treebank; automatic hierarchical state-split approach; conjunction taxonomy; context-free grammar; intrinsic syntactic constraints; parsing performance; taxonomy refinement degree; taxonomy scope limitation; Computational linguistics; Grammar; Manuals; Pragmatics; Syntactics; Taxonomy; Training; conjunction; parsing; refinement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-4799-5401-8
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2014.6889199
  • Filename
    6889199