• DocumentCode
    1796958
  • Title

    Learning latent variable grammars from complementary perspectives

  • Author

    Dongchen Li ; Xiantao Zhang ; Xihong Wu

  • Author_Institution
    Key Lab. of Machine Perception & Intell., Peking Univ., Beijing, China
  • fYear
    2014
  • fDate
    9-13 July 2014
  • Firstpage
    124
  • Lastpage
    128
  • Abstract
    The corpus for training a parser consists of sentences of heterogeneous grammar usages. Previous parser domain adaptation work has concentrated on adaptation to the shifts in vocabulary rather than grammar usage. In this paper, we focus on exploiting the diversity of training date separately and then accumulates their advantages. We propose an approach that grammar is biased toward relevant syntactic style, and the complementary grammar usage are combined for inference. Multiple grammars with partly complementary points of strength are induced individually. They capture complementary data representation, and we accumulates their advantages in a joint model to assemble the complementary depicting powers. Despite its compatibility with many other methods, out product model achieves 85.20% F1 score on Penn Chinese Treebank, higher than previous systems.
  • Keywords
    grammars; learning (artificial intelligence); natural language processing; F1 score; Penn Chinese Treebank; complementary data representation; complementary grammar usage; complementary perspectives; heterogeneous grammar usage sentences; inference; joint model; latent variable grammar learning; parser training corpus; partly-complementary points; syntactic style; training date; Analytical models; Computational linguistics; Grammar; Mathematical model; Merging; Pragmatics; Syntactics; PCFGLA; Parsing; sparsity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-4799-5401-8
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2014.6889215
  • Filename
    6889215