• DocumentCode
    12632
  • Title

    Distributed Feature Representations for Dependency Parsing

  • Author

    Wenliang Chen ; Min Zhang ; Yue Zhang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
  • Volume
    23
  • Issue
    3
  • fYear
    2015
  • fDate
    Mar-15
  • Firstpage
    451
  • Lastpage
    460
  • Abstract
    This paper presents an approach to automatically learning distributed representations for features to address the feature sparseness problem for dependency parsing. Borrowing terminologies from word embeddings, we call the feature representation feature embeddings. In our approach, the feature embeddings are inferred from large amounts of auto-parsed data. First, the sentences in raw data are parsed by a baseline system and we obtain dependency trees. Then, we represent each model feature using the surrounding features on the dependency trees. Based on the representation of surrounding context, we proposed two learning methods to infer feature embeddings. Finally, based on feature embeddings, we present a set of new features for graph-based dependency parsing models. The new parsers can not only make full use of well-established hand-designed features but also benefit from the hidden-class representations of features. Experiments on the standard Chinese and English data sets show that the new parser achieves significant performance improvements over a strong baseline.
  • Keywords
    grammars; learning (artificial intelligence); natural language processing; trees (mathematics); Chinese data sets; English data sets; baseline system; borrowing terminologies; dependency trees; distributed feature representations; distributed representation learning; feature embeddings; feature sparseness problem; graph-based dependency parsing models; hidden-class representations; sentence parsing; surrounding context representation; word embeddings; Adaptation models; Context; Context modeling; Data models; Predictive models; Training; Vectors; Natural language processing; dependency parsing; feature embeddings; semi-supervised approach;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2365359
  • Filename
    6936866