DocumentCode
12632
Title
Distributed Feature Representations for Dependency Parsing
Author
Wenliang Chen ; Min Zhang ; Yue Zhang
Author_Institution
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Volume
23
Issue
3
fYear
2015
fDate
Mar-15
Firstpage
451
Lastpage
460
Abstract
This paper presents an approach to automatically learning distributed representations for features to address the feature sparseness problem for dependency parsing. Borrowing terminologies from word embeddings, we call the feature representation feature embeddings. In our approach, the feature embeddings are inferred from large amounts of auto-parsed data. First, the sentences in raw data are parsed by a baseline system and we obtain dependency trees. Then, we represent each model feature using the surrounding features on the dependency trees. Based on the representation of surrounding context, we proposed two learning methods to infer feature embeddings. Finally, based on feature embeddings, we present a set of new features for graph-based dependency parsing models. The new parsers can not only make full use of well-established hand-designed features but also benefit from the hidden-class representations of features. Experiments on the standard Chinese and English data sets show that the new parser achieves significant performance improvements over a strong baseline.
Keywords
grammars; learning (artificial intelligence); natural language processing; trees (mathematics); Chinese data sets; English data sets; baseline system; borrowing terminologies; dependency trees; distributed feature representations; distributed representation learning; feature embeddings; feature sparseness problem; graph-based dependency parsing models; hidden-class representations; sentence parsing; surrounding context representation; word embeddings; Adaptation models; Context; Context modeling; Data models; Predictive models; Training; Vectors; Natural language processing; dependency parsing; feature embeddings; semi-supervised approach;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2014.2365359
Filename
6936866
Link To Document