DocumentCode
1796958
Title
Learning latent variable grammars from complementary perspectives
Author
Dongchen Li ; Xiantao Zhang ; Xihong Wu
Author_Institution
Key Lab. of Machine Perception & Intell., Peking Univ., Beijing, China
fYear
2014
fDate
9-13 July 2014
Firstpage
124
Lastpage
128
Abstract
The corpus for training a parser consists of sentences of heterogeneous grammar usages. Previous parser domain adaptation work has concentrated on adaptation to the shifts in vocabulary rather than grammar usage. In this paper, we focus on exploiting the diversity of training date separately and then accumulates their advantages. We propose an approach that grammar is biased toward relevant syntactic style, and the complementary grammar usage are combined for inference. Multiple grammars with partly complementary points of strength are induced individually. They capture complementary data representation, and we accumulates their advantages in a joint model to assemble the complementary depicting powers. Despite its compatibility with many other methods, out product model achieves 85.20% F1 score on Penn Chinese Treebank, higher than previous systems.
Keywords
grammars; learning (artificial intelligence); natural language processing; F1 score; Penn Chinese Treebank; complementary data representation; complementary grammar usage; complementary perspectives; heterogeneous grammar usage sentences; inference; joint model; latent variable grammar learning; parser training corpus; partly-complementary points; syntactic style; training date; Analytical models; Computational linguistics; Grammar; Mathematical model; Merging; Pragmatics; Syntactics; PCFGLA; Parsing; sparsity;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location
Xi´an
Print_ISBN
978-1-4799-5401-8
Type
conf
DOI
10.1109/ChinaSIP.2014.6889215
Filename
6889215
Link To Document