DocumentCode :
2348886
Title :
A reranking method for syntactic parsing with heterogeneous treebanks
Author :
Ding, Haibo ; Zhu, Muhua ; Zhu, Jingbo
Author_Institution :
Natural Language Process. Lab., Northeastern Univ., Shenyang, China
fYear :
2010
fDate :
21-23 Aug. 2010
Firstpage :
1
Lastpage :
4
Abstract :
In the field of natural language processing (NLP), there often exist multiple corpora with different annotation standards for the same task. In this paper, we take syntactic parsing as a case study and propose a reranking method which is able to make direct use of disparate treebanks simultaneously without using techniques such as treebank conversion. The method proceeds in three steps: 1) build parsers on individual treebanks; 2) use parsers independently to generate n-best lists for each sentence in test set; 3) rerank individual n-best lists which correspond to the same sentence by using consensus information exchanged among these n-best lists. Experimental results on two open Chinese treebanks show that our method significantly outperforms the baseline system by 0.84% and 0.53% respectively.
Keywords :
natural language processing; tree data structures; Chinese treebanks; annotation standards; disparate treebanks; heterogeneous treebanks; natural language processing; reranking method; syntactic parsing; treebank conversion; Accuracy; Artificial neural networks; Equations; Standards; Syntactic parsing; heterogeneous treebanks; reranking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6896-6
Type :
conf
DOI :
10.1109/NLPKE.2010.5587842
Filename :
5587842
Link To Document :
بازگشت