DocumentCode :
677182
Title :
Building a treebank for Vietnamese dependency parsing
Author :
Luong Nguyen Thi ; Linh Ha My ; Hung Nguyen Viet ; Huyen Nguyen Thi Minh ; Phuong Le Hong
Author_Institution :
Dalat Univ., Lamdong, Vietnam
fYear :
2013
fDate :
10-13 Nov. 2013
Firstpage :
147
Lastpage :
151
Abstract :
The problem of Vietnamese syntactic parsing, especially constituency parsing, has recently been tackled by several research groups. A common effort of the Vietnamese language processing community has allowed the creation of VietTreebank, a reference parsed corpus containing about 10,000 sentences for the constituency parsing task. In this paper, we present our work to build a reference treebank, based on VietTreebank, for the dependency parsing task, which has not yet been very well studied for Vietnamese. First we define a dependency label set by adapting the dependency schema developed by the NLP group at Stanford university and taking into account the particularities of Vietnamese grammar. Then we propose an algorithm to convert a constituency treebank to a dependency one. The algorithm is tested on a set of 100 sentences of VietTreebank corpus and gives very good results. Finally, we carry out an experiment on Vietnamese dependency parsing using MaltParser tool and the dependency treebank converted from VietTreebank.
Keywords :
grammars; natural language processing; MaltParser tool; NLP group; VietTreebank corpus; Vietnamese dependency parsing; Vietnamese grammar; Vietnamese language processing community; Vietnamese syntactic parsing; constituency parsing task; dependency label set; dependency parsing task; dependency treebank; reference parsed corpus; reference treebank; Accuracy; Bills of materials; Educational institutions; Grammar; Magnetic heads; Syntactics; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2013 IEEE RIVF International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4799-1349-7
Type :
conf
DOI :
10.1109/RIVF.2013.6719884
Filename :
6719884
Link To Document :
بازگشت