DocumentCode :
3102589
Title :
A Modular Cascaded Approach to Complete Parsing
Author :
Husain, Samar ; Gadde, Phani ; Ambati, Bharat ; Sharma, Dipti Misra ; Sangal, Rajeev
Author_Institution :
Language Technol. Res. Centre, IIIT, Hyderabad, India
fYear :
2009
fDate :
7-9 Dec. 2009
Firstpage :
141
Lastpage :
146
Abstract :
In this paper, we propose a modular cascaded approach to data driven dependency parsing. Each module or layer leading to the complete parse produces a linguistically valid partial parse. We do this by introducing an artificial root node in the dependency structure of a sentence and by catering to distinct dependency label sets that reflect the function of the set internal labels vis-a¿-vis a distinct and identifiable linguistic unit, at different layers. The linguistic unit in our approach is a clause. Output (partial parse) from each layer can be accessed independently. We applied this approach to Hindi, a morphologically rich free word order language using MST parser. We did all our experiments on a part of Hyderabad Dependency Treebank. The final results show an increase of 1.35% in unlabeled attachment and 1.36% in labeled attachment accuracies over state-of-the-art data driven Hindi parser.
Keywords :
grammars; linguistics; natural language processing; MST parser; artificial root node; data driven dependency parsing; dependency label sets; hyderabad dependency treebank; identifiable linguistic unit; modular cascaded approach; morphologically rich free word order language; state-of-the-art data driven Hindi parser; Availability; Data mining; Information analysis; Natural languages; Robustness; Speech analysis; Speech recognition; Tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing, 2009. IALP '09. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-0-7695-3904-1
Type :
conf
DOI :
10.1109/IALP.2009.37
Filename :
5380786
Link To Document :
بازگشت