DocumentCode
3659668
Title
A synchronised tree adjoining Grammar for English to Tamil Machine Translation
Author
Vijay Krishna Menon; Rajendran S; Soman K P
Author_Institution
Centre for Excellence in Computational Engineering and Networking, Amrita Vishwa Vidyapeetham, Coimbatore, India
fYear
2015
Firstpage
1497
Lastpage
1501
Abstract
Tree adjoining Grammar (TAG) is a rich formalism for capturing syntax and some limited semantics of Natural languages. The XTAG project has contributed a very comprehensive TAG for English Language. Although TAGs have been proposed nearly 40 years ago by Joshi et al, 1975, their usage and application in the Indian Languages have been very rare, predominantly due to their complexity and lack of resources. In this paper we discuss a new TAG system and methodology of development for Tamil Language that can be extended for other Indian languages. The trees are developed synchronously with a minimalistic grammar obtained by careful pruning of XTAG English Grammar. We also apply Chomskian minimalism on these TAG trees, so as to make them simple and easily parsable. Furthermore we have also developed a parser that can parse simple sentences using the above mentioned grammar, and generating a TAG derivation that can be used for dependency resolution. Due to the synchronous nature of these TAG pairs they can be readily adapted for Formalism based Machine Translation (MT) from English to Tamil and vice versa.
Keywords
"Grammar","Syntactics","Semantics","Informatics","Computational linguistics","Natural languages","Complexity theory"
Publisher
ieee
Conference_Titel
Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on
Print_ISBN
978-1-4799-8790-0
Type
conf
DOI
10.1109/ICACCI.2015.7275824
Filename
7275824
Link To Document