• DocumentCode
    3659668
  • Title

    A synchronised tree adjoining Grammar for English to Tamil Machine Translation

  • Author

    Vijay Krishna Menon; Rajendran S; Soman K P

  • Author_Institution
    Centre for Excellence in Computational Engineering and Networking, Amrita Vishwa Vidyapeetham, Coimbatore, India
  • fYear
    2015
  • Firstpage
    1497
  • Lastpage
    1501
  • Abstract
    Tree adjoining Grammar (TAG) is a rich formalism for capturing syntax and some limited semantics of Natural languages. The XTAG project has contributed a very comprehensive TAG for English Language. Although TAGs have been proposed nearly 40 years ago by Joshi et al, 1975, their usage and application in the Indian Languages have been very rare, predominantly due to their complexity and lack of resources. In this paper we discuss a new TAG system and methodology of development for Tamil Language that can be extended for other Indian languages. The trees are developed synchronously with a minimalistic grammar obtained by careful pruning of XTAG English Grammar. We also apply Chomskian minimalism on these TAG trees, so as to make them simple and easily parsable. Furthermore we have also developed a parser that can parse simple sentences using the above mentioned grammar, and generating a TAG derivation that can be used for dependency resolution. Due to the synchronous nature of these TAG pairs they can be readily adapted for Formalism based Machine Translation (MT) from English to Tamil and vice versa.
  • Keywords
    "Grammar","Syntactics","Semantics","Informatics","Computational linguistics","Natural languages","Complexity theory"
  • Publisher
    ieee
  • Conference_Titel
    Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on
  • Print_ISBN
    978-1-4799-8790-0
  • Type

    conf

  • DOI
    10.1109/ICACCI.2015.7275824
  • Filename
    7275824