• DocumentCode
    3182383
  • Title

    Issues in parsing for Machine Aided Translation from English to Hindi

  • Author

    Sugandhi, Rekha S. ; Shekhar, Ritika ; Agarwal, Tarun ; Bedi, Rajneesh K. ; Wadhai, Vijay M.

  • Author_Institution
    Dept. of Comput. Eng., MIT Coll. of Eng., Pune, India
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    754
  • Lastpage
    759
  • Abstract
    Resolving cases of ambiguity due to differences in the syntactic form of grammatical rules in source and target languages is a major challenge in the development of a Machine Aided Translation (MAT) system. The primary focus in this paper is laid on translation of text from English to Hindi, one among the most popular Indian languages. Producing an unambiguous parse tree is an extremely difficult task considering Indian languages as the target language because of their free order nature. This paper presents a sub-model of a parser identifies the syntactical elements in English sentences and suggests its Hindi translation taking in to account various grammatical forms of Hindi. Thus this parser would focus on the grammatical structure of the statement especially those of gender, karaka (case) and context. Here we use the Subject-Verb-Object approach to break down the sentences.
  • Keywords
    grammars; language translation; natural language processing; trees (mathematics); English; Hindi; Indian languages; grammatical rules; machine aided translation system; parse tree; parsing; source languages; subject-verb-object approach; target languages; Context; Databases; Dictionaries; Grammar; Production; Random access memory; Syntactics; Machine Translation; Multi-lingual Machine Translation; Natural Language Processing; Parsers; Text Mining; Word Sense Disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technologies (WICT), 2011 World Congress on
  • Conference_Location
    Mumbai
  • Print_ISBN
    978-1-4673-0127-5
  • Type

    conf

  • DOI
    10.1109/WICT.2011.6141341
  • Filename
    6141341