• DocumentCode
    3222780
  • Title

    VAASAANUBAADA: automatic machine translation of bilingual Bengali-Assamese news texts

  • Author

    Vijayanand, Kommaluri ; Choudhury, S.I. ; Ratna, Pranab

  • Author_Institution
    Assam Univ., Silchar, India
  • fYear
    2002
  • fDate
    13-15 Dec. 2002
  • Firstpage
    183
  • Lastpage
    188
  • Abstract
    This paper presents a project for translating bilingual Bengali-Assamese news texts using an example-based machine translation technique. The work involves machine translation of bilingual texts at sentence level. In addition, the work also includes preprocessing and post-processing tasks. The work is unique because of the language pair that is chosen for experimentation. We constructed and aligned the bilingual corpus manually by feeding real examples using pseudo code. The longer input sentence is fragmented at punctuations, which resulted in high quality translation. Backtracking is used when an exact match is not found at the sentence/fragment level, leading to further fragmentation of the sentence. Since bilingual Bengali-Assamese languages belong to the Magadha Prakrit group, the grammatical form of sentences is very similar and has no lexical word groups. The results when tested are fascinating with quality translation.
  • Keywords
    backtracking; document handling; grammars; language translation; learning by example; Magadha Prakrit group; automatic machine translation; backtracking; bilingual Bengali-Assamese news texts; bilingual corpus; example-based machine translation; grammatical form; input sentence fragmentation; post-processing; preprocessing; pseudo code; punctuations; sentence level; Application software; Computer science; Concrete; Databases; Humans; Knowledge representation; Probability; Testing; Turning; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Language Engineering Conference, 2002. Proceedings
  • Print_ISBN
    0-7695-1885-0
  • Type

    conf

  • DOI
    10.1109/LEC.2002.1182307
  • Filename
    1182307