• DocumentCode
    3595768
  • Title

    Machine translation using corpus-based acquisition of transfer rules

  • Author

    Winiwarter, Werner

  • Author_Institution
    Dept. of Sci. Comput., Vienna Univ., Vienna
  • Volume
    1
  • fYear
    2007
  • Firstpage
    345
  • Lastpage
    350
  • Abstract
    In this paper we present a Japanese-English transfer- based machine translation system. Our main research contribution is that the transfer rules are not handcrafted but are learnt automatically from a parallel corpus. We learn specific transfer rules from sentence pairs, which are then generalized in a consolidation phase to avoid overtraining. The system has been implemented in Amzi! Prolog, which offers scalability for large rule bases, full Unicode support for Japanese characters, and several APIs for the seamless integration of the translation functionality into common office environments, e.g. Microsoft Word. The dynamic nature of our system allows for an easy customization of the rule base according to the user´s personal preferences.
  • Keywords
    PROLOG; language translation; natural language processing; Amzi! Prolog; Japanese characters; corpus-based acquisition; machine translation; parallel corpus; transfer rules; Computer languages; Concrete; Dictionaries; Humans; Knowledge acquisition; Natural languages; Scalability; Scientific computing; Space technology; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Information Management, 2007. ICDIM '07. 2nd International Conference on
  • Print_ISBN
    978-1-4244-1475-8
  • Electronic_ISBN
    978-1-4244-1476-5
  • Type

    conf

  • DOI
    10.1109/ICDIM.2007.4444247
  • Filename
    4444247