• DocumentCode
    1910019
  • Title

    A Cascaded Framework for Statistical Machine Translation System Combination

  • Author

    Du, Jinhua ; Wei, Wei ; Yang, Zhendong ; Xu, Bo

  • Author_Institution
    Chinese Acad. of Sci., Beijing
  • fYear
    2007
  • fDate
    Aug. 30 2007-Sept. 1 2007
  • Firstpage
    285
  • Lastpage
    292
  • Abstract
    This paper investigates an extensive evaluation of combination techniques and presents a cascaded framework for combining multiple machine translation (MT) system outputs. A word transition network (WTN) is constructed from an N -best list by aligning the hypotheses against an alignment reference, where the alignment is based on minimising an modified translation edit rate (TER) with word or phrase reordering. The minimum Bayes risk (MBR) decoding techinque is inverstigated for the selection of an appropriate alignment reference. Pairwise word alignment is created by an enhanced statistical alignment algorithm that explicitly models word reordering. Experimental results are presented based on three MT systems of Chinese-English translation outputs. It is shown that worthwhile improvements in translation performance can be obtained using the proposed framework.
  • Keywords
    Bayes methods; decoding; language translation; natural languages; Chinese-English translation; MBR decoding techinque; TER; WTN; cascaded framework; combination techniques; minimum Bayes risk; modified translation edit rate; pairwise word alignment; phrase reordering; statistical alignment algorithm; statistical machine translation system; word transition network; Algorithm design and analysis; Automatic speech recognition; Automation; Decoding; Error analysis; Surface-mount technology; System performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-1611-0
  • Electronic_ISBN
    978-1-4244-1611-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2007.4368045
  • Filename
    4368045