• DocumentCode
    1619042
  • Title

    Discriminative reranking for SMT using various global features

  • Author

    Goh, Chooi-Ling ; Watanabe, Taro ; Finch, Andrew ; Sumita, Eiichiro

  • Author_Institution
    MASTAR Project, Nat. Inst. of Inf. & Commun. Technol., Keihanna Science City, Japan
  • fYear
    2010
  • Firstpage
    8
  • Lastpage
    14
  • Abstract
    In this paper, we propose to use various global features for discriminative reranking in an SMT framework. We employ an online large-margin based training algorithm for the structural output support vector machines based on the margin infused relaxed algorithm. Besides the standard features used, such as decoder´s scores, source and target sentences, alignments and part-of-speech tags, we include sentence type probabilities, posterior probabilities and back translation features for reranking. These features have been proved to be useful in other approaches in statistical machine translation but it is the first attempt to apply them in reranking. Our experimental results using 160K BTEC corpus show an improvement of 1-4 BLEU percentage points on Japanese/Chinese to English translation.
  • Keywords
    language translation; learning (artificial intelligence); natural language processing; statistical analysis; support vector machines; 160K BTEC corpus; BLEU percentage points; back translation features; discriminative reranking; part-of-speech tags; posterior probabilities; sentence type probabilities; statistical machine translation; structural output support vector machines; training algorithm; Data models; Feature extraction; Hidden Markov models; Probability; Support vector machines; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Universal Communication Symposium (IUCS), 2010 4th International
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-7821-7
  • Type

    conf

  • DOI
    10.1109/IUCS.2010.5666776
  • Filename
    5666776