• DocumentCode
    3489925
  • Title

    A Systematic Comparison of SVM and Maximum Entropy Classifiers for Translation Error Detection

  • Author

    Jinhua Du ; Sha Wang

  • Author_Institution
    Sch. of Autom. & Inf. Eng., Xi´an Univ. of Technol., Xi´an, China
  • fYear
    2012
  • fDate
    13-15 Nov. 2012
  • Firstpage
    125
  • Lastpage
    128
  • Abstract
    In recent years, the translation error detection or confidence estimation for SMT has been becoming a hot question, especially in the localization industry. This paper mainly focuses on a systematic comparison on two different classifiers Maximum Entropy (MaxEnt) and SVM over different features to illustrate their error detection capabilities. Three typical word posterior probabilities (WPP) and three linguistic features are introduced and fairly compared over two classifiers on Chinese to-English NIST datasets. Experimental results show that the combination of WPP with linguistic features can significantly reduce the CER, and the SVM classifier outperforms the MaxEnt classifier in terms of the CER and F measure.
  • Keywords
    language translation; maximum entropy methods; natural language processing; pattern classification; probability; support vector machines; CER; Chinese to-English NIST datasets; F measure; MaxEnt; SM; SVM; WPP; confidence estimation; error detection capabilities; localization industry; maximum entropy classifiers; statistical machine translation; systematic comparison; translation error detection; word posterior probabilities; Estimation; Feature extraction; NIST; Pragmatics; Support vector machines; Syntactics; Systematics; Error Detection; Linguistic Features; Maximum Entropy Classifier; SVM classifier; Word Posterior Probability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2012 International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4673-6113-2
  • Electronic_ISBN
    978-0-7695-4886-9
  • Type

    conf

  • DOI
    10.1109/IALP.2012.20
  • Filename
    6473712