• DocumentCode
    2281943
  • Title

    Employing topic modeling for statistical machine translation

  • Author

    Zhengxian, Gong ; Guodong, Zhou

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
  • Volume
    4
  • fYear
    2011
  • fDate
    10-12 June 2011
  • Firstpage
    24
  • Lastpage
    28
  • Abstract
    The mixture modeling approaches have dominated the research of domain adaptation in Statistical Machine Translation (SMT). Such approaches construct a general model and several sub-models in advance and focus on the way of determining the relative importance of all the models. In this paper, we propose a simple yet effective approach for better domain adaptation in phrase-based SMT via topic modeling. Different from existing approaches, our topic modeling approach employs one additional feature function to capture the topic inherent in the source phrase and help the decoder dynamically choose related target phrases according to the specific topic of the source phrase. Evaluation on a conversation corpus shows very encouraging results.
  • Keywords
    language translation; statistical analysis; conversation corpus; mixture modeling approaches; phrase-based SMT; source phrase; statistical machine translation; topic modeling approach; Adaptation models; Analytical models; Business; Computational modeling; Decoding; Hidden Markov models; Training; domain adaptation; statistical macine translation; topic modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-8727-1
  • Type

    conf

  • DOI
    10.1109/CSAE.2011.5952796
  • Filename
    5952796