• DocumentCode
    1858785
  • Title

    FOLSOM: a fast and memory-efficient phrase-based approach to statistical machine translation

  • Author

    Bowen Zhou ; Chen, S.F. ; Yuqing Gao

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    226
  • Lastpage
    229
  • Abstract
    In this work, we propose a novel framework for performing phrase-based statistical machine translation using weighted finite-state transducers (WFST´s) that is significantly faster than existing frameworks while also being memory-efficient. In particular, we represent the entire translation model with a single WFST that is statically optimized, in contrast to previous work that represents the translation model as multiple WFST´s that must be composed on the fly. We describe a new search algorithm that conveniently and efficiently combines multiple knowledge sources during decoding. The proposed approach is particularly suitable for converged real-time speech translation on scalable computing devices. We were able to develop a SMT system that can translate more than 3000 words/second while still retaining excellent accuracy.
  • Keywords
    decoding; finite state machines; language translation; speech processing; FOLSOM; decoding; phrase-based approach; real-time speech translation; search algorithm; statistical machine translation; weighted finite-state transducers; Decoding; Delay; Encoding; Handheld computers; Lattices; Personal digital assistants; Speech recognition; Surface-mount technology; Transducers; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326796
  • Filename
    4123403