• DocumentCode
    498897
  • Title

    English-Chinese OOV translation based on PAT Tree

  • Author

    Wang, Yang ; Zhang, Yue-jie ; Zhang, Tao

  • Author_Institution
    Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
  • Volume
    3
  • fYear
    2009
  • fDate
    12-15 July 2009
  • Firstpage
    1732
  • Lastpage
    1736
  • Abstract
    In Cross-Language Information Retrieval (CLIR) process, Out-Of-Vocabulary (OOV) or the unknown word translation is a significant and challenging issue. Specifically, for English-Chinese OOV translation, OOV term detection and extraction of translation pair still remain to be key problems. In this paper, an English-Chinese OOV translation pattern based on PAT-Tree is proposed. Web-mining is utilized as the corpus source to collect translation pairs, and translation candidates are acquired by Chinese OOV term extraction based on PAT-Tree. The experimental results show that the proposed approach can outperform some of the current translation engines, and is especially efficient in English-Chinese OOV translation.
  • Keywords
    Internet; data mining; information retrieval; language translation; search engines; English-Chinese out-of-vocabulary translation; OOV term detection; OOV term extraction; Web-mining; cross-language information retrieval; translation engines; Cybernetics; Machine learning; Cross-Language Information Retrieval (CLIR); English-Chinese OOV translation; Out-of-Vocabulary (OOV); PAT-Tree; term extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2009 International Conference on
  • Conference_Location
    Baoding
  • Print_ISBN
    978-1-4244-3702-3
  • Electronic_ISBN
    978-1-4244-3703-0
  • Type

    conf

  • DOI
    10.1109/ICMLC.2009.5212280
  • Filename
    5212280