• DocumentCode
    3278040
  • Title

    Multi-feature representation for Web-based English-Chinese OOV term translation

  • Author

    Zhang, Yue-jie ; Su, Yan-xia ; Jin, Cheng ; Zhang, Tao

  • Author_Institution
    Sch. of Comput. Sci., Fudan Univ., Shanghai, China
  • Volume
    4
  • fYear
    2011
  • fDate
    10-13 July 2011
  • Firstpage
    1515
  • Lastpage
    1519
  • Abstract
    This paper focuses on the Web-based English-Chinese Out-of-Vocabulary (OOV) term translation pattern, and emphasizes particularly on the selection strategy based on the multi-feature representation for translation evaluation. Three kinds of feature, local feature, global feature and Boolean feature, are extracted from translation candidates based on the fusion strategy of multi-features. By utilizing the CoNLL 2003 corpus for the English Named Entity Recognition (NER) task, the related experiments based on such a standard data source show the promising results. The established multi-feature representation mechanism for English-Chinese OOV term translation model can “filter” the most possible translation candidate with better ability.
  • Keywords
    Internet; computational linguistics; feature extraction; language translation; pattern recognition; sensor fusion; vocabulary; CoNLL 2003 corpus; Web-based English-Chinese OOV term translation; local-global-Boolean feature extraction; multifeature representation; named entity recognition; out-of-vocabulary; selection strategy; standard data source; Accuracy; Cybernetics; Feature extraction; Machine learning; Semantics; Web mining; Boolean feature; Out-of-Vocabulary (OOV) term translation; global feature; local feature; multi-feature representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
  • Conference_Location
    Guilin
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4577-0305-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2011.6016971
  • Filename
    6016971