• DocumentCode
    3309147
  • Title

    Semi-supervised learning for word sense disambiguation using parallel corpora

  • Author

    Mo Yu ; Shu Wang ; Conghui Zhu ; Tiejun Zhao

  • Author_Institution
    MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin, China
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1490
  • Lastpage
    1494
  • Abstract
    The Application of word sense disambiguation (WSD) methods based on supervised machine learning are limited by the difficulties in defining sense tags and acquiring labeled data for training. In this paper, the two problems of WSD are solved in a semi-supervised learning framework with the help of parallel corpora. The sense tags are defined automatically according to the results of word alignment on the parallel corpora. And label propagation, a graph-based semi-supervised algorithm, is employed. The experiments show that our method achieves great improvement on Chinese WSD tasks and the performances get significant growth when the scale of monolingual sentences is increasing.
  • Keywords
    computational linguistics; learning (artificial intelligence); monolingual sentences; parallel corpora; semisupervised learning; supervised machine learning; word alignment; word sense disambiguation; Accuracy; Computational linguistics; Computers; Conferences; Support vector machines; Training; Training data; label propagation; parallel corpora; semi-supervised learning; word sense disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-61284-180-9
  • Type

    conf

  • DOI
    10.1109/FSKD.2011.6019785
  • Filename
    6019785