• DocumentCode
    3212583
  • Title

    A Semi-Supervised method for Persian homograph Disambiguation

  • Author

    Riahi, Noushin ; Sedghi, Fatemeh

  • Author_Institution
    Comput. Eng. Dept., Alzahra Univ., Tehran, Iran
  • fYear
    2012
  • fDate
    15-17 May 2012
  • Firstpage
    748
  • Lastpage
    751
  • Abstract
    One of the major challenges in the most natural languages processing (NLP) tasks such as machine translation, text to speech and text mining is Word Sense Disambiguation (WSD). Supervised methods are the most common solutions for WSD. However, they need large tagged corpuses which are not available in some languages such as Persian. The Semi-Supervised methods can solve this problem by using small tagged corpus and large untagged corpus. This paper presents a coarse-grained work in WSD that uses tri-training as the semi-supervised method and decision list as supervised classifier for training. The proposed method was evaluated on a corpus. The results show that the proposed method is more precise than the conventional Decision list when the tagged corpus is small.
  • Keywords
    natural language processing; pattern classification; NLP tasks; Persian homograph disambiguation; WSD; decision list; large untagged corpus; machine translation; natural languages processing task; semisupervised method; small tagged corpus; supervised classifier; text mining; text to speech; tri-training; word sense disambiguation; Accuracy; decision list; homograph disambiguation; semi-supervised; tritraining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering (ICEE), 2012 20th Iranian Conference on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4673-1149-6
  • Type

    conf

  • DOI
    10.1109/IranianCEE.2012.6292453
  • Filename
    6292453