• DocumentCode
    2664338
  • Title

    Feature expansion for word sense disambiguation

  • Author

    Tsao, Nai-Lung ; Wible, David ; Kuo, Chin-Hwa

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Tamkang Univ., Taiwan, China
  • fYear
    2003
  • fDate
    26-29 Oct. 2003
  • Firstpage
    126
  • Lastpage
    131
  • Abstract
    One of the most serious obstacles in research on word sense disambiguation (WSD) is sparseness of training data. We describe and motivate a method of feature expansion as a means of resolving the data sparseness problem in supervised corpus-based WSD. The expanded features are extracted from an existing corpus and WordNet automatically. We use our method to supplement the feature expansion approach of [Leacock and Chodorow 1998]. In the experiments, the addition of our method more than doubled the precision improvement over baseline that was obtained by using Leacock and Chodorow´s approach alone.
  • Keywords
    computational linguistics; feature extraction; word processing; WordNet; data sparseness; feature expansion; feature extraction; supervised corpus-based word sense disambiguation; training data; Bayesian methods; Computer science; Data engineering; Data mining; Data structures; Feature extraction; Information science; Tagging; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-7902-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2003.1275882
  • Filename
    1275882