Title :
Clustering-based feature selection for verb sense disambiguation
Author :
Chen, Jinying ; Palmer, Martha
Author_Institution :
Dept. of Comput. & Inf. Sci., Pennsylvania Univ., Philadelphia, PA, USA
fDate :
30 Oct.-1 Nov. 2005
Abstract :
This paper presents a novel feature selection algorithm for supervised verb sense disambiguation. The algorithm disambiguates and aggregates WordNet synsets of a verb´s noun phrase (NP) arguments in the training data. It was then used to filter out irrelevant WordNet semantic features introduced by the ambiguity of verb NP arguments. Experimental results showed that our new feature selection method boosted our system´s performance on verbs whose meanings depended heavily on their NP arguments. Furthermore, our method outperformed two standard feature selection methods, indicating its effectiveness and advantages, especially for small-sample machine learning tasks like supervised WSD.
Keywords :
feature extraction; learning (artificial intelligence); natural languages; WordNet semantic features; clustering-based feature selection algorithm; machine learning; supervised WSD; supervised verb sense disambiguation; training data set; verb NP argument; verb noun phrase; Aggregates; Clustering algorithms; Filters; Frequency; Information science; Machine learning; Machine learning algorithms; Smoothing methods; System performance; Training data;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598703