DocumentCode :
1724202
Title :
Using features extracted from Wikipedia for the task of Word Sense Disambiguation
Author :
Bawakid, Abdullah ; Oussalah, Mourad
Author_Institution :
Dept. of Electron., Electr. & Comput. Eng., Univ. of Birmingham, Birmingham, UK
fYear :
2010
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, a method using features extracted from Wikipedia for the task of Word Sense Disambiguation (WSD) is presented and evaluated. A term-concepts table constructed from Wikipedia and the redirect links is described. With its help, the Wikipedia internal links along with the categories structure are used to compute the relatedness between any two concepts through a two-level process: a term-concepts expansion followed by a links-based expansion. The result is a ranked list of concepts which are most related to the ambiguous term given the context it exists in. For the evaluation experiment, the benchmark is constructed from a segment of the internal links of Wikipedia. The evaluation results obtained suggest that introducing links analysis and the categories structure to the built term-concepts table provide improvement to the accuracy of the method in the WSD task.
Keywords :
Internet; Web sites; feature extraction; natural language processing; WSD; Wikipedia; feature extraction; word sense disambiguation; Boosting; Context; Electronic publishing; Encyclopedias; Feature extraction; Internet; Categories; WSD; Wikipedia; Word Sense Disambiguation; links analysis; strong links;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cybernetic Intelligent Systems (CIS), 2010 IEEE 9th International Conference on
Conference_Location :
Reading
Print_ISBN :
978-1-4244-9023-3
Electronic_ISBN :
978-1-4244-9024-0
Type :
conf
DOI :
10.1109/UKRICIS.2010.5898147
Filename :
5898147
Link To Document :
بازگشت