Title :
Named Entity Disambiguation Leveraging Multi-aspect Information
Author :
Quanlong Zhang;Feng Li;Fang Wang;Zhoujun Li
Author_Institution :
State Key Lab. of Software Dev. Environ., Beihang Univ., Beijing, China
Abstract :
Named Entity Disambiguation (NED) aims at disambiguating named entity mentions in a text to their corresponding entries in a knowledge base such as Wikipedia. Itis a fundamental task in Natural Language Processing (NLP)and has many applications such as information extraction, information retrieval, and knowledge acquisition. In the past decade, a number of methods have been proposed for the NED task. However, most of existing work focuses on exploring many more useful information to help tackle this problem. The effectiveness of different features proposed for the task are not well-studied in a same platform. In this paper, we extract various remarkable features by leveraging statistical, textual and semantic information, and evaluate various combinations of the multiaspect features for the disambiguation task in the same platform. Specifically, we utilize two learning to rank methods to combine different features, train and test the combined methods on several standard data sets. Through extensive experiments, we investigate the effects on the quality of the disambiguation of exploiting different features and show which combinations of features are the best choices for disambiguation.
Keywords :
"Encyclopedias","Electronic publishing","Internet","Feature extraction","Context","Semantics"
Conference_Titel :
Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
Electronic_ISBN :
2375-9259
DOI :
10.1109/ICDMW.2015.35