Title :
A Model for Ranking Entities and Its Application to Wikipedia
Author :
Demartini, Gianluca ; Firan, Claudiu S. ; Iofciu, Tereza ; Krestel, Ralf ; Nejdl, Wolfgang
Author_Institution :
L3S Res. Center, Leibniz Univ. Hannover, Hannover
Abstract :
Entity Ranking (ER) is a recently emerging search task in Information Retrieval, where the goal is not finding documents matching the query words, but instead finding entities which match types and attributes mentioned in the query. In this paper we propose a formal model to define entities as well as a complete ER system, providing examples of its application to enterprise, Web, and Wikipedia scenarios. Since searching for entities on Web scale repositories is an open challenge as the effectiveness of ranking is usually not satisfactory, we present a set of algorithms based on our model and evaluate their retrieval effectiveness. The results show that combining simple Link Analysis, Natural Language Processing, and Named Entity Recognition methods improves retrieval performance of entity search by over 53% for P@10 and 35% for MAP.
Keywords :
Internet; pattern matching; query processing; Web scale repository; attribute matching; entity ranking; formal model; information retrieval; query word; search task; type matching; wikipedia; Algorithm design and analysis; Data mining; Erbium; Information retrieval; Natural language processing; Performance analysis; Search engines; Testing; Web pages; Wikipedia; Wikipedia; entity ranking; evaluation; model;
Conference_Titel :
Web Conference, 2008. LA-WEB '08., Latin American
Conference_Location :
Espfrito Santo
Print_ISBN :
978-0-7695-3397-1
Electronic_ISBN :
978-0-7695-3397-1
DOI :
10.1109/LA-WEB.2008.8