Title :
Document Indexing and Retrieval Using Wikipedia
Author :
Chahine, Carlo Abi ; Chaignaud, Nathalie ; Kotowicz, Jean-philippe ; Pécuchet, Jean-Pierre
Author_Institution :
LITIS, INSA Rouen, Rouen, France
fDate :
Nov. 28 2011-Dec. 1 2011
Abstract :
This paper introduces a framework to perform conceptual indexing and retrieval of documents. It uses a graph composed of terms from a text and elements of the Wikipedia Category Network. Conceptual indexing consists in finding the relevant Wikipedia articles and categories that can be used to describe the text. Conceptual retrieval consists in using these articles and categories to return the relevant documents for a user query. A proof-of-concept prototype is finally presented.
Keywords :
Web sites; indexing; information retrieval; Wikipedia Category Network; conceptual retrieval; document indexing; document retrieval; user query; Electronic publishing; Encyclopedias; Indexing; Internet; Java; Linux; Information retrieval; Wikipedia; document indexing;
Conference_Titel :
Signal-Image Technology and Internet-Based Systems (SITIS), 2011 Seventh International Conference on
Conference_Location :
Dijon
Print_ISBN :
978-1-4673-0431-3
DOI :
10.1109/SITIS.2011.52