Title :
An Adaptation of the Vector-Space Model for Ontology-Based Information Retrieval
Author :
Castells, Pablo ; Fernández, Miriam ; Vallet, David
Author_Institution :
Escuela Politecnica Superior, Univ. Autonoma de Madrid
Abstract :
Semantic search has been one of the motivations of the semantic Web since it was envisioned. We propose a model for the exploitation of ontology-based knowledge bases to improve search over large document repositories. In our view of information retrieval on the semantic Web, a search engine returns documents rather than, or in addition to, exact values in response to user queries. For this purpose, our approach includes an ontology-based scheme for the semiautomatic annotation of documents and a retrieval system. The retrieval model is based on an adaptation of the classic vector-space model, including an annotation weighting algorithm, and a ranking algorithm. Semantic search is combined with conventional keyword-based retrieval to achieve tolerance to knowledge base incompleteness. Experiments are shown where our approach is tested on corpora of significant scale, showing clear improvements with respect to keyword-based search
Keywords :
information retrieval; knowledge based systems; ontologies (artificial intelligence); search engines; semantic Web; text analysis; annotation weighting algorithm; information retrieval models; keyword-based retrieval; large document repository; ontology languages; ontology-based information retrieval; ontology-based knowledge bases; ranking algorithm; retrieval system; search engine; semantic Web; semantic search; semiautomatic annotation; user query; vector-space model; Costs; Helium; Information retrieval; Knowledge management; Ontologies; Search engines; Semantic Web; Taxonomy; Testing; Information retrieval models; ontology languages; semantic Web.; semantic search;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2007.22