DocumentCode
1865997
Title
Semantic search using a similarity graph
Author
Stanchev, Lubomir
Author_Institution
Comput. Sci. Dept., Indiana Univ. - Purdue Univ. Fort Wayne, Fort Wayne, IN, USA
fYear
2015
fDate
7-9 Feb. 2015
Firstpage
93
Lastpage
100
Abstract
Given a set of documents and an input query that is expressed in a natural language, the problem of document search is retrieving the most relevant documents. Unlike most existing systems that perform document search based on keywords matching, we propose a search method that considers the meaning of the words in the query and the document. As a result, our algorithm can return documents that have no words in common with the input query as long as the documents are relevant. For example, a document that contains the words “Ford”, “Chrysler” and “General Motors” multiple times is surely relevant for the query “car” even if the word “car” does not appear in the document. Our semantic search algorithm is based on a similarity graph that contains the degree of semantic similarity between terms, where a term can be a word or a phrase. We experimentally validate our algorithm on the Cranfield benchmark that contains 1400 documents and 225 natural language queries. The benchmark also contains the relevant documents for every query as determined by human judgment. We show that our semantic search algorithm produces a higher value for the mean average precision (MAP) score than a keywords matching algorithm. This shows that our approach can improve the quality of the result because the meaning of the words and phrases in the documents and the queries is taken into account.
Keywords
natural language processing; query processing; semantic Web; MAP score; cranfield benchmark; document search; infonnation retrieval system; keyword matching algorithm; mean average precision score; natural language queries; semantic search algorithm; semantic similarity degree; similarity graph; Electronic publishing; Encyclopedias; Information retrieval; OWL; Ontologies;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantic Computing (ICSC), 2015 IEEE International Conference on
Conference_Location
Anaheim, CA
Type
conf
DOI
10.1109/ICOSC.2015.7050785
Filename
7050785
Link To Document