DocumentCode :
3323196
Title :
NAGA: Searching and Ranking Knowledge
Author :
Kasneci, Gjergji ; Suchanek, Fabian M. ; Ifrim, Georgiana ; Ramanath, Maya ; Weikum, Gerhard
Author_Institution :
Max-Planck Inst. for Inf., Saarbrucken
fYear :
2008
fDate :
7-12 April 2008
Firstpage :
953
Lastpage :
962
Abstract :
The Web has the potential to become the world´s largest knowledge base. In order to unleash this potential, the wealth of information available on the Web needs to be extracted and organized. There is a need for new querying techniques that are simple and yet more expressive than those provided by standard keyword-based search engines. Searching for knowledge rather than Web pages needs to consider inherent semantic structures like entities (person, organization, etc.) and relationships (isA, located In, etc.). In this paper, we propose NAGA, a new semantic search engine. NAGA builds on a knowledge base, which is organized as a graph with typed edges, and consists of millions of entities and relationships extracted from Web-based corpora. A graph-based query language enables the formulation of queries with additional semantic information. We introduce a novel scoring model, based on the principles of generative language models, which formalizes several notions such as confidence, informativeness and compactness and uses them to rank query results. We demonstrate NAGA´s superior result quality over state-of-the-art search engines and question answering systems.
Keywords :
Internet; Web sites; graph theory; knowledge based systems; query languages; query processing; search engines; NAGA; Web pages; Web-based corpora; World Wide Web; generative language models; graph-based query language; keyword-based search engines; knowledge base; querying techniques; question answering systems; ranking knowledge; searching knowledge; semantic search engine; state-of-the-art search engines; Biological system modeling; Cultural differences; Data mining; Database languages; Informatics; Information retrieval; Instruction sets; Search engines; Web pages; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
Type :
conf
DOI :
10.1109/ICDE.2008.4497504
Filename :
4497504
Link To Document :
بازگشت