Title :
Knowledge harvesting from text and Web sources
Author :
Suchanek, F. ; Weikum, G.
Author_Institution :
Max Planck Inst. for Inf., Saarbrucken, Germany
Abstract :
The proliferation of knowledge-sharing communities such as Wikipedia and the progress in scalable information extraction from Web and text sources has enabled the automatic construction of very large knowledge bases. Recent endeavors of this kind include academic research projects such as DBpedia, KnowItAll, Probase, ReadTheWeb, and YAGO, as well as industrial ones such as Freebase and Trueknowledge. These projects provide automatically constructed knowledge bases of facts about named entities, their semantic classes, and their mutual relationships. Such world knowledge in turn enables cognitive applications and knowledge-centric services like disambiguating natural-language text, deep question answering, and semantic search for entities and relations in Web and enterprise data. Prominent examples of how knowledge bases can be harnessed include the Google Knowledge Graph and the IBM Watson question answering system. This tutorial presents state-of-the-art methods, recent advances, research opportunities, and open challenges along this avenue of knowledge harvesting and its applications.
Keywords :
Internet; knowledge based systems; natural language processing; text analysis; DBpedia; Freebase; Google knowledge graph; IBM Watson question answering system; KnowItAll; Probase; ReadTheWeb; Web data; Web sources; Wikipedia; YAGO; automatic construction; cognitive applications; deep question answering; enterprise data; knowledge bases; knowledge centric services; knowledge harvesting; knowledge sharing communities; mutual relationships; natural-language text; scalable information extraction; semantic classes; semantic search; text sources; Electronic publishing; Encyclopedias; Information retrieval; Internet; Knowledge based systems; Semantics;
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
DOI :
10.1109/ICDE.2013.6544916