Title :
Keyword Search Over Probabilistic RDF Graphs
Author :
Xiang Lian ; Lei Chen ; Zi Huang
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas-Pan American, Edinburg, TX, USA
Abstract :
In many real applications, RDF (Resource Description Framework) has been widely used as a W3C standard to describe data in the Semantic Web. In practice, RDF data may often suffer from the unreliability of their data sources, and exhibit errors or inconsistencies. In this paper, we model such unreliable RDF data by probabilistic RDF graphs, and study an important problem, keyword search query over probabilistic RDF graphs (namely, the pg-KWS query). To retrieve meaningful keyword search answers, we design the score rankings for subgraph answers specific for RDF data. Furthermore, we propose effective pruning methods (via offline pre-computed score bounds and probabilistic threshold) to quickly filter out false alarms. We construct an index over the pre-computed data for RDF, and present an efficient query answering approach through the index. Extensive experiments have been conducted to verify the effectiveness and efficiency of our proposed approaches.
Keywords :
graph theory; query processing; semantic Web; W3C standard; data source unreliability; keyword search; pg-KWS query; probabilistic RDF graphs; pruning methods; query answering approach; resource description framework; score rankings; semantic Web; Data models; Entropy; Keyword search; Probabilistic logic; Resource description framework; Semantics; Vectors; Probabilistic RDF graph; keyword search; pg-KWS;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2014.2365791