• DocumentCode
    19123
  • Title

    Keyword Search Over Probabilistic RDF Graphs

  • Author

    Xiang Lian ; Lei Chen ; Zi Huang

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas-Pan American, Edinburg, TX, USA
  • Volume
    27
  • Issue
    5
  • fYear
    2015
  • fDate
    May 1 2015
  • Firstpage
    1246
  • Lastpage
    1260
  • Abstract
    In many real applications, RDF (Resource Description Framework) has been widely used as a W3C standard to describe data in the Semantic Web. In practice, RDF data may often suffer from the unreliability of their data sources, and exhibit errors or inconsistencies. In this paper, we model such unreliable RDF data by probabilistic RDF graphs, and study an important problem, keyword search query over probabilistic RDF graphs (namely, the pg-KWS query). To retrieve meaningful keyword search answers, we design the score rankings for subgraph answers specific for RDF data. Furthermore, we propose effective pruning methods (via offline pre-computed score bounds and probabilistic threshold) to quickly filter out false alarms. We construct an index over the pre-computed data for RDF, and present an efficient query answering approach through the index. Extensive experiments have been conducted to verify the effectiveness and efficiency of our proposed approaches.
  • Keywords
    graph theory; query processing; semantic Web; W3C standard; data source unreliability; keyword search; pg-KWS query; probabilistic RDF graphs; pruning methods; query answering approach; resource description framework; score rankings; semantic Web; Data models; Entropy; Keyword search; Probabilistic logic; Resource description framework; Semantics; Vectors; Probabilistic RDF graph; keyword search; pg-KWS;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2365791
  • Filename
    6940261