• DocumentCode
    2059982
  • Title

    Detecting Abnormal Semantic Web Data Using Semantic Dependency

  • Author

    Yu, Yang ; Li, Yingjie ; Heflin, Jeff

  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    154
  • Lastpage
    157
  • Abstract
    Data quality is a critical problem for the Semantic Web. We propose that the degree to which a triple deviates from similar triples can be an important heuristic for identifying errors. Inspired by data dependency, which has shown promise in database data quality research, we introduce Semantic Dependency to assess quality of Semantic Web data. The system first builds a summary graph for finding candidate semantic dependencies. Each semantic dependency has a probability according to its instantiations and is subsequently adjusted based on the inconsistencies among them. Then triples can get a posterior probability of normality based on what semantic dependencies can support each of them. Repeating the iteration above, the proposed approach detects abnormal Semantic Web data. Experiments have shown that the system is efficient on data set with 10M triples and has more than a ten percent F-score improvement over our previous system.
  • Keywords
    database management systems; semantic Web; F-score improvement; abnormal semantic Web data; data dependency; database data quality research; semantic dependency; summary graph; Databases; Ontologies; Probabilistic logic; Resource description framework; Semantics; USA Councils; Detecting Abnormal Semantic Web Data; Semantic Dependency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2011 Fifth IEEE International Conference on
  • Conference_Location
    Palo Alto, CA
  • Print_ISBN
    978-1-4577-1648-5
  • Electronic_ISBN
    978-0-7695-4492-2
  • Type

    conf

  • DOI
    10.1109/ICSC.2011.81
  • Filename
    6061425