• DocumentCode
    1900881
  • Title

    XML Semantic Filtering via Ontology Reasoning

  • Author

    Baggi, M. ; Falaschi, M. ; Ballis, D.

  • Author_Institution
    Dip. di Sci. Matematiche e Inf., Univ. Sienna, Siena
  • fYear
    2008
  • fDate
    8-13 June 2008
  • Firstpage
    482
  • Lastpage
    487
  • Abstract
    In this paper, we present an extension of PHIL, a declarative language for filtering information from XML data. The proposed approach allows us to extract relevant data as well as to exclude useless and misleading contents from an XML document. Essentially, it combines ontology reasoning with an approximate pattern-matching engine which searches for patterns in a flexible way (i.e. modulo renaming, insertion, and deletion of XML items) and ranks the results w.r.t. their cost. The filtering process is guided by the syntax as well as the semantics of the XML documents, since it relies on both the document structure and the onto- logical information to which the document is related. Such information is retrieved by querying (possibly remote) ontology reasoners. Our methodology has been implemented in the XPHIL system, which is written in Haskell. By using the XML benchmarking tool xmlgen, we have developed some scalable experiments which demonstrate the usefulness of our approach.
  • Keywords
    XML; ontologies (artificial intelligence); XML data; XML document; XML semantic filtering; declarative language; information filtering; ontological information; ontology reasoning; pattern-matching engine; Data mining; Databases; Information filtering; Information filters; Information retrieval; Matched filters; Ontologies; Pattern matching; Web and internet services; XML; XML semantic filtering; ontology reasoning; query language;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Internet and Web Applications and Services, 2008. ICIW '08. Third International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-0-7695-3163-2
  • Electronic_ISBN
    978-0-7695-3163-2
  • Type

    conf

  • DOI
    10.1109/ICIW.2008.89
  • Filename
    4545659