• DocumentCode
    247102
  • Title

    An Automatic Method for Deriving OWL Ontologies from XML Documents

  • Author

    Minutolo, A. ; Esposito, Anna ; Ciampi, Mario ; Esposito, M. ; Cassetti, G.

  • Author_Institution
    Inst. for High-Performance Comput. & Networking, Naples, Italy
  • fYear
    2014
  • fDate
    8-10 Nov. 2014
  • Firstpage
    426
  • Lastpage
    431
  • Abstract
    In the last decade, the field of Big Data Analytics has become increasingly important in both the academic and the business communities. Typically, data are mostly structured, collected by different actors through various heterogeneous and distributed information sources, and stored and managed often directly in XML. In order to enable large volume of data to be described in such a way that their meaning can be exploited by machines and, thus, semantic queries and automatic inferential procedures can be enabled, this paper presents an automatic method to derive OWL ontologies from XML schemas. The main contribution of this method relies on the possibility of producing a target ontology starting from multiple XML schemas, by discriminating between domain and cross-domain entities and, contextually, simplifying the overall structure of the final ontology generated, i.e. By eliminating not-used cross-domain entities. This method has been applied to a concrete application case in the healthcare domain, with the goal of generating an ontological model from the XML schemas implementing the HL7 Version 3 Clinical Document Architecture Release 2.
  • Keywords
    Big Data; XML; data analysis; document handling; knowledge representation languages; ontologies (artificial intelligence); Big Data analytics; HL7 Clinical Document Architecture Release; OWL ontology derivation; Web ontology language; XML documents; automatic inferential procedure; data collection; data management; data storage; extensible markup language; health care domain; information sources; ontology production; semantic queries; Medical services; OWL; Ontologies; Optimization; Resource description framework; Semantics; XML; HL7 CDA; OWL; Ontologies; Ontology generation; XML Schema;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2014 Ninth International Conference on
  • Conference_Location
    Guangdong
  • Type

    conf

  • DOI
    10.1109/3PGCIC.2014.88
  • Filename
    7024622