• DocumentCode
    244543
  • Title

    ART Lab infrastructure for semantic Big Data processing

  • Author

    Fiorelli, Manuel ; Pazienza, Maria Teresa ; Stellato, A. ; Turbati, Andrea

  • Author_Institution
    Dept. of Enterprise Eng. (DII), ART Res. Group, Univ. of Rome "Tor Vergata", Rome, Italy
  • fYear
    2014
  • fDate
    21-25 July 2014
  • Firstpage
    327
  • Lastpage
    334
  • Abstract
    In this paper we briefly describe the ART Lab infrastructure for semantic Big Bata processing. Our most relevant contribution is the definition of an architecture supporting ontology development driven by knowledge acquired from heterogeneous resources, such as documents and web pages. The overall perspective is to propose a gluing architecture driving and supporting the entire flow of information, from data acquisition from external heterogeneous resources to their exploitation for RDF triplification. In such an architecture, the unstructured content analysis capabilities of frameworks such as UIMA are integrated in a coordinated environment supporting the processing, transformation and projection of produced metadata into RDF semantic repositories, which are managed by Semantic Turkey, our platform for Knowledge Acquisition and Management. Further contributions relate to the possibility of easily managing high dimension repositories (e.g., thesauri, vocabularies, etc.), and supporting end users for sharing the “logics” under the reasoning processes!
  • Keywords
    Big Data; Web sites; data acquisition; knowledge acquisition; ontologies (artificial intelligence); semantic Web; ART lab infrastructure; RDF semantic repository; RDF triplification; Semantic Turkey; UIMA; Web pages; architecture supporting ontology development; coordinated environment; data acquisition; gluing architecture; heterogeneous resource; knowledge acquisition and management; reasoning process; relevant contribution; semantic Big Data processing; unstructured content analysis capability; Big data; Knowledge acquisition; Ontologies; Resource description framework; Semantics; Subspace constraints; Vocabulary; architecture; big data; semantic processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing & Simulation (HPCS), 2014 International Conference on
  • Conference_Location
    Bologna
  • Print_ISBN
    978-1-4799-5312-7
  • Type

    conf

  • DOI
    10.1109/HPCSim.2014.6903704
  • Filename
    6903704