• DocumentCode
    2406260
  • Title

    Bridging the Gap between Heterogeneous and Semantically Diverse Content of Different Disciplines

  • Author

    Bykau, Siarhei ; Kiyavitskaya, Nadzeya ; Tsinaraki, Chrisa ; Velegrakis, Yannis

  • fYear
    2010
  • fDate
    Aug. 30 2010-Sept. 3 2010
  • Firstpage
    305
  • Lastpage
    309
  • Abstract
    The Web has been flooded with highly heterogeneous data sources that freely offer their data to the public. Careful design and compliance to standards is a way to cope with the heterogeneity. However, any agreement and compliance is practically hard to achieve across different communities. In this work we describe a framework that enables the exploitation of content across different scientific disciplines. Our approach combines several novel techniques at the syntactic, structural and semantic level. In particular, we advocate that integration should take place at the much higher level, factoring out any syntactic discrepancies, and facilitating the exchange of information. We show how a novel technique for data annotation using intentional attributes can cope with data associations in high data volumes, we present a way to overcome the multilingualism barrier, and describe a new kind of database that considers data evolution as first class citizen with the additional ability to annotate free text.
  • Keywords
    Internet; data analysis; distributed databases; World Wide Web; data annotation; data associations; data evolution; data volumes; heterogeneous data sources; intentional attributes; multilingualism barrier; semantically diverse content; syntactic discrepancy; Biotechnology; Context; Data models; Data structures; Databases; Internet; Semantics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications (DEXA), 2010 Workshop on
  • Conference_Location
    Bilbao
  • ISSN
    1529-4188
  • Print_ISBN
    978-1-4244-8049-4
  • Type

    conf

  • DOI
    10.1109/DEXA.2010.67
  • Filename
    5591174