• Title of article

    Integrating domain heterogeneous data sources using decomposition aggregation queries

  • Author/Authors

    Jian Xu، نويسنده , , Rachel Pottinger، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2014
  • Pages
    28
  • From page
    80
  • To page
    107
  • Abstract
    The decomposition aggregation query (DAQ) we introduce in this paper extends semantic integration queries by allowing query translation to create aggregate queries based on the DAQʹs novel three role structure. We describe the application of DAQs in integrating domain heterogeneous data sources, the new semantics of DAQ answers and the query translation algorithm called “aggregation rewriting”. A central problem of optimizing DAQ processing requires determining the data sources towards which the DAQ is translated. Our source selection algorithm has cover-finding and partitioning steps which are optimized to 1. lower the processing overhead while speeding up query answering and 2. eliminate duplicates with minimal overhead. We establish connections between source selection optimizations and classic NP-hard optimizations and resolve the optimization problems with efficient solvers. We empirically study both the DAQ query translation and the source selection algorithms using real-world and synthetic data sets; the results show satisfying scalability both in size of aggregations and data sources for the query translation algorithms and the source selection algorithms save a good amount of computational resources.
  • Keywords
    Semantic integration , Aggregation , Query Optimization
  • Journal title
    Information Systems
  • Serial Year
    2014
  • Journal title
    Information Systems
  • Record number

    1230362