• DocumentCode
    2381575
  • Title

    DM_Integration: A Framework for Iterative Large Volume Data Integration

  • Author

    Li, Junkui ; Wang, Yuanzhen ; Li, Zhuan

  • fYear
    2007
  • fDate
    1-3 Nov. 2007
  • Firstpage
    68
  • Lastpage
    73
  • Abstract
    In this work, we consider the problem of large volume data integration in a relative compacted memory system. Previous work on data integration mainly focused on the difference elimination between data sources, and performed integration in memory in one pass. However, we see a new aspect of the problem: the data volume. We present a general framework called DM Integration for large volume data integration. The approach, instead of running one task to finish the whole integration, decomposes the integration task into several subtasks, each subtask works on a rela- tive small fraction of the data, and the subtasks run concur- rently. We show this method can be applied in novel ways for large volume data integration in a configurable mode. Keywords: Data Integration, Large Volume Data, Task Decomposition
  • Keywords
    Acceleration; Computer science; Data mining; Data privacy; Databases; Delta modulation; Dispatching; Educational institutions; Web pages; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data, Privacy, and E-Commerce, 2007. ISDPE 2007. The First International Symposium on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-0-7695-3016-1
  • Type

    conf

  • DOI
    10.1109/ISDPE.2007.10
  • Filename
    4402641