• DocumentCode
    2396869
  • Title

    Integration of relational databases and record-based legacy systems for populating data warehouses

  • Author

    Miller, L.L. ; Yu, Xin ; Nilakanta, Sree

  • Author_Institution
    Dept. of Comput. Sci., Iowa State Univ., Ames, IA, USA
  • fYear
    2002
  • fDate
    7-10 Jan. 2002
  • Firstpage
    3033
  • Lastpage
    3041
  • Abstract
    The number of data sources that an organization has to deal with continues to be nontrivial. Integrating this data is a growing problem. A great deal of research has been done to solve the general problem. Work on topics like multi-databases, mediators and ontologies has been directed at solving the general data integration problem. While all of this activity has been useful, the general problem of integrating heterogeneous data sources remains only partially solved. This paper looks at the solution of a sub-problem of the general problem where the data sources are restricted to relational databases and record-based legacy systems owned by the same organization. For many organizations, this restriction precisely defines their integration problem. For example, data warehouses typically have a tuple as their storage format, whether they are table- or cube-oriented. The task of defining the tuple requires integrating the organization´s existing relational and/or record-based legacy systems. Populating the data warehouse can be accomplished by querying the integrated data sources. Specifically, we provide a mechanism for developing a relational model for the set of data sources, provide a method for generating correct queries over the model, and create an architecture for executing the queries based on the mobile agent paradigm. A prototype of the system has been designed and implemented.
  • Keywords
    data warehouses; distributed programming; integrated software; merging; query processing; records management; relational databases; software agents; software architecture; software prototyping; correct query generation; data warehouse population; general data integration problem; heterogeneous data sources; integrated data source querying; mediators; mobile agent paradigm; multi-databases; ontologies; prototype system; query execution architecture; record-based legacy systems; relational databases; relational model; sub-problem; tuple storage format; Computer science; Data warehouses; Distributed databases; Educational institutions; File systems; Information systems; Mobile agents; Ontologies; Prototypes; Relational databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2002. HICSS. Proceedings of the 35th Annual Hawaii International Conference on
  • Print_ISBN
    0-7695-1435-9
  • Type

    conf

  • DOI
    10.1109/HICSS.2002.994286
  • Filename
    994286