DocumentCode :
3305800
Title :
A data preparation framework based on a multidatabase language
Author :
Sattler, Kai-Uwe ; Schallehn, Eike
Author_Institution :
Dept. of Comput. Sci., Magdeburg Univ. of Technol., Germany
fYear :
2001
fDate :
2001
Firstpage :
219
Lastpage :
228
Abstract :
Integration and analysis of data from different sources have to deal with several problems resulting from potential heterogeneities. The activities addressing these problems are called data preparation and are supported by various available tools. However, these tools process mostly in a batch-like manner, not supporting the iterative and explorative nature of the integration and analysis process. The authors present a framework for important data preparation tasks based on a multidatabase language. This language offers features for solving common integration and cleaning problems as part of query processing. Combining data preparation mechanisms and multidatabase query facilities permits applying and evaluating different integration and cleaning strategies without explicit loading and materialization of data. The paper introduces the language concepts and discusses their application for individual tasks of data preparation
Keywords :
data analysis; data preparation; distributed databases; query languages; query processing; cleaning problems; data analysis; data cleaning strategies; data integration; data preparation framework; data preparation tasks; heterogeneities; language concepts; multidatabase language; multidatabase query facilities; query processing; Cleaning; Computer science; Data analysis; Data mining; Database languages; Database systems; Memory management; Query processing; Read-write memory; Warehousing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Engineering and Applications, 2001 International Symposium on.
Conference_Location :
Grenoble
Print_ISBN :
0-7695-1140-6
Type :
conf
DOI :
10.1109/IDEAS.2001.938088
Filename :
938088
Link To Document :
بازگشت