Title :
D3G: novel approaches to data statistics, understanding and preprocessing on the grid
Author :
Wöhrer, Alexander ; Nováková, Lenka ; Brezany, Peter ; Tjoa, A. Min
Author_Institution :
Inst. of Sci. Comput., Univ. of Vienna, Austria
Abstract :
Relocating the code for data preprocessing (DPP) closer towards the data source is the overall task of the D3G framework (data statistics, data understanding, data preprocessing on the grid), developed within a joint project of the University of Vienna, the Vienna University of Technology and the Czech Technical University. This paper presents the data service side architecture to gather data statistics on-the-fly and use them in remote DPP methods on query results as well as an approach to gather exact continuous data statistics for whole tables in a database on the grid. The performance results of our prototype implementation are showing low running costs for the continuous data statistics inside the database and also the feasibility of our proposed data service side functionality.
Keywords :
data analysis; data mining; grid computing; query processing; relational databases; D3G framework; RDBMS; data mining; data preprocessing; data service side architecture; data statistics; data understanding; grid computing; query processing; remote DPP methods; Cost function; Cybernetics; Data mining; Data preprocessing; Databases; Interactive systems; Prototypes; Scientific computing; Statistical analysis; Statistics;
Conference_Titel :
Advanced Information Networking and Applications, 2006. AINA 2006. 20th International Conference on
Print_ISBN :
0-7695-2466-4
DOI :
10.1109/AINA.2006.137