Title :
Using an out-of-core technique for clustering large data sets
Author :
Masciari, Elio ; Pizzuti, Clara ; Raimondo, G. ; Talia, Domenico
Author_Institution :
DEIS, Calabria Univ., Rende, Italy
Abstract :
Data mining algorithms generally deal with very large data sets that do not fit in main memory. Therefore, techniques that manage huge data sets need to be developed. Any algorithm that is proposed for mining data should account for out-of-core data structures. However, most of the existing algorithms have not yet addressed this issue. In this paper we describe the implementation of an out-of-core technique for the data analysis of very large data sets with the sequential and parallel version of the clustering algorithm AutoClass. We discuss the out-of-core technique and show the performance results in terms of execution time and speed up
Keywords :
data analysis; data mining; data structures; AutoClass; clustering algorithm; data analysis; data mining; out-of-core data structures; very large data sets; Algorithm design and analysis; Bayesian methods; Clustering algorithms; Data analysis; Data mining; Data structures; Databases; Delta modulation; Information analysis; Partitioning algorithms;
Conference_Titel :
Database and Expert Systems Applications, 2001. Proceedings. 12th International Workshop on
Conference_Location :
Munich
Print_ISBN :
0-7695-1230-5
DOI :
10.1109/DEXA.2001.953053