DocumentCode :
3522024
Title :
Using an out-of-core technique for clustering large data sets
Author :
Masciari, Elio ; Pizzuti, Clara ; Raimondo, G. ; Talia, Domenico
Author_Institution :
DEIS, Calabria Univ., Rende, Italy
fYear :
2001
fDate :
2001
Firstpage :
133
Lastpage :
137
Abstract :
Data mining algorithms generally deal with very large data sets that do not fit in main memory. Therefore, techniques that manage huge data sets need to be developed. Any algorithm that is proposed for mining data should account for out-of-core data structures. However, most of the existing algorithms have not yet addressed this issue. In this paper we describe the implementation of an out-of-core technique for the data analysis of very large data sets with the sequential and parallel version of the clustering algorithm AutoClass. We discuss the out-of-core technique and show the performance results in terms of execution time and speed up
Keywords :
data analysis; data mining; data structures; AutoClass; clustering algorithm; data analysis; data mining; out-of-core data structures; very large data sets; Algorithm design and analysis; Bayesian methods; Clustering algorithms; Data analysis; Data mining; Data structures; Databases; Delta modulation; Information analysis; Partitioning algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2001. Proceedings. 12th International Workshop on
Conference_Location :
Munich
Print_ISBN :
0-7695-1230-5
Type :
conf
DOI :
10.1109/DEXA.2001.953053
Filename :
953053
Link To Document :
بازگشت