Title :
Averaging refined sample centers for faster out-of-core clustering
Author :
Pakhira, Malay K.
Author_Institution :
Kalyani Gov. Eng. Coll., Kalyani
Abstract :
A novel sample based clustering technique has been developed in this paper. Since traditional k-means algorithm is very time consuming for large disk resident data, sample based, out-of-core clustering techniques have gained high popularity recently. We have used the concept of elimination of measurement errors by averaging over a number of samples. Here, samples or original data set are chosen randomly, they are clustered individually, in association with a refinement technique, to produce a number of sets of refined cluster centers. Average of these refined centers are expected to form a near approximation of the true centers of the original data set. A comparison of the proposed method with some existing ones proves the efficiency and usefulness of the former.
Keywords :
measurement errors; pattern clustering; faster out-of-core clustering; large disk resident data; measurement errors; refined sample centers; Clustering algorithms; Educational institutions; Government; Measurement errors; Memory management; Multidimensional systems; Partitioning algorithms; Refining; Sampling methods; Testing; Averaging; Refined centers; Sample based clustering;
Conference_Titel :
Computing, Communication and Networking, 2008. ICCCn 2008. International Conference on
Conference_Location :
St. Thomas, VI
Print_ISBN :
978-1-4244-3594-4
Electronic_ISBN :
978-1-4244-3595-1
DOI :
10.1109/ICCCNET.2008.4787716