DocumentCode :
2598657
Title :
Generating Optimum Number of Clusters Using Median Search and Projection Algorithms
Author :
Suresh, Lalith ; Simha, Jay B. ; Veluru, Rajappa
Author_Institution :
CSE Dept., CITech, Bangalore, India
fYear :
2010
fDate :
20-23 April 2010
Firstpage :
97
Lastpage :
102
Abstract :
K-means Clustering is an important algorithm for identifying the structure in data. Kmeans is the simplest clustering algorithm. This algorithm takes a predefined number of clusters as input. Mean stands for an average, an average location of all the members of a particular cluster. This algorithm is based on random selection of cluster centers and iteratively improving the results. In this work, a novel approach to seeding the clusters with the latent data structure is proposed. This is expected to minimize: The need for number of clusters apriory Time for convergence by providing near optimal cluster centers. Also these algorithms are tested on the latest standards for data warehouses - the column store databases.
Keywords :
data structures; data warehouses; pattern clustering; clustering algorithm; column store databases; data structure; data warehouses; k-means clustering; median search; near optimal cluster centers; projection algorithms; Clustering algorithms; Clustering methods; Conferences; Convergence; Data structures; Iterative algorithms; Partitioning algorithms; Performance analysis; Projection algorithms; Testing; Clustering; DBMS; Median Projection; Median Selection; k-means Algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Information Networking and Applications Workshops (WAINA), 2010 IEEE 24th International Conference on
Conference_Location :
Perth, WA
Print_ISBN :
978-1-4244-6701-3
Type :
conf
DOI :
10.1109/WAINA.2010.196
Filename :
5480848
Link To Document :
بازگشت