Title :
Improved K-means clustering algorithm based on the optimized initial centriods
Author_Institution :
Dept. of Comput. Sci. & Technol, Langfang Teachers Coll., Langfang, China
Abstract :
K-means clustering algorithm is one of the most widely used clustering algorithms and has been applied in many fields of science and technology. A major problem of the k-means clustering algorithm is that the results in different types of clusters depending on the initial centroid which choose at random. At the same time, many feature values are taked into consideration, it leads to severe degradation in the performance. In this paper, an improved k-means clustering algorithm with variance is proposed. It selects the initial centriods using the Huffman tree structure. In order to solve the high-dimensional problem, principal component analysis based on variance is adopted. The experimental results confirm that the proposed algorithm is an efficient algorithm with better clustering accuracy and very less execution time.
Keywords :
pattern clustering; principal component analysis; trees (mathematics); Huffman tree structure; clustering accuracy; feature values; k-means clustering algorithm; optimized initial centriods; principal component analysis; variance; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Machine learning algorithms; Partitioning algorithms; Principal component analysis; Standards; Huffman tree; dissimilarity matrix; high-dimensional data; initial centriods; k-means; principal component analysis; variance;
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2013 3rd International Conference on
Conference_Location :
Dalian
DOI :
10.1109/ICCSNT.2013.6967151