Title :
K-means clustering algorithm based on coefficient of variation
Author :
Ren, Shuhua ; Fan, Alin
Author_Institution :
Sch. of Inf. Sci. & Eng., Dalian Polytech. Univ., Dalian, China
Abstract :
The performance of k-means clustering algorithm depends on the selection of distance metrics. The Euclid distance is commonly chosen as the similarity measure in k-means clustering algorithm, which treats all features equally and does not accurately reflect the similarity among samples. K-means clustering algorithm based on coefficient of variation (CV-k-means) is proposed in this paper to solve this problem. The CV-k-means clustering algorithm uses variation coefficient weight vector to decrease the affects of irrelevant features. The experimental results show that the proposed algorithm can generate better clustering results than k-means algorithm do.
Keywords :
pattern clustering; CV-k-means clustering algorithm; Euclid distance; distance metrics; variation coefficient weight vector; Accuracy; Algorithm design and analysis; Clustering algorithms; Equations; Mathematical model; Minimization; Signal processing algorithms; coefficient of variation; k-means clustering; similarity metrics; weighting;
Conference_Titel :
Image and Signal Processing (CISP), 2011 4th International Congress on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9304-3
DOI :
10.1109/CISP.2011.6100578