Title :
A new information theoretic clustering algorithm using k-nn
Author :
Vikjord, Vidar ; Jenssen, Robert
Author_Institution :
Dept. of Phys. & Technol., Univ. of Tromso, Tromso, Norway
Abstract :
We develop a new non-parametric hierarchical information theoretic clustering algorithm based on implicit estimation of cluster densities using k-nearest neighbors (k-nn). Compared to a kernel-based procedure, our k-nn approach is very robust with respect to the parameter choices, with a key ability to detect clusters of vastly different scales. Of particular importance is the use of two different values of k, depending on the evaluation of within-cluster entropy or a cross-cluster cross-entropy in order to obtain the final clustering. We conduct clustering experiments, and report promising results, focusing in particular on the proposed algorithm´s robustness to scale.
Keywords :
data analysis; entropy; pattern clustering; K-NN clustering approach; cross-cluster cross-entropy; data analysis; k-nearest neighbor clustering; kernel-based procedure; nonparametric hierarchical information theoretic clustering algorithm; within-cluster entropy evaluation; Clustering algorithms; Cost function; Entropy; Estimation; Kernel; Robustness; Signal processing algorithms; Information theoretic clustering; Parzen windowing; k-nearest neighbors; kernel density estimation; robustness to scale;
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2013 IEEE International Workshop on
Conference_Location :
Southampton
DOI :
10.1109/MLSP.2013.6661968