Title :
Max-margin clustering: Detecting margins from projections of points on lines
Author :
Gopalan, Raghuraman ; Sankaranarayanan, Jagan
Author_Institution :
Center for Autom. Res., Univ. of Maryland, College Park, MD, USA
Abstract :
Given a unlabelled set of points X ϵ ℝN belonging to k groups, we propose a method to identify cluster assignments that provides maximum separating margin among the clusters. We address this problem by exploiting sparsity in data points inherent to margin regions, which a max-margin classifier would produce under a supervised setting to separate points belonging to different groups. By analyzing the projections of X on the set of all possible lines L in ℝN, we first establish some basic results that are satisfied only by those line intervals lying outside a cluster, under assumptions of linear separability of clusters and absence of outliers. We then encode these results into a pair-wise similarity measure to determine cluster assignments, where we accommodate non-linearly separable clusters using the kernel trick. We validate our method on several UCI datasets and on some computer vision problems, and empirically show its robustness to outliers, and in cases where the exact number of clusters is not available. The proposed approach offers an improvement in clustering accuracy of about 6% on the average, and up to 15% when compared with several existing methods.
Keywords :
pattern classification; pattern clustering; UCI datasets; cluster assignment identification; cluster linear separability; computer vision problems; data point sparsity exploitation; kernel trick; line point projection; margin detection; max-margin classifier; max-margin clustering; outlier absence; pair-wise similarity measure; Algorithm design and analysis; Clustering algorithms; Decision making; Kernel; Principal component analysis; Silicon; Support vector machines;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
Print_ISBN :
978-1-4577-0394-2
DOI :
10.1109/CVPR.2011.5995485