Title :
Two semi-supervised locality sensitive k-means clustering algorithms by seeding
Author_Institution :
JiangSu Province Support Software Eng. R&D Center for Modern Inf. Technol. Applic. in Enterprise, Suzhou, China
Abstract :
Semi-supervised clustering takes advantage of a small amount of labeled data to bring a great benefit to the clustering of unlabeled data. Based on a locality sensitive k-means clustering method, this paper presents two novel semi-supervised clustering algorithms inspired by the semi-supervised variants of the k-means clustering by seeding. To investigate the effectiveness of our approaches, experiments are done on one artificial dataset and three real datasets. Experimental results show that two proposed methods can improve the clustering performance significantly compared to other unsupervised and semi-supervised clustering algorithms.
Keywords :
data handling; learning (artificial intelligence); pattern clustering; artificial dataset; clustering performance imrpovement; labeled data; real datasets; seeding; semi-supervised locality sensitive k-means clustering algorithms; semi-supervised variants; unlabeled data clustering; Accuracy; Clustering algorithms; Clustering methods; Information technology; Machine learning; Software engineering;
Conference_Titel :
Advanced Computational Intelligence (ICACI), 2012 IEEE Fifth International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4673-1743-6
DOI :
10.1109/ICACI.2012.6463172