Title :
Unsupervised learning via iteratively constructed clustering ensemble
Author :
Yang, Yun ; Chen, Ke
Author_Institution :
Sch. of Comput. Sci., Univ. of Manchester, Manchester, UK
Abstract :
Unsupervised classification or clustering is an important data analysis technique demanded in various fields including machine learning, data mining, pattern recognition, image analysis and bioinformatics. Recently a large number of studies have attempted to improve clustering by combing multiple clustering solutions into a single consolidated clustering ensemble that has the best performance among given clustering solutions. However, the different clustering ensembles have their own behaviors on data of various characteristics. In this paper, we propose a novel approach to data clustering by constructing a clustering ensemble iteratively based on partitions generated on training subsets sampled from the original dataset. To yield a robust clustering ensemble our approach employs a hybrid sampling scheme inspired by both boosting and bagging techniques originally proposed for supervised learning. Our approach has been evaluated on synthetic data and real-world motion trajectory data sets, and experimental results demonstrate that it yields satisfactory performance for a variety of clustering tasks.
Keywords :
data analysis; iterative methods; pattern clustering; sampling methods; unsupervised learning; bagging technique; boosting technique; clustering ensemble; data analysis technique; hybrid sampling scheme; iterative methods; unsupervised learning; Watches;
Conference_Titel :
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6916-1
DOI :
10.1109/IJCNN.2010.5596577