Title :
Robust clustering algorithm for high dimensional data classification based on multiple supports
Author :
Lam, Benson S Y ; Yan, Hong
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon
Abstract :
High dimensionality, noisy features and outliers can cause problems in cluster analysis. Many existing methods can handle one of the problems well but not the others. In this paper, we propose a new clustering algorithm to solve these problems. The basic idea is to control the support of the optimization procedure so that the effect produced by those contaminated samples and dimensions is greatly reduced. This is achieved by using multiple supports. Initially, a large support is used and then its size is reduced and eventually only a subgroup of data samples is considered for clustering. This procedure can filter out lots of contaminated information. Experiment results show that the proposed method effectively resolves all these problems. It outperforms existing ones for real world high dimensional datasets.
Keywords :
fuzzy set theory; optimisation; pattern classification; pattern clustering; statistical analysis; unsupervised learning; cluster analysis; fuzzy c-means clustering; high dimensional data classification; multiple support; noise feature; optimization; robust clustering algorithm; unsupervised learning; Clustering algorithms; Clustering methods; Degradation; Image analysis; Machine learning algorithms; Partitioning algorithms; Pattern analysis; Pollution measurement; Robustness; Unsupervised learning;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4634068