DocumentCode :
3118295
Title :
Clustering based on Generalized Inverse Transformation
Author :
Chen, Yu-Chen ; Shih, Hong-Jie ; Jheng, Yu-Siang ; Shen, Sih-Yin ; Guo, Meng-Di ; Wang, Jung-Hua
Author_Institution :
Electr. Eng. Dept., Nat. Taiwan Ocean Univ., Keelung
fYear :
2008
fDate :
12-15 Oct. 2008
Firstpage :
3045
Lastpage :
3050
Abstract :
This paper presents a novel approach which incorporates dimension extension and generalized inverse transformation (DEGIT) to realize data clustering. Unlike k-means algorithm, DEGIT needs not pre-specify the number of clusters k, centroid locations are updated and redundant centroids eliminated automatically during iterative training process. The essence of DEGIT is that clustering is performed by generalized inverse transforming the input data such that each data point is represented by a linear combination of bases with extended dimension, with each basis corresponding to a centroid and its coefficient representing the closeness between the data point and the basis. Issue of clustering validation is also addressed in this paper. First, principal component analysis is applied to detect if there exists a dominated dimension, if so, the original input data will be rotated by a certain angle w.r.t. a defined center of mass, and the resulting data undergo another run of iterative training process. After plural runs of rotation and iterative process, the labeled results from various runs are compared, a data point labeled to a centroid more times than others will be labeled to the class indexed by that wining centroid.
Keywords :
iterative methods; pattern clustering; principal component analysis; DEGIT; centroid locations; clustering validation; data clustering; dimension extension; generalized inverse transformation; iterative training process; principal component analysis; wining centroid; Bioinformatics; Clustering algorithms; Convergence; Data mining; Euclidean distance; Iterative algorithms; Oceans; Performance evaluation; Principal component analysis; Robustness; centroids; clustering; dimension extension; generalized inverse transformation; principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location :
Singapore
ISSN :
1062-922X
Print_ISBN :
978-1-4244-2383-5
Electronic_ISBN :
1062-922X
Type :
conf
DOI :
10.1109/ICSMC.2008.4811762
Filename :
4811762
Link To Document :
بازگشت