DocumentCode
939088
Title
General C-means clustering model
Author
Yu, Jian
Author_Institution
Sch. of Comput. Sci. & Inf. Technol., Beijing Jiaotong Univ., China
Volume
27
Issue
8
fYear
2005
Firstpage
1197
Lastpage
1211
Abstract
Partitional clustering is an important part of cluster analysis. Based on various theories, numerous clustering algorithms have been developed, and new clustering algorithms continue to appear in the literature. It is known that Occam´s razor plays a pivotal role in data-based models, and partitional clustering is categorized as a data-based model. However, no relation had previously been discovered between Occam´s razor and partitional clustering, as we discuss in this paper. The three main contributions of this paper can be summarized as follows: (1) according to a novel definition of the mean, a unifying generative framework for partitional clustering algorithms, called a general c-means clustering model (GCM), is presented and studied; and, (2) based on the local optimality test of the GCM, the connection between Occam´s razor and partitional clustering is established for the first time. As its application, a comprehensive review of the existing objective function-based clustering algorithms is presented based on GCM. 3) Under a common assumption about partitional clustering, a theoretical guide for devising and implementing clustering algorithm is discovered. These conclusions are verified by numerical experimental results.
Keywords
Hessian matrices; Occam; pattern clustering; statistical analysis; Occam razor; cluster analysis; data-based models; general C-means clustering model; local optimality test; partitional clustering; Clustering algorithms; Convergence; Data mining; Image analysis; Partitioning algorithms; Pattern analysis; Pattern recognition; Prototypes; Remote sensing; Testing; Hessian matrix.; Index Terms- Partitional clustering; Occam´s razor; cluster validity; density estimator; fixed point; mean; optimality test; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Information Storage and Retrieval; Models, Statistical; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2005.160
Filename
1453509
Link To Document