Title :
Hierarchical Stability-Based Model Selection for Clustering Algorithms
Author :
Yin, Bing ; Hamerly, Greg
Author_Institution :
Comput. Sci. Dept., Baylor Univ., Waco, TX, USA
Abstract :
We present an algorithm called HS-means which is able to learn the number of clusters in a mixture model. Our method extends the concept of clustering stability to a concept of hierarchical stability. The method chooses a model for the data based on analysis of clustering stability; it then analyzes the stability of each component in the estimated model and chooses a stable model for this component. It continues this recursive stability analysis until all the estimated components are unimodal. In so doing, the method is able to handle hierarchical and symmetric data that existing stability-based algorithms have difficulty with. We test our algorithm on both synthetic datasets and real world datasets. The results show that HS-means outperforms a popular stability-based model selection algorithm, both in terms of handling symmetric data and finding high-quality clusterings in the task of predicting CPU performance.
Keywords :
learning (artificial intelligence); pattern clustering; stability; HS-means algorithm; clustering algorithm; hierarchical stability; model selection algorithm; recursive stability analysis; Application software; Clustering algorithms; Computer science; Data analysis; Machine learning; Machine learning algorithms; Predictive models; Recursive estimation; Stability analysis; Testing; clustering; expectation-maximization; k-means; model selection; stability;
Conference_Titel :
Machine Learning and Applications, 2009. ICMLA '09. International Conference on
Conference_Location :
Miami Beach, FL
Print_ISBN :
978-0-7695-3926-3
DOI :
10.1109/ICMLA.2009.64