DocumentCode :
177918
Title :
Bayesian Non-parametric Parsimonious Gaussian Mixture for Clustering
Author :
Chamroukhi, F. ; Bartcus, M. ; Glotin, H.
Author_Institution :
ENSAM, Aix Marseille Univ., Marseille, France
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
1460
Lastpage :
1465
Abstract :
Clustering is one of the essential tasks in machine learning and statistical pattern recognition. One of the most popular approaches in cluster analysis is the one based on the parametric finite mixture model. However, often, parametric models are not well adapted to represent complex and realistic data sets. Another issue in the finite mixture model-based clustering approach is the one of selecting the number of mixture components. The Bayesian non-parametric statistical methods for clustering provide a principled way to overcome these issues. This paper proposes a new Bayesian non-parametric approach for clustering. It relies on an Infinite Gaussian mixture model with an Eigen value decomposition of the covariance matrix of each cluster, and a Chinese Restaurant Process (CRP) prior over the hidden partition. The CRP prior allows to control the model complexity in a principled way, and to automatically learn the number of clusters from the data. The covariance matrix decomposition allows to fit various flexible models going from simplest spherical ones to the more complex general one. We develop a Gibbs sampler to learn the various models and apply it to simulated data and benchmarks, and a real-world data issued from a challenging problem of whale song decomposition. The obtained results highlight the interest of the proposed non-parametric parsimonious mixture model for clustering.
Keywords :
Bayes methods; Gaussian processes; covariance matrices; data analysis; eigenvalues and eigenfunctions; learning (artificial intelligence); mixture models; pattern clustering; statistical analysis; Bayesian nonparametric parsimonious Gaussian mixture; CRP; Chinese restaurant process; Gibbs sampler; clustering approach; complex data sets; covariance matrix decomposition; eigenvalue decomposition; infinite Gaussian mixture model; machine learning; parametric finite mixture model; realistic data sets; statistical pattern recognition; Adaptation models; Bayes methods; Covariance matrices; Data models; Eigenvalues and eigenfunctions; Iris; Numerical models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.260
Filename :
6976970
Link To Document :
بازگشت