DocumentCode
2859898
Title
MML-Based Approach for High-Dimensional Unsupervised Learning Using the Generalized Dirichlet Mixture
Author
Bouguila, Nizar ; Ziou, Djemel
Author_Institution
Universite de Sherbrooke
fYear
2005
fDate
25-25 June 2005
Firstpage
53
Lastpage
53
Abstract
We consider the problem of determining the structure of high-dimensional data, without prior knowledge of the number of clusters. Data are represented by a finite mixture model based on the generalized Dirichlet distribution. The generalized Dirichlet distribution has a more general covariance structure than the Dirichlet distribution and offers high flexibility and ease of use for the approximation of both symmetric and asymmetric distributions. In addition, the mathematical properties of this distribution allow highdimensional modeling without requiring dimensionality reduction and thus without a loss of information. The number of clusters is determined using the Minimum Message length (MML) principle. Parameters estimation is done by a hybrid stochastic expectation-maximization (HSEM) algorithm. The model is compared with results obtained by other selection criteria (AIC, MDL and MMDL). The performance of our method is tested by real data clustering and by applying it to an image object recognition problem.
Keywords
Clustering algorithms; Computer vision; Covariance matrix; Face detection; Mathematical model; Parameter estimation; Pattern recognition; Stochastic processes; Support vector machines; Unsupervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition - Workshops, 2005. CVPR Workshops. IEEE Computer Society Conference on
Conference_Location
San Diego, CA, USA
ISSN
1063-6919
Print_ISBN
0-7695-2372-2
Type
conf
DOI
10.1109/CVPR.2005.493
Filename
1565354
Link To Document