• DocumentCode
    2859898
  • Title

    MML-Based Approach for High-Dimensional Unsupervised Learning Using the Generalized Dirichlet Mixture

  • Author

    Bouguila, Nizar ; Ziou, Djemel

  • Author_Institution
    Universite de Sherbrooke
  • fYear
    2005
  • fDate
    25-25 June 2005
  • Firstpage
    53
  • Lastpage
    53
  • Abstract
    We consider the problem of determining the structure of high-dimensional data, without prior knowledge of the number of clusters. Data are represented by a finite mixture model based on the generalized Dirichlet distribution. The generalized Dirichlet distribution has a more general covariance structure than the Dirichlet distribution and offers high flexibility and ease of use for the approximation of both symmetric and asymmetric distributions. In addition, the mathematical properties of this distribution allow highdimensional modeling without requiring dimensionality reduction and thus without a loss of information. The number of clusters is determined using the Minimum Message length (MML) principle. Parameters estimation is done by a hybrid stochastic expectation-maximization (HSEM) algorithm. The model is compared with results obtained by other selection criteria (AIC, MDL and MMDL). The performance of our method is tested by real data clustering and by applying it to an image object recognition problem.
  • Keywords
    Clustering algorithms; Computer vision; Covariance matrix; Face detection; Mathematical model; Parameter estimation; Pattern recognition; Stochastic processes; Support vector machines; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition - Workshops, 2005. CVPR Workshops. IEEE Computer Society Conference on
  • Conference_Location
    San Diego, CA, USA
  • ISSN
    1063-6919
  • Print_ISBN
    0-7695-2372-2
  • Type

    conf

  • DOI
    10.1109/CVPR.2005.493
  • Filename
    1565354