• DocumentCode
    1408044
  • Title

    Learning Gaussian Mixture Models With Entropy-Based Criteria

  • Author

    Benavent, Antonio Peñalver ; Ruiz, Francisco Escolano ; Sáez, Juan Manuel

  • Author_Institution
    Dept. de Estadistica, Mat. e Inf., Univ. Miguel Hernandez, Elche, Spain
  • Volume
    20
  • Issue
    11
  • fYear
    2009
  • Firstpage
    1756
  • Lastpage
    1771
  • Abstract
    In this paper, we address the problem of estimating the parameters of Gaussian mixture models. Although the expectation-maximization (EM) algorithm yields the maximum-likelihood (ML) solution, its sensitivity to the selection of the starting parameters is well-known and it may converge to the boundary of the parameter space. Furthermore, the resulting mixture depends on the number of selected components, but the optimal number of kernels may be unknown beforehand. We introduce the use of the entropy of the probability density function (pdf) associated to each kernel to measure the quality of a given mixture model with a fixed number of kernels. We propose two methods to approximate the entropy of each kernel and a modification of the classical EM algorithm in order to find the optimum number of components of the mixture. Moreover, we use two stopping criteria: a novel global mixture entropy-based criterion called Gaussianity deficiency (GD) and a minimum description length (MDL) principle-based one. Our algorithm, called entropy-based EM (EBEM), starts with a unique kernel and performs only splitting by selecting the worst kernel attending to GD. We have successfully tested it in probability density estimation, pattern classification, and color image segmentation. Experimental results improve the ones of other state-of-the-art model order selection methods.
  • Keywords
    Gaussian processes; entropy; expectation-maximisation algorithm; image colour analysis; image segmentation; maximum likelihood estimation; pattern classification; EM algorithm; Gaussian mixture model learning; color image segmentation; entropy-based criteria; expectation-maximization algorithm; global mixture entropy-based criterion; maximum-likelihood solution; minimum description length principle; model order selection method; pattern classification; probability density function; Clustering; EM algorithm; entropy estimation; minimum description length (MDL) criterion; mixture models; model order selection; Algorithms; Artificial Intelligence; Data Interpretation, Statistical; Entropy; Models, Statistical; Neural Networks (Computer); Normal Distribution;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2009.2030190
  • Filename
    5247027