• DocumentCode
    2039247
  • Title

    A Novel Finite Mixture Model for Count Data Modeling

  • Author

    Bouguila, Nizar

  • Author_Institution
    Concordia Inst. for Inf. Syst. Eng., Concordia Univ., Montreal, QC, Canada
  • fYear
    2007
  • fDate
    24-27 Nov. 2007
  • Firstpage
    17
  • Lastpage
    20
  • Abstract
    In this paper we examine the problem of count data clustering. We analyze this problem using finite mixtures of distributions. The multinomial and the multinomial Dirichlet distributions are widely accepted to model count data. We show that these two distributions cannot be the best choice in all the applications and we propose another model based on the selection of the generalized Dirichlet as a prior to the multinomial. The estimation of the parameters and the determination of the number of components in our model are based on the expectation-maximization approach and the minimum description length criterion, respectively. We compare our method to standard approaches to show its merits. The comparison involves spatial color image databases indexing.
  • Keywords
    expectation-maximisation algorithm; parameter estimation; pattern clustering; statistical distributions; count data clustering problem; count data modeling; expectation-maximization approach; finite mixture model; generalized Dirichlet distribution; minimum description length criterion; multinomial Dirichlet distribution; parameter estimation; Color; Context modeling; Data engineering; Image databases; Information systems; Parameter estimation; Signal processing; Signal processing algorithms; Spatial databases; Systems engineering and theory; Count data; finite mixture models; generalized Dirichlet; image databases; multinomial;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
  • Conference_Location
    Dubai
  • Print_ISBN
    978-1-4244-1235-8
  • Electronic_ISBN
    978-1-4244-1236-5
  • Type

    conf

  • DOI
    10.1109/ICSPC.2007.4728244
  • Filename
    4728244