Title :
A Novel Finite Mixture Model for Count Data Modeling
Author_Institution :
Concordia Inst. for Inf. Syst. Eng., Concordia Univ., Montreal, QC, Canada
Abstract :
In this paper we examine the problem of count data clustering. We analyze this problem using finite mixtures of distributions. The multinomial and the multinomial Dirichlet distributions are widely accepted to model count data. We show that these two distributions cannot be the best choice in all the applications and we propose another model based on the selection of the generalized Dirichlet as a prior to the multinomial. The estimation of the parameters and the determination of the number of components in our model are based on the expectation-maximization approach and the minimum description length criterion, respectively. We compare our method to standard approaches to show its merits. The comparison involves spatial color image databases indexing.
Keywords :
expectation-maximisation algorithm; parameter estimation; pattern clustering; statistical distributions; count data clustering problem; count data modeling; expectation-maximization approach; finite mixture model; generalized Dirichlet distribution; minimum description length criterion; multinomial Dirichlet distribution; parameter estimation; Color; Context modeling; Data engineering; Image databases; Information systems; Parameter estimation; Signal processing; Signal processing algorithms; Spatial databases; Systems engineering and theory; Count data; finite mixture models; generalized Dirichlet; image databases; multinomial;
Conference_Titel :
Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
Conference_Location :
Dubai
Print_ISBN :
978-1-4244-1235-8
Electronic_ISBN :
978-1-4244-1236-5
DOI :
10.1109/ICSPC.2007.4728244