DocumentCode :
2039247
Title :
A Novel Finite Mixture Model for Count Data Modeling
Author :
Bouguila, Nizar
Author_Institution :
Concordia Inst. for Inf. Syst. Eng., Concordia Univ., Montreal, QC, Canada
fYear :
2007
fDate :
24-27 Nov. 2007
Firstpage :
17
Lastpage :
20
Abstract :
In this paper we examine the problem of count data clustering. We analyze this problem using finite mixtures of distributions. The multinomial and the multinomial Dirichlet distributions are widely accepted to model count data. We show that these two distributions cannot be the best choice in all the applications and we propose another model based on the selection of the generalized Dirichlet as a prior to the multinomial. The estimation of the parameters and the determination of the number of components in our model are based on the expectation-maximization approach and the minimum description length criterion, respectively. We compare our method to standard approaches to show its merits. The comparison involves spatial color image databases indexing.
Keywords :
expectation-maximisation algorithm; parameter estimation; pattern clustering; statistical distributions; count data clustering problem; count data modeling; expectation-maximization approach; finite mixture model; generalized Dirichlet distribution; minimum description length criterion; multinomial Dirichlet distribution; parameter estimation; Color; Context modeling; Data engineering; Image databases; Information systems; Parameter estimation; Signal processing; Signal processing algorithms; Spatial databases; Systems engineering and theory; Count data; finite mixture models; generalized Dirichlet; image databases; multinomial;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
Conference_Location :
Dubai
Print_ISBN :
978-1-4244-1235-8
Electronic_ISBN :
978-1-4244-1236-5
Type :
conf
DOI :
10.1109/ICSPC.2007.4728244
Filename :
4728244
Link To Document :
بازگشت