DocumentCode :
2447384
Title :
Mixture model clustering of binned uncertain data: the classification approach
Author :
Hamdan, Hani
Author_Institution :
STMS, UMR CNRS 9912, IRCAM, 1, place Igor Stravinsky, 75004 Paris, France. Hani.Hamdan@ircam.fr
Volume :
1
fYear :
2006
fDate :
24-28 April 2006
Firstpage :
1645
Lastpage :
1650
Abstract :
Basing cluster analysis on Gaussian mixture models is a powerful approach. In this context, two commonly used maximum likelihood approaches have been proposed: the mixture approach and the classification approach. Loosely speaking, the mixture approach aims to maximize the likelihood over the mixture parameters, whereas the classification approach aims to maximize the likelihood over the mixture parameters and over the identifying labels of the mixture component origin for each point. This paper addresses the problem of taking into account data imprecision in the mixture model clustering of binned data. Binning (or grouping) data is common in data analysis and machine learning. Recently, we developed an original method which fitted the binning data procedure to imprecise data. The idea was to model imprecise data by multivariate uncertainty zones and to assign each uncertainty zone to several bins with proportions proportional to its overlapping volumes with the bins. The experimental results of this method when it was associated with the binned-EM algorithm (mixture approach) were encouraging. However, the binned-EM algorithm has the disadvantage of being sometimes computationally expensive. To overcome this problem, we propose in this paper to apply our binning data procedure with the classification approach based on bin-EM-CEM algorithm which is much faster than the binned-EM algorithm. The paper concludes with a brief description of a flaw diagnosis application using acoustic emission.
Keywords :
Acoustic emission; Acoustic measurements; Clustering algorithms; Data analysis; Frequency; Histograms; Machine learning; Machine learning algorithms; Multidimensional systems; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technologies, 2006. ICTTA '06. 2nd
Print_ISBN :
0-7803-9521-2
Type :
conf
DOI :
10.1109/ICTTA.2006.1684631
Filename :
1684631
Link To Document :
بازگشت