Title :
X-mHMM: an efficient algorithm for training mixtures of HMMs when the number of mixtures is unknown
Author :
Szamonek, Zoltán ; Szepesvári, Csaba
Author_Institution :
Comput. & Autom. Res. Inst. of the Hungarian Acad. of Sci., Budapest, Hungary
Abstract :
In this paper we consider sequence clustering problems and propose an algorithm for the estimation of the number of clusters based on the X-means algorithm. The sequences are modeled using mixtures of Hidden Markov Models. By means of experiments with synthetic data we analyze the proposed algorithm. This algorithm proved to be both computationally efficient and capable of providing accurate estimates of the number of clusters. Some results of experiments with real-world Web-log data are also given.
Keywords :
hidden Markov models; pattern clustering; X-mHMM algorithm; X-means algorithm; hidden Markov model; sequence clustering; Algorithm design and analysis; Application software; Automation; Biology computing; Chemistry; Clustering algorithms; Data analysis; Hidden Markov models; Partitioning algorithms; Sequences;
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
Print_ISBN :
0-7695-2278-5
DOI :
10.1109/ICDM.2005.156