DocumentCode :
1665616
Title :
Sparse hidden Markov models for purer clusters
Author :
Bharadwaj, Samarth ; Hasegawa-Johnson, Mark ; Ajmera, Jitendra ; Deshmukh, Om ; Verma, A.
Author_Institution :
Dept. of Electr. Eng., Univ. of Illinois, Urbana, IL, USA
fYear :
2013
Firstpage :
3098
Lastpage :
3102
Abstract :
The hidden Markov model (HMM) is widely popular as the de facto tool for representing temporal data; in this paper, we add to its utility in the sequence clustering domain - we describe a novel approach that allows us to directly control purity in HMM-based clustering algorithms. We show that encouraging sparsity in the observation probabilities increases cluster purity and derive an algorithm based on lp regularization; as a corollary, we also provide a different and useful interpretation of the value of p in Renyi p-entropy. We test our method on the problem of clustering non-speech audio events from the BBC sound effects corpus. Experimental results confirm that our approach does learn purer clusters, with (unweighted) average purity as high as 0.88 - a considerable improvement over both the baseline HMM (0.72) and k-means clustering (0.69).
Keywords :
audio signal processing; data structures; hidden Markov models; pattern clustering; speech processing; HMM; Renyi p-entropy; de facto tool; k-means clustering; nonspeech audio events clustering; sequence clustering; sparse hidden Markov models; temporal data representation; Clustering algorithms; Entropy; Estimation; Hidden Markov models; Measurement; Speech; Vectors; Renyi entropy; cluster purity; hidden Markov model; sequence clustering; sparsity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638228
Filename :
6638228
Link To Document :
بازگشت