DocumentCode
1665616
Title
Sparse hidden Markov models for purer clusters
Author
Bharadwaj, Samarth ; Hasegawa-Johnson, Mark ; Ajmera, Jitendra ; Deshmukh, Om ; Verma, A.
Author_Institution
Dept. of Electr. Eng., Univ. of Illinois, Urbana, IL, USA
fYear
2013
Firstpage
3098
Lastpage
3102
Abstract
The hidden Markov model (HMM) is widely popular as the de facto tool for representing temporal data; in this paper, we add to its utility in the sequence clustering domain - we describe a novel approach that allows us to directly control purity in HMM-based clustering algorithms. We show that encouraging sparsity in the observation probabilities increases cluster purity and derive an algorithm based on lp regularization; as a corollary, we also provide a different and useful interpretation of the value of p in Renyi p-entropy. We test our method on the problem of clustering non-speech audio events from the BBC sound effects corpus. Experimental results confirm that our approach does learn purer clusters, with (unweighted) average purity as high as 0.88 - a considerable improvement over both the baseline HMM (0.72) and k-means clustering (0.69).
Keywords
audio signal processing; data structures; hidden Markov models; pattern clustering; speech processing; HMM; Renyi p-entropy; de facto tool; k-means clustering; nonspeech audio events clustering; sequence clustering; sparse hidden Markov models; temporal data representation; Clustering algorithms; Entropy; Estimation; Hidden Markov models; Measurement; Speech; Vectors; Renyi entropy; cluster purity; hidden Markov model; sequence clustering; sparsity;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6638228
Filename
6638228
Link To Document