Title :
Stream derivation and clustering scheme for subspace distribution clustering hidden Markov model
Author :
Mak, Brian ; Bocchieri, Enrico ; Barnard, Etienne
Author_Institution :
AT&T Labs., Florham Park, NJ, USA
Abstract :
Bocchieri and Mak (Proc. Eurospeech, vol. 1, p. 107-10, 1997) introduced a novel subspace distribution clustering hidden Markov model (SDCHMM) as an approximation to a continuous-density HMM (CDHMM). Deriving SDCHMMs from CDHMMs requires a definition of multiple streams and a Gaussian clustering scheme. Previously, we have tried 4 and 13 streams, which are common but ad hoc choices. In this paper, we present a simple and coherent definition for streams of any dimension: the streams comprise the most correlated features. The new definition is shown to give better performance in two speech recognition tasks. The clustering scheme of Bocchieri and Mak is an O(n2) algorithm which can be slow when the number of Gaussians in the original CDHMMs is large. Now, we have devised a modified k-means clustering scheme using the Bhattacharyya distance as the distance measure between Gaussian clusters. Not only is the new clustering scheme faster but, when combined with the new stream definitions, we now obtain SDCHMMs which perform at least as well as the original CDHMMs (with better results in some cases)
Keywords :
Gaussian distribution; computational complexity; hidden Markov models; speech recognition; Bhattacharyya distance; Gaussian clustering scheme; continuous-density hidden Markov model approximation; correlated features; k-means clustering scheme; performance; speech recognition; stream derivation; subspace distribution clustering hidden Markov model; Clustering algorithms; Data compression; Encoding; Equations; Gaussian distribution; Hidden Markov models; Prototypes; Quantization; Training data;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659109