Title :
Clustering acoustic prototypes with self organizing distortion measures
Author_Institution :
IBM Thomas J. Watson Research Center, Yorktown Heights, NY
Abstract :
Selection of acoustic prototypes is an important aspect of speech recognition systems. In this paper a new algorithm for obtaining a set of acoustic prototypes is described. Associated with each prototype is a distortion measure whose full quadratic distance form is optimized to achieve a local minimum for the average distortion. Using a K-means clustering strategy, it is shown that in each iteration the minimum is achieved when the eigenvectors of the weighting matrices of the qudratic distances are identical to those of the sample covariance matrices of their corresponding clusters. Under certain eigenvalue constraints, closed form solutions for finding the eigenvalues are provided. Furtheremore, it is shown that clustering schemes that assume a multivariate Gaussian mixture density for the data can be solved using the new technique. Thereby, a new derivation for the maximum likelihood estimate of their associated covariance matrices is presented. Finally, recognition results obtained by incorporating the new algorithm in the IBM Speech Recognition system are presented.
Keywords :
Acoustic distortion; Acoustic measurements; Closed-form solution; Clustering algorithms; Covariance matrix; Distortion measurement; Eigenvalues and eigenfunctions; Organizing; Prototypes; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
DOI :
10.1109/ICASSP.1986.1169212