DocumentCode :
3008793
Title :
Clustering acoustic prototypes with self organizing distortion measures
Author :
Nahamoo, David
Author_Institution :
IBM Thomas J. Watson Research Center, Yorktown Heights, NY
Volume :
11
fYear :
1986
fDate :
31503
Firstpage :
753
Lastpage :
756
Abstract :
Selection of acoustic prototypes is an important aspect of speech recognition systems. In this paper a new algorithm for obtaining a set of acoustic prototypes is described. Associated with each prototype is a distortion measure whose full quadratic distance form is optimized to achieve a local minimum for the average distortion. Using a K-means clustering strategy, it is shown that in each iteration the minimum is achieved when the eigenvectors of the weighting matrices of the qudratic distances are identical to those of the sample covariance matrices of their corresponding clusters. Under certain eigenvalue constraints, closed form solutions for finding the eigenvalues are provided. Furtheremore, it is shown that clustering schemes that assume a multivariate Gaussian mixture density for the data can be solved using the new technique. Thereby, a new derivation for the maximum likelihood estimate of their associated covariance matrices is presented. Finally, recognition results obtained by incorporating the new algorithm in the IBM Speech Recognition system are presented.
Keywords :
Acoustic distortion; Acoustic measurements; Closed-form solution; Clustering algorithms; Covariance matrix; Distortion measurement; Eigenvalues and eigenfunctions; Organizing; Prototypes; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type :
conf
DOI :
10.1109/ICASSP.1986.1169212
Filename :
1169212
Link To Document :
بازگشت