DocumentCode :
2323915
Title :
An optimal Bhattacharyya centroid algorithm for Gaussian clustering with applications in automatic speech recognition
Author :
Rigazio, Luca ; Tsakam, Brice ; Junqua, Jean-Claude
Author_Institution :
Speech Technol. Lab., Panasonic Technols Inc., Santa Barbara, CA, USA
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1599
Abstract :
The problem of clustering Gaussian distributions can be effectively solved by standard vector quantization algorithms where the metric is defined by the Bhattacharyya distance. This paper presents a novel algorithm for computing the optimal centroid for a cluster of Gaussian distributions according to the Bhattacharyya metric. We show that this centroid maximizes an upper bound on the probability of representing the population modeled by the distributions associated with the cluster. The proposed method is evaluated in clustering distributions of hidden Markov model speech recognizers to reduce the overall memory consumption and runtime complexity of the decoding. Experimental results show that, depending on the task, the number of distributions can be reduced by a factor of 2 to 6 with an increase in recognition accuracy. When compared to a maximum likelihood centroid, the Bhattacharyya centroid provides a 13% error rate reduction in a 2k word recognition task
Keywords :
Gaussian distribution; computational complexity; decoding; hidden Markov models; optimisation; speech coding; speech recognition; vector quantisation; 2k word recognition task; Gaussian clustering; automatic speech recognition; clustering Gaussian distributions; clustering distributions; decoding; error rate reduction; hidden Markov model speech recognizers; memory consumption; optimal Bhattacharyya centroid algorithm; optimal centroid; probability; recognition accuracy; runtime complexity; standard vector quantization algorithms; upper bound; Clustering algorithms; Distributed computing; Gaussian distribution; Hidden Markov models; Maximum likelihood decoding; Runtime; Speech analysis; Speech recognition; Upper bound; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.861998
Filename :
861998
Link To Document :
بازگشت