Title :
Speaker and gender normalization for continuous-density hidden Markov models
Author :
Acero, Alejandro ; Huang, Xuedong
Author_Institution :
Microsoft Corp., Redmond, WA, USA
Abstract :
We describe a speaker-cluster normalization algorithm that we applied to both gender-normalization and speaker-normalization. To achieve parameter sharing the acoustic space is partitioned into classes. A maximum likelihood approach has been proposed under which the data between the distribution mean and its corresponding acoustic class is mostly speaker-independent, whereas the means of the acoustic classes are mostly speaker-dependent. When applied to gender-normalization the error rate reduction approaches that of a gender-dependent system but with half the number of parameters. For a speaker-normalized system, a 30% decrease in error rate was obtained in a batch recognition experiment in a context-dependent continuous-density HMM system
Keywords :
acoustic signal processing; hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; HMM system; acoustic class; acoustic space; batch recognition experiment; continuous-density hidden Markov models; distribution mean; error rate reduction; gender normalization; gender-dependent system; maximum likelihood approach; parameter sharing; speaker-cluster normalization algorithm; Acoustic noise; Cepstral analysis; Convergence; Error analysis; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Partitioning algorithms; Training data; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.541102