Title :
Fast robust inverse transform speaker adapted training using diagonal transformations
Author :
Jin, Hubert ; Matsoukas, Spyros ; Schwartz, Richard ; Kubala, Francis
Author_Institution :
BBN Technol., Cambridge, MA, USA
Abstract :
We present a new method of speaker adapted training (SAT) that is more robust, faster, and results in lower error rate than the previous methods. The method, called inverse transform SAT (IT-SAT) is based on removing the differences between speakers before training, rather than modeling the differences during training. We develop several methods to avoid the problems associated with inverting the transformation. In one method, we interpolate the transformation matrix with an identity or diagonal transformation. We also apply constraints to the matrix to avoid estimation problems. Finally, we show that the resulting method is much faster, requires much less disk space, and results in higher accuracy than the original SAT method
Keywords :
error statistics; interpolation; matrix inversion; maximum likelihood estimation; speech processing; speech recognition; transforms; IT-SAT; MAP estimation; accuracy; diagonal transformation; diagonal transformations; disk space; error rate; estimation problems; fast robust inverse transform; identity transformation; inverse transform SAT; matrix constraints; maximum a posteriori estimation; speaker adapted training; speech recognition; transformation matrix interpolation; Covariance matrix; Error analysis; Feedback; Gaussian distribution; Hidden Markov models; Robustness; Speech; Statistics; Testing; Training data;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675382