DocumentCode :
323775
Title :
Fast robust inverse transform speaker adapted training using diagonal transformations
Author :
Jin, Hubert ; Matsoukas, Spyros ; Schwartz, Richard ; Kubala, Francis
Author_Institution :
BBN Technol., Cambridge, MA, USA
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
785
Abstract :
We present a new method of speaker adapted training (SAT) that is more robust, faster, and results in lower error rate than the previous methods. The method, called inverse transform SAT (IT-SAT) is based on removing the differences between speakers before training, rather than modeling the differences during training. We develop several methods to avoid the problems associated with inverting the transformation. In one method, we interpolate the transformation matrix with an identity or diagonal transformation. We also apply constraints to the matrix to avoid estimation problems. Finally, we show that the resulting method is much faster, requires much less disk space, and results in higher accuracy than the original SAT method
Keywords :
error statistics; interpolation; matrix inversion; maximum likelihood estimation; speech processing; speech recognition; transforms; IT-SAT; MAP estimation; accuracy; diagonal transformation; diagonal transformations; disk space; error rate; estimation problems; fast robust inverse transform; identity transformation; inverse transform SAT; matrix constraints; maximum a posteriori estimation; speaker adapted training; speech recognition; transformation matrix interpolation; Covariance matrix; Error analysis; Feedback; Gaussian distribution; Hidden Markov models; Robustness; Speech; Statistics; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675382
Filename :
675382
Link To Document :
بازگشت