Title :
Combining EigenVoices and structural MLLR for speaker adaptation
Author :
Lauri, Fubrice ; Mina, I. ; Fohr, Dominique
Author_Institution :
Speech Group, LORIA-INPL, Vandoeuvre-les-Nancy, France
Abstract :
This paper considers the problem of speaker adaptation of acoustic models in speech recognition. We have investigated four different possible methods which integrate the concepts of both Structural Maximum Likelihood Linear Regression (SMLLR) and EigenVoices-based technique (EV) to adapt the Gaussian means of the speaker independent models for a new speaker. The experiments were evaluated using the speech recognition engine ESPERE on the data of the corpus Resource Management. They show that all of the proposed methods can improve the performances of an automatic speech recognition system (ASRS) in supervised batch adaptation as efficiently as SMLLR and EigenVoices-based techniques whatever the amount of adaptation data is available. For an unsupervised incremental adaptation, only the approach SMLLR + SEV gives the best results.
Keywords :
Gaussian processes; maximum likelihood estimation; speech recognition; statistical analysis; ESPERE; EigenVoices; Gaussian means; Resource Management corpus; acoustic models; automatic speech recognition system; speaker adaptation; speaker independent models; speech recognition; speech recognition engine; structural MLLR; structural maximum likelihood linear regression; supervised batch adaptation; Acoustic testing; Automatic speech recognition; Automatic testing; Engines; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Resource management; Speech recognition; System testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198847