DocumentCode :
3430815
Title :
Speaker adaptation using Maximum Likelihood General Regression
Author :
Bahari, Mohamad Hasan ; Van hamme, Hugo
Author_Institution :
Dept. of Electr. Eng. (ESAT), KU Leuven, Leuven, Belgium
fYear :
2012
fDate :
2-5 July 2012
Firstpage :
29
Lastpage :
34
Abstract :
In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for speaker adaptation. Gaussian means of a speaker independent (SI) model are adapted to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. MLGR performs a non-linear regression between ML estimates of the means and the SI means using General Regression Neural Network. The proposed method is evaluated on the Wall Street Journal database. Evaluation results show that the suggested scheme outperforms different conventional approaches in the case of short adaptation utterances. We also mathematically prove that the Gaussian means of the adapted model using the MLGR converges to their ML estimates in the case of long adaptation utterances.
Keywords :
Gaussian processes; maximum likelihood estimation; neural nets; regression analysis; speaker recognition; Gaussian means; Wall Street Journal database; general regression neural network; maximum likelihood general regression; non-linear mapping; non-linear regression; speaker adaptation; speaker independent model; Adaptation models; Data models; Function approximation; Hidden Markov models; Maximum likelihood estimation; Silicon; general regression neural networks; maximum likelihood; non-linear speaker adaptation; speaker adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-0381-1
Electronic_ISBN :
978-1-4673-0380-4
Type :
conf
DOI :
10.1109/ISSPA.2012.6310564
Filename :
6310564
Link To Document :
بازگشت