مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker adaptation using combined transformation and Bayesian methods

DocumentCode :

1064959

Title :

Speaker adaptation using combined transformation and Bayesian methods

Author :

Digal, Vassilios V. ; Neumeyer, Leonardo G.

Author_Institution :

Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania, Greece

Volume :

Issue :

fYear :

1996

fDate :

7/1/1996 12:00:00 AM

Firstpage :

294

Lastpage :

300

Abstract :

Adapting the parameters of a statistical speaker independent continuous-speech recognizer to the speaker and the channel can significantly improve the recognition performance and robustness of the system. In continuous mixture-density hidden Markov models the number of component densities is typically very large, and it may not be feasible to acquire a sufficient amount of adaptation data for robust maximum-likelihood estimates. To solve this problem, we have recently proposed a constrained estimation technique for Gaussian mixture densities. To improve the behavior of our adaptation scheme for large amounts of adaptation data, we combine it here with Bayesian techniques. We evaluate our algorithms on the large-vocabulary Wall Street Journal corpus for nonnative speakers of American English. The recognition error rate is approximately halved with only a small amount of adaptation data, and it approaches the speaker-independent accuracy achieved for native speakers

Keywords :

Bayes methods; Gaussian processes; adaptive estimation; hidden Markov models; maximum likelihood estimation; speech recognition; American English; Bayesian method; Gaussian mixture densities; algorithms; constrained estimation technique; continuous mixture-density hidden Markov models; large-vocabulary Wall Street Journal corpus; native speakers; nonnative speakers; recognition error rate; recognition performance; robust maximum-likelihood estimates; robustness; speaker adaptation; statistical speaker independent continuous-speech recognizer; transformation method; Bayesian methods; Degradation; Error analysis; Hidden Markov models; Maximum likelihood estimation; Natural languages; Robustness; Speech recognition; Testing; Training data;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.506933

Filename :

506933

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1064959