DocumentCode :
312134
Title :
Bayesian adaptation of speech recognizers to field speech data
Author :
Miglietta, C.G. ; Mokbel, Chajic ; Jouvet, Denis ; Monne, J.
Author_Institution :
CNET, Lannion, France
Volume :
2
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
917
Abstract :
The article studies a Bayesian (or Maximum A Posteriori MAP) approach to the adaptation of continuous density hidden Markov models (CDHMMs) to a specific condition of a speech recognition application. In order to improve the model robustness, CDHMMs formerly trained from laboratory data are then adapted using context dependent field utterances. Two specific problems have to be faced when using the MAP approach: the estimation of the a priori distribution parameters and the lack of field adaptation data for some distributions of the CDHMM. To estimate the a priori distribution parameters, we need to identify different realizations of the model parameters. Three different solutions are proposed and evaluated. To overcome the lack of adaptation data, field acoustical training frames may be shared among similar distributions. This is performed using an acoustical tree, obtained by progressively clustering the model distributions. Recognition results show that MAP adapted models significantly outperform those trained by maximum likelihood (ML), specifically when the field data set is small
Keywords :
Bayes methods; acoustic signal processing; hidden Markov models; maximum likelihood estimation; speech recognition; Bayesian adaptation; CDHMMs; MAP adapted models; MAP approach; Maximum A Posteriori MAP; a priori distribution parameters; acoustical tree; context dependent field utterances; continuous density hidden Markov models; field acoustical training frames; field adaptation data; field data set; field speech data; maximum likelihood; model distributions; model parameters; model robustness; speech recognition application; speech recognizers; Adaptation model; Bayesian methods; Covariance matrix; Databases; Equations; Iterative algorithms; Laboratories; Random variables; Robustness; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607751
Filename :
607751
Link To Document :
بازگشت