DocumentCode :
792277
Title :
Quasi-Bayes linear regression for sequential learning of hidden Markov models
Author :
Chien, Jen-Tzung
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
10
Issue :
5
fYear :
2002
fDate :
7/1/2002 12:00:00 AM
Firstpage :
268
Lastpage :
278
Abstract :
This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent speech recognition system to handle the nonstationary environments via the linear regression adaptation of HMMs. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute the sequential adaptation where the regression matrix is estimated using QB theory. In the estimation, we specify the prior density of regression matrix as a matrix variate normal distribution and derive the pooled posterior density belonging to the same distribution family. Accordingly, the optimal regression matrix can be easily calculated. Also, the reproducible prior/posterior pair provides a meaningful mechanism for sequential learning of prior statistics. At each sequential epoch, only the updated prior statistics and the current observed data are required for adaptation. The proposed QBLR is a general framework with maximum likelihood linear regression (MLLR) and maximum a posteriori linear regression (MAPLR) as special cases. Experiments on supervised and unsupervised speaker adaptation demonstrate that the sequential adaptation using QBLR is efficient and asymptotical to batch learning using MLLR and MAPLR in recognition performance.
Keywords :
Bayes methods; hidden Markov models; learning (artificial intelligence); matrix algebra; normal distribution; speech recognition; statistical analysis; MAPLR; MLLR; batch learning; hidden Markov models; matrix variate normal distribution; maximum a posteriori linear regression; maximum likelihood linear regression; nonstationary environments; online/sequential linear regression adaptation; optimal regression matrix; pooled posterior density; prior statistics; quasi-Bayes linear regression algorithm; recognition performance; reproducible prior/posterior pair; sequential learning; speaker-independent speech recognition system; supervised speaker adaptation; unsupervised speaker adaptation; Acoustic testing; Covariance matrix; Gaussian distribution; Hidden Markov models; Linear regression; Maximum likelihood estimation; Maximum likelihood linear regression; Speech recognition; Statistical distributions; Vectors;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2002.800555
Filename :
1021070
Link To Document :
بازگشت