مرکز منطقه ای اطلاع رساني علوم و فناوري - Online speaker adaptation based on quasi-Bayes linear regression

DocumentCode :

1749664

Title :

Online speaker adaptation based on quasi-Bayes linear regression

Author :

Chien, Jen-Tzung ; Huang, Chih-Hsien

Author_Institution :

Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan

Volume :

fYear :

2001

fDate :

2001

Firstpage :

329

Abstract :

This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve the speaker-independent (SI) speech recognizer to meet nonstationary environments via linear regression adaptation of SI HMMs. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute online adaptation where the regression matrix is estimated using QB theory. In the estimation, we moderately specify the prior density of the regression matrix as a matrix variate normal distribution and exactly derive the pooled posterior density belonging to the same distribution family. Accordingly, the optimal regression matrix can be easily calculated. Also, the reproducible prior/posterior density pair provides a meaningful mechanism for sequential learning of prior statistics. At each sequential epoch, only the updated prior statistics and the current observed data are required for adaptation. In general, the proposed QBLR is universal and can be reduced to the well-known maximum likelihood linear regression (MLLR) and maximum a posteriori linear regression (MAPLR). Experiments show that the QBLR is effective for speaker adaptation in car environments

Keywords :

Bayes methods; adaptive signal processing; hidden Markov models; matrix algebra; normal distribution; online operation; parameter estimation; speech recognition; statistical analysis; HMM; MAPLR; MLLR; QBLR maximum likelihood linear regression; car environment; hidden Markov model; hyperparameter estimation; matrix variate normal distribution; maximum posteriori linear regression; observed data; online speaker adaptation; online/sequential linear regression adaptation; optimal regression matrix; pooled posterior density; prior statistics; quasi-Bayes linear regression; regression matrix density; reproducible prior/posterior density; sequential epoch; sequential learning; speaker adaptation; speaker-independent speech recognition; updated prior statistics; Acoustic testing; Estimation theory; Hidden Markov models; Linear regression; Maximum likelihood estimation; Maximum likelihood linear regression; Sparse matrices; Speech recognition; Statistical distributions; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location :

Salt Lake City, UT

ISSN :

1520-6149

Print_ISBN :

0-7803-7041-4

Type :

conf

DOI :

10.1109/ICASSP.2001.940834

Filename :

940834

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1749664