Title :
Online unsupervised learning of hidden Markov models for adaptive speech recognition
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fDate :
10/1/2001 12:00:00 AM
Abstract :
A novel framework of an online unsupervised learning algorithm is presented to flexibly adapt the existing speaker-independent hidden Markov models (HMMs) to nonstationary environments induced by varying speakers, transmission channels, ambient noises, etc. The quasi-Bayes (QB) estimate is applied to incrementally obtain word sequence and adaptation parameters for adjusting HMMs when a block of unlabelled data is enrolled. The underlying statistics of a nonstationary environment can be successively traced according to the newest enrolment data. To improve the QB estimate, the adaptive initial hyperparameters are employed in the beginning session of online learning. These hyperparameters are estimated from a cluster of training speakers closest to the test environment. Additionally, a selection process is developed to select reliable parameters from a list of candidates for unsupervised learning. A set of reliability assessment criteria is explored for selection. In a series of speaker adaptation experiments, the effectiveness of the proposed method is confirmed and it is found that using the adaptive initial hyperparameters in online learning and the multiple assessments in parameter selection can improve the recognition performance
Keywords :
Bayes methods; adaptive signal processing; online operation; parameter estimation; speech recognition; unsupervised learning; adaptive initial hyperparameters; adaptive speech recognition; ambient noise; automatic speech recognition; enrolment data; hidden Markov models; nonstationary environments; online unsupervised learning; parameter selection; quasi-Bayes estimate; recognition performance; reliability assessment criteria; speaker adaptation experiments; speaker-independent HMM; statistics; test environment; transmission channels; unlabelled data; word sequence parameters;
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
DOI :
10.1049/ip-vis:20010560