Title :
Online hierarchical transformation of hidden Markov models for speech recognition
Author :
Chien, Jen-Tzung
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fDate :
11/1/1999 12:00:00 AM
Abstract :
This paper proposes a novel framework of online hierarchical transformation of hidden Markov model (HMM) parameters for adaptive speech recognition. Our goal is to incrementally transform (or adapt) all the HMM parameters to a new acoustical environment even though most of HMM units are unseen in observed adaptation data. We establish a hierarchical tree of HMM units and apply the tree to dynamically search the transformation parameters for individual HMM mixture components. In this paper, the transformation framework formulated according to the approximate Bayesian estimate, where the prior statistics and the transformation parameters can be jointly and incrementally refreshed after each consecutive adaptation data, is presented. Using this formulation, only the refreshed prior statistics and the current block of data are needed for online transformation. In a series of speaker adaptation experiments on the recognition of 408 Mandarin syllables, we examine the effects on constructing various types of hierarchical trees. The efficiency and effectiveness of proposed method on incremental adaptation of overall HMM units are also confirmed. Besides, we demonstrate the superiority of proposed online transformation to Huo´s (see ibid., vol.5, p.161-72, 1997) on-line adaptation for a wide range of adaptation data
Keywords :
Bayes methods; adaptive signal processing; approximation theory; hidden Markov models; speech recognition; statistical analysis; trees (mathematics); HMM mixture components; HMM parameters; HMM units; Mandarin syllables recognition; acoustical environment; adaptation data; adaptive speech recognition; approximate Bayesian estimate; current data block; efficiency; hidden Markov models; hierarchical tree; observed adaptation data; online hierarchical transformation; refreshed prior statistics; speaker adaptation experiments; speech recognition; transformation parameters; Algorithm design and analysis; Bayesian methods; Calibration; Hidden Markov models; Pattern recognition; Robustness; Speech recognition; Statistics; Testing; Vocabulary;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on