Title :
A compact model for speaker-adaptive training
Author :
Anastasakos, Tasos ; McDonough, John ; Schwartz, Richard ; Makhoul, John
Author_Institution :
Northeastern Univ., Boston, MA, USA
Abstract :
We formulate a novel approach to estimating the parameters of continuous density HMMs for speaker-independent (SI) continuous speech recognition. It is motivated by the fact that variability in SI acoustic models is attributed to both phonetic variation and variation among the speakers of the training population, that is independent of the information content of the speech signal. These two variation sources are decoupled and the proposed method jointly annihilates the inter-speaker variation and estimates the HMM parameters of the SI acoustic models. We compare the proposed training algorithm to the common SI training paradigm within the context of supervised adaptation. We show that the proposed acoustic models are more efficiently adapted to the test speakers, thus achieving significant overall word error rate reductions of 19% and 25% for 20K and 5K vocabulary tasks respectively
Keywords :
acoustics; hidden Markov models; learning (artificial intelligence); parameter estimation; speech recognition; acoustic model variability; continuous density hidden Markov models; inter-speaker variation; parameter estimation; phonetic variation; speaker-adaptive training algorithm; speaker-independent continuous speech recognition; speech signal information content; supervised adaptation; vocabulary; word error rate reductions; Acoustic testing; Electronic mail; Error analysis; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Parameter estimation; Speech recognition; Training data; Vocabulary;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607807