DocumentCode :
302326
Title :
Tied-structure HMM based on parameter correlation for efficient model training
Author :
Takahashi, Satoshi ; Sagayama, Shigeki
Author_Institution :
NTT Human Interface Labs., Kanagawa, Japan
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
467
Abstract :
This paper proposes a new scheme of the tied-structure for constraining an HMM structure to increase training efficiency and recognition robustness. In conventional tied-structure approaches, tied parameters (or distributions, states, allophones) share the same value for decreasing model complexity. In the new framework, the tied parameters are correlated to each other rather than share the same value. To establish the appropriate correlation between model parameters, a speaker-independent initial model is trained using multiple sets of speaker-dependent data. The transfer vectors from the initial mean vectors of Gaussian distributions to the trained mean vectors are clustered to obtain sets of mean vectors that are mutually correlated across speakers. This kind of speaker-independent parameter-correlated structure can yield a lower degree-of-freedom for the model compared to the baseline speaker-independent model. Using the model with the parameter-correlated structure, speaker adaptation experiments are performed to demonstrate the higher training efficiency of the model compared to conventional speaker adaptation techniques without the correlation structure
Keywords :
Gaussian distribution; Gaussian processes; correlation methods; hidden Markov models; parameter estimation; speech recognition; Gaussian distributions; allophones; distributions; initial mean vectors; model complexity; model training; parameter correlation; recognition robustness; speaker adaptation experiments; speaker dependent data; speaker independent initial model; speaker independent parameter correlated structure; states; tied parameters; tied structure HMM; trained mean vectors; training efficiency; transfer vectors; Clustering algorithms; Gaussian distribution; Hidden Markov models; Humans; Laboratories; Loudspeakers; Probability; Robustness; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.541134
Filename :
541134
Link To Document :
بازگشت