DocumentCode :
312181
Title :
Speaker adaptation using tree structured shared-state HMMs
Author :
Ishii, Jun ; Tonomura, Masahiro ; Matsunaga, Shinichiro
Author_Institution :
ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan
Volume :
2
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1149
Abstract :
The paper proposes a novel speaker adaptation method that flexibly controls state-sharing of HMMs according to the amount of adaptation data. In the scheme, acoustic modeling is combined with adaptation to efficiently utilize the acoustic models sharing characteristics for adaptation. The shared-state set of HMMs is determined by using tree-structured shared-state HMMs created from the history recorded for acoustic model generation. The proposed method is applied to the parameter-tying and parameter-smoothing techniques. Experiments have been performed on a Japanese phoneme recognition test using continuous density mixture Gaussian HMMs. Using 50 adaptation phrases, a 42% reduction in the phoneme recognition error rate from the speaker-independent model was achieved
Keywords :
hidden Markov models; speech recognition; tree data structures; Japanese phoneme recognition test; acoustic model generation; acoustic modeling; adaptation data; adaptation phrases; continuous density mixture Gaussian HMMs; history; parameter-smoothing techniques; parameter-tying techniques; phoneme recognition error rate; speaker adaptation method; speaker-independent model; tree structured shared-state HMMs; Acoustic testing; Clustering algorithms; Hidden Markov models; History; Humans; Loudspeakers; Smoothing methods; Speech recognition; Telecommunication control; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607810
Filename :
607810
Link To Document :
بازگشت