DocumentCode
294611
Title
Speaker-independent phone modeling based on speaker-dependent HMMs´ composition and clustering
Author
Kosaka, Tetsuo ; Matsunaga, Shoichi ; Kuraoka, Mikio
Author_Institution
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
441
Abstract
This paper proposes a novel method for speaker-independent phone modeling based on the composition and clustering method (CCL) of speaker-dependent HMMs. In general, HMM phone models are trained by the Baum-Welch (B-W) algorithm. We, however, propose a speaker-independent phone modeling in which speaker-dependent (SD) HMMs are combined to form speaker-independent (SI) HMMs without parameter reestimation. Furthermore, by using this method, we investigate how different kinds of reference speakers influence the development of the SI models. The method is evaluated in Japanese phoneme and phrase recognition experiments. Results show that the performance of this method is similar to the conventional B-W algorithm´s with great reduction of computational cost
Keywords
acoustic signal processing; hidden Markov models; speech processing; speech recognition; Baum-Welch algorithm; Japanese phoneme recognition; Japanese phrase recognition; composition and clustering method; computational cost reduction; performance; reference speakers; speaker-dependent HMM; speaker-independent HMM; speaker-independent phone modeling; speech recognition experiments; Clustering algorithms; Clustering methods; Computational efficiency; Context modeling; Hidden Markov models; Parameter estimation; Telecommunications; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479623
Filename
479623
Link To Document