Speaker-independent phone modeling based on speaker-dependent HMMs´ composition and clustering

Author

Kosaka, Tetsuo ; Matsunaga, Shoichi ; Kuraoka, Mikio

Author_Institution

ATR Interpreting Telephony Res. Labs., Kyoto, Japan

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

441

Abstract

This paper proposes a novel method for speaker-independent phone modeling based on the composition and clustering method (CCL) of speaker-dependent HMMs. In general, HMM phone models are trained by the Baum-Welch (B-W) algorithm. We, however, propose a speaker-independent phone modeling in which speaker-dependent (SD) HMMs are combined to form speaker-independent (SI) HMMs without parameter reestimation. Furthermore, by using this method, we investigate how different kinds of reference speakers influence the development of the SI models. The method is evaluated in Japanese phoneme and phrase recognition experiments. Results show that the performance of this method is similar to the conventional B-W algorithm´s with great reduction of computational cost

Keywords

acoustic signal processing; hidden Markov models; speech processing; speech recognition; Baum-Welch algorithm; Japanese phoneme recognition; Japanese phrase recognition; composition and clustering method; computational cost reduction; performance; reference speakers; speaker-dependent HMM; speaker-independent HMM; speaker-independent phone modeling; speech recognition experiments; Clustering algorithms; Clustering methods; Computational efficiency; Context modeling; Hidden Markov models; Parameter estimation; Telecommunications; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479623

Filename

479623