An acoustic model adaptation using HMM-based speech synthesis

Author

Tanaka, Koji ; Kuroiwa, Shingo ; Tsuge, Satoru ; Ren, Fuji

Author_Institution

Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan

fYear

2003

fDate

26-29 Oct. 2003

Firstpage

368

Lastpage

373

Abstract

Recently, personal digital assistants like cellular phones are shifting to the IP terminal. The encoding-decoding process utilized for transmitting over IP networks deteriorates the quality of the speech data. This deterioration causes degradation in speech recognition performance. Acoustic model adaptations could improve recognition performance. However, the current adaptation methods usually require a large amount of adaptation data. A novel adaptation method using speech synthesis based on HMM (hidden Markov model) is proposed. This method does not require speech data for adaptation because speech data is generated by speech synthesis from the acoustic model. Experimental results on G.723.1 coded speech recognition show that the proposed method improves speech recognition performance. A relative improvement in word accuracy of approximately 2% was observed.

Keywords

Internet telephony; hidden Markov models; speech recognition; speech synthesis; G.723.1 coded speech recognition; HMM-based speech synthesis; IP networks; acoustic model adaptation; cellular phone; encoding-decoding process; hidden Markov model; personal digital assistants; speech recognition performance; Acoustic distortion; Adaptation model; Cellular phones; Hidden Markov models; Personal digital assistants; Speech analysis; Speech coding; Speech recognition; Speech synthesis; Telephony;

fLanguage

English

Publisher

ieee

Conference_Titel

Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on

Conference_Location

Beijing, China

Print_ISBN

0-7803-7902-0

Type

conf

DOI

10.1109/NLPKE.2003.1275933

Filename

1275933