Title :
Speech parameter generation algorithms for HMM-based speech synthesis
Author :
Tokuda, Keiichi ; Yoshimura, Takayoshi ; Masuko, Takashi ; Kobayashi, Takao ; Kitamura, Tadashi
Author_Institution :
Dept. of Comput. Sci., Nagoya Inst. of Technol., Japan
Abstract :
This paper derives a speech parameter generation algorithm for HMM-based speech synthesis, in which the speech parameter sequence is generated from HMMs whose observation vector consists of a spectral parameter vector and its dynamic feature vectors. In the algorithm, we assume that the state sequence (state and mixture sequence for the multi-mixture case) or a part of the state sequence is unobservable (i.e., hidden or latent). As a result, the algorithm iterates the forward-backward algorithm and the parameter generation algorithm for the case where the state sequence is given. Experimental results show that by using the algorithm, we can reproduce clear formant structure from multi-mixture HMMs as compared with that produced from single-mixture HMMs
Keywords :
hidden Markov models; maximum likelihood estimation; speech synthesis; HMM-based speech synthesis; dynamic feature vector; formant structure; forward-backward algorithm; multi-mixture HMM; observation vector; spectral parameter vector; speech parameter generation algorithms; speech parameter sequence; state sequence; Cepstral analysis; Character generation; Computer science; Context modeling; Databases; Hidden Markov models; Interpolation; Runtime; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.861820