DocumentCode
3438515
Title
Speech synthesis using HMMs with dynamic features
Author
Masuko, Takashi ; Tokuda, Keiichi ; Kobayashi, Takao ; Imai, Suguru
Author_Institution
Precision & Intelligence Lab., Tokyo Inst. of Technol., Yokohama, Japan
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
389
Abstract
This paper presents a new text-to-speech synthesis system based on HMM which includes dynamic features, i.e., delta and delta-delta parameters of speech. The system uses triphone HMMs as the synthesis units. The triphone HMMs share less than 2,000 clustered states, each of which is modelled by a single Gaussian distribution. For a given text to be synthesized, a sentence HMM is constructed by concatenating the triphone HMMs. Speech parameters are generated from the sentence HMM in such a way that the output probability is maximized. The speech signal is synthesized directly from the obtained parameters using the mel log spectral approximation (MLSA) filter. Without dynamic features, the discontinuity of the generated speech spectra causes glitches in the synthesized speech. On the other hand, with dynamic features, the synthesized speech becomes quite smooth and natural even if the number of clustered states is small
Keywords
Gaussian distribution; Gaussian processes; filtering theory; hidden Markov models; parameter estimation; spectral analysis; speech synthesis; Gaussian distribution; MLSA filter; clustered states; delta parameters; delta-delta parameters; dynamic features; mel log spectral approximation; output probability; sentence HMM; speech parameters; speech signal; speech spectra discontinuity; text to speech synthesis system; triphone HMM; Band pass filters; Cepstral analysis; Databases; Gaussian distribution; Hidden Markov models; Laboratories; Signal synthesis; Speech analysis; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.541114
Filename
541114
Link To Document