DocumentCode
118003
Title
A hybrid text-to-speech based on sub-band approach
Author
Inoue, Takuma ; Hara, Sunao ; Abe, Masanobu
Author_Institution
Dept. of Comput. Sci., Okayama Univ., Okayama, Japan
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
1
Lastpage
4
Abstract
This paper proposes a sub-band speech synthesis approach to develop high-quality Text-to-Speech (TTS). For the low-frequency band and high-frequency band, Hidden Markov Model (HMM)-based speech synthesis and waveform-based speech synthesis are used, respectively. Both speech synthesis methods are widely known to show good performance and to have benefits and shortcomings from different points of view. One motivation is to apply the right speech synthesis method in the right frequency band. Experiment results show that in terms of the smoothness the proposed approach shows better performance than waveform-based speech synthesis, and in terms of the clarity it shows better than HMM-based speech synthesis. Consequently, the proposed approach combines the inherent benefits from both waveform-based speech synthesis and HMM-based speech synthesis.
Keywords
hidden Markov models; speech synthesis; HMM; Hidden Markov Model; TTS; hybrid text-to-speech; speech synthesis methods; subband speech synthesis approach; waveform based speech synthesis; Frequency synthesizers; Harmonic analysis; Hidden Markov models; Speech; Speech synthesis; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
Conference_Location
Siem Reap
Type
conf
DOI
10.1109/APSIPA.2014.7041575
Filename
7041575
Link To Document