DocumentCode
257793
Title
Modified post-filter to recover modulation spectrum for HMM-based speech synthesis
Author
Takamichi, Shinnosuke ; Toda, Tomoki ; Black, Alan W. ; Nakamura, Satoshi
Author_Institution
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
fYear
2014
fDate
3-5 Dec. 2014
Firstpage
547
Lastpage
551
Abstract
This paper proposes a modified post-filter to recover a Modulation Spectrum (MS) in HMM-based speech synthesis. To alleviate the over-smoothing effect which is one of the major problems in HMM-based speech synthesis, the MS-based post-filter has been proposed. It recovers the utterance-level MS of the generated speech trajectory, and we have reported its benefit to the quality improvement. However, this post-filter is not applicable to various lengths of speech parameter trajectories, such as phrases or segments, which are shorter than an utterance. To address this problem, we propose two modified post-filters, (1) the time-invariant filter with a simplified conversion form and (2) the segment-level post-filter which applicable to a short-term parameter sequence. Furthermore, we also propose (3) the post-filter to recover the phoneme-level MS of HMM-state duration. Experimental results show that the modified post-filters also yield significant quality improvements in synthetic speech as yielded by the conventional post-filter.
Keywords
hidden Markov models; speech synthesis; HMM based speech synthesis; HMM state duration; MS; modified post filter; modulation spectrum; quality improvement; speech parameter trajectories; speech trajectory; time invariant filter; Hidden Markov models; Indexes; Modulation; Speech; Speech synthesis; Trajectory; HMM-based speech synthesis; modulation spectrum; over-smoothing; post-filter;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal and Information Processing (GlobalSIP), 2014 IEEE Global Conference on
Conference_Location
Atlanta, GA
Type
conf
DOI
10.1109/GlobalSIP.2014.7032177
Filename
7032177
Link To Document