• DocumentCode
    257793
  • Title

    Modified post-filter to recover modulation spectrum for HMM-based speech synthesis

  • Author

    Takamichi, Shinnosuke ; Toda, Tomoki ; Black, Alan W. ; Nakamura, Satoshi

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    547
  • Lastpage
    551
  • Abstract
    This paper proposes a modified post-filter to recover a Modulation Spectrum (MS) in HMM-based speech synthesis. To alleviate the over-smoothing effect which is one of the major problems in HMM-based speech synthesis, the MS-based post-filter has been proposed. It recovers the utterance-level MS of the generated speech trajectory, and we have reported its benefit to the quality improvement. However, this post-filter is not applicable to various lengths of speech parameter trajectories, such as phrases or segments, which are shorter than an utterance. To address this problem, we propose two modified post-filters, (1) the time-invariant filter with a simplified conversion form and (2) the segment-level post-filter which applicable to a short-term parameter sequence. Furthermore, we also propose (3) the post-filter to recover the phoneme-level MS of HMM-state duration. Experimental results show that the modified post-filters also yield significant quality improvements in synthetic speech as yielded by the conventional post-filter.
  • Keywords
    hidden Markov models; speech synthesis; HMM based speech synthesis; HMM state duration; MS; modified post filter; modulation spectrum; quality improvement; speech parameter trajectories; speech trajectory; time invariant filter; Hidden Markov models; Indexes; Modulation; Speech; Speech synthesis; Trajectory; HMM-based speech synthesis; modulation spectrum; over-smoothing; post-filter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (GlobalSIP), 2014 IEEE Global Conference on
  • Conference_Location
    Atlanta, GA
  • Type

    conf

  • DOI
    10.1109/GlobalSIP.2014.7032177
  • Filename
    7032177