• DocumentCode
    3123725
  • Title

    Statistical modification based post-filtering technique for HMM-based speech synthesis

  • Author

    Zhengqi Wen ; Jianhua Tao ; Hao Che

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    146
  • Lastpage
    149
  • Abstract
    The speech generated from hidden Markov model (HMM)-based speech synthesis systems (HTS) is suffered from over-smoothing problem which is due to statistical modeling. This paper will focus on post-filtering technique based on statistical modification for the generated speech parameters. The marginal statistics of parameters´ trajectory, such as mean, variance, skewness and kurtosis are adjusted according to the values generated from the HTS system. This technique is compared with global variance (GV)-based speech generation algorithm. The listening test showed that the post-filtering technique considering the mean and variance could generate almost equal result with GV model. When further considering the modification of skewness and kurtosis, the quality of generated speech has been improved.
  • Keywords
    hidden Markov models; smoothing methods; speech synthesis; statistical analysis; HMM-based speech synthesis; hidden Markov model; kurtosis; marginal statistics; mean; over-smoothing problem; post-filtering technique; skewness; speech quality; statistical modeling; statistical modification; variance; Heuristic algorithms; Hidden Markov models; High temperature superconductors; Signal processing algorithms; Speech; Speech synthesis; Training; HMM-based speech synthesis; global variance; marginal statistics; post-filtering; statistical modification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423456
  • Filename
    6423456