• DocumentCode
    3723847
  • Title

    Dictionary-learning-based post-filter for HMM-based speech synthesis

  • Author

    Praneeth Kurpad Narayanamurthy;Chandra Sekhar Seelamantula

  • Author_Institution
    Department of Electrical Engineering, Indian Institute of Science Bangalore - 560 012, India
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse-coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods.
  • Keywords
    "Hidden Markov models","Speech","Trajectory","Natural languages","Dictionaries","Speech synthesis","Training data"
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2015 - 2015 IEEE Region 10 Conference
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-8639-2
  • Electronic_ISBN
    2159-3450
  • Type

    conf

  • DOI
    10.1109/TENCON.2015.7373091
  • Filename
    7373091