• DocumentCode
    2175342
  • Title

    HMM-based speech synthesiser using the LF-model of the glottal source

  • Author

    Cabral, João P. ; Renals, Steve ; Yamagishi, Junichi ; Richmond, Korin

  • Author_Institution
    Sch. of Comput. Sci. & Inf., Univ. Coll. Dublin, Dublin, Ireland
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4704
  • Lastpage
    4707
  • Abstract
    A major factor which causes a deterioration in speech quality in HMM-based speech synthesis is the use of a simple delta pulse signal to generate the excitation of voiced speech. This paper sets out a new approach to using an acoustic glottal source model in HMM-based synthesisers instead of the traditional pulse signal. The goal is to improve speech quality and to better model and transform voice characteristics. We have found the new method decreases buzziness and also improves prosodic modelling. A perceptual evaluation has supported this finding by showing a 55.6% preference for the new system, as against the baseline. This improvement, while not being as significant as we had initially expected, does encourage us to work on developing the proposed speech synthesiser further.
  • Keywords
    hidden Markov models; speech synthesis; HMM-based speech synthesiser; acoustic glottal source model LF-model; delta pulse signal; perceptual evaluation; prosodic modelling; speech quality; voiced speech generation; Hidden Markov models; Mathematical model; Noise; Speech; Speech synthesis; Synthesizers; Glottal Source Modelling; HMM-based Speech Synthesis; LF-Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947405
  • Filename
    5947405