• DocumentCode
    3428582
  • Title

    Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech

  • Author

    Merritt, Thomas ; Latorre, Javier ; King, Simon

  • Author_Institution
    Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4220
  • Lastpage
    4224
  • Abstract
    Even the best statistical parametric speech synthesis systems do not achieve the naturalness of good unit selection. We investigated possible causes of this. By constructing speech signals that lie in between natural speech and the output from a complete HMM synthesis system, we investigated various effects of modelling. We manipulated the temporal smoothness and the variance of the spectral parameters to create stimuli, then presented these to listeners alongside natural and vocoded speech, as well as output from a full HMM-based text-to-speech system and from an idealised `pseudo-HMM´. All speech signals, except the natural waveform, were created using vocoders employing one of two popular spectral parameterisations: Mel-Cepstra or Mel-Line Spectral Pairs. Listeners made `same or different´ pairwise judgements, from which we generated a perceptual map using Multidimensional Scaling. We draw conclusions about which aspects of HMM synthesis are limiting the naturalness of the synthetic speech.
  • Keywords
    hidden Markov models; speech synthesis; vocoders; voice equipment; HMM synthesis; Mel-Cepstra pairs; Mel-Line Spectral pairs; hidden Markov model; modelled speech; modelling errors; natural speech; speech naturalness; speech synthesis systems; vocoded speech; Hidden Markov models; Lead; Smoothing methods; Speech; hidden Markov modelling; speech synthesis; vocoding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178766
  • Filename
    7178766