• DocumentCode
    24212
  • Title

    Integrated Expression Prediction and Speech Synthesis From Text

  • Author

    Langzhou Chen ; Gales, Mark J.F. ; Braunschweiler, Norbert ; Akamine, Masami ; Knill, Kate

  • Author_Institution
    Toshiba Res. Eur. Ltd., Cambridge, UK
  • Volume
    8
  • Issue
    2
  • fYear
    2014
  • fDate
    Apr-14
  • Firstpage
    323
  • Lastpage
    335
  • Abstract
    Generating expressive, naturally sounding, speech from text using a speech synthesis (TTS) system is a highly challenging problem. However for tasks such as audiobooks it is essential if their use is to become widespread. Generating expressive speech from text can be divided into two parts: predicting expressive information from text; and synthesizing the speech with a particular expression. Traditionally these components have been studied separately. This paper proposes an integrated approach, where the training data and representation of expressive synthesis is shared across the two components. There are several advantages to this scheme including: robust handling of automatically generated expressive labels; support for a continuous representation of expressions; and joint training of the expression predictor and speech synthesizer. Synthesis experiments indicated that the proposed approach produced far more expressive speech than both a neutral TTS and one where the expression was randomly selected. The experimental results also show the advantage of a continuous expressive synthesis space over a discrete space.
  • Keywords
    speech synthesis; text analysis; TTS system; expression predictor; expressive information; expressive synthesis; integrated expression prediction; speech synthesis from text; speech synthesizer; Adaptation models; Decision trees; Hidden Markov models; Speech; Speech synthesis; Training; Vectors; Expressive speech synthesis; audiobook; cluster adaptive training; hidden Markov model; neural network;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Signal Processing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1932-4553
  • Type

    jour

  • DOI
    10.1109/JSTSP.2013.2294938
  • Filename
    6683056