• DocumentCode
    1937577
  • Title

    Perfect synthesis for all of the people all of the time

  • Author

    Black, Alan W.

  • fYear
    2002
  • fDate
    11-13 Sept. 2002
  • Firstpage
    167
  • Lastpage
    170
  • Abstract
    The quality of speech synthesis has drastically improved over the last ten years. Or at least it appears that this is the case. We have moved from diphones to unit selection. However, although we can produce much more natural sounding examples we have also given up an certain amount of control over what can be synthesized. We have reached the stage where playing a few examples to a non-expert can easily convince them that speech synthesis is a solved problem. This paper looks at how we might not only convince some of the people some of the time, but what we must do to produce perfect synthesis for all of the people all of the time.
  • Keywords
    speech processing; speech synthesis; speech quality; speech synthesis; unit selection; Cepstral analysis; Cost function; Encoding; Labeling; Natural languages; Spatial databases; Speech recognition; Speech synthesis; Weight measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on
  • Print_ISBN
    0-7803-7395-2
  • Type

    conf

  • DOI
    10.1109/WSS.2002.1224400
  • Filename
    1224400