• DocumentCode
    1357630
  • Title

    Emotion Conversion Based on Prosodic Unit Selection

  • Author

    Erro, Daniel ; Navas, Eva ; Hernáez, Inma ; Saratxaga, Ibon

  • Author_Institution
    Electron. & Telecommun. Dept., Univ. of the Basque Country (UPVEHU), Bilbao, Spain
  • Volume
    18
  • Issue
    5
  • fYear
    2010
  • fDate
    7/1/2010 12:00:00 AM
  • Firstpage
    974
  • Lastpage
    983
  • Abstract
    Voice conversion has been traditionally focused on spectrum. Current systems lack a solid prosody conversion method suitable for different speaking styles. Recently, the unit selection technique has been applied to transform emotional intonation contours. This paper goes one step beyond: it explores strategies for training and configuring the selection cost function in an emotion conversion application. The proposed system, which uses accent groups as basic intonation units and performs conversion also on phoneme durations and intensity, is evaluated by means of a carefully designed subjective test involving the big six emotions. Although the expressiveness of the converted sentences is still far from that of natural emotional speech, satisfactory results are obtained when different configurations are used for different emotions.
  • Keywords
    natural language processing; speech synthesis; accent groups; emotion conversion; emotional intonation contour transform; natural emotional speech; phoneme durations; phoneme intensity; prosodic unit selection; selection cost function; speech synthesis; voice conversion; Emotional speech synthesis; intonation; prosody; unit selection; voice conversion;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2038658
  • Filename
    5353715