DocumentCode :
3703408
Title :
Fundamental frequency modeling using wavelets for emotional voice conversion
Author :
Huaiping Ming;Dongyan Huang;Minghui Dong;Haizhou Li;Lei Xie;Shaofei Zhang
Author_Institution :
Institute for Infocomm Research, A?STAR, 1 Fusionopolis Way, #21-01 Connexis, Singapore 138632
fYear :
2015
Firstpage :
804
Lastpage :
809
Abstract :
This paper is to show a representation of fundamental frequency (F0) using continuous wavelet transform (CWT) for prosody modeling in emotion conversion. Emotional conversion aims at converting speech from one emotion state to another. Specifically, we use CWT to decompose F0 into a five-scale representation that corresponds to five temporal scales. A neutral voice is converted to an emotional voice under an exemplar-based voice conversion framework, where both spectrum and F0 are simultaneously converted. The simulation results demonstrate that the dynamics of F0 in different temporal scales can be well captured and converted using the five-scale CWT representation. The converted speech signals are evaluated both objectively and subjectively, that confirm the effectiveness of the proposed method.
Keywords :
"Speech","Hidden Markov models","Continuous wavelet transforms","Dictionaries","Matrix converters","Radio frequency"
Publisher :
ieee
Conference_Titel :
Affective Computing and Intelligent Interaction (ACII), 2015 International Conference on
Electronic_ISBN :
2156-8111
Type :
conf
DOI :
10.1109/ACII.2015.7344665
Filename :
7344665
Link To Document :
بازگشت