DocumentCode :
1843323
Title :
Emotional speech conversion based on spectrum-prosody dual transformation
Author :
Bingjie Li ; Zhongzhe Xiao ; Yan Shen ; Qiang Zhou ; Zhi Tao
Author_Institution :
Sch. of Phys. Sci. & Technol., Soochow Univ., Suzhou, China
Volume :
1
fYear :
2012
fDate :
21-25 Oct. 2012
Firstpage :
531
Lastpage :
535
Abstract :
A dual transformation system to transform neutral speech to emotional speech is proposed in this paper. Since spectral and prosodic features are key factors that influence the emotional effects of speech, Gaussian Mixture Model (GMM) method and the prosody rules algorithm are applied to transform the spectral and prosodic features respectively. In this paper, we transform neutral speech to angry, happy and sad speech. The training corpus is taken from Danish speech database, and the corpus used to transform is taken from the Berlin Database of Emotional Speech. It is shown in the listening test that the speech synthesized by the proposed method is perceived to portray the targeted speech emotion well. It also shows that emotions can be independent of human language from the same language family.
Keywords :
Gaussian processes; emotion recognition; speech synthesis; transforms; Berlin database; Danish speech database; GMM method; Gaussian mixture model; angry speech; emotional speech; happy speech; human language; language family; neutral speech; neutral speech transform; prosodic features; sad speech; spectral features; spectrum-prosody dual transformation-based emotional speech conversion; speech emotional effects; speech synthesis; targeted speech emotion; training corpus; GMM; emotional speech conversion; prosody rules;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2012 IEEE 11th International Conference on
Conference_Location :
Beijing
ISSN :
2164-5221
Print_ISBN :
978-1-4673-2196-9
Type :
conf
DOI :
10.1109/ICoSP.2012.6491543
Filename :
6491543
Link To Document :
بازگشت