DocumentCode :
2017201
Title :
Generating emotional speech from neutral speech
Author :
Cen, Ling ; Chan, Paul ; Dong, Minghui ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res., A*STAR, Singapore, Singapore
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
383
Lastpage :
386
Abstract :
Emotional speech is one of the key techniques towards a natural and realistic conversation between human and machines. Generating emotional speech by means of converting a neutral speech is desirable as this allows us to generate emotional speech from many existing text-to-speech systems. The GMM based method is capable of synthesizing the desired spectrum, while the rule-based algorithm is effective in implementing the targeted prosodic features. Note that spectral and prosodic features are key factors that project the emotional effects of speech, in this paper, we propose the synthesis of emotional speech by applying a two-stage transformation that combines the GMM and RB methods. We synthesize happy, angry and sad speech and compare the proposed method with GMM linear transformation and RB transformation respectively. The listening test has shown that the speech synthesized by the proposed method is perceived to best portray the targeted speech emotion.
Keywords :
speech synthesis; GMM linear transformation; RB transformation; emotional speech generation; natural conversation; neutral speech; realistic conversation; rule based algorithm; text-to-speech systems; Databases; Feature extraction; Hidden Markov models; Speech; Speech synthesis; Training; Gaussian Mixture Model (GMM); emotional speech synthesis; rule-based method; voice conversation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684862
Filename :
5684862
Link To Document :
بازگشت