DocumentCode
2628952
Title
Perception of synthetic emotion expressions in speech: Categorical and dimensional annotations
Author
Kessens, Judith M. ; Neerincx, Mark A. ; Looije, Rosemarijn ; Kroes, Melanie ; Bloothooft, Gerrit
Author_Institution
TNO Defense, Safety & Security, Soesterberg, Netherlands
fYear
2009
fDate
10-12 Sept. 2009
Firstpage
1
Lastpage
5
Abstract
In this paper, both categorical and dimensional annotations have been made of neutral and emotional speech synthesis (anger, fear, sad, happy and relaxed). With various prosodic emotion manipulation techniques we found emotion classification rates of 40%, which is significantly above chance level (17%). The classification rates are higher for sentences that have a semantics matching the synthetic emotion. By manipulating the pitch and duration, differences in arousal were perceived whereas differences in valence were hardly perceived. Of the investigated emotion manipulation methods, EmoFilt and EmoSpeak performed very similar, except for the emotion fear. Copy synthesis did not perform well, probably caused by suboptimal alignments and the use of multiple speakers.
Keywords
emotion recognition; speech recognition; speech synthesis; EmoFilt; EmoSpeak; categorical annotations; copy synthesis; dimensional annotations; duration manipulation; emotion classification rates; emotional speech synthesis; neutral speech synthesis; pitch manipulation; prosodic emotion manipulation techniques; semantics; synthetic emotion expression perception; synthetic emotion matching; Databases; Diabetes; Loudspeakers; Pediatrics; Robots; Safety; Security; Speech synthesis; Stochastic processes; Synthesizers;
fLanguage
English
Publisher
ieee
Conference_Titel
Affective Computing and Intelligent Interaction and Workshops, 2009. ACII 2009. 3rd International Conference on
Conference_Location
Amsterdam
Print_ISBN
978-1-4244-4800-5
Electronic_ISBN
978-1-4244-4799-2
Type
conf
DOI
10.1109/ACII.2009.5349594
Filename
5349594
Link To Document