Title :
Automatic Staging of Audio with Emotions
Author :
Saheer, Lakshmi ; Cernak, M.
Abstract :
Current day text-to-speech technologies are mature enough to be acceptable in quality for the users. There is still a large gap between a synthesised speech and a real human speech due to lack of expressions and emotions. Geneemo is a technology for automatic addition of emotions and expressions to any audio. The process of staging the text is to dramatize it. The text is enriched and transformed into a performance. Similarly, "staging the audio" refers to extending text dramatisation to audio by enriching emotionally neutral audio content into a natural human speech with real expressions. The audio can be generated by any text-to-speech technology. The aim of the project is to make human computer interactions as natural as possible with expressive speech. This also opens up a portfolio of applications replacing real human voices.
Keywords :
emotion recognition; human computer interaction; speech synthesis; Geneemo; audio automatic staging; expressive speech; human computer interactions; text dramatisation; text-to-speech technology; Abstracts; Affective computing; Human computer interaction; Markov processes; Speech; Speech recognition; Speech synthesis;
Conference_Titel :
Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on
Conference_Location :
Geneva
DOI :
10.1109/ACII.2013.124