Title :
Further analysis of latent affective mapping for naturally expressive speech synthesis
Author :
Bellegarda, Jerome R.
Author_Institution :
Speech & Language Technol., Apple Inc., Cupertino, CA, USA
Abstract :
An essential step in the generation of expressive speech synthesis is the automatic detection and classification of emotions most likely to be present in textual input. At last Interspeech, we introduced latent affective mapping, a new emotion analysis approach which leverages two separate levels of semantic information: one that encapsulates the foundations of the domain considered, and one that specifically accounts for the overall affective fabric of the language. The ensuing framework exposes the emergent relationship between these two levels in order to advantageously inform the emotion classification process. This paper presents further validation of latent affective mapping, as well as a detailed analysis of its behavior given the attendant richer emotional description. The various mapping instantiations supported compare favorably with more conventional techniques based on expert knowledge. In particular, representative case studies point to a better approximation of the true probability distribution across the range of standard emotions.
Keywords :
probability; speech synthesis; automatic detection; expert knowledge; latent affective mapping; mapping instantiations; natural expressive speech synthesis; probability distribution; ventional techniques; Blogs; Manuals; Probability distribution; Semantics; Speech; Speech synthesis; Training; affective congruence; emotion detection/classification; expressive speech synthesis; latent semantic mapping;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947568