Title of article :
PROSODIC ANALYSIS AND MODELLING FOR MALAY EMOTIONAL SPEECH SYNTHESIS
Author/Authors :
MUSTAFA, Mumtaz B. university of malaya - Faculty of Computer Science and Information Technology - ICT Research Cluster, Computational Speech Group, Malaysia , AINON, Raja N. university of malaya - Faculty of Computer Science and Information Technology - ICT Research Cluster, Computational Speech Group, Malaysia , Zainuddin, Roziati university of malaya - Faculty of Computer Science and Information Technology - ICT Research Cluster, Computational Speech Group, Malaysia , Don, Zuraidah M. university of malaya - Faculty of Language and Linguistics - ICT Research Cluster, Computational Speech Group, Malaysia , Knowles, Gerry Lingenium Sdn Bhd, Malaysia , Knowles, Gerry university of malaya - ICT Research Cluster, Computational Speech Group, Malaysia , Mokhtar, Salimah university of malaya - Faculty of Computer Science and Information Technology - ICT Research Cluster, Computational Speech Group, Malaysia
From page :
102
To page :
110
Abstract :
This paper discusses an emotional prosody generator for a Malay speech synthesis system that can re-synthesize the selected vocal emotion from neutral synthesized speech output and improve the naturalness by adopting rule-based prosody conversion techniques. The role of prosodic features in emotional expression, particularly fundamental frequency and duration, has been widely investigated in several research projects. This project attempts to improve the naturalness of the synthesized emotional Malay speech by establishing an effective mechanism for the re-synthesis of emotion. Such a mechanism is created by analyzing the variation in the F0 contour of continuous emotional Malay speech against a fixed time period. The emotional prosodic generator for Malay developed in the course of this research makes use of principles of parametric prosody manipulation to synthesize four basic emotions, namely happiness, anger, sadness and fear. Subjective evaluation by means of listening tests was conducted to validate the ability of the emotions generator to generate the necessary prosody to synthesize emotional expression. The evaluation results show an overall recognition rate of between 61% and 85%.
Keywords :
Emotional speech re , synthesis , Prosody conversion , Rule , based approach, MBROLA
Journal title :
Malaysian Journal of Computer Science
Journal title :
Malaysian Journal of Computer Science
Record number :
2571898
Link To Document :
بازگشت