Title :
A source generator based modeling framework for synthesis of speech under stress
Author :
Ghazale, Sahar E Bou ; Hansen, John H L
Author_Institution :
Dept. of Electr. Eng., Duke Univ., Durham, NC, USA
Abstract :
The objective of this paper is to formulate an algorithm to generate stressed synthetic speech from neutral speech using a source generator framework previously employed for stressed speech recognition. The following goals are addressed (i) identify the most visible indicators of stress as perceived by the listener in stressed speaking styles such as loud, Lombard effect and angry, (ii) develop a mathematical model for representing speech production under stressed conditions, and (iii) employ the above model to produce emotional/stressed synthetic speech from neutral speech. The stress modeling scheme is applied to an existing low-bit rate CELP speech coder in order to investigate (i) the coder´s ability and limitations reproducing stressed synthetic speech, and (ii) our ability to perturb coded neutral speech parameters at the synthesis stage so that resulting speech is perceived as being under stress. Two stress perturbation algorithms are proposed and evaluated. Results from formal listener evaluations show that 87% of neutral perturbed speech was indeed perceived as stressed
Keywords :
linear predictive coding; speech coding; speech intelligibility; speech processing; speech recognition; speech synthesis; vocoders; Lombard effect; angry; coded neutral speech parameters; emotional/stressed synthetic speech; formal listener evaluations; loud; low-bit rate CELP speech coder; mathematical model; neutral perturbed speech; neutral speech; source generator; speech modeling; speech production; speech synthesis; stress modeling; stress perturbation algorithms; stressed conditions; stressed speaking styles; stressed speech recognition; stressed synthetic speech; visible stress indicators; Laboratories; Mathematical model; Robustness; Speech analysis; Speech coding; Speech processing; Speech recognition; Speech synthesis; Stress; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479685