A source generator based modeling framework for synthesis of speech under stress

Author

Ghazale, Sahar E Bou ; Hansen, John H L

Author_Institution

Dept. of Electr. Eng., Duke Univ., Durham, NC, USA

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

664

Abstract

The objective of this paper is to formulate an algorithm to generate stressed synthetic speech from neutral speech using a source generator framework previously employed for stressed speech recognition. The following goals are addressed (i) identify the most visible indicators of stress as perceived by the listener in stressed speaking styles such as loud, Lombard effect and angry, (ii) develop a mathematical model for representing speech production under stressed conditions, and (iii) employ the above model to produce emotional/stressed synthetic speech from neutral speech. The stress modeling scheme is applied to an existing low-bit rate CELP speech coder in order to investigate (i) the coder´s ability and limitations reproducing stressed synthetic speech, and (ii) our ability to perturb coded neutral speech parameters at the synthesis stage so that resulting speech is perceived as being under stress. Two stress perturbation algorithms are proposed and evaluated. Results from formal listener evaluations show that 87% of neutral perturbed speech was indeed perceived as stressed

Keywords

linear predictive coding; speech coding; speech intelligibility; speech processing; speech recognition; speech synthesis; vocoders; Lombard effect; angry; coded neutral speech parameters; emotional/stressed synthetic speech; formal listener evaluations; loud; low-bit rate CELP speech coder; mathematical model; neutral perturbed speech; neutral speech; source generator; speech modeling; speech production; speech synthesis; stress modeling; stress perturbation algorithms; stressed conditions; stressed speaking styles; stressed speech recognition; stressed synthetic speech; visible stress indicators; Laboratories; Mathematical model; Robustness; Speech analysis; Speech coding; Speech processing; Speech recognition; Speech synthesis; Stress; Telephony;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479685

Filename

479685