DocumentCode :
312330
Title :
Synthesis of stressed speech from isolated neutral speech using HMM-based models
Author :
Bou-Ghazale, Sahar E. ; Hansen, John H L
Author_Institution :
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
Volume :
3
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1860
Abstract :
A novel approach is proposed for modeling speech parameter variations between neutral and stressed conditions and employed in a technique for stressed speech synthesis. The proposed method consists of modeling the variations in pitch contour, voiced speech duration and average spectral structure using hidden Markov models (HMMs). While HMMs have traditionally been used for recognition applications, here they are used to statistically model the characteristics needed for generating pitch contour and spectral slope patterns to modify the speaking style of isolated neutral words. An algorithm is developed based on an analysis-synthesis speech model, and HMM pitch and spectral stress characteristics for stress perturbation. Informal listener evaluations of the stress-modified speech confirm the HMM´s ability to capture the parameter variations under stressed conditions. The proposed HMM models are both speaker- and word-independent, but unique to each speaking style. While the modeling scheme is applicable to a variety of stress and emotional speaking styles, the evaluations presented in this study focus on angry, Lombard-effect and loud-spoken speech
Keywords :
hidden Markov models; speech synthesis; Lombard effect; analysis-synthesis speech model; angry speech; average spectral structure; emotional speaking styles; hidden Markov models; isolated neutral speech; isolated neutral words; listener evaluations; loud speech; pitch contour; speaker-independent models; speaking style modification; spectral slope patterns; spectral stress characteristics; speech parameter variations; statistical modelling; stress perturbation; stress-modified speech; stressed speech synthesis; voiced speech duration; word-independent models; Autocorrelation; Character recognition; Hidden Markov models; Laboratories; Robustness; Speech analysis; Speech coding; Speech processing; Speech synthesis; Stress;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607994
Filename :
607994
Link To Document :
بازگشت