DocumentCode :
1130491
Title :
Measuring and modeling vocal source-tract interaction
Author :
Childers, Donald G. ; Wong, Chun-Fan
Author_Institution :
Dept. of Electr. Eng., Florida Univ., Gainesville, FL, USA
Volume :
41
Issue :
7
fYear :
1994
fDate :
7/1/1994 12:00:00 AM
Firstpage :
663
Lastpage :
671
Abstract :
The quality of synthetic speech is affected by two factors: intelligibility and naturalness. At present, synthesized speech may be highly intelligible, but often sounds unnatural. Speech intelligibility depends on the synthesizer´s ability to reproduce the formants, the formant bandwidths, and formant transitions, whereas speech naturalness is thought to depend on the excitation waveform characteristics for voiced and unvoiced sounds. Voiced sounds may be generated by a quasiperiodic train of glottal pulses of specified shape exciting the vocal tract filter. It is generally assumed that the glottal source and the vocal tract filter are linearly separable and do not interact. However, this assumption is often not valid, since it has been observed that appreciable source-tract interaction can occur in natural speech. Previous experiments in speech synthesis have demonstrated that the naturalness of synthetic speech does improve when source-tract interaction is simulated in the synthesis process. The purpose of this paper is two-fold: (1) to present an algorithm for automatically measuring source-tract interaction for voiced speech, and (2) to present a simple speech production model that incorporates source-tract interaction into the glottal source model, This glottal source model controls: (1) the skewness of the glottal pulse, and (2) the amount of the first formant ripple superimposed on the glottal pulse. A major application of the results of this paper is the modeling of vocal disorders.
Keywords :
biomedical measurement; physiological models; speech intelligibility; speech synthesis; algorithm; excitation waveform characteristics; first formant ripple; formant bandwidth; glottal source; quasiperiodic glottal pulses train; speech naturalness; synthetic speech quality; unvoiced sounds; vocal disorders modeling; vocal source-tract interaction; voiced sounds; Automatic control; Bandwidth; Filters; Natural languages; Pulse generation; Pulse measurements; Pulse shaping methods; Shape; Speech processing; Speech synthesis; Algorithms; Female; Glottis; Humans; Male; Models, Biological; Signal Processing, Computer-Assisted; Software; Speech Intelligibility; Speech, Alaryngeal; Voice;
fLanguage :
English
Journal_Title :
Biomedical Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9294
Type :
jour
DOI :
10.1109/10.301733
Filename :
301733
Link To Document :
بازگشت