Title :
Shape invariant time-scale and pitch modification of speech
Author :
Quatieri, Thomas F. ; McAulay, Robert J.
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
fDate :
3/1/1992 12:00:00 AM
Abstract :
The simplified linear model of speech production predicts that when the rate of articulation is changed, the resulting waveform takes on the appearance of the original, except for a change in the time scale. A time-scale modification system that preserves this shape-invariance property during voicing is developed. This is done using a version of the sinusoidal analysis-synthesis system that models and independently modifies the phase contributions of the vocal tract and vocal cord excitation. An important property of the system is its ability to perform time-varying rates of change. Extensions of the method are applied to fixed and time-varying pitch modification of speech. The sine-wave analysis-synthesis system also allows for shape-invariant joint time-scale and pitch modification, and allows for the adjustment of the time scale and pitch according to speech characteristics such as the degree of voicing
Keywords :
speech analysis and processing; speech synthesis; linear model; pitch modification; shape-invariant time-scale modification; sine-wave analysis-synthesis system; sinusoidal analysis-synthesis system; speech characteristics; speech production; time-scale modification system; vocal cord excitation; vocal tract; voicing; Concatenated codes; Frequency; Predictive models; Psychoacoustic models; Psychology; Shape; Speech analysis; Speech synthesis; Time varying systems; Vocoders;
Journal_Title :
Signal Processing, IEEE Transactions on