Title :
A 450 b.p.s. vocoder with natural-sounding speech
Author :
Cheng, Yan Ming ; Shaug, Douglas O.
Author_Institution :
INRS-Telecommun., Verdun, Que., Canada
Abstract :
An ARX model with glottal excitation is used to achieve an accurate and economical frequency compression of speech, and a short-term temporal decomposition is used to do temporal compression. Quantization properties of the new set of coefficients and of the side information are studied through vector quantization. Moreover, use is made of the sequential structure of the vectors produced by a short-term temporal decomposition to decrease further the coding bit rate. As an application of the above techniques, a 450-b/s vocoder with a 200-ms delay is described
Keywords :
analogue-digital conversion; encoding; speech synthesis; vocoders; 450 bit/s; ARX model; coding; frequency compression; speech synthesis; temporal compression; temporal decomposition; vector quantization; vocoder; Bit rate; Delay; Filters; Frequency; Integrated circuit modeling; Predictive models; Signal processing; Speech; Speech coding; Speech enhancement; Speech processing; Vector quantization; Vocoders; White noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115824