Title :
Vector quantization of harmonic magnitudes for low-rate speech coders
Author :
Lupini, Peter ; Cuperman, Vladimir
Author_Institution :
Sch. of Eng. Sci., Simon Fraser Univ., Burnaby, BC, Canada
fDate :
28 Nov- 2 Dec 1994
Abstract :
Several techniques for speech coding at rates of 4 kb/s and lower require quantization of spectral magnitudes at a set of frequencies which are harmonics of the fundamental pitch period of the talker (for example: multiband excitation coding, sinusoidal transform coding, and time-frequency interpolation). The number of harmonic magnitudes to be quantized depends on the fundamental frequency value and hence is variable, changing from frame to frame. The variable number of components to be quantized makes it difficult to use fixed-dimension vector quantization for harmonic magnitude encoding. In this paper, we introduce a quantization technique called non-square transform vector quantization (NSTVQ) which uses a fixed-dimension vector quantizer combined with a variable-size non-square transform which maps the variable-dimension harmonic magnitude vector into a fixed-dimension vector. The optimal reconstruction procedure for non-square transforms is derived and shown to be equivalent to an optimal least-square estimation procedure. The proposed technique is evaluated experimentally as part of a new coding system called spectral excitation coding (SEC). The results are compared to an existing technique which estimates the spectral shape using all-pole modeling followed by vector quantization of the LSP parameters
Keywords :
harmonics; source coding; spectral analysis; speech coding; vector quantisation; vocoders; LSP parameters; all-pole modeling; encoding; fixed-dimension vector quantizer; fundamental frequency value; harmonic magnitudes; low-rate speech coders; multiband excitation coding; nonsquare transform vector quantization; optimal least-square estimation procedure; optimal reconstruction procedure; sinusoidal transform coding; spectral excitation coding; spectral magnitudes; spectral shape; speech coding; time-frequency interpolation; vector quantization; Encoding; Frequency estimation; Image reconstruction; Interpolation; Linear predictive coding; Spectral shape; Speech coding; Time frequency analysis; Transform coding; Vector quantization;
Conference_Titel :
Global Telecommunications Conference, 1994. GLOBECOM '94. Communications: The Global Bridge., IEEE
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-1820-X
DOI :
10.1109/GLOCOM.1994.512716