DocumentCode
297115
Title
Vector quantization of harmonic magnitudes for low-rate speech coders
Author
Lupini, Peter ; Cuperman, Vladimir
Author_Institution
Sch. of Eng. Sci., Simon Fraser Univ., Burnaby, BC, Canada
Volume
2
fYear
1994
fDate
28 Nov- 2 Dec 1994
Firstpage
858
Abstract
Several techniques for speech coding at rates of 4 kb/s and lower require quantization of spectral magnitudes at a set of frequencies which are harmonics of the fundamental pitch period of the talker (for example: multiband excitation coding, sinusoidal transform coding, and time-frequency interpolation). The number of harmonic magnitudes to be quantized depends on the fundamental frequency value and hence is variable, changing from frame to frame. The variable number of components to be quantized makes it difficult to use fixed-dimension vector quantization for harmonic magnitude encoding. In this paper, we introduce a quantization technique called non-square transform vector quantization (NSTVQ) which uses a fixed-dimension vector quantizer combined with a variable-size non-square transform which maps the variable-dimension harmonic magnitude vector into a fixed-dimension vector. The optimal reconstruction procedure for non-square transforms is derived and shown to be equivalent to an optimal least-square estimation procedure. The proposed technique is evaluated experimentally as part of a new coding system called spectral excitation coding (SEC). The results are compared to an existing technique which estimates the spectral shape using all-pole modeling followed by vector quantization of the LSP parameters
Keywords
harmonics; source coding; spectral analysis; speech coding; vector quantisation; vocoders; LSP parameters; all-pole modeling; encoding; fixed-dimension vector quantizer; fundamental frequency value; harmonic magnitudes; low-rate speech coders; multiband excitation coding; nonsquare transform vector quantization; optimal least-square estimation procedure; optimal reconstruction procedure; sinusoidal transform coding; spectral excitation coding; spectral magnitudes; spectral shape; speech coding; time-frequency interpolation; vector quantization; Encoding; Frequency estimation; Image reconstruction; Interpolation; Linear predictive coding; Spectral shape; Speech coding; Time frequency analysis; Transform coding; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Global Telecommunications Conference, 1994. GLOBECOM '94. Communications: The Global Bridge., IEEE
Conference_Location
San Francisco, CA
Print_ISBN
0-7803-1820-X
Type
conf
DOI
10.1109/GLOCOM.1994.512716
Filename
512716
Link To Document