Title :
Segmental quantization of speech spectral information
Author :
Svendsen, Torbjørn
Author_Institution :
Dept. of Telecommun., Norwegian Inst. of Technol., Trondheim, Norway
Abstract :
The majority of current speech coding algorithms for medium-to-low bit rates transmit two information components, a short-time spectrum estimate and an excitation signal. Even though advanced intraframe quantization schemes have been proposed, the spectral information still consumes large proportion of the available bit rate. For many speech sounds, the speech spectrum is relatively smooth for time intervals much longer than the sampling rate of the spectrum estimates. Thus, compression can be obtained by identifying smoothly varying segments of the speech spectrum and only transmitting the spectral information once for each segment. The segment spectral information is then an approximation to the true spectrum, but if the segmentation criterion is properly chosen, the induced distortion can be controlled to be within the acceptable 1 dB mean spectral distortion limit. In the present paper the author shows that segment quantization can be applied to reduce the required bit rate for the spectral information by a factor of approximately two without compromising the total spectral distortion
Keywords :
channel capacity; data compression; quantisation (signal); spectral analysis; speech coding; bit rate; compression; excitation signal; induced distortion; segmental quantization; segmentation; short-time spectrum estimate; smoothly varying segments; speech coding algorithms; speech sounds; speech spectral information; Bit rate; Interpolation; Sampling methods; Spectral analysis; Speech coding; Steady-state; Time frequency analysis; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389242