Title :
Multi-band vector excitation coding of speech at 4.8 kbps
Author :
Garcia-Mateo, C. ; Casajus-Quiros, F.J. ; Hernandez-Gomez, L.A.
Author_Institution :
ETSI Telecommunicacion-USC, Pontevedra, Spain
Abstract :
A speech coder is presented which combines vector excitation coding (VXC) with frequency-domain representations so as to obtain a high-quality efficient scheme at 4.8 kb/s based on the use of different coding strategies and bit allocations for different frames of speech and different frequency bands. Attention is focused on the representation of voiced sounds and on multiband procedures for representing the short-time Fourier transform. The proposed scheme starts with a phonetically based frame segmentation. Multiband long-term prediction is included only when necessary, and random excitation in voiced sounds is spectrally shaped by using a linear predictive coding envelope from the long-term prediction error
Keywords :
encoding; speech analysis and processing; speech intelligibility; vocoders; 4.8 kbit/s; VXC; bit allocations; coding strategies; different frames of speech; different frequency bands; frequency-domain representations; high-quality efficient scheme; linear predictive coding envelope; long-term prediction error; multiband procedures; phonetically based frame segmentation; representation of voiced sounds; short-time Fourier transform; speech coder; speech encoding; transparent synthetic speech; vector excitation coding; Bit rate; Code standards; Fourier transforms; Frequency domain analysis; Linear predictive coding; Radio spectrum management; Speech coding; Telecommunication standards; Time domain analysis; US Government;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115525