Title :
Toll quality variable-rate speech codec
Author_Institution :
Speech & Audio Syst. Lab., Nokia Res. Center, Tampere, Finland
Abstract :
This paper presents a source controlled variable-rate CELP type speech codec. First, a voice activity detection block distinguishes active speech frames from silence and background noise. The active speech is further classified into voiced and unvoiced frames. The voiced frames have variable bit-rate pitch-lag quantization based on the characteristics of the speech, whereas the unvoiced frames are coded without pitch information. A variable bit-rate fixed codebook excitation with a variable number of excitation pulses is determined for each speech frame. The performance of the linear analysis part of the codec as well as the input speech characteristics determine the excitation bit-rate. The average bit-rate of the codec is around 7.0 kbit/s for active speech, and the overall bit-rate ranges from 0 to 7.85 kbit/s. The described variable-rate codec produces toll quality speech equal to that of the 32 kbit/s ADPCM (G.726) standard
Keywords :
linear predictive coding; quantisation (signal); source coding; spectral analysis; speech codecs; speech coding; speech processing; variable rate codes; 7 kbit/s; CELP; active speech frames; average bit-rate; background noise; fixed codebook excitation; overall bit-rate; silence; source controlled speech codec; speech classification; toll quality speech; unvoiced frames; variable bit-rate pitch-lag quantization; variable-rate speech codec; voice activity detection block; voiced frames; Decoding; Linear predictive coding; Performance analysis; Quantization; Signal analysis; Speech analysis; Speech codecs; Speech coding; Speech enhancement; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596028