Title :
Multiband pitch and residual coding of speech signals
Author :
Mermelstein, P. ; Zeng, P. ; Saikaly, M. ; Qian, Y.
Author_Institution :
INRS Telecommun., Ile des Soeurs, Que., Canada
Abstract :
We explore improvements to code-excited linear prediction of speech coders near 7 kb/s that allow attainment of toll quality while maintaining low complexity. We introduce three-band pitch predictors to increase pitch prediction gain through the use of three pitch gain parameters, one for each frequency band. Prediction of the gains in the higher bands from that determined for the lowest band avoids the need to increase the transmitted information. The same three-band structure is exploited for codebook index generation by segmenting a full-band trained codebook into three band-pass components. Very small size codebooks suffice for the bands above 1 kHz, allowing relatively more bits to be assigned to the lowest codebook. Subjective evaluation results indicate that in channels without errors the proposed codec achieves a speech quality comparable to the 32 kb/s ADPCM standard G.726, i.e., toll quality
Keywords :
linear predictive coding; speech coding; speech intelligibility; vocoders; 32 kbit/s; 7 kbit/s; CELP; G.726 ADPCM standard; band-pass components; code-excited linear prediction; codebook index generation; frequency band; full-band trained codebook segmentation; low complexity; multiband pitch coding; pitch gain parameters; pitch prediction gain; residual coding; speech coders; speech quality; speech signals; subjective evaluation results; three-band pitch predictors; toll quality; Code standards; Decoding; Delay; Filtering; Frequency; Power harmonic filters; Quantization; Speech codecs; Speech coding; Stochastic processes;
Conference_Titel :
Communication Technology Proceedings, 1998. ICCT '98. 1998 International Conference on
Conference_Location :
Beijing
Print_ISBN :
7-80090-827-5
DOI :
10.1109/ICCT.1998.741003