Title :
Analysis by synthesis speech coding with generalized pitch prediction
Author :
Mermelstein, Paul ; Qian, Yasheng
Author_Institution :
INRS-Telecommun., Quebec Univ., Verdun, Que., Canada
Abstract :
A new analysis-by-synthesis speech coding structure is presented for high-quality speech coding in the 4 to 8 kb/s range. CELP with generalized pitch prediction (GPP-CELP) differs from classical code-excited linear prediction (CELP) in that for voiced segments it is the speech signal that is decomposed into a component predictable with the aid of the adaptive codebook (ACB) and a nonpredictable aperiodic component, not the LPC residual. The spectrum of the aperiodic component is estimated by linear-prediction analysis. An approximation to the aperiodic component is synthesized from a stochastic codebook of sparse pulse sequences and its spectrum is shaped by the LPC synthesis filter. The ACB contains samples of the past reconstructed signal, low-passed to increase the pitch prediction gain. For voiced segments the new structure yields higher pitch prediction gain and lower linear-prediction gain than classical CELP. Subjective and objective comparisons reveal significant advantages for GPP-CELP over classical CELP
Keywords :
filtering theory; linear predictive coding; speech coding; 4 to 8 kbit/s; CELP; LPC synthesis filter; adaptive codebook; analysis-by-synthesis speech coding; code-excited linear prediction; generalized pitch prediction; high-quality speech coding; linear-prediction analysis; linear-prediction gain; nonpredictable aperiodic component; pitch prediction gain; sparse pulse sequences; stochastic codebook; voiced segments; Feedback loop; Frequency; Linear predictive coding; Low pass filters; Nonlinear filters; Quantization; Speech analysis; Speech coding; Speech synthesis; Stochastic processes;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758047