DocumentCode
395210
Title
Joint optimization of short-term and long-term predictors in CELP speech coders
Author
Zarrinkoub, Houman ; Mermelstein, Paul
Author_Institution
Inst. Nat. de la Recherche Scientifique, Quebec Univ., Montreal, Que., Canada
Volume
2
fYear
2003
fDate
6-10 April 2003
Abstract
The objective of this work is to investigate whether joint optimization of short-term and long-term predictors manifests significant advantages over the sequential optimization in speech coding. We propose a new joint optimization method based on Wiener filtering. The proposed analysis model resolves the pitch-bias problem of classical LPC analysis by considering the contribution of the long-term predictor while optimizing the short-term predictor. Our approach to joint optimization is based on analysis-by-synthesis and guarantees the synthesis filter stability. By applying our proposed joint optimization approach to CELP coding we obtain superior objective and subjective performance relative to CELP coding with sequential optimization. To provide voice quality equivalent to that of sequentially optimized CELP, the jointly optimized coder needs fewer FCB pulses and requires a reduced bit budget for LPC quantization. Our listening tests suggest that the JCELP coder at 4.25 kbps is equivalent in quality to the G.729 at 8 kbps.
Keywords
Wiener filters; data compression; filtering theory; linear predictive coding; optimisation; speech coding; speech intelligibility; speech synthesis; vector quantisation; 4.25 kbit/s; 8 kbit/s; CELP coding; CELP speech coders; FCB pulses; G.729; LPC analysis; LPC quantization; Wiener filtering; analysis-by-synthesis; bit budget; joint optimization; listening tests; long-term predictor; objective performance; pitch-bias problem; sequential optimization; short-term predictor; speech coding; speech quality; subjective performance; synthesis filter stability; Linear predictive coding; Optimization methods; Power harmonic filters; Predictive models; Quantization; Signal analysis; Signal synthesis; Speech coding; Speech synthesis; Wiener filter;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1202318
Filename
1202318
Link To Document