مرکز منطقه ای اطلاع رساني علوم و فناوري - Voicing-specific LPC quantization for variable-rate speech coding

DocumentCode :

1544767

Title :

Voicing-specific LPC quantization for variable-rate speech coding

Author :

Hagen, Roar ; Paksoy, Erdal ; Gersho, Allen

Author_Institution :

Dept. of Inf. Theory, Chalmers Univ. of Technol., Goteborg, Sweden

Volume :

Issue :

fYear :

1999

fDate :

9/1/1999 12:00:00 AM

Firstpage :

485

Lastpage :

494

Abstract :

Phonetic classification of speech frames allows distinctive quantization and bit allocation schemes suited to the particular class. Separate quantization of the linear predictive coding (LPC) parameters for voiced and unvoiced speech frames is shown to offer useful gains for representing the synthesis filter commonly used in code-excited linear prediction (CELP) and other coders. Subjective test results are reported that determine the required bit rate and accuracy in the two classes of voiced and unvoiced LPC spectra for CELP coding with phonetic classification. It was found, in this context, that unvoiced spectra need 9 b/frame or more whereas voiced spectra need 25 b/frame or more with the quantization schemes used. New spectral distortion criteria needed to assure transparent LPC spectral quantization for each voicing class in CELP coders are presented. Similar subjective test results for speech synthesized from the true residual signal are also presented, leading to some interesting observations on the role of the analysis-by-synthesis structure of CELP. Objective performance assessments based on the spectral distortion measure are also presented. The theoretical distortion-rate function for the spectral distortion measure is estimated for voiced and unvoiced LPC parameters and compared with experimental results obtained with unstructured vector quantization (VQ). These results show a saving of at least 2 b/frame for unvoiced spectra compared to voiced spectra to achieve the same spectral distortion performance

Keywords :

linear predictive coding; parameter estimation; rate distortion theory; signal classification; spectral analysis; speech coding; variable rate codes; vector quantisation; CELP; LPC parameters; LPC spectra; VQ; analysis-by-synthesis structure; bit allocation; bit rate; code-excited linear prediction; distortion-rate function; experimental results; linear predictive coding; objective performance; phonetic classification; residual signal; spectral distortion performance; subjective test results; synthesis filter; unstructured vector quantization; unvoiced spectra; unvoiced speech frames; variable-rate speech coding; voiced spectra; voiced speech frames; voicing-specific LPC quantization; Bit rate; Distortion measurement; Linear predictive coding; Nonlinear filters; Quantization; Signal synthesis; Speech analysis; Speech coding; Speech synthesis; Testing;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.784101

Filename :

784101

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1544767