DocumentCode :
1668950
Title :
Rate-distortion function for speech coding based on perceptual distortion measure
Author :
De, Aloknath ; Kabal, Peter
Author_Institution :
Dept. of Electr. Eng., McGill Univ., Montreal, Que., Canada
fYear :
1992
Firstpage :
452
Abstract :
The authors (1992) proposed a perceptual distortion measure for speech coders using an auditory (cochlear) model. This measure evaluates the neural-firing cross-entropy of the coded speech with respect to that of the original speech. Here the output space of the cochlear model is explored using this measure, in order to verify the existence of the pitch and formant information. A rate-distortion analysis for speech coding is provided. A lower bound to the rate-distortion function is evaluated based on the distortion measure, and the exact rate-distortion function is computed using the Blahut (1972) algorithm. Four state-of-the-art speech coders with rates ranging from 4.8 kb/s (CELP) to 32 kb/s (ADPCM) are studied from the viewpoint of their performance with respect to the rate-distortion limits
Keywords :
ear; hearing; speech coding; 4.8 to 32 kbits/s; ADPCM; Blahut algorithm; CELP; auditory model; cochlear model; formant; lower bound; neural-firing cross-entropy; perceptual distortion measure; performance; pitch; rate-distortion analysis; rate-distortion function; speech coders; speech coding; Band pass filters; Distortion measurement; Ear; Frequency; Performance evaluation; Rate-distortion; Signal mapping; Space exploration; Speech analysis; Speech coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Global Telecommunications Conference, 1992. Conference Record., GLOBECOM '92. Communication for Global Users., IEEE
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-0608-2
Type :
conf
DOI :
10.1109/GLOCOM.1992.276544
Filename :
276544
Link To Document :
بازگشت