• DocumentCode
    1668950
  • Title

    Rate-distortion function for speech coding based on perceptual distortion measure

  • Author

    De, Aloknath ; Kabal, Peter

  • Author_Institution
    Dept. of Electr. Eng., McGill Univ., Montreal, Que., Canada
  • fYear
    1992
  • Firstpage
    452
  • Abstract
    The authors (1992) proposed a perceptual distortion measure for speech coders using an auditory (cochlear) model. This measure evaluates the neural-firing cross-entropy of the coded speech with respect to that of the original speech. Here the output space of the cochlear model is explored using this measure, in order to verify the existence of the pitch and formant information. A rate-distortion analysis for speech coding is provided. A lower bound to the rate-distortion function is evaluated based on the distortion measure, and the exact rate-distortion function is computed using the Blahut (1972) algorithm. Four state-of-the-art speech coders with rates ranging from 4.8 kb/s (CELP) to 32 kb/s (ADPCM) are studied from the viewpoint of their performance with respect to the rate-distortion limits
  • Keywords
    ear; hearing; speech coding; 4.8 to 32 kbits/s; ADPCM; Blahut algorithm; CELP; auditory model; cochlear model; formant; lower bound; neural-firing cross-entropy; perceptual distortion measure; performance; pitch; rate-distortion analysis; rate-distortion function; speech coders; speech coding; Band pass filters; Distortion measurement; Ear; Frequency; Performance evaluation; Rate-distortion; Signal mapping; Space exploration; Speech analysis; Speech coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global Telecommunications Conference, 1992. Conference Record., GLOBECOM '92. Communication for Global Users., IEEE
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    0-7803-0608-2
  • Type

    conf

  • DOI
    10.1109/GLOCOM.1992.276544
  • Filename
    276544