• DocumentCode
    1092343
  • Title

    Predictive coding of speech signals and subjective error criteria

  • Author

    Atal, Bishnu S. ; Schroeder, Manfred R.

  • Author_Institution
    AT&T Bell Laboratories, Murray Hill, NJ, USA
  • Volume
    27
  • Issue
    3
  • fYear
    1979
  • fDate
    6/1/1979 12:00:00 AM
  • Firstpage
    247
  • Lastpage
    254
  • Abstract
    Predictive coding methods attempt to minimize the rms error in the coded signal. However, the human ear does not perceive signal distortion on the basis of rms error, regardless of its spectral shape relative to the signal spectrum. In designing a coder for speech signals, it is necessary to consider the spectrum of the quantization noise and its relation to the speech spectrum. The theory of auditory masking suggests that noise in the formant regions would be partially or totally masked by the speech signal. Thus, a large part of the perceived noise in a coder comes from frequency regions where the signal level is low. In this paper, methods for reducing the subjective distortion in predictive coders for speech signals are described and evaluated. Improved speech quality is obtained: 1) by efficient removal of formant and pitch-related redundant structure of speech before quantizing, and 2) by effective masking of the quantizer noise by the speech signal.
  • Keywords
    Distortion; Ear; Humans; Noise shaping; Predictive coding; Quantization; Signal design; Spectral shape; Speech coding; Speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1979.1163237
  • Filename
    1163237