Predictive coding of speech signals and subjective error criteria

Author

Atal, Bishnu S. ; Schroeder, Manfred R.

Author_Institution

AT&T Bell Laboratories, Murray Hill, NJ, USA

Volume

27

Issue

3

fYear

1979

fDate

6/1/1979 12:00:00 AM

Firstpage

247

Lastpage

254

Abstract

Predictive coding methods attempt to minimize the rms error in the coded signal. However, the human ear does not perceive signal distortion on the basis of rms error, regardless of its spectral shape relative to the signal spectrum. In designing a coder for speech signals, it is necessary to consider the spectrum of the quantization noise and its relation to the speech spectrum. The theory of auditory masking suggests that noise in the formant regions would be partially or totally masked by the speech signal. Thus, a large part of the perceived noise in a coder comes from frequency regions where the signal level is low. In this paper, methods for reducing the subjective distortion in predictive coders for speech signals are described and evaluated. Improved speech quality is obtained: 1) by efficient removal of formant and pitch-related redundant structure of speech before quantizing, and 2) by effective masking of the quantizer noise by the speech signal.

Keywords

Distortion; Ear; Humans; Noise shaping; Predictive coding; Quantization; Signal design; Spectral shape; Speech coding; Speech enhancement;

fLanguage

English

Journal_Title

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher

ieee

ISSN

0096-3518

Type

jour

DOI

10.1109/TASSP.1979.1163237

Filename

1163237