DocumentCode
1092343
Title
Predictive coding of speech signals and subjective error criteria
Author
Atal, Bishnu S. ; Schroeder, Manfred R.
Author_Institution
AT&T Bell Laboratories, Murray Hill, NJ, USA
Volume
27
Issue
3
fYear
1979
fDate
6/1/1979 12:00:00 AM
Firstpage
247
Lastpage
254
Abstract
Predictive coding methods attempt to minimize the rms error in the coded signal. However, the human ear does not perceive signal distortion on the basis of rms error, regardless of its spectral shape relative to the signal spectrum. In designing a coder for speech signals, it is necessary to consider the spectrum of the quantization noise and its relation to the speech spectrum. The theory of auditory masking suggests that noise in the formant regions would be partially or totally masked by the speech signal. Thus, a large part of the perceived noise in a coder comes from frequency regions where the signal level is low. In this paper, methods for reducing the subjective distortion in predictive coders for speech signals are described and evaluated. Improved speech quality is obtained: 1) by efficient removal of formant and pitch-related redundant structure of speech before quantizing, and 2) by effective masking of the quantizer noise by the speech signal.
Keywords
Distortion; Ear; Humans; Noise shaping; Predictive coding; Quantization; Signal design; Spectral shape; Speech coding; Speech enhancement;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1979.1163237
Filename
1163237
Link To Document