Title :
On speech coding in a perceptual domain
Author :
Kubin, Gemot ; Kleijn, W. Bastiaan
Author_Institution :
Tech. Univ. Wien, Austria
Abstract :
For speech coders which fall within the class of waveform coders, the reconstructed signal approaches the original with increasing bit rate. In such coders, the distortion criterion generally operates on the speech signal or a signal obtained by adaptive linear filtering of the speech signal. To satisfy computational and delay constraints, the distortion criterion must be reduced to a very simple approximation of the auditory system. This drawback of conventional approaches motivates a new speech coding paradigm in which the coding is performed in a domain where the single-letter squared-error criterion forms an accurate representation of perception. The new paradigm requires a model of the auditory periphery which is accurate, can be be inverted with relatively low computational effort, and which represents the signal with relatively few parameters. We develop such a model of the auditory periphery and discuss its suitability for speech coding. The results indicate that the new paradigm in general and our auditory model in particular form a promising basis for the coding of both speech and audio at low bit rates
Keywords :
adaptive filters; adaptive signal processing; audio coding; channel bank filters; filtering theory; hearing; signal reconstruction; signal representation; speech coding; vocoders; adaptive linear filtering; audio coding; auditory periphery model; auditory system approximation; computational constraint; delay constraint; distortion criterion; filterbank; invertible auditory model; low bit rate coding; perceptual domain; reconstructed signal; single-letter squared-error criterion; source coding; speech coders; speech coding; speech signal representation; waveform coders; Auditory system; Bit rate; Decorrelation; Delay; Distortion measurement; Maximum likelihood detection; Rate distortion theory; Signal design; Source coding; Speech coding;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758098