Title :
Frequency selectivity via the SpEnt methodology for wideband speech compression
Author :
Kokes, Mark G. ; Gibson, Jerry D.
Author_Institution :
Dept. of Electr. Eng., Southern Methodist Univ., Dallas, TX, USA
Abstract :
In speech and audio coding, frequency selectivity of the basis functions is an important property of the codec. The more precise the frequency selectivity, the less chance there is for audible coding effects due to uncanceled aliasing. We use Campbell´s (1960) coefficient rate and the spectral entropy (SpEnt) of the source random process as a guide to formulate adaptive nonuniform modulated lapped biorthogonal transforms (NMLBT). The use of the NMLBT allows for efficient implementation of a time-varying transform which possesses both good frequency and time resolution at all instances, without the need for transitional filters. By coupling the SpEnt methodology with modulated lapped biorthogonal transforms (MLBT), we develop band combining strategies to produce an adaptive NMLBT. Due to the nature of the SpEnt methodology, the new frequency selection process comprises a non-linear approximation method to determine the best n basis functions to represent the current speech frame. We implement a wideband speech compression scheme based on this strategy and verify its improved performance in coding speech and audio signals at 16 and 24 kbps
Keywords :
adaptive codes; approximation theory; data compression; entropy; random processes; spectral analysis; speech codecs; speech coding; transform coding; transforms; 16 kbit/s; 24 kbit/s; Campbell´s coefficient rate; MLBT; SpEnt methodology; adaptive NMLBT; audio coding; audio signals; codec; frequency resolution; frequency selectivity; modulated lapped biorthogonal transforms; nonlinear approximation method; nonuniform modulated lapped biorthogonal transforms; source random process; spectral entropy; speech coding; speech frame; time resolution; time-varying transform; wideband speech compression; Approximation methods; Audio coding; Entropy; Filters; Frequency; Random processes; Speech codecs; Speech coding; Speech processing; Wideband;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941030