Title :
Rate determination based on perceptual loudness
Author :
Atti, Venkatraman ; Spanias, Andreas
Author_Institution :
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ, USA
Abstract :
We describe a perceptually-motivated rate determination algorithm (RDA) for variable bit rate speech coding. Unlike existing rate selection strategies that are based on a voice activity detector and energy thresholds, the proposed method employs a perceptual loudness (PL) measure. The TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection; speech frame energy-thresholding; phonetic segmentation. Experimental results demonstrate that the proposed PL-based RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.
Keywords :
loudness; speech codecs; speech coding; variable rate codes; TIA IS-127; average bitrate; enhanced variable rate codec; perceptual loudness; phonetic segmentation; rate determination algorithm; speech frame energy-thresholding; speech quality; variable bit rate speech coding; voice activity detection; Background noise; Bit rate; Encoding; Energy measurement; Psychoacoustic models; Psychology; Speech analysis; Speech codecs; Speech coding; Speech enhancement;
Conference_Titel :
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN :
0-7803-8834-8
DOI :
10.1109/ISCAS.2005.1464721