DocumentCode :
3542132
Title :
Rate determination based on perceptual loudness
Author :
Atti, Venkatraman ; Spanias, Andreas
Author_Institution :
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ, USA
fYear :
2005
fDate :
23-26 May 2005
Firstpage :
848
Abstract :
We describe a perceptually-motivated rate determination algorithm (RDA) for variable bit rate speech coding. Unlike existing rate selection strategies that are based on a voice activity detector and energy thresholds, the proposed method employs a perceptual loudness (PL) measure. The TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection; speech frame energy-thresholding; phonetic segmentation. Experimental results demonstrate that the proposed PL-based RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.
Keywords :
loudness; speech codecs; speech coding; variable rate codes; TIA IS-127; average bitrate; enhanced variable rate codec; perceptual loudness; phonetic segmentation; rate determination algorithm; speech frame energy-thresholding; speech quality; variable bit rate speech coding; voice activity detection; Background noise; Bit rate; Encoding; Energy measurement; Psychoacoustic models; Psychology; Speech analysis; Speech codecs; Speech coding; Speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN :
0-7803-8834-8
Type :
conf
DOI :
10.1109/ISCAS.2005.1464721
Filename :
1464721
Link To Document :
بازگشت