DocumentCode :
294568
Title :
A robust variable-rate speech coder
Author :
Shen, A. ; Tang, B. ; Alwan, A. ; Pottie, G.
Author_Institution :
Dept. of Electr. Eng., California Univ., Los Angeles, CA, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
249
Abstract :
The goal of this study is to develop a robust and high-quality speech coder for wireless communication. The proposed coder is a perceptually-based variable-rate subband coder. The perceptual metric ensures that encoding is optimized to the human listener and is based on calculating the signal-to-mask ratio in short-time frames of the input signal. An adaptive bit allocation scheme is employed and the subband energies are then quantized using a Max-Lloyd quantizer. The coder is fully scalable-increasing the bit rates, improves the quality of encoded speech. Subjective listening tests, using quiet and noisy input signals, indicate that the proposed coder produces high-quality speech when operating at 12 kbps or higher. In error-free conditions, our coder has comparable performance to that of QCELP or GSM coders. For speech in background noise, however, our coder, at 12 kbps, outperforms QCELP significantly, and for music, it outperforms both QCELP and GSM
Keywords :
land mobile radio; quantisation (signal); speech coding; speech intelligibility; variable rate codes; vocoders; 12 kbit/s; Max-Lloyd quantizer; adaptive bit allocation; background noise; bit rates; encoded speech quality; encoding; error-free conditions; high-quality speech; high-quality speech coder; input signal; music; noisy input signals; perceptual metric; perceptually-based variable-rate subband coder; quiet input signals; robust variable-rate speech coder; short-time frames; signal-to-mask ratio; subband energies; subjective listening tests; wireless communication; Bit rate; Filter bank; Finite impulse response filter; GSM; IIR filters; Quantization; Robustness; Speech codecs; Speech enhancement; Testing; Wireless communication;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479520
Filename :
479520
Link To Document :
بازگشت