• DocumentCode
    294568
  • Title

    A robust variable-rate speech coder

  • Author

    Shen, A. ; Tang, B. ; Alwan, A. ; Pottie, G.

  • Author_Institution
    Dept. of Electr. Eng., California Univ., Los Angeles, CA, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    249
  • Abstract
    The goal of this study is to develop a robust and high-quality speech coder for wireless communication. The proposed coder is a perceptually-based variable-rate subband coder. The perceptual metric ensures that encoding is optimized to the human listener and is based on calculating the signal-to-mask ratio in short-time frames of the input signal. An adaptive bit allocation scheme is employed and the subband energies are then quantized using a Max-Lloyd quantizer. The coder is fully scalable-increasing the bit rates, improves the quality of encoded speech. Subjective listening tests, using quiet and noisy input signals, indicate that the proposed coder produces high-quality speech when operating at 12 kbps or higher. In error-free conditions, our coder has comparable performance to that of QCELP or GSM coders. For speech in background noise, however, our coder, at 12 kbps, outperforms QCELP significantly, and for music, it outperforms both QCELP and GSM
  • Keywords
    land mobile radio; quantisation (signal); speech coding; speech intelligibility; variable rate codes; vocoders; 12 kbit/s; Max-Lloyd quantizer; adaptive bit allocation; background noise; bit rates; encoded speech quality; encoding; error-free conditions; high-quality speech; high-quality speech coder; input signal; music; noisy input signals; perceptual metric; perceptually-based variable-rate subband coder; quiet input signals; robust variable-rate speech coder; short-time frames; signal-to-mask ratio; subband energies; subjective listening tests; wireless communication; Bit rate; Filter bank; Finite impulse response filter; GSM; IIR filters; Quantization; Robustness; Speech codecs; Speech enhancement; Testing; Wireless communication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479520
  • Filename
    479520