• DocumentCode
    337501
  • Title

    An algorithm for compression of wideband diverse speech and audio signals

  • Author

    Trinkaus, Trevor R. ; Clements, Mark A.

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    901
  • Abstract
    A compression scheme for diverse speech and audio signals is proposed. In this scheme, signals are analyzed with a 2-band QMF filter bank followed by the application of a modulated lapped biorthogonal transform (MLBT) to each of the filter bank channels. Subsequent encoding of transform coefficients is performed using Laplacian optimized scalar and vector quantizers, whose rates are determined by an estimated noise threshold, i.e., masking threshold. Listening tests show that the coder achieves a quality at 32 kbits/s that is preferred over the ITU G.722 coder at 64 kbits/s, for speech, music, and more diverse signals consisting of speech in the presence of eventful background sounds. Both the delay of the coder, at 40 ms, and the level of complexity are moderate
  • Keywords
    audio coding; channel bank filters; data compression; filtering theory; modulation; music; quadrature mirror filters; signal representation; speech coding; transform coding; vector quantisation; 2-band QMF filter bank; 32 kbit/s; 40 ms; 64 kbit/s; ITU G.722 coder; Laplacian optimized scalar quantizer; Laplacian optimized vector quantizer; algorithm; audio signal compression; background sounds; bit allocation; complexity; delay; estimated noise threshold; filter bank channels; listening tests; masking threshold; modulated lapped biorthogonal transform; music; signal analysis; signal representation; transform coefficients encoding; wideband diverse speech signal compression; Acoustic noise; Acoustic testing; Filter bank; Laplace equations; Masking threshold; Multiple signal classification; Music; Signal analysis; Speech; Wideband;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759817
  • Filename
    759817