• DocumentCode
    2399563
  • Title

    Adaptive wavelet subband coding for music compression

  • Author

    Ferens, K. ; Kinsner, W.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Manitoba Univ., Winnipeg, Man., Canada
  • fYear
    1995
  • fDate
    28-30 Mar 1995
  • Firstpage
    460
  • Abstract
    This paper describes modelling of the coefficient domain in wavelet subbands of wideband audio signals for low-bit rate and high-quality compression. The purpose is to develop models of the perception of wideband audio signals in the wavelet domain. The coefficients in the wavelet subbands are quantized using a scheme that adapts to the subband signal by setting the quantization step size for a particular subband to a size that is inversely proportional to the subband energy, and then, within a subband, by modifying the energy determined step size as inversely proportional to the amplitude probability density of the coefficient. The amplitude probability density of the coefficients in each subband is modelled using learned vector/scalar quantization employing frequency sensitive competitive learning. The source data consists of 1-channel, 16-bit linear data sampled at 44.1 kHz from a CD containing major classical and pop music. Preliminary results show a bit-rate of 150 kbps, rather than 705.6 kbps, with no perceptual loss in quality. The wavelet transform provides better results for representing multifractal signals, such as wide band audio, than do other standard transforms, such as the Fourier transform
  • Keywords
    adaptive signal processing; audio coding; audio discs; audio signals; music; signal representation; unsupervised learning; vector quantisation; wavelet transforms; 150 kbit/s; 16 bit; 44.1 kHz; CD; adaptive wavelet subband coding; amplitude probability density; classical music; frequency sensitive competitive learning; high quality compression; learned vector/scalar quantization; linear data; low bit rate; multifractal signals; music compression; pop music; quantization step size; signal representation; source data; subband energy; wavelet domain; wavelet subbands coefficients; wavelet transform; wideband audio signals; Data compression; Fourier transforms; Fractals; Frequency; Multiple signal classification; Psychoacoustic models; Signal resolution; Wavelet domain; Wavelet transforms; Wideband;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 1995. DCC '95. Proceedings
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    0-8186-7012-6
  • Type

    conf

  • DOI
    10.1109/DCC.1995.515570
  • Filename
    515570