• DocumentCode
    1610531
  • Title

    Speech and image signal compression with wavelets

  • Author

    Kinsner, W. ; Langi, A.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Manitoba Univ., Winnipeg, Man., Canada
  • fYear
    1993
  • fDate
    6/15/1905 12:00:00 AM
  • Firstpage
    368
  • Lastpage
    375
  • Abstract
    The authors consider time-frequency multiresolution analysis based on wavelets, as it applies to speech/audio and image/video signal compression. They compare the wavelet analysis to the traditional short-window techniques used in signal compression. The performance of the discrete wavelet transform in terms of the bit rates and signal quality is comparable to that for other techniques such as the discrete cosine transform (DCT) for images and code-excited linear predictive coding (CELP) for speech, but with much less computational burden. Experiments with an image and Daubechies´s four-coefficient wavelet show that truncation of wavelet coefficients as high as 90% still produces 30-dB peak signal-to-noise ratio (PSNR) quality. This is better than DCT. In an experiment on a male spoken sentence, the scheme reaches a 12.82-dB segmental signal-to-noise ratio (SEGSNR) at a rate of less than 4.8 kb/s. In comparison, the state-of-the-art CELP coding at 4.8 kbit/s can attain SEGSNR of 10-13 dB. Other experiments with images and Haar two-coefficient wavelet are also highlighted.
  • Keywords
    data compression; image coding; speech coding; time-frequency analysis; wavelet transforms; 4.8 kbit/s; Daubechies´s four-coefficient wavelet; Haar two-coefficient wavelet; discrete wavelet transform; image compression; male spoken sentence; peak signal-to-noise ratio; segmental signal-to-noise ratio; signal compression; speech compression; time-frequency multiresolution analysis; wavelets; Discrete cosine transforms; Discrete wavelet transforms; Image coding; Multiresolution analysis; PSNR; Signal analysis; Speech analysis; Time frequency analysis; Video compression; Wavelet analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    WESCANEX 93. 'Communications, Computers and Power in the Modern Environment.' Conference Proceedings., IEEE
  • Conference_Location
    Saskatoon, Sask., Canada
  • Print_ISBN
    0-7803-1319-4
  • Type

    conf

  • DOI
    10.1109/WESCAN.1993.270520
  • Filename
    270520