• DocumentCode
    310635
  • Title

    Perceptual speech coding using time and frequency masking constraints

  • Author

    Garnero, B. ; Drygajlo, Andrzej

  • Author_Institution
    Signal Process. Lab., Swiss Fed. Inst. of Technol., Lausanne, Switzerland
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1363
  • Abstract
    This paper presents a new wide-band speech coding system based on a fast wavelet packet transform algorithm as well as a formulation of temporal and spectral psychoacoustic models of masking. The proposed FFT-like overlapped block orthogonal transform allows us to approximate the auditory critical band decomposition in an efficient manner, which is a major advantage over previous approaches that used uniform filter banks. As a result of such a decomposition, the perceptually tuned time-frequency structure of the original speech signal is preserved. This allows us to make use of the temporal and spectral properties of the human auditory system to decrease the average bit rate of the encoder, while perceptually hiding the quantization error
  • Keywords
    hearing; spectral analysis; speech coding; speech intelligibility; speech processing; time-frequency analysis; transform coding; wavelet transforms; FFT; auditory critical band decomposition; average bit rate reduction; fast wavelet packet transform algorithm; frequency masking constraints; human auditory system; overlapped block orthogonal transform; perceptual speech coding; perceptually tuned time-frequency structure; quantization error; spectral properties; spectral psychoacoustic models; speech signal; temporal properties; temporal psychoacoustic models; time masking constraints; wideband speech coding system; Auditory system; Bit rate; Filter bank; Humans; Psychoacoustic models; Speech coding; Time frequency analysis; Wavelet packets; Wavelet transforms; Wideband;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596200
  • Filename
    596200