• DocumentCode
    1871254
  • Title

    Speech analysis and coding using a multi-resolution sinusoidal transform

  • Author

    Anderson, David V.

  • Author_Institution
    Sch. of Electr. & Compu. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    2
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    1037
  • Abstract
    The sinusoidal transform, as developed by Quatieri and McAulay (1986), provides a sparse representation for speech signals by taking advantage of psychoacoustic masking. The currently reported work takes the sinusoidal transform one step further by considering the frequency resolution abilities of the human auditory system in more detail. The new transform is based on the wavelet principle of variable resolution in time/frequency analysis. Specifically, a sinusoidal transform is developed which uses quadrature mirror filter (QMF) banks to obtain better time resolution at high frequencies and better frequency resolution at low frequencies. This naturally provides a perceptually improved allocation of the sinusoids. The new transform matches the human auditory system better than its predecessor and it also matches speech signals well, both fricative sounds and voiced speech. The QMF based ST is then shown to be equivalent to a more efficient FFT based implementation
  • Keywords
    band-pass filters; discrete Fourier transforms; filtering theory; hearing; quadrature mirror filters; signal representation; signal resolution; speech coding; speech intelligibility; speech processing; time-frequency analysis; transform coding; wavelet transforms; DFT; QMF banks; frequency resolution; fricative sounds; high frequencies; human auditory system; low frequencies; multiresolution sinusoidal transform; psychoacoustic masking; quadrature mirror filter banks; sparse representation; speech analysis; speech coding; speech signals; time resolution; time/frequency analysis; variable resolution; voiced speech; wavelet transforms; Auditory system; Frequency; Humans; Mirrors; Psychology; Signal resolution; Speech analysis; Speech coding; Wavelet analysis; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.543301
  • Filename
    543301