• DocumentCode
    584523
  • Title

    A Perceptually Motivated Multi-Band Spectral Subtraction Algorithm for Enhancement of Degraded Speech

  • Author

    Upadhyay, N. ; Karmakar, A.

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Birla Inst. of Technol. & Sci., Pilani, India
  • fYear
    2012
  • fDate
    23-25 Nov. 2012
  • Firstpage
    340
  • Lastpage
    345
  • Abstract
    The spectral subtraction method is a classical approach for enhancement of degraded speech. The basic principle of this technique is to estimate the short-time spectral magnitude of speech by subtracting estimated noise from the noisy speech spectrum and to combine it with the phase of the noisy speech. Besides reducing the noise, this method generates an unnatural and unpleasant noise, called remnant noise. The other drawback of this method is that it can work only for white Gaussian noise which has a flat spectrum and is distributed uniformly over the frequency spectrum. But real-world noise is mostly colored and has a non-uniform spectrum. To take care of this kind of noises, spectral subtraction algorithm has been extended to a multi-band case with uniformly spaced frequency bands. In this paper, a perceptually motivated multi-band spectral subtraction algorithm is proposed to enhance the speech signal degraded by colored noise. In the proposed scheme, the whole speech spectrum is divided in different non-uniform bands (N = 6) in accordance to the critical-band rate scale and spectral subtraction is executed independently in each band. The simulation results as well as informal subjective evaluations show that the proposed algorithm reduces remnant noise efficiently and the enhanced speech contains minimal speech distortions with improved signal-to-noise ratio.
  • Keywords
    AWGN; spectral analysis; speech enhancement; colored noise; critical-band rate scale; degraded speech enhancement; flat spectrum; frequency spectrum; informal subjective evaluations; minimal speech distortions; noisy speech spectrum; nonuniform spectrum; perceptually motivated multiband spectral subtraction algorithm; real-world noise; remnant noise; short-time spectral magnitude estimation; signal-to-noise ratio; uniformly spaced frequency bands; unpleasant noise; white Gaussian noise; Algorithm design and analysis; Humans; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; critical-band rate scale; multi-band spectral subtraction; speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Communication Technology (ICCCT), 2012 Third International Conference on
  • Conference_Location
    Allahabad
  • Print_ISBN
    978-1-4673-3149-4
  • Type

    conf

  • DOI
    10.1109/ICCCT.2012.75
  • Filename
    6394724