• DocumentCode
    2855828
  • Title

    Noisy speech recognition with microphone array steering and Fourier/wavelet spectral subtraction

  • Author

    Denda, Yuki ; Nishiura, Takanobu ; Kawahara, H.

  • Author_Institution
    Fac. of Syst. Eng., Wakayama Univ., Japan
  • fYear
    2003
  • fDate
    28 Sept.-1 Oct. 2003
  • Firstpage
    593
  • Lastpage
    596
  • Abstract
    It is very important to capture distant-talking speech with high quality for teleconferencing systems or voice-controlled systems. For this purpose, microphone array steering and Fourier spectral subtraction, for example, are ideal candidates. A combination technique using both microphone array steering and Fourier spectral subtraction has also been proposed to improve performance. However, it is difficult for the conventional approach to reduce non-stationary noise, although it is easy to robustly reduce stationary noise. To cope with this problem, we propose a new combination technique with microphone array steering and Fourier/wavelet spectral subtraction. Wavelet spectral subtraction promises to effectively reduce non-stationary noise, because the wavelet transform admits a variable time-frequency resolution on each frequency band. As a result of evaluation experiments in a real room, we confirmed that the proposed combination technique provides better performance of the ASR (automatic speech recognition) and NRR (noise reduction rate) than the conventional combination technique in a directional and a diffused noise environment.
  • Keywords
    Fourier transform spectra; array signal processing; microphones; spectral analysis; speech recognition; wavelet transforms; Fourier spectral subtraction; automatic speech recognition; distant-talking speech; microphone array steering; noise reduction rate; nonstationary noise; teleconferencing systems; time-frequency resolution; voice-controlled systems; wavelet spectral subtraction; Acoustic noise; Additive noise; Automatic speech recognition; Frequency; Microphone arrays; Noise reduction; Speech recognition; Teleconferencing; Wavelet transforms; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Statistical Signal Processing, 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7997-7
  • Type

    conf

  • DOI
    10.1109/SSP.2003.1289545
  • Filename
    1289545