• DocumentCode
    394330
  • Title

    An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition

  • Author

    Nishiura, Takanobu ; Nakayama, Masato ; Nakamura, Satoshi

  • Author_Institution
    ATR Spoken Language Translation Res. Labs., Kyoto, Japan
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    Distant-talking speech recognition in noisy environments is indispensable for self-moving robots or teleconference systems. However, background noise and room reverberations seriously degrade the sound-capture quality in real acoustic environments. A microphone array is an ideal candidate as an effective method for capturing distant-talking speech. AMNOR (Adaptive Microphone-array for NOise Reduction) was proposed as an adaptive beamformer for capturing the desired distant signals in noisy environments by Y. Kaneda and J. Ohga (see IEEE Trans. Acoust. Speech Sig. Process., vol.ASSP-34, no.6, p.1391-1400, 1986). Although the AMNOR has been proven effective, it can be further improved if we know the spectrum characteristics of the desired distant signals in advance. Regarding speech as a desired distant signal, we have designed an AMNOR based on the average speech spectrum. We particularly focus on the performance of the proposed AMNOR for distant-talking speech capture and recognition. Evaluation experiments in real acoustic environments confirm that ASR (automatic speech recognition) performance in noisy environments is improved by 5-10% using our AMNOR. In addition, the proposed AMNOR provides better noise reduction performance than that of the conventional AMNOR.
  • Keywords
    acoustic noise; adaptive filters; array signal processing; interference suppression; microphones; random noise; reverberation; spectral analysis; speech recognition; adaptive beamformer; adaptive filter; adaptive microphone-array for noise reduction; automatic speech recognition; average speech spectrum; background noise; distant-talking speech capture; noisy speech recognition; room reverberation; self-moving robots; teleconference systems; Acoustic noise; Automatic speech recognition; Background noise; Microphone arrays; Noise reduction; Robots; Speech analysis; Speech recognition; Teleconferencing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198869
  • Filename
    1198869