• DocumentCode
    394273
  • Title

    Subband parameter optimization of microphone arrays for speech recognition in reverberant environments

  • Author

    Seltzer, Michael L. ; Stern, Richard M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    We present a new subband microphone array processing algorithm specifically designed for speech recognition applications. We previously proposed a speech recognizer-based array processing algorithm which resulted in significant improvements in recognition accuracy when the speech was corrupted by additive noise and moderate levels of reverberation. However, little improvement was achieved over conventional beamforming methods in highly reverberant environments. Subband processing has been used to improve the poor performance of LMS-type algorithms when the number of filter parameters to estimate is large and the noise is highly correlated to the speech signal, e.g. in highly reverberant environments. We apply a subband approach to a new array processing architecture in which select groups of subbands are processed jointly to maximize the likelihood of the resulting speech recognition features, as measured by the recognition system itself. By incorporating the recognizer into the filter optimization scheme we ensure that signal components important for recognition are emphasized without undue emphasis on less critical components. By utilizing a subband approach, we can effectively apply this framework to highly reverberant environments. In doing so, we are able to achieve improvements in word error rate of over 20% compared to conventional methods in highly reverberant environments.
  • Keywords
    acoustic transducer arrays; array signal processing; filtering theory; microphones; optimisation; reverberation; speech processing; speech recognition; LMS-type algorithms; additive noise; array processing architecture; beamforming methods; filter optimization; filter parameters; log mel spectrum subband filtering; recognition accuracy; reverberant environments; reverberation; signal components; speech recognition applications; speech signal; subband microphone array processing algorithm; subband parameter optimization; word error rate; Additive noise; Algorithm design and analysis; Array signal processing; Filters; Microphone arrays; Process design; Reverberation; Speech enhancement; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198804
  • Filename
    1198804