• DocumentCode
    865887
  • Title

    Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model

  • Author

    Shi, Guangji ; Aarabi, Parham ; Jiang, Hui

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Toronto, Ont.
  • Volume
    15
  • Issue
    1
  • fYear
    2007
  • Firstpage
    109
  • Lastpage
    118
  • Abstract
    This paper proposes a phase-based dual-microphone speech enhancement technique that utilizes a prior speech model. Recently, it has been shown that phase-based dual-microphone filters can result in significant noise reduction in low signal-to-noise ratio [(SNR) less than 10 dB] conditions and negligible distortion at high SNRs (greater than 10 dB), as long as a correct filter parameter is chosen at each SNR. While prior work utilizes a constant parameter for all SNRs, we present an SNR-adaptive filter parameter estimation algorithm that maximizes the likelihood of the enhanced speech features based on a prior speech model. Experimental results using the CARVUI database show significant speech recognition accuracy rate improvement over alternative techniques in low SNR situations (e.g., an improvement of 11% in word error rate (WER) over postfiltering and 23% over delay-and-sum beamforming at 0 dB) and negligible distortion at high SNRs. The proposed adaptive approach also significantly outperforms the original phase-based filter with a constant parameter. Furthermore, it improves the filter´s robustness when there are errors in time delay estimation
  • Keywords
    adaptive filters; microphones; speech enhancement; speech recognition; SNR-adaptive filter; a prior speech model; delay-and-sum beamforming; low signal-to-noise ratio; parameter estimation algorithm; phase-based dual-microphone speech enhancement; speech recognition; time delay estimation; word error rate; Delay; Error analysis; Filters; Noise reduction; Parameter estimation; Phase distortion; Signal to noise ratio; Spatial databases; Speech enhancement; Speech recognition; Microphone array; phase-error filtering; robust speech recognition; speech enhancement; time-frequency masking;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.876870
  • Filename
    4032794