• DocumentCode
    1296052
  • Title

    A Two-Channel Noise Estimator for Speech Enhancement in a Highly Nonstationary Environment

  • Author

    Choi, Min-Seok ; Kang, Hong-Goo

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
  • Volume
    19
  • Issue
    4
  • fYear
    2011
  • fDate
    5/1/2011 12:00:00 AM
  • Firstpage
    905
  • Lastpage
    915
  • Abstract
    This paper proposes a two-channel noise estimator for speech enhancement in a highly nonstationary environment. The proposed noise estimator utilizes a spatial filter which has a capability of extracting noise information even in a speech presence region. We exploit a first-order recursion method with time-frequency varying smoothing coefficients to accurately estimate a noise power spectral density (PSD) in both slowly and rapidly varying regions. The smoothing coefficients are determined by measuring the nonstationarity factor of noise, e.g., degree of noise variation. The nonstationarity factor is derived through a statistical assumption of stationary background noise, which does not need any assumption on the type of nonstationary noise. Since the proposed method efficiently estimates the noise PSD both in stationary and nonstationary regions, the enhanced speech obtained by applying the proposed algorithm to the two-channel enhancement system shows superior performance to conventional approaches in various noise environments.
  • Keywords
    recursion method; smoothing methods; spatial filters; speech enhancement; time-frequency analysis; background noise; first-order recursion method; noise extraction; noise nonstationarity factor; noise power spectral density; noise variation degree; nonstationary environment; spatial filter; speech enhancement; time-frequency varying smoothing coefficient; two-channel enhancement system; two-channel noise estimator; First-order recursion; noise power spectral density (PSD); nonstationarity; smoothing coefficient; speech enhancement; two-channel noise estimation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2066969
  • Filename
    5549863