• DocumentCode
    1049422
  • Title

    Noise Tracking Using DFT Domain Subspace Decompositions

  • Author

    Hendriks, Richard C. ; Jensen, Jesper ; Heusdens, Richard

  • Author_Institution
    Delft Univ. of Technol., Delft
  • Volume
    16
  • Issue
    3
  • fYear
    2008
  • fDate
    3/1/2008 12:00:00 AM
  • Firstpage
    541
  • Lastpage
    553
  • Abstract
    All discrete Fourier transform (DFT) domain-based speech enhancement gain functions rely on knowledge of the noise power spectral density (PSD). Since the noise PSD is unknown in advance, estimation from the noisy speech signal is necessary. An overestimation of the noise PSD will lead to a loss in speech quality, while an underestimation will lead to an unnecessary high level of residual noise. We present a novel approach for noise tracking, which updates the noise PSD for each DFT coefficient in the presence of both speech and noise. This method is based on the eigenvalue decomposition of correlation matrices that are constructed from time series of noisy DFT coefficients. The presented method is very well capable of tracking gradually changing noise types. In comparison to state-of-the-art noise tracking algorithms the proposed method reduces the estimation error between the estimated and the true noise PSD. In combination with an enhancement system the proposed method improves the segmental SNR with several decibels for gradually changing noise types. Listening experiments show that the proposed system is preferred over the state-of-the-art noise tracking algorithm.
  • Keywords
    correlation methods; discrete Fourier transforms; matrix algebra; speech enhancement; DFT domain subspace decompositions; correlation matrices; discrete Fourier transform; domain-based speech enhancement gain functions; enhancement system; noise power spectral density; noise tracking algorithms; residual noise; speech quality; Discrete Fourier transform (DFT) domain subspace decompositions; noise tracking; speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.914977
  • Filename
    4441732