Title :
Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation
Author :
Krawczyk-Becker, Martin ; Fischer, Dorte ; Gerkmann, Timo
Author_Institution :
Dept. of Med. Phys. & Acoust. & Cluster of Excellence “Hearing4all”, Univ. of Oldenburg, Oldenburg, Germany
Abstract :
For the enhancement of speech degraded by noise, accurate estimation of the noise power spectral density (PSD) is indispensable, especially if only a single microphone signal is available. Fast and accurate tracking of the noise PSD is particularly challenging in highly non-stationary noise types, since the distinction between speech and noise components becomes more difficult. Short-time discrete Fourier transform (STFT) based noise PSD estimation algorithms which employ estimates of the speech presence probability (SPP) with fixed priors have been shown to yield good tracking performance even in adverse noise conditions. In this paper, we compare two methods to incorporate spectro-temporal correlations to improve the tracking performance. The first method smoothes the noisy observation over time and frequency before computing the SPP, while the second is based on a Hidden Markov Model (HMM) of the speech presence and absence states. We show that the proposed modifications lead to improved noise PSD estimators which are less sensitive to spectral outliers of the noise and track changes in the noise PSD more quickly than the reference method. Further, when employed in a common speech enhancement setup, the proposed estimators achieve an increased noise reduction while keeping speech distortions at a comparable level.
Keywords :
discrete Fourier transforms; hidden Markov models; probability; speech enhancement; HMM; SPP; STFT based noise PSD estimation algorithms; hidden Markov model; noise PSD estimators; noise power spectral density; non-stationary noise types; short-time discrete Fourier transform based noise PSD estimation algorithms; spectral outliers; spectro-temporal correlations; speech distortions; speech enhancement setup; speech presence probability; Estimation; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; Time-frequency analysis; noise power estimation; noise reduction; speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7177992