DocumentCode :
1103095
Title :
Environment-Optimized Speech Enhancement
Author :
Fingscheidt, Tim ; Suhadi, Suhadi ; Stan, Sorel
Author_Institution :
Inst. for Commun. Technol., Braunschweig Tech. Univ., Braunschweig
Volume :
16
Issue :
4
fYear :
2008
fDate :
5/1/2008 12:00:00 AM
Firstpage :
825
Lastpage :
834
Abstract :
In this paper, we present a training-based approach to speech enhancement that exploits the spectral statistical characteristics of clean speech and noise in a specific environment. In contrast to many state-of-the-art approaches, we do not model the probability density function (pdf) of the clean speech and the noise spectra. Instead, subband-individual weighting rules for noisy speech spectral amplitudes are separately trained for speech presence and speech absence from noise recordings in the environment of interest. Weighting rules for a variety of cost functions are given; they are parameterized and stored as a table look-up. The speech enhancement system simply works by computing the weighting rules from the table look-up indexed by the a posteriori signal-to-noise ratio (SNR) and the a priori SNR for each subband computed on a Bark scale. Optimized for an automotive environment, our approach outperforms known-environment-independent-speech enhancement techniques, namely the a priori SNR-driven Wiener filter and the minimum mean square error (MMSE) log-spectral amplitude estimator, both in terms of speech distortion and noise attenuation.
Keywords :
noise (working environment); noise abatement; signal processing; speech enhancement; Bark scale; Wiener filter; clean speech; environment-optimized speech enhancement; log-spectral amplitude estimator; minimum mean square error; noise attenuation; noise recording; noise spectra; noisy speech spectral amplitude; probability density function; signal-to-noise ratio; spectral statistical characteristics; speech distortion; subband-individual weighting rules; table look-up; Noise reduction; speech enhancement;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.920062
Filename :
4472220
Link To Document :
بازگشت