• DocumentCode
    1103095
  • Title

    Environment-Optimized Speech Enhancement

  • Author

    Fingscheidt, Tim ; Suhadi, Suhadi ; Stan, Sorel

  • Author_Institution
    Inst. for Commun. Technol., Braunschweig Tech. Univ., Braunschweig
  • Volume
    16
  • Issue
    4
  • fYear
    2008
  • fDate
    5/1/2008 12:00:00 AM
  • Firstpage
    825
  • Lastpage
    834
  • Abstract
    In this paper, we present a training-based approach to speech enhancement that exploits the spectral statistical characteristics of clean speech and noise in a specific environment. In contrast to many state-of-the-art approaches, we do not model the probability density function (pdf) of the clean speech and the noise spectra. Instead, subband-individual weighting rules for noisy speech spectral amplitudes are separately trained for speech presence and speech absence from noise recordings in the environment of interest. Weighting rules for a variety of cost functions are given; they are parameterized and stored as a table look-up. The speech enhancement system simply works by computing the weighting rules from the table look-up indexed by the a posteriori signal-to-noise ratio (SNR) and the a priori SNR for each subband computed on a Bark scale. Optimized for an automotive environment, our approach outperforms known-environment-independent-speech enhancement techniques, namely the a priori SNR-driven Wiener filter and the minimum mean square error (MMSE) log-spectral amplitude estimator, both in terms of speech distortion and noise attenuation.
  • Keywords
    noise (working environment); noise abatement; signal processing; speech enhancement; Bark scale; Wiener filter; clean speech; environment-optimized speech enhancement; log-spectral amplitude estimator; minimum mean square error; noise attenuation; noise recording; noise spectra; noisy speech spectral amplitude; probability density function; signal-to-noise ratio; spectral statistical characteristics; speech distortion; subband-individual weighting rules; table look-up; Noise reduction; speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.920062
  • Filename
    4472220