Title :
Audible Noise Reduction in Eigendomain for Speech Enhancement
Author :
You, Chang Huai ; Rahardja, Susanto ; Koh, Soo Ngee
Author_Institution :
Inst. for Infocomm Res., Singapore
Abstract :
A signal subspace scheme based on masking properties is proposed for enhancement of speech degraded by additive noise. Since the masking properties are related to the critical frequency band that is derived from the characteristics of human cochlea, the incorporation of masking threshold into a subspace technique requires the transformation between the frequency and eigen domains. We present and apply an invertible transformation between the frequency and eigen domains. In this paper, we use masking properties of the human auditory system to define the audible noise quantity in the eigendomain. We derive the eigen-decomposition of the estimated speech autocorrelation matrix with the assumption of white noise. Subsequently, an audible noise reduction scheme is developed based on a signal subspace technique, and the implementation of our proposed scheme is outlined. We further extend the scheme to the colored noise case. Simulation results show the superiority of our proposed scheme over other existing subspace methods in terms of segmental signal-to-noise ratio (SNR), perceptual evaluation of speech quality (PESQ), modified Bark spectral distortion (MBSD), spectrogram and informal listening tests.
Keywords :
matrix algebra; speech enhancement; speech intelligibility; additive noise; audible noise reduction; human auditory system; human cochlea; invertible transformation; masking threshold; modified Bark spectral distortion; perceptual evaluation of speech quality; signal subspace technique; signal-to-noise ratio; speech autocorrelation matrix; speech enhancement eigendomain; Acoustic noise; Additive noise; Auditory system; Degradation; Frequency; Humans; Masking threshold; Signal to noise ratio; Speech coding; Speech enhancement; Audible noise reduction; eigen-decomposition; masking properties; signal subspace; speech enhancement;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.899288