مرکز منطقه ای اطلاع رساني علوم و فناوري - Audible Noise Reduction in Eigendomain for Speech Enhancement

DocumentCode :

1060270

Title :

Audible Noise Reduction in Eigendomain for Speech Enhancement

Author :

You, Chang Huai ; Rahardja, Susanto ; Koh, Soo Ngee

Author_Institution :

Inst. for Infocomm Res., Singapore

Volume :

Issue :

fYear :

2007

Firstpage :

1753

Lastpage :

1765

Abstract :

A signal subspace scheme based on masking properties is proposed for enhancement of speech degraded by additive noise. Since the masking properties are related to the critical frequency band that is derived from the characteristics of human cochlea, the incorporation of masking threshold into a subspace technique requires the transformation between the frequency and eigen domains. We present and apply an invertible transformation between the frequency and eigen domains. In this paper, we use masking properties of the human auditory system to define the audible noise quantity in the eigendomain. We derive the eigen-decomposition of the estimated speech autocorrelation matrix with the assumption of white noise. Subsequently, an audible noise reduction scheme is developed based on a signal subspace technique, and the implementation of our proposed scheme is outlined. We further extend the scheme to the colored noise case. Simulation results show the superiority of our proposed scheme over other existing subspace methods in terms of segmental signal-to-noise ratio (SNR), perceptual evaluation of speech quality (PESQ), modified Bark spectral distortion (MBSD), spectrogram and informal listening tests.

Keywords :

matrix algebra; speech enhancement; speech intelligibility; additive noise; audible noise reduction; human auditory system; human cochlea; invertible transformation; masking threshold; modified Bark spectral distortion; perceptual evaluation of speech quality; signal subspace technique; signal-to-noise ratio; speech autocorrelation matrix; speech enhancement eigendomain; Acoustic noise; Additive noise; Auditory system; Degradation; Frequency; Humans; Masking threshold; Signal to noise ratio; Speech coding; Speech enhancement; Audible noise reduction; eigen-decomposition; masking properties; signal subspace; speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2007.899288

Filename :

4276768

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1060270