مرکز منطقه ای اطلاع رساني علوم و فناوري - A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise

DocumentCode :

865654

Title :

A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise

Author :

Ju, Gwo-Hwa ; Lee, Lin-shan

Author_Institution :

Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei

Volume :

Issue :

fYear :

2007

Firstpage :

119

Lastpage :

134

Abstract :

The singular value decomposition (SVD)-based method for single-channel speech enhancement has been shown to be very useful when the additive noise is white. For colored noise, with this approach, one needs to whiten the noise spectrum prior to SVD-based approach and perform the inverse whitening processing afterwards. A truncated quotient SVD (QSVD)-based approach has been proposed to handle this problem and found very useful. In this paper, a generalized SVD (GSVD)-based subspace approach for speech enhancement is first extended from the concept of the truncated QSVD-based approach, in which the dimension of the signal subspace can be precisely and automatically determined for each frame of the noisy signal. But with this new approach some residual noise is still perceivable under lower signal-to-noise ratio conditions. Therefore a perceptually constrained GSVD (PCGSVD)-based approach is further proposed to incorporate the masking properties of human auditory system to make sure the undesired residual noise to be nearly un-perceivable. Closed-form solutions are obtained for both the GSVD- and PCGSVD-based enhancement approaches. Very carefully performed objective evaluations and subjective listening tests show that the PCGSVD-based approach proposed here can offer improved speech quality, intelligibility and recognition accuracy, whether the noise is stationary or nonstationary, especially when the additive noise is nonwhite

Keywords :

singular value decomposition; speech enhancement; speech intelligibility; speech recognition; colored noise; inverse whitening processing; noise spectrum; noisy signal; perceptually constrained GSVD-based approach; single-channel speech enhancement; singular value decomposition; speech intelligibility; speech quality; speech recognition; undesired residual noise; Additive noise; Auditory system; Closed-form solution; Colored noise; Humans; Performance evaluation; Signal to noise ratio; Singular value decomposition; Speech enhancement; Testing; Auditory masking thresholds; colored noise; generalized singular value decomposition (GSVD); signal subspace; speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.876868

Filename :

4032775

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=865654