مرکز منطقه ای اطلاع رساني علوم و فناوري - Recognition of Reverberant Speech Using Frequency Domain Linear Prediction

DocumentCode :

978296

Title :

Recognition of Reverberant Speech Using Frequency Domain Linear Prediction

Author :

Thomas, Samuel ; Ganapathy, Sriram ; Hermansky, Hynek

Author_Institution :

IDIAP Res. Inst., Martigny

Volume :

fYear :

2008

fDate :

6/30/1905 12:00:00 AM

Firstpage :

681

Lastpage :

684

Abstract :

Performance of a typical automatic speech recognition (ASR) system severely degrades when it encounters speech from reverberant environments. Part of the reason for this degradation is the feature extraction techniques that use analysis windows which are much shorter than typical room impulse responses. We present a feature extraction technique based on modeling temporal envelopes of the speech signal in narrow subbands using frequency domain linear prediction (FDLP). FDLP provides an all-pole approximation of the Hilbert envelope of the signal obtained by linear prediction on cosine transform of the signal. ASR experiments on speech data degraded with a number of room impulse responses (with varying degrees of distortion) show significant performance improvements for the proposed FDLP features when compared to other robust feature extraction techniques (average relative reduction of 24% in word error rate). Similar improvements are also obtained for far-field data which contain natural reverberation in background noise. These results are achieved without any noticeable degradation in performance for clean speech.

Keywords :

Hilbert transforms; approximation theory; feature extraction; frequency-domain analysis; prediction theory; reverberation; speech recognition; transient response; Hilbert envelope; all-pole approximation; analysis windows; automatic speech recognition; cosine transform; feature extraction techniques; frequency domain linear prediction; reverberant speech recognition; room impulse response; temporal envelope modeling; Automatic speech recognition; Background noise; Degradation; Error analysis; Feature extraction; Frequency domain analysis; Noise robustness; Predictive models; Reverberation; Speech recognition; Automatic speech recognition; frequency domain linear prediction; reverberant speech;

fLanguage :

English

Journal_Title :

Signal Processing Letters, IEEE

Publisher :

ieee

ISSN :

1070-9908

Type :

jour

DOI :

10.1109/LSP.2008.2002708

Filename :

4666765

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=978296