Missing Feature Speech Recognition using Dereverberation and Echo Suppression in Reverberant Environments

Author

Hyung-Min Park ; Stern, Richard M.

Author_Institution

Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume

4

fYear

2007

fDate

15-20 April 2007

Abstract

This paper describes an algorithm that efficiently segregates desired speech features from spatially-separated interfering sources in reverberant environments. Although most binaural segregation techniques successfully remove interference components in the absence of reverberation, source segregation in reverberant environments remains a challenging problem. In order to reduce the effects of reverberation, we present a method that dereverberates input signals before they are segregated. The dereverberation filter is estimated from the autocorrelation of the observations and primarily deals with early reflections, while late reflections are effectively suppressed by an inhibitory mechanism that estimates their relative contribution in each time-frequency segment. Information about the salience of the target in a given time-frequency segment based on source separation is combined with the corresponding information based on reverberation suppression through the use of a continually-variable weighting function or mask. Use of the novel reverberation processing results in a relative decrease in WER of 11.5% to 20.9% and use of the combined approaches reduces relative WER by as much as 65.3%.

Keywords

echo suppression; filtering theory; reverberation; speech processing; speech recognition; autocorrelation; continually-variable weighting function; dereverberation filter; echo suppression; feature speech recognition; reverberant environments; source segregation; spatially-separated interfering sources; time-frequency segment; Autocorrelation; Automatic speech recognition; Degradation; Filter bank; Humans; Natural languages; Reverberation; Signal processing; Speech recognition; Time frequency analysis; Speech recognition; binaural processing; dereverberation; missing feature theory; spatial segregation;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location

Honolulu, HI

ISSN

1520-6149

Print_ISBN

1-4244-0727-3

Type

conf

DOI

10.1109/ICASSP.2007.366929

Filename

4218117