DocumentCode
2701816
Title
Missing Feature Speech Recognition using Dereverberation and Echo Suppression in Reverberant Environments
Author
Hyung-Min Park ; Stern, Richard M.
Author_Institution
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
This paper describes an algorithm that efficiently segregates desired speech features from spatially-separated interfering sources in reverberant environments. Although most binaural segregation techniques successfully remove interference components in the absence of reverberation, source segregation in reverberant environments remains a challenging problem. In order to reduce the effects of reverberation, we present a method that dereverberates input signals before they are segregated. The dereverberation filter is estimated from the autocorrelation of the observations and primarily deals with early reflections, while late reflections are effectively suppressed by an inhibitory mechanism that estimates their relative contribution in each time-frequency segment. Information about the salience of the target in a given time-frequency segment based on source separation is combined with the corresponding information based on reverberation suppression through the use of a continually-variable weighting function or mask. Use of the novel reverberation processing results in a relative decrease in WER of 11.5% to 20.9% and use of the combined approaches reduces relative WER by as much as 65.3%.
Keywords
echo suppression; filtering theory; reverberation; speech processing; speech recognition; autocorrelation; continually-variable weighting function; dereverberation filter; echo suppression; feature speech recognition; reverberant environments; source segregation; spatially-separated interfering sources; time-frequency segment; Autocorrelation; Automatic speech recognition; Degradation; Filter bank; Humans; Natural languages; Reverberation; Signal processing; Speech recognition; Time frequency analysis; Speech recognition; binaural processing; dereverberation; missing feature theory; spatial segregation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366929
Filename
4218117
Link To Document