DocumentCode
1721617
Title
On the Use of Empirically Determined Impulse Responses for Improving Distant Talking Speech Recognition
Author
Plötz, Thomas ; Fink, Gernot A.
Author_Institution
Robot. Res. Inst., Dortmund Univ. of Technol., Dortmund
fYear
2008
Firstpage
156
Lastpage
159
Abstract
Recognition rates of distant talking speech recognition applications substantially decrease if the acoustic environment contains reverberation. Although standard approaches for compensating such distortions, e.g. cepstral mean subtraction (CMS), are quite effective, they are not appropriate for dynamic human machine interaction. When only short portions of speech are uttered by speakers at different positions, compensation methods fail that require several seconds of speech. For this kind of applications we present a dereverberation approach utilizing empirically determined impulse responses. Prior to speaking users are asked to produce some impulse-like signal (clapping their hands, or snipping the fingers) which is used for compensation. By means of an experimental evaluation on the German Verbmobil corpus we demonstrate the promising potential of the approach.
Keywords
compensation; reverberation; speech recognition; transient response; German Verbmobil corpus; acoustic environment; compensation method; dereverberation approach; distant talking speech recognition; impulse response; Acoustic distortion; Automatic speech recognition; Cepstral analysis; Collision mitigation; Feature extraction; Filtering; Loudspeakers; Reverberation; Speech enhancement; Speech recognition; De-reverberation; cepstral mean subtraction; distant talking speech recognition; impulse responses;
fLanguage
English
Publisher
ieee
Conference_Titel
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
Conference_Location
Trento
Print_ISBN
978-1-4244-2337-8
Electronic_ISBN
978-1-4244-2338-5
Type
conf
DOI
10.1109/HSCMA.2008.4538710
Filename
4538710
Link To Document