DocumentCode
1721343
Title
Distant Speech Recognition: Bridging the Gaps
Author
McDonough, John ; Wölfel, Matthias
Author_Institution
Spoken Language Syst., Saarland Univ., Saarbrucken
fYear
2008
Firstpage
108
Lastpage
114
Abstract
While great progress has been made in both fields, there is currently a relatively large rift between researchers engaged in acoustic array processing and those engaged in automatic speech recognition. This is unfortunate for many reasons, but most of all because it prevents the two sides, both of whom are investigating different aspects of the same problem, from truly understanding one another and cooperating. In many cases, the two sides see each other through the eyes of strangers. If ground breaking progress is to be made in the emerging field of distant speech recognition (DSR), this abysmal state of affairs must change. In this work, we outline five pressing problems in the DSR research field, and we make initial proposals for their solutions. The problems discussed here are by no means the only ones that must be solved in order to construct truly effective DSR systems. Nonetheless, their solution, in our view, will represent significant first steps towards this goal, inasmuch as the solution of each of these problems will require a substantial change in the mind-sets and thought patterns of those engaged in this field of research.
Keywords
acoustic arrays; acoustic signal processing; array signal processing; speech recognition; DSR research; acoustic array processing; distant speech recognition; ground breaking progress; Acoustic distortion; Array signal processing; Automatic speech recognition; Eyes; Hidden Markov models; Higher order statistics; Independent component analysis; Microphone arrays; Particle filters; Speech recognition; automatic speech recognition; beamforming; joint denoising and dereveberation; microphone arrays; multi-step linear prediction; particle filter; speech feature enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
Conference_Location
Trento
Print_ISBN
978-1-4244-2337-8
Electronic_ISBN
978-1-4244-2338-5
Type
conf
DOI
10.1109/HSCMA.2008.4538699
Filename
4538699
Link To Document