Title :
Distant Speech Recognition: Bridging the Gaps
Author :
McDonough, John ; Wölfel, Matthias
Author_Institution :
Spoken Language Syst., Saarland Univ., Saarbrucken
Abstract :
While great progress has been made in both fields, there is currently a relatively large rift between researchers engaged in acoustic array processing and those engaged in automatic speech recognition. This is unfortunate for many reasons, but most of all because it prevents the two sides, both of whom are investigating different aspects of the same problem, from truly understanding one another and cooperating. In many cases, the two sides see each other through the eyes of strangers. If ground breaking progress is to be made in the emerging field of distant speech recognition (DSR), this abysmal state of affairs must change. In this work, we outline five pressing problems in the DSR research field, and we make initial proposals for their solutions. The problems discussed here are by no means the only ones that must be solved in order to construct truly effective DSR systems. Nonetheless, their solution, in our view, will represent significant first steps towards this goal, inasmuch as the solution of each of these problems will require a substantial change in the mind-sets and thought patterns of those engaged in this field of research.
Keywords :
acoustic arrays; acoustic signal processing; array signal processing; speech recognition; DSR research; acoustic array processing; distant speech recognition; ground breaking progress; Acoustic distortion; Array signal processing; Automatic speech recognition; Eyes; Hidden Markov models; Higher order statistics; Independent component analysis; Microphone arrays; Particle filters; Speech recognition; automatic speech recognition; beamforming; joint denoising and dereveberation; microphone arrays; multi-step linear prediction; particle filter; speech feature enhancement;
Conference_Titel :
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
Conference_Location :
Trento
Print_ISBN :
978-1-4244-2337-8
Electronic_ISBN :
978-1-4244-2338-5
DOI :
10.1109/HSCMA.2008.4538699