Title :
Bridging the Gap: Towards a Unified Framework for Hands-Free Speech Recognition Using Microphone Arrays
Author :
Seltzer, Michael L.
Author_Institution :
Microsoft Res., Redmond, WA
Abstract :
In this paper we describe two families of algorithms for hands-free speech recognition using microphone arrays. Enhancement-based approaches use a cascade of independent processing blocks to perform speech enhancement followed by speech recognition. We discuss the reasons why this approach may be sub-optimal and motivate the need for a solution that tightly integrates all processing blocks into a common unified framework. This leads to a second family of algorithms called unified approaches which considers all processing stages to be components of a single system that operates with the common goal of improved recognition accuracy. We describe several examples of such algorithms that have been shown to outperform more traditional signal-processing-based approaches. In doing so, we hope to convey the benefits of performing hands-free speech recognition in this manner and motivate further research in this area.
Keywords :
array signal processing; microphone arrays; speech enhancement; speech recognition; hands-free speech recognition; microphone arrays; speech enhancement; Array signal processing; Automatic speech recognition; Decoding; Distortion; Feature extraction; Microphone arrays; Reverberation; Speech enhancement; Speech recognition; Working environment noise; beamforming; microphone array processing; speech recognition;
Conference_Titel :
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
Conference_Location :
Trento
Print_ISBN :
978-1-4244-2337-8
Electronic_ISBN :
978-1-4244-2338-5
DOI :
10.1109/HSCMA.2008.4538698