Title :
A detection based approach to robust speech understanding
Author_Institution :
Speech Technol. Group, Microsoft Res., Redmond, WA, USA
Abstract :
Field speech data pose great challenges to statistical modeling because the speech signal is often intermixed with extraneous sounds and other environmental noise either that are too difficult to compensate dynamically or for which it is too expensive to collect sufficient data for proper offline training. We propose a detection based method in which the speech recognizer can sharply tune to only the "meaningful" speech and gracefully ignore the "unwanted" audio segments. The method is designed to be integrated with the frame synchronous search for a single pass processing. In contrast to the conventional keyword spotting techniques, this integration allows the use of the language model for better predicting the detection targets during the search. To study its efficacy, we apply the framework to a spontaneous speech understanding application where cohesive phrases congruent to the domain semantics and application context are used as the salient feature for selective hearing. Experimental results on the effectiveness of the system in dealing with out of domain phrases and other spontaneous speech effects are encouraging.
Keywords :
acoustic noise; audio signal processing; prediction theory; random noise; speech recognition; statistical analysis; audio segments; cohesive phrases; detection targets; domain semantics; environmental noise; extraneous sounds; frame synchronous search; keyword spotting; language model; meaningful speech signal; robust speech understanding; selective hearing; speech recognizer; spontaneous speech understanding; statistical modeling; Auditory system; Automatic speech recognition; Decoding; Design methodology; Noise robustness; Predictive models; Speech coding; Speech processing; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326010