DocumentCode :
417181
Title :
A detection based approach to robust speech understanding
Author :
Wang, Kuansan
Author_Institution :
Speech Technol. Group, Microsoft Res., Redmond, WA, USA
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Field speech data pose great challenges to statistical modeling because the speech signal is often intermixed with extraneous sounds and other environmental noise either that are too difficult to compensate dynamically or for which it is too expensive to collect sufficient data for proper offline training. We propose a detection based method in which the speech recognizer can sharply tune to only the "meaningful" speech and gracefully ignore the "unwanted" audio segments. The method is designed to be integrated with the frame synchronous search for a single pass processing. In contrast to the conventional keyword spotting techniques, this integration allows the use of the language model for better predicting the detection targets during the search. To study its efficacy, we apply the framework to a spontaneous speech understanding application where cohesive phrases congruent to the domain semantics and application context are used as the salient feature for selective hearing. Experimental results on the effectiveness of the system in dealing with out of domain phrases and other spontaneous speech effects are encouraging.
Keywords :
acoustic noise; audio signal processing; prediction theory; random noise; speech recognition; statistical analysis; audio segments; cohesive phrases; detection targets; domain semantics; environmental noise; extraneous sounds; frame synchronous search; keyword spotting; language model; meaningful speech signal; robust speech understanding; selective hearing; speech recognizer; spontaneous speech understanding; statistical modeling; Auditory system; Automatic speech recognition; Decoding; Design methodology; Noise robustness; Predictive models; Speech coding; Speech processing; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326010
Filename :
1326010
Link To Document :
بازگشت