Title :
Robust hands-free Automatic Speech Recognition for human-machine interaction
Author :
Gomez, Randy ; Kawahara, Tatsuya ; Nakadai, Kazuhiro
Author_Institution :
Acad. Center for Comput. & Media Studies, Kyoto Univ., Kyoto, Japan
Abstract :
In enclosed environments where robots are deployed, the observed speech signal is smeared due to reverberation. This degrades the performance of the automatic speech recognition (ASR). Thus, hands-free speech recognition for human-machine communication is a difficult task. Most speech enhancement techniques used to address this problem enhance the contaminated waveform independent from that of the ASR. However, this approach does not necessarily improve ASR performance. In this paper, we expand the conventional spectral subtraction-based (SS) technique to deal with reverberation. In our proposed approach, the dereverberation parameters of SS are optimized to improve the likelihood of the acoustic model and not just the waveform signal. The system is capable of adaptively fine-tuning these parameters jointly with acoustic model training for effective use in ASR application. We have experimented using real reverberant data collected from an operational robot. Moreover, we also evaluated with reverberant data corrupted with environmental and robot internal noise. Experimental results show that the proposed method significantly improves the recognition performance over conventional approach.
Keywords :
acoustic signal processing; human-robot interaction; optimisation; speech recognition; acoustic model; automatic speech recognition; human-machine interaction; parameter optimization; spectral subtraction technique; Acoustics; Hidden Markov models; Noise; Optimization; Speech; Speech recognition; Training;
Conference_Titel :
Humanoid Robots (Humanoids), 2010 10th IEEE-RAS International Conference on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-8688-5
Electronic_ISBN :
978-1-4244-8689-2
DOI :
10.1109/ICHR.2010.5686828