DocumentCode :
2545632
Title :
A scene-associated training method for mobile robot speech recognition in multisource reverberated environments
Author :
Liu, Jindong ; Johns, Edward ; Yang, Guang-Zhong
Author_Institution :
The Hamlyn Centre, Imperial College London, UK
fYear :
2011
fDate :
25-30 Sept. 2011
Firstpage :
542
Lastpage :
549
Abstract :
In this paper, we present a new technique for social mobile robot speech recognition based on scene-associated training models. The key contribution of the paper is a real-time framework that reduces the effect of room reverberation and ambient noise, a challenging problem in speech recognition. In classical approaches, anechoic sound is used to train the model, with the main focus on removing reverberation or noise from the sound. Our technique differs in that we train a number of speech recognizers directly from the reverberated sound, by associating each recognizer with a unique visual scene, to deal with the varying reverberation properties of different rooms. By extracting local features from a captured image and recognizing a scene, the robot can use the appropriate speech recognizer that is trained for the particular structural properties of that scene. We tested our method by using a baseline speech recognition model (HTK) across a variety of rooms and different levels of background noise. The results show that the association between a visual scene and a corresponding speech recognizer greatly improves the robot´s speech recognition accuracy, together with increasing the computational speed of recognition, compared to competing techniques.
Keywords :
Databases; Feature extraction; Reverberation; Robots; Speech recognition; Training; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on
Conference_Location :
San Francisco, CA
ISSN :
2153-0858
Print_ISBN :
978-1-61284-454-1
Type :
conf
DOI :
10.1109/IROS.2011.6094669
Filename :
6094669
Link To Document :
بازگشت