DocumentCode
2545632
Title
A scene-associated training method for mobile robot speech recognition in multisource reverberated environments
Author
Liu, Jindong ; Johns, Edward ; Yang, Guang-Zhong
Author_Institution
The Hamlyn Centre, Imperial College London, UK
fYear
2011
fDate
25-30 Sept. 2011
Firstpage
542
Lastpage
549
Abstract
In this paper, we present a new technique for social mobile robot speech recognition based on scene-associated training models. The key contribution of the paper is a real-time framework that reduces the effect of room reverberation and ambient noise, a challenging problem in speech recognition. In classical approaches, anechoic sound is used to train the model, with the main focus on removing reverberation or noise from the sound. Our technique differs in that we train a number of speech recognizers directly from the reverberated sound, by associating each recognizer with a unique visual scene, to deal with the varying reverberation properties of different rooms. By extracting local features from a captured image and recognizing a scene, the robot can use the appropriate speech recognizer that is trained for the particular structural properties of that scene. We tested our method by using a baseline speech recognition model (HTK) across a variety of rooms and different levels of background noise. The results show that the association between a visual scene and a corresponding speech recognizer greatly improves the robot´s speech recognition accuracy, together with increasing the computational speed of recognition, compared to competing techniques.
Keywords
Databases; Feature extraction; Reverberation; Robots; Speech recognition; Training; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on
Conference_Location
San Francisco, CA
ISSN
2153-0858
Print_ISBN
978-1-61284-454-1
Type
conf
DOI
10.1109/IROS.2011.6094669
Filename
6094669
Link To Document