Title :
Robot audition for dynamic environments
Author :
Nakadai, Kazuhiro ; Ince, Gokhan ; Nakamura, Keisuke ; Nakajima, Hirofumi
Author_Institution :
Honda Res. Inst. Japan Co., Ltd., Wako, Japan
Abstract :
This paper addresses robot audition for dynamic environments, where speakers and/or a robot is moving within a dynamically-changing acoustic environment. Robot Audition studied so far assumed only stationary human-robot interaction scenes, and thus they have difficulties in coping with such dynamic environments. We recently developed new techniques for a robot to listen to several things simultaneously using its own ears even in dynamic environments; MUltiple SIgnal Classification based on Generalized Eigen-Value Decomposition (GEVD-MUSIC), Geometrically constrained High-order Decorrelation based Source Separation with Adaptive Step-size control (GHDSS-AS), Histogram-based Recursive Level Estimation (HRLE), and Template-based Ego Noise Suppression (TENS). GEVD-MUSIC provides noise-robust sound source localization. GHDSS-AS is a new sound source separation method which quickly adapts its sound source separation parameters to dynamic changes. HRLE is a practical post-filtering method with a small number of parameters. ENS estimates the motor noise of the robot by using templates recorded in advance and eliminates it. These methods are implemented as modules for our open-source robot audition software HARK to be easily integrated. We show that each of these methods and their combinations are effective to cope with dynamic environments through off-line experiments and on-line real-time demonstrations.
Keywords :
audio signal processing; eigenvalues and eigenfunctions; human-robot interaction; public domain software; signal classification; source separation; GEVD-MUSIC; GHDSS-AS; HARK; HRLE; TENS; dynamically-changing acoustic environment; geometrically constrained high-order decorrelation based source separation with adaptive step-size control; histogram-based recursive level estimation; multiple signal classification based on generalized eigenvalue decomposition; noise-robust sound source localization; open-source robot audition software; post-filtering method; robot audition; stationary human-robot interaction scenes; template-based ego noise suppression; Abstracts; Multiple signal classification; Robots; Signal to noise ratio; Speech; Dynamic environment; Ego noise suppression; Microphone array; Robot audition; Sound source localization; Sound source separation;
Conference_Titel :
Signal Processing, Communication and Computing (ICSPCC), 2012 IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4673-2192-1
DOI :
10.1109/ICSPCC.2012.6335729