DocumentCode :
3284433
Title :
Robust speaker direction estimation with particle filtering
Author :
Warsitz, E. ; Haeb-Umbach, Reinhold
Author_Institution :
Dept. of Commun. Eng., Paderborn Univ., Germany
fYear :
2004
fDate :
29 Sept.-1 Oct. 2004
Firstpage :
367
Lastpage :
370
Abstract :
The paper is concerned with binaural signal processing for a bimodal human-robot interface with hearing and vision. The two microphone signals are processed to obtain an enhanced single-channel input signal for the subsequent speech recognizer and to localize the acoustic source, an important information for establishing a natural human-robot communication. We utilize a robust adaptive algorithm for filter-and-sum beamforming (FSB) and extract speaker direction information from the resulting FIR filter coefficients. Further, particle filtering is applied which conducts a nonlinear Bayesian tracking of speaker movement. Good location accuracy can be achieved even in highly reverberant environments. The results obtained outperform the conventional generalized cross correlation (GCC) method.
Keywords :
filtering theory; microphones; signal processing; speech enhancement; speech recognition; user interfaces; FIR filter coefficient; bimodal human-robot interface; binaural signal processing; enhanced single-channel input signal; filter-and-sum beamforming; generalized cross correlation method; microphone signal; nonlinear Bayesian tracking; particle filtering; robust adaptive algorithm; robust speaker direction estimation; speech recognizer; Acoustic signal processing; Adaptive signal processing; Auditory system; Filtering; Finite impulse response filter; Loudspeakers; Microphones; Robustness; Signal processing; Signal processing algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2004 IEEE 6th Workshop on
Print_ISBN :
0-7803-8578-0
Type :
conf
DOI :
10.1109/MMSP.2004.1436569
Filename :
1436569
Link To Document :
بازگشت