DocumentCode :
743426
Title :
Distributed IMM-Unscented Kalman Filter for Speaker Tracking in Microphone Array Networks
Author :
Ye Tian ; Zhe Chen ; Fuliang Yin
Author_Institution :
Sch. of Inf. & Commun. Eng., Dalian Univ. of Technol., Dalian, China
Volume :
23
Issue :
10
fYear :
2015
Firstpage :
1637
Lastpage :
1647
Abstract :
In this paper, we first propose a distributed unscented Kalman filter (DUKF) to overcome the nonlinearity of measurement model in speaker tracking. Next, for the different motion dynamics of a speaker in the in-door environment, we introduce the interacting multiple model (IMM) algorithm and propose a distributed interacting multiple model-unscented Kalman filter (IMM-UKF) for estimating time-varying speaker´s positions in a microphone array network. In the distributed IMM-UKF based speaker tracking method, the time difference of arrival (TDOA) of the speech signals received by a pair of microphones at each node is estimated by the generalized cross-correlation (GCC) method, then the distributed IMM-UKF is used to track a speaker whose position and speed significantly vary over time in a microphone array network. The proposed method can estimate speaker´s positions globally in the network and obtain a smoothed trajectory of the speaker´s movement robustly in noisy and reverberant environments, and it is scalable for speaker tracking. Simulation and real-world experiment results reveal the effectiveness of the proposed speaker tracking method.
Keywords :
Kalman filters; microphone arrays; speech processing; DUKF; GCC method; distributed IMM-unscented Kalman filter; generalized cross-correlation method; interacting multiple model algorithm; microphone array networks; speaker tracking; speech signals; time difference of arrival; time-varying speaker positions; Arrays; Heuristic algorithms; Kalman filters; Microphones; Sensors; Speech; Tracking; Distributed unscented Kalman filter (DUKF); interacting multiple model; microphone array network; speaker tracking; time difference of arrival (TDOA);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2442418
Filename :
7118687
Link To Document :
بازگشت