• DocumentCode
    743426
  • Title

    Distributed IMM-Unscented Kalman Filter for Speaker Tracking in Microphone Array Networks

  • Author

    Ye Tian ; Zhe Chen ; Fuliang Yin

  • Author_Institution
    Sch. of Inf. & Commun. Eng., Dalian Univ. of Technol., Dalian, China
  • Volume
    23
  • Issue
    10
  • fYear
    2015
  • Firstpage
    1637
  • Lastpage
    1647
  • Abstract
    In this paper, we first propose a distributed unscented Kalman filter (DUKF) to overcome the nonlinearity of measurement model in speaker tracking. Next, for the different motion dynamics of a speaker in the in-door environment, we introduce the interacting multiple model (IMM) algorithm and propose a distributed interacting multiple model-unscented Kalman filter (IMM-UKF) for estimating time-varying speaker´s positions in a microphone array network. In the distributed IMM-UKF based speaker tracking method, the time difference of arrival (TDOA) of the speech signals received by a pair of microphones at each node is estimated by the generalized cross-correlation (GCC) method, then the distributed IMM-UKF is used to track a speaker whose position and speed significantly vary over time in a microphone array network. The proposed method can estimate speaker´s positions globally in the network and obtain a smoothed trajectory of the speaker´s movement robustly in noisy and reverberant environments, and it is scalable for speaker tracking. Simulation and real-world experiment results reveal the effectiveness of the proposed speaker tracking method.
  • Keywords
    Kalman filters; microphone arrays; speech processing; DUKF; GCC method; distributed IMM-unscented Kalman filter; generalized cross-correlation method; interacting multiple model algorithm; microphone array networks; speaker tracking; speech signals; time difference of arrival; time-varying speaker positions; Arrays; Heuristic algorithms; Kalman filters; Microphones; Sensors; Speech; Tracking; Distributed unscented Kalman filter (DUKF); interacting multiple model; microphone array network; speaker tracking; time difference of arrival (TDOA);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2442418
  • Filename
    7118687