Title :
Multiple concurrent speaker short-term tracking using a Kalman filter bank
Author :
Oualil, Youssef ; Klakow, Dietrich
Author_Institution :
Spoken Language Syst., Saarland Univ., Saarbrucken, Germany
Abstract :
This paper presents a novel filtering approach for tracking multiple concurrent speakers with a microphone array. In this framework, a Kalman filter bank that evolves in time according to a temporal Hidden Markov Model (HMM) is proposed. This approach was designed to overcome two major problems that occur in spontaneous speech; namely, 1) the speaker overlap. This problem is solved using a bank of parallel Kalman filters that track multiple simultaneous speakers, and 2) the high discontinuity of spontaneous speech caused by short breaks and silences. This is solved using an HMM that allows speakers to change their state (speaking, silent, etc.) over time. The actual active speakers number and locations are extracted from the active filters using a second Kalman filter. Experiments on the AV16.3 showed an average tracking rate improvement of 8% compared to a short-term clustering approach, while being 7 times faster.
Keywords :
Kalman filters; active filters; hidden Markov models; microphone arrays; speaker recognition; AV16.3; HMM; Kalman filter bank; active filters; active speakers locations; active speakers number; microphone array; multiple concurrent speaker short-term tracking; multiple simultaneous speakers; parallel filters; short-term clustering approach; temporal hidden Markov model; Arrays; Bayes methods; Hidden Markov models; Kalman filters; Microphones; Noise; Speech; Kalman filter; Microphone array; hidden Markov model; multiple speaker tracking;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853836