DocumentCode :
1083496
Title :
Binaural Tracking of Multiple Moving Sources
Author :
Roman, Nicoleta ; Wang, DeLiang
Author_Institution :
Dept. of Math., Ohio State Univ., Lima, OH
Volume :
16
Issue :
4
fYear :
2008
fDate :
5/1/2008 12:00:00 AM
Firstpage :
728
Lastpage :
739
Abstract :
This paper addresses the problem of tracking multiple moving sources using binaural input. We observe that binaural cues are strongly correlated with source locations in time-frequency regions dominated by only one source. Based on this observation, we propose a novel tracking algorithm that integrates probabilities across reliable frequency channels in order to produce a likelihood function in the target space, which describes the azimuths of all active sources at a particular time frame. Finally, a hidden Markov model (HMM) is employed to form continuous tracks and automatically detect the number of active sources across time. Results are presented for up to three moving talkers in anechoic conditions. A comparison shows that our HMM model outperforms a Kalman filter-based approach in tracking active sources across time. Our study represents a first step in addressing auditory scene analysis with moving sound sources.
Keywords :
Kalman filters; hidden Markov models; tracking filters; Kalman filter; active source tracking; anechoic condition; auditory scene analysis; binaural cues; binaural tracking; hidden Markov model; likelihood function; moving sound source; moving talker; multiple moving source tracking; reliable frequency channel; tracking algorithm; Binaural processing; hidden Markov model (HMM); moving source tracking; multisource tracking;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.918978
Filename :
4457928
Link To Document :
بازگشت