DocumentCode
867368
Title
Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques
Author
Potamitis, Ilyas ; Kokkinakis, George
Author_Institution
Dept. of Music Technol. & Acoust., Technol. Educ. Inst. of Crete, Rethymno
Volume
37
Issue
1
fYear
2007
Firstpage
72
Lastpage
81
Abstract
The general problem addressed in this paper is that of separating the voices of active moving speakers in the presence of background noise and moderate reverberation level in the acoustic field using a single microphone array. We adapt the multisensor multitarget tracking theory to the context of microphone arrays in order to form receptive beams that lock on each moving speaker on an extended time basis and therefore, achieve voice separation. Our approach: 1) incorporates kinematical information of speakers´ movement by using an interacting multiple model (IMM) estimator per speaker in order to constrain the evolution of direction of arrival (DOA) measurements, which characterize various motions of the speakers, and 2) can directly account for measurement origin uncertainty, i.e., which measurement comes from which speaker, by using the probabilistic-data-association technique in conjunction with the IMM estimator. The effectiveness of the approach is illustrated by an extensive simulation study on tracking the DOAs of two speakers with crossing trajectories and three static speakers having a conversation with partially overlapping speech and long pauses
Keywords
direction-of-arrival estimation; microphone arrays; sensor fusion; speech processing; target tracking; direction of arrival measurements; multiple moving speakers; multisensor multitarget techniques; probabilistic-data-association technique; single microphone array; speech separation; speech signal processing; Acoustic arrays; Adaptive arrays; Background noise; Direction of arrival estimation; Loudspeakers; Measurement uncertainty; Microphone arrays; Motion estimation; Reverberation; Speech; Microphone array applications; speaker direction of arrival (DOA) tracking; speech signal processing; voice separation;
fLanguage
English
Journal_Title
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
Publisher
ieee
ISSN
1083-4427
Type
jour
DOI
10.1109/TSMCA.2006.886338
Filename
4032927
Link To Document