DocumentCode :
17140
Title :
Multichannel Source Separation and Tracking With RANSAC and Directional Statistics
Author :
Traa, Johannes ; Smaragdis, Paris
Author_Institution :
Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign (UIUC), Urbana, IL, USA
Volume :
22
Issue :
12
fYear :
2014
fDate :
Dec. 2014
Firstpage :
2233
Lastpage :
2243
Abstract :
We describe multichannel blind source separation and tracking algorithms based on clustering wrapped interchannel phase difference (IPD) features. We pose the clustering problem as one of multimodal circular-linear regression and present its probabilistic formulation. Phase wrapping due to spatial aliasing is explicitly incorporated by modeling the IPD features as circular variables. We present two methods based on Expectation-Maximization (EM) and a sequential variant of RANdom SAmple Consensus (RANSAC). We show that their strengths can be combined by using RANSAC to initialize EM. The IPD clustering algorithm is applied to separate stationary speakers from a multichannel mixture. We then extend it to the case of moving speakers by tracking their directions-of-arrival with the Factorial Wrapped Kalman Filter (FWKF) using RANSAC as a data preprocessor. Experimental results demonstrate that the proposed methods perform well in the presence of reverberant babble noise and spatial aliasing. The FWKF successfully tracks and separates moving speakers with separation quality comparable to that for stationary speakers.
Keywords :
Kalman filters; blind source separation; direction-of-arrival estimation; expectation-maximisation algorithm; pattern clustering; probability; random processes; regression analysis; signal sampling; EM; FWKF; IPD feature; RANSAC; clustering wrapped interchannel phase difference feature; data preprocessor; directional statistics; directions-of-arrival tracking algorithm; expectation-maximization; factorial wrapped Kalman filter; multichannel blind source separation algorithm; multichannel mixture; multimodal circular-linear regression; phase wrapping; probabilistic formulation; random sample consensus; reverberant babble noise presence; spatial aliasing; stationary speaker separation; Arrays; Clustering algorithms; Microphones; Source separation; Speech; Speech processing; Vectors; Blind source separation (BSS); directional statistics; interchannel phase difference (IPD); wrapped Kalman filter;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2014.2365701
Filename :
6939657
Link To Document :
بازگشت