DocumentCode
614569
Title
Real-time microphone array processing for sound source separation and localization
Author
Longji Sun ; Qi Cheng
Author_Institution
Sch. of Electr. & Comput. Eng., Oklahoma State Univ., Stillwater, OK, USA
fYear
2013
fDate
20-22 March 2013
Firstpage
1
Lastpage
6
Abstract
In this paper, the problem of sound source separation and localization is studied using a microphone array. A pure delay mixture model which is typical in outdoor environments is adopted. Our proposed approach utilizes the subspace method to estimate the directions of arrival (DOAs) of the sources from the collected mixtures. Since sound signals are generally considered broadband, the DOA estimates for a source at different frequencies are used to approximate the probability density function of the DOA. The maximum likelihood criterion is used to determine the final DOA estimate for the source. Using the estimated DOAs, the corresponding mixing and demixing matrices in the frequency domain are computed, and the source signals are recovered using the inverse short time Fourier transform (STFT). Our algorithm inherits the robustness to noise of the subspace method and also supports real-time implementation. Comprehensive simulations and experiments have been conducted to examine various aspects of the algorithm.
Keywords
Fourier transforms; acoustic generators; acoustic radiators; approximation theory; array signal processing; delays; direction-of-arrival estimation; inverse transforms; matrix algebra; maximum likelihood estimation; microphone arrays; probability; source separation; DOA; STFT; approximation theory; demixing matrix; directions of arrival estimation; frequency domain analysis; inverse short time Fourier transform; maximum likelihood criterion; mixing matrix; outdoor environment; probability density function; pure delay mixture model; real-time microphone array processing; sound signal; sound source localization; sound source separation; subspace method; Arrays; Direction-of-arrival estimation; Frequency estimation; Frequency-domain analysis; Microphones; Signal to noise ratio;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Sciences and Systems (CISS), 2013 47th Annual Conference on
Conference_Location
Baltimore, MD
Print_ISBN
978-1-4673-5237-6
Electronic_ISBN
978-1-4673-5238-3
Type
conf
DOI
10.1109/CISS.2013.6552257
Filename
6552257
Link To Document