DocumentCode :
3484329
Title :
Multiple sound sources localization with perception sensor network
Author :
Quang Nguyen ; JongSuk Choi
Author_Institution :
Univ. of Sci. & Technol. (UST), Seoul, South Korea
fYear :
2013
fDate :
26-29 Aug. 2013
Firstpage :
418
Lastpage :
423
Abstract :
This paper presents a general framework for localization of multiple sound sources in Cartesian coordinate with perception sensor network (PSN). PSN consists of Kinect sensor that has a color camera, a depth camera, and an internal microphone array and our experimental pan-tilt-zoom camera with attached microphone array. Sound localization with PSN is based on three-stage analysis. Short-time narrowband directional sound localization based on Phase difference of arrival (PDOA) is obtained in every time-frequency points, utilizing the sparseness assumption of audio mixtures. Multi-sensor directional localizations are transformed to Cartesian coordinate by a simple triangulation. The results are accumulated in all frequency bins for a block of frames and then clustered to obtain mid-term broadband localization. Furthermore, the framework is able to integrate any Bayesian filtering algorithms for long-term localization. Simulation results with four arrays (each has four microphone) show that the proposed framework successfully localize three simultaneous sources where the distance among sources is about one meter.
Keywords :
Bayes methods; acoustic radiators; acoustic signal processing; audio signal processing; cameras; direction-of-arrival estimation; distributed sensors; filtering theory; microphone arrays; sensor fusion; source separation; Bayesian filtering algorithms; Cartesian coordinate; Kinect sensor; PDOA; PSN; audio mixture sparseness; color camera; depth camera; experimental pan-tilt-zoom camera; internal microphone array; long-term localization; mid-term broadband localization; multiple sound source localization; multisensor directional localizations; perception sensor network; phase difference of arrival; short-time narrowband directional sound localization; three-stage analysis; time-frequency points; Artificial neural networks; Handheld computers; Robot kinematics; Robot sensing systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
RO-MAN, 2013 IEEE
Conference_Location :
Gyeongju
ISSN :
1944-9445
Type :
conf
DOI :
10.1109/ROMAN.2013.6628515
Filename :
6628515
Link To Document :
بازگشت