DocumentCode
3018949
Title
Microphone Arrays as Generalized Cameras for Integrated Audio Visual Processing
Author
Donovan, Adam O. ; Duraiswami, Ramani ; Neumann, Jan
Author_Institution
Univ. of Maryland, College Park
fYear
2007
fDate
17-22 June 2007
Firstpage
1
Lastpage
8
Abstract
Combinations of microphones and cameras allow the joint audio visual sensing of a scene. Such arrangements of sensors are common in biological organisms and in applications such as meeting recording and surveillance where both modalities are necessary to provide scene understanding. Microphone arrays provide geometrical information on the source location, and allow the sound sources in the scene to be separated and the noise suppressed, while cameras allow the scene geometry and the location and motion of people and other objects to be estimated. In most previous work the fusion of the audio-visual information occurs at a relatively late stage. In contrast, we take the viewpoint that both cameras and microphone arrays are geometry sensors, and treat the microphone arrays as generalized cameras. We employ computer-vision inspired algorithms to treat the combined system of arrays and cameras. In particular, we consider the geometry introduced by a general microphone array and spherical microphone arrays. The latter show a geometry that is very close to central projection cameras, and we show how standard vision based calibration algorithms can be profitably applied to them. Experiments are presented that demonstrate the usefulness of the considered approach.
Keywords
acoustic signal processing; array signal processing; audio-visual systems; cameras; computer vision; geometry; microphone arrays; sensor fusion; audio-visual information fusion; computer-vision inspired algorithms; generalized cameras; geometrical information; geometry sensors; integrated audio visual processing; microphone arrays; scene geometry; sound sources; source location; Acoustic sensors; Audio recording; Biological systems; Biosensors; Cameras; Geometry; Layout; Microphone arrays; Sensor arrays; Surveillance;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
Conference_Location
Minneapolis, MN
ISSN
1063-6919
Print_ISBN
1-4244-1179-3
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2007.383345
Filename
4270343
Link To Document