• DocumentCode
    3018949
  • Title

    Microphone Arrays as Generalized Cameras for Integrated Audio Visual Processing

  • Author

    Donovan, Adam O. ; Duraiswami, Ramani ; Neumann, Jan

  • Author_Institution
    Univ. of Maryland, College Park
  • fYear
    2007
  • fDate
    17-22 June 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Combinations of microphones and cameras allow the joint audio visual sensing of a scene. Such arrangements of sensors are common in biological organisms and in applications such as meeting recording and surveillance where both modalities are necessary to provide scene understanding. Microphone arrays provide geometrical information on the source location, and allow the sound sources in the scene to be separated and the noise suppressed, while cameras allow the scene geometry and the location and motion of people and other objects to be estimated. In most previous work the fusion of the audio-visual information occurs at a relatively late stage. In contrast, we take the viewpoint that both cameras and microphone arrays are geometry sensors, and treat the microphone arrays as generalized cameras. We employ computer-vision inspired algorithms to treat the combined system of arrays and cameras. In particular, we consider the geometry introduced by a general microphone array and spherical microphone arrays. The latter show a geometry that is very close to central projection cameras, and we show how standard vision based calibration algorithms can be profitably applied to them. Experiments are presented that demonstrate the usefulness of the considered approach.
  • Keywords
    acoustic signal processing; array signal processing; audio-visual systems; cameras; computer vision; geometry; microphone arrays; sensor fusion; audio-visual information fusion; computer-vision inspired algorithms; generalized cameras; geometrical information; geometry sensors; integrated audio visual processing; microphone arrays; scene geometry; sound sources; source location; Acoustic sensors; Audio recording; Biological systems; Biosensors; Cameras; Geometry; Layout; Microphone arrays; Sensor arrays; Surveillance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1063-6919
  • Print_ISBN
    1-4244-1179-3
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2007.383345
  • Filename
    4270343