• DocumentCode
    3018915
  • Title

    Harmony in Motion

  • Author

    Barzelay, Zohar ; Schechner, Yoav Y.

  • Author_Institution
    Technion - Israel Inst. of Technol., Haifa
  • fYear
    2007
  • fDate
    17-22 June 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Cross-modal analysis offers information beyond that extracted from individual modalities. Consider a camcorder having a single microphone in a cocktail-party: it captures several moving visual objects which emit sounds. A task for audio-visual analysis is to identify the number of independent audio-associated visual objects (AVOs), pinpoint the AVOs´ spatial locations in the video and isolate each corresponding audio component. Part of these problems were considered by prior studies, which were limited to simple cases, e.g., a single AVO or stationary sounds. We describe an approach that seeks to overcome these challenges. It acknowledges the importance of temporal features that are based on significant changes in each modality. A probabilistic formalism identifies temporal coincidences between these features, yielding cross-modal association and visual localization. This association is of particular benefit in harmonic sounds, as it enables subsequent isolation of each audio source. We demonstrate this in challenging experiments, having multiple, simultaneous highly nonstationary AVOs.
  • Keywords
    audio signal processing; computer vision; feature extraction; probability; video signal processing; associated visual object; audio-visual analysis; computer vision; cross-modal analysis; feature extraction; probabilistic formalism; video signal processing; visual localization; Cameras; Computer vision; Data mining; Hardware; Independent component analysis; Information analysis; Microphone arrays; Motion pictures; Object recognition; Video equipment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1063-6919
  • Print_ISBN
    1-4244-1179-3
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2007.383344
  • Filename
    4270342