• DocumentCode
    3494748
  • Title

    Spatial synchronization of audiovisual objects by 3D audio object coding

  • Author

    Gunel, Banu ; Ekmekcioglu, Erhan ; Kondoz, Ahmet M.

  • Author_Institution
    I-Lab. Multimedia Commun. Res., Univ. of Surrey, Guildford, UK
  • fYear
    2010
  • fDate
    4-6 Oct. 2010
  • Firstpage
    460
  • Lastpage
    465
  • Abstract
    Free viewpoint video enables the visualisation of a scene from arbitrary viewpoints and directions. However, this flexibility in video rendering provides a challenge in 3D media for achieving spatial synchronicity between the audio and video objects. When the viewpoint is changed, its effect on the perceived audio scene should be considered to avoid mismatches in the perceived positions of audiovisual objects. Spatial audio coding with such flexibility requires decomposing the sound scene into audio objects initially, and then synthesizing the new scene according to the geometric relations between the A/V capturing setup, selected viewpoint and the rendering system. This paper proposes a free viewpoint audio coding framework for 3D media systems utilising multiview cameras and a microphone array. A real-time source separation technique is used for object decomposition followed by spatial audio coding. Binaural, multichannel sound systems and wave field synthesis systems are addressed. Subjective test results shows that the method achieves spatial synchronicity for various viewpoints consistently, which is not possible by conventional recording techniques.
  • Keywords
    audio coding; audio-visual systems; cameras; data visualisation; microphone arrays; rendering (computer graphics); source separation; synchronisation; video signal processing; 3D audio object coding; 3D media systems; A/V capturing setup; audiovisual objects; binaural multichannel sound systems; free viewpoint video; microphone array; multiview cameras; object decomposition; real-time source separation technique; scene visualisation; sound scene decomposition; spatial audio coding; spatial synchronization; video rendering; wave field synthesis systems; Arrays; Cameras; Loudspeakers; Microphones; Rendering (computer graphics); Three dimensional displays; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing (MMSP), 2010 IEEE International Workshop on
  • Conference_Location
    Saint Malo
  • Print_ISBN
    978-1-4244-8110-1
  • Electronic_ISBN
    978-1-4244-8111-8
  • Type

    conf

  • DOI
    10.1109/MMSP.2010.5662065
  • Filename
    5662065