Title :
Audio-based nonlinear video diffusion
Author :
Casanovas, Anna Llagostera ; Vandergheynst, Pierre
Author_Institution :
Signal Process. Inst. (LTS2), Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
Abstract :
We propose a novel non-linear video diffusion approach which is able to focus on parts of a video sequence that are relevant for applications in audio-visual analysis. The diffusion process is controlled by a diffusion coefficient based on an estimate of the synchrony between video motion and audio energy at each point of the video volume. Thus, regions whose motion is not coherent with the soundtrack are iteratively smoothed. The discretization of the proposed continuous diffusion formulation is carefully studied and its stability demonstrated. Our approach is tested in challenging situations involving sequence degradation and distracting video motion. Results show that in all cases our method is able to keep the focus of attention on the sound sources.
Keywords :
audio signal processing; audio-visual systems; image motion analysis; image sequences; video signal processing; audio energy; audio-based nonlinear video diffusion; audio-visual analysis; diffusion coefficient; sequence degradation; sound sources; video motion; video sequence; video volume; Cepstral analysis; Degradation; Diffusion processes; Layout; Motion control; Process control; Signal analysis; Speech recognition; Stability; Testing; Audio-visual processing; finite difference methods; linear/nonlinear diffusion;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5494896