مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning sparse generative models of audiovisual signals

DocumentCode :

700023

Title :

Learning sparse generative models of audiovisual signals

Author :

Monaci, Gianluca ; Sommer, Friedrich T. ; Vandergheynst, Pierre

Author_Institution :

Redwood Center for Theor. Neurosci., Univ. of California Berkeley, Berkeley, CA, USA

fYear :

2008

fDate :

25-29 Aug. 2008

Firstpage :

Lastpage :

Abstract :

This paper presents a novel framework to learn sparse representations for audiovisual signals. An audiovisual signal is modeled as a sparse sum of audiovisual kernels. The kernels are bimodal functions made of synchronous audio and video components that can be positioned independently and arbitrarily in space and time. We design an algorithm capable of learning sets of such audiovisual, synchronous, shift-invariant functions by alternatingly solving a coding and a learning procedure. The proposed methodology is used to learn audiovisual features from a set of bimodal sequences. The basis functions that emerge are audio-video pairs that capture salient data structures.

Keywords :

audio coding; audio-visual systems; learning (artificial intelligence); video coding; audiovisual kernel; audiovisual signal modeling; bimodal function; bimodal sequences; coding procedure; learning sparse generative model; salient data structures; shift invariant functions; sparse representation; synchronous audio components; synchronous video components; Dictionaries; Encoding; Kernel; Matching pursuit algorithms; Signal processing; Signal processing algorithms; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing Conference, 2008 16th European

Conference_Location :

Lausanne

ISSN :

2219-5491

Type :

conf

Filename :

7080555

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=700023