مرکز منطقه ای اطلاع رساني علوم و فناوري - Representing Videos Using Mid-level Discriminative Patches

DocumentCode :

639497

Title :

Representing Videos Using Mid-level Discriminative Patches

Author :

Jain, Abhishek ; Gupta, Arpan ; Rodriguez, M. ; Davis, Larry S.

fYear :

2013

fDate :

23-28 June 2013

Firstpage :

2571

Lastpage :

2578

Abstract :

How should a video be represented? We propose a new representation for videos based on mid-level discriminative spatio-temporal patches. These spatio-temporal patches might correspond to a primitive human action, a semantic object, or perhaps a random but informative spatio-temporal patch in the video. What defines these spatio-temporal patches is their discriminative and representative properties. We automatically mine these patches from hundreds of training videos and experimentally demonstrate that these patches establish correspondence across videos and align the videos for label transfer techniques. Furthermore, these patches can be used as a discriminative vocabulary for action classification where they demonstrate state-of-the-art performance on UCF50 and Olympics datasets.

Keywords :

image representation; video signal processing; Olympics datasets; UCF50 datasets; action classification; discriminative vocabulary; informative spatiotemporal patch; label transfer techniques; mid-level discriminative spatiotemporal patches; primitive human action; semantic object; video representation; Measurement; Semantics; Support vector machines; Training; Training data; Vectors; Videos; Action Recognition; Video Understanding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location :

Portland, OR

ISSN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2013.332

Filename :

6619176

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=639497