مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning human actions via information maximization

DocumentCode :

2401634

Title :

Learning human actions via information maximization

Author :

Liu, Jingen ; Shah, Mubarak

Author_Institution :

Comput. Vision Lab., Central Florida Univ., Orlando, FL

fYear :

2008

fDate :

23-28 June 2008

Firstpage :

Lastpage :

Abstract :

In this paper, we present a novel approach for automatically learning a compact and yet discriminative appearance-based human action model. A video sequence is represented by a bag of spatiotemporal features called video-words by quantizing the extracted 3D interest points (cuboids) from the videos. Our proposed approach is able to automatically discover the optimal number of video-word clusters by utilizing maximization of mutual information(MMI). Unlike the k-means algorithm, which is typically used to cluster spatiotemporal cuboids into video words based on their appearance similarity, MMI clustering further groups the video-words, which are highly correlated to some group of actions. To capture the structural information of the learnt optimal video-word clusters, we explore the correlation of the compact video-word clusters. We use the modified correlogram, which is not only translation and rotation invariant, but also somewhat scale invariant. We extensively test our proposed approach on two publicly available challenging datasets: the KTH dataset and IXMAS multiview dataset. To the best of our knowledge, we are the first to try the bag of video-words related approach on the multiview dataset. We have obtained very impressive results on both datasets.

Keywords :

data analysis; feature extraction; human computer interaction; image sequences; learning (artificial intelligence); spatiotemporal phenomena; video signal processing; IXMAS multiview dataset; KTH dataset; appearance-based human action model; correlogram; human action learning; information maximization; k-means algorithm; learnt optimal video-word clusters; maximization of mutual information; spatiotemporal cuboids; spatiotemporal features; video extraction; video sequence; video-words; Cameras; Clustering algorithms; Computer vision; Data mining; Feature extraction; Humans; Image motion analysis; Spatiotemporal phenomena; Testing; Video sequences;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on

Conference_Location :

Anchorage, AK

ISSN :

1063-6919

Print_ISBN :

978-1-4244-2242-5

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2008.4587723

Filename :

4587723

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2401634