مرکز منطقه ای اطلاع رساني علوم و فناوري - Human action recognition using Local Spatio-Temporal Discriminant Embedding

DocumentCode :

2401771

Title :

Human action recognition using Local Spatio-Temporal Discriminant Embedding

Author :

Jia, Kui ; Yeung, Dit-Yan

Author_Institution :

Shenzhen Inst. of Adv. Integration Technol., CAS / CUHK, Shenzhen

fYear :

2008

fDate :

23-28 June 2008

Firstpage :

Lastpage :

Abstract :

Human action video sequences can be considered as nonlinear dynamic shape manifolds in the space of image frames. In this paper, we address learning and classifying human actions on embedded low-dimensional manifolds. We propose a novel manifold embedding method, called Local Spatio-Temporal Discriminant Embedding (LSTDE). The discriminating capabilities of the proposed method are two-fold: (1) for local spatial discrimination, LSTDE projects data points (silhouette-based image frames of human action sequences) in a local neighborhood into the embedding space where data points of the same action class are close while those of different classes are far apart; (2) in such a local neighborhood, each data point has an associated short video segment, which forms a local temporal subspace on the embedded manifold. LSTDE finds an optimal embedding which maximizes the principal angles between those temporal subspaces associated with data points of different classes. Benefiting from the joint spatio-temporal discriminant embedding, our method is potentially more powerful for classifying human actions with similar space-time shapes, and is able to perform recognition on a frame-by-frame or short video segment basis. Experimental results demonstrate that our method can accurately recognize human actions, and can improve the recognition performance over some representative manifold embedding methods, especially on highly confusing human action types.

Keywords :

gesture recognition; image sequences; video signal processing; LSTDE; embedded low-dimensional manifolds; human action recognition; human actions; learning; local neighborhood; local spatial discrimination; local spatio-temporal discriminant embedding; manifold embedding method; nonlinear dynamic shape manifolds; recognition performance; silhouette-based image frames; spatio-temporal discriminant embedding; video segment; video sequences; Computer vision; Content addressable storage; Humans; Image recognition; Information analysis; Learning systems; Optical computing; Shape; Space technology; Video sequences;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on

Conference_Location :

Anchorage, AK

ISSN :

1063-6919

Print_ISBN :

978-1-4244-2242-5

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2008.4587732

Filename :

4587732

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2401771