DocumentCode :
639363
Title :
Multi-task Sparse Learning with Beta Process Prior for Action Recognition
Author :
Chunfeng Yuan ; Weiming Hu ; Guodong Tian ; Shuang Yang ; Haoran Wang
fYear :
2013
fDate :
23-28 June 2013
Firstpage :
423
Lastpage :
429
Abstract :
In this paper, we formulate human action recognition as a novel Multi-Task Sparse Learning(MTSL) framework which aims to construct a test sample with multiple features from as few bases as possible. Learning the sparse representation under each feature modality is considered as a single task in MTSL. Since the tasks are generated from multiple features associated with the same visual input, they are not independent but inter-related. We introduce a Beta process(BP) prior to the hierarchical MTSL model, which efficiently learns a compact dictionary and infers the sparse structure shared across all the tasks. The MTSL model enforces the robustness in coefficient estimation compared with performing each task independently. Besides, the sparseness is achieved via the Beta process formulation rather than the computationally expensive L1 norm penalty. In terms of non-informative gamma hyper-priors, the sparsity level is totally decided by the data. Finally, the learning problem is solved by Gibbs sampling inference which estimates the full posterior on the model parameters. Experimental results on the KTH and UCF sports datasets demonstrate the effectiveness of the proposed MTSL approach for action recognition.
Keywords :
computer vision; feature extraction; gesture recognition; image representation; inference mechanisms; learning (artificial intelligence); Gibbs sampling inference; KTH sport dataset; UCF sports dataset; beta process prior; coefficient estimation; compact dictionary learning; feature modality; hierarchical MTSL model; human action recognition; learning problem; multitask sparse learning; noninformative gamma hyperpriors; robustness; sparse representation learning; sparse structure inference; visual input; Dictionaries; Histograms; Solid modeling; Training; Vectors; Video sequences; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location :
Portland, OR
ISSN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2013.61
Filename :
6618905
Link To Document :
بازگشت