DocumentCode :
2287675
Title :
Action detection in complex scenes with spatial and temporal ambiguities
Author :
Hu, Yuxiao ; Cao, Liangliang ; Lv, Fengjun ; Yan, Shuicheng ; Gong, Yihong ; Huang, Thomas S.
Author_Institution :
Dept. of ECE, UIUC, Singapore, Singapore
fYear :
2009
fDate :
Sept. 29 2009-Oct. 2 2009
Firstpage :
128
Lastpage :
135
Abstract :
In this paper, we investigate the detection of semantic human actions in complex scenes. Unlike conventional action recognition in well-controlled environments, action detection in complex scenes suffers from cluttered backgrounds, heavy crowds, occluded bodies, and spatial-temporal boundary ambiguities caused by imperfect human detection and tracking. Conventional algorithms are likely to fail with such spatial-temporal ambiguities. In this work, the candidate regions of an action are treated as a bag of instances. Then a novel multiple-instance learning framework, named SMILE-SVM (Simulated annealing Multiple Instance LEarning Support Vector Machines), is presented for learning human action detector based on imprecise action locations. SMILE-SVM is extensively evaluated with satisfactory performances on two tasks: (1) human action detection on a public video action database with cluttered backgrounds, and (2) a real world problem of detecting whether the customers in a shopping mall show an intention to purchase the merchandise on shelf (even if they didn´t buy it eventually). In addition, the complementary nature of motion and appearance features in action detection are also validated, demonstrating a boosted performance in our experiments.
Keywords :
object detection; semantic networks; simulated annealing; support vector machines; SMILE-SVM; cluttered backgrounds; human action detection; occluded bodies; real world problem; semantic human actions detection; simulated annealing multiple instance learning support vector machines; spatial-temporal boundary ambiguities; video action database; Computer vision; Detectors; Humans; Layout; Machine learning; Merchandise; Performance evaluation; Simulated annealing; Spatial databases; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location :
Kyoto
ISSN :
1550-5499
Print_ISBN :
978-1-4244-4420-5
Electronic_ISBN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2009.5459153
Filename :
5459153
Link To Document :
بازگشت