DocumentCode :
3296585
Title :
Video Event Detection Using Temporal Pyramids of Visual Semantics with Kernel Optimization and Model Subspace Boosting
Author :
Codella, Noel C F ; Natsev, Apostol ; Hua, Gang ; Hill, Matthew ; Cao, Liangliang ; Gong, Leiguang ; Smith, John R.
Author_Institution :
Multimedia Res. Group, IBM T. J. Watson Res. Center, Hawthorne, NY, USA
fYear :
2012
fDate :
9-13 July 2012
Firstpage :
747
Lastpage :
752
Abstract :
In this study, we present a system for video event classification that generates a temporal pyramid of static visual semantics using minimum-value, maximum-value, and average-value aggregation techniques. Kernel optimization and model subspace boosting are then applied to customize the pyramid for each event. SVM models are independently trained for each level in the pyramid using kernel selection according to 3-fold cross-validation. Kernels that both enforce static temporal order and permit temporal alignment are evaluated. Model subspace boosting is used to select the best combination of pyramid levels and aggregation techniques for each event. The NIST TRECVID Multimedia Event Detection (MED) 2011 dataset was used for evaluation. Results demonstrate that kernel optimizations using both temporally static and dynamic kernels together achieves better performance than any one particular method alone. In addition, model sub-space boosting reduces the size of the model by 80%, while maintaining 96% of the performance gain.
Keywords :
image classification; object detection; optimisation; support vector machines; video signal processing; NIST TRECVID multimedia event detection dataset; SVM models; average-value aggregation techniques; kernel optimization; kernel selection; model subspace boosting; permit temporal alignment; static temporal order; static visual semantic temporal pyramids; video event classification; video event detection; Boosting; Data models; Event detection; Kernel; Multimedia communication; Support vector machines; Visualization; Bipartite; Classification; Event; Kernel; MED; Model; Modeling; Optimization; Pyramid; SVM; Selection; Semantic; TRECVID; Temporal; Video; Visual;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2012 IEEE International Conference on
Conference_Location :
Melbourne, VIC
ISSN :
1945-7871
Print_ISBN :
978-1-4673-1659-0
Type :
conf
DOI :
10.1109/ICME.2012.190
Filename :
6298492
Link To Document :
بازگشت