Title :
Video Event Detection Using Temporal Pyramids of Visual Semantics with Kernel Optimization and Model Subspace Boosting
Author :
Codella, Noel C F ; Natsev, Apostol ; Hua, Gang ; Hill, Matthew ; Cao, Liangliang ; Gong, Leiguang ; Smith, John R.
Author_Institution :
Multimedia Res. Group, IBM T. J. Watson Res. Center, Hawthorne, NY, USA
Abstract :
In this study, we present a system for video event classification that generates a temporal pyramid of static visual semantics using minimum-value, maximum-value, and average-value aggregation techniques. Kernel optimization and model subspace boosting are then applied to customize the pyramid for each event. SVM models are independently trained for each level in the pyramid using kernel selection according to 3-fold cross-validation. Kernels that both enforce static temporal order and permit temporal alignment are evaluated. Model subspace boosting is used to select the best combination of pyramid levels and aggregation techniques for each event. The NIST TRECVID Multimedia Event Detection (MED) 2011 dataset was used for evaluation. Results demonstrate that kernel optimizations using both temporally static and dynamic kernels together achieves better performance than any one particular method alone. In addition, model sub-space boosting reduces the size of the model by 80%, while maintaining 96% of the performance gain.
Keywords :
image classification; object detection; optimisation; support vector machines; video signal processing; NIST TRECVID multimedia event detection dataset; SVM models; average-value aggregation techniques; kernel optimization; kernel selection; model subspace boosting; permit temporal alignment; static temporal order; static visual semantic temporal pyramids; video event classification; video event detection; Boosting; Data models; Event detection; Kernel; Multimedia communication; Support vector machines; Visualization; Bipartite; Classification; Event; Kernel; MED; Model; Modeling; Optimization; Pyramid; SVM; Selection; Semantic; TRECVID; Temporal; Video; Visual;
Conference_Titel :
Multimedia and Expo (ICME), 2012 IEEE International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4673-1659-0
DOI :
10.1109/ICME.2012.190