DocumentCode :
1236847
Title :
Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment
Author :
Xu, Dong ; Chang, Shih-Fu
Author_Institution :
Nanyang Technol. Univ., Singapore
Volume :
30
Issue :
11
fYear :
2008
Firstpage :
1985
Lastpage :
1997
Abstract :
In this work, we systematically study the problem of event recognition in unconstrained news video sequences. We adopt the discriminative kernel-based method for which video clip similarity plays an important role. First, we represent a video clip as a bag of orderless descriptors extracted from all of the constituent frames and apply the earth mover´s distance (EMD) to integrate similarities among frames from two clips. Observing that a video clip is usually comprised of multiple subclips corresponding to event evolution over time, we further build a multilevel temporal pyramid. At each pyramid level, we integrate the information from different subclips with Integer-value-constrained EMD to explicitly align the subclips. By fusing the information from the different pyramid levels, we develop temporally aligned pyramid matching (TAPM) for measuring video similarity. We conduct comprehensive experiments on the TRECVID 2005 corpus, which contains more than 6,800 clips. Our experiments demonstrate that (1) the TAPM multilevel method clearly outperforms single-level EMD (SLEMD) and (2) SLEMD outperforms keyframe and multiframe-based detection methods by a large margin. In addition, we conduct in-depth investigation of various aspects of the proposed techniques such as weight selection in SLEMD, sensitivity to temporal clustering, the effect of temporal alignment, and possible approaches for speedup. Extensive analysis of the results also reveals intuitive interpretation of video event recognition through video subclip alignment at different levels.
Keywords :
image matching; image sequences; video signal processing; Integer-value-constrained EMD; discriminative kernel-based method; event recognition; kernel methods; multilevel temporal alignment; multilevel temporal pyramid; temporal alignment; temporal clustering; temporally aligned pyramid matching; unconstrained news video sequences; video event recognition; Concept Ontology; Concept-based Video Indexing; Earth Mover´s Distance; Event Recognition; News Video; Temporally Aligned Pyramid Matching; Algorithms; Artificial Intelligence; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Pattern Recognition, Automated; Subtraction Technique; Video Recording;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.129
Filename :
4531742
Link To Document :
بازگشت