DocumentCode :
84233
Title :
Multimedia Event Detection Using A Classifier-Specific Intermediate Representation
Author :
Zhigang Ma ; Yi Yang ; Sebe, Nicu ; Kai Zheng ; Hauptmann, Alexander G.
Author_Institution :
Dept. of Inf. Eng. & Comput. Sci., Univ. of Trento, Trento, Italy
Volume :
15
Issue :
7
fYear :
2013
fDate :
Nov. 2013
Firstpage :
1628
Lastpage :
1637
Abstract :
Multimedia event detection (MED) plays an important role in many applications such as video indexing and retrieval. Current event detection works mainly focus on sports and news event detection or abnormality detection in surveillance videos. Differently, our research aims to detect more complicated and generic events within a longer video sequence. In the past, researchers have proposed using intermediate concept classifiers with concept lexica to help understand the videos. Yet it is difficult to judge how many and what concepts would be sufficient for the particular video analysis task. Additionally, obtaining robust semantic concept classifiers requires a large number of positive training examples, which in turn has high human annotation cost. In this paper, we propose an approach that exploits the external concepts-based videos and event-based videos simultaneously to learn an intermediate representation from video features. Our algorithm integrates the classifier inference and latent intermediate representation into a joint framework. The joint optimization of the intermediate representation and the classifier makes them mutually beneficial and reciprocal. Effectively, the intermediate representation and the classifier are tightly correlated. The classifier dependent intermediate representation not only accurately reflects the task semantics but is also more suitable for the specific classifier. Thus we have created a discriminative semantic analysis framework based on a tightly coupled intermediate representation. Extensive experiments on multimedia event detection using real-world videos demonstrate the effectiveness of the proposed approach.
Keywords :
image classification; image representation; image sequences; multimedia systems; optimisation; video signal processing; video surveillance; abnormality detection; classifier inference; classifier-specific intermediate representation; discriminative semantic analysis framework; event-based video; external concept-based video; joint optimization; latent intermediate representation; multimedia event detection; news event detection; sports event detection; surveillance video; tightly coupled intermediate representation; video feature; video indexing; video retrieval; video sequence; $p$-norm; Intermediate representation; multimedia event detection;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2013.2264928
Filename :
6522476
Link To Document :
بازگشت