DocumentCode
2453877
Title
Detecting Generic Visual Eventswith Temporal Cues
Author
Xie, Lexing ; Xu, Dong ; Ebadollahi, Shahram ; Scheinberg, Katya ; Chang, Shih-Fu ; Smith, John R.
Author_Institution
IBM T. J. Watson Res. Center, New York, NY
fYear
2006
fDate
Oct. 29 2006-Nov. 1 2006
Firstpage
54
Lastpage
58
Abstract
We present novel algorithms for detecting generic visual events from video. Target event models will produce binary decisions on each shot about classes of events involving object actions and their interactions with the scene, such as airplane taking off, exiting car, riot. While event detection has been studied in scenarios with strong scene and imaging assumptions, the detection of generic visual events from an unconstrained domain such as broadcast news has not been explored. This work extends our recent work on event detection by (1) using a novel bag-of-features representation along with the earth movers´ distance to account for the temporal variations within a shot, (2) learn the importance among input modalities with a double-convex combination along both different kernels and different support vectors, which is in turn solved via multiple kernel learning. Experiments show that the bag-of-features representation significantly outperforms the static baseline; multiple kernel learning yields promising performance improvement while providing intuitive explanations for the importance of the input kernels.
Keywords
video signal processing; bag-of-features representation; binary decisions; double-convex combination; generic visual events; input modalities; multiple kernel learning; target event models; temporal cues; Airplanes; Computer vision; Event detection; Feature extraction; Government; Hidden Markov models; Kernel; Layout; Object recognition; Streaming media;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers, 2006. ACSSC '06. Fortieth Asilomar Conference on
Conference_Location
Pacific Grove, CA
ISSN
1058-6393
Print_ISBN
1-4244-0784-2
Electronic_ISBN
1058-6393
Type
conf
DOI
10.1109/ACSSC.2006.356582
Filename
4176511
Link To Document