DocumentCode :
177648
Title :
Video Event Detection via Multi-modality Deep Learning
Author :
I-Hong Jhuo ; Lee, D.T.
Author_Institution :
Inst. of Inf. Sci., Taipei, Taiwan
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
666
Lastpage :
671
Abstract :
Detecting complex video events based on audio and visual modalities is still a largely unresolved issue. While the conventional video representation methods extract each modality ineffectively, we propose a regularized multi-modality deep learning for video event detection. We first build an auto-encoder based on unconstrained minimization and adopt the conjugate gradient method with linear search for optimization. The learned auto-encoder can capture the relationship between the audio and visual modality corresponding to the same video event at each layer of the network. To make the network robust to local variance, we adopt the commonly used local contrast normalization and spatial maximum pooling to each modality for video representation. Compared with traditional methods using manually designed features, our method is more efficient. Experimental results on publicly available video event detection datasets demonstrate that the proposed method consistently outperforms the state-of-the-art video representation methods.
Keywords :
audio-visual systems; conjugate gradient methods; image representation; learning (artificial intelligence); linear programming; minimisation; search problems; video retrieval; video signal processing; audio modality; auto-encoder; complex video event detection; conjugate gradient method; linear search; local contrast normalization; local variance robustness; optimization; regularized multimodality deep learning; spatial maximum pooling; unconstrained minimization; video representation methods; visual modality; Event detection; Feature extraction; Image reconstruction; Neural networks; Training; Unsupervised learning; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.125
Filename :
6976835
Link To Document :
بازگشت