DocumentCode
3673959
Title
Cultural Event recognition with visual ConvNets and temporal models
Author
Amaia Salvador;Daniel Manchón-Vizuete;Andrea Calafell;Xavier Giró-i-Nieto;Matthias Zeppelzauer
Author_Institution
Universitat Politecnica de Catalunya (UPC), Barcelona, Catalonia/Spain
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
36
Lastpage
44
Abstract
This paper presents our contribution to the ChaLearn Challenge 2015 on Cultural Event Classification. The challenge in this task is to automatically classify images from 50 different cultural events. Our solution is based on the combination of visual features extracted from convolutional neural networks with temporal information using a hierarchical classifier scheme. We extract visual features from the last three fully connected layers of both CaffeNet (pre-trained with ImageNet) and our fine tuned version for the ChaLearn challenge. We propose a late fusion strategy that trains a separate low-level SVM on each of the extracted neural codes. The class predictions of the low-level SVMs form the input to a higher level SVM, which gives the final event scores. We achieve our best result by adding a temporal refinement step into our classification scheme, which is applied directly to the output of each low-level SVM. Our approach penalizes high classification scores based on visual features when their time stamp does not match well an event-specific temporal distribution learned from the training and validation data. Our system achieved the second best result in the ChaLearn Challenge 2015 on Cultural Event Classification with a mean average precision of 0.767 on the test set.
Keywords
"Cultural differences","Visualization","Feature extraction","Support vector machines","Metadata","Computational modeling"
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition Workshops (CVPRW), 2015 IEEE Conference on
Electronic_ISBN
2160-7516
Type
conf
DOI
10.1109/CVPRW.2015.7301334
Filename
7301334
Link To Document