DocumentCode
3083649
Title
Image annotation via deep neural network
Author
Sun Chengjian ; Songhao Zhu ; Zhe Shi
Author_Institution
Sch. of Autom., Nanjing Univ. of Posts & Telecommun., Nanjing, China
fYear
2015
fDate
18-22 May 2015
Firstpage
518
Lastpage
521
Abstract
Multilabel image annotation is one of the most important open problems in computer vision field. Unlike existing works that usually use conventional visual features to annotate images, features based on deep learning have shown potential to achieve outstanding performance. In this work, we propose a multimodal deep learning framework, which aims to optimally integrate multiple deep neural networks pretrained with convolutional neural networks. In particular, the proposed framework explores a unified two-stage learning scheme that consists of (i) learning to fune-tune the parameters of deep neural network with respect to each individual modality, and (ii) learning to find the optimal combination of diverse modalities simultaneously in a coherent process. Experiments conducted on the NUS-WIDE dataset evaluate the performance of the proposed framework for multilabel image annotation, in which the encouraging results validate the effectiveness of the proposed algorithms.
Keywords
computer vision; feature extraction; neural nets; NUS-WIDE dataset; coherent process; computer vision; conventional visual features; convolutional neural networks; diverse modality; multilabel image annotation; multimodal deep learning framework; multiple deep neural networks; optimal combination; unified two-stage learning scheme; Computer architecture; Computer vision; Feature extraction; Image classification; Neural networks; Pattern recognition; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Vision Applications (MVA), 2015 14th IAPR International Conference on
Conference_Location
Tokyo
Type
conf
DOI
10.1109/MVA.2015.7153244
Filename
7153244
Link To Document