مرکز منطقه ای اطلاع رساني علوم و فناوري - Understanding deep image representations by inverting them

DocumentCode :

3672611

Title :

Understanding deep image representations by inverting them

Author :

Aravindh Mahendran;Andrea Vedaldi

Author_Institution :

University of Oxford, USA

fYear :

2015

fDate :

6/1/2015 12:00:00 AM

Firstpage :

5188

Lastpage :

5196

Abstract :

Image representations, from SIFT and Bag of Visual Words to Convolutional Neural Networks (CNNs), are a crucial component of almost any image understanding system. Nevertheless, our understanding of them remains limited. In this paper we conduct a direct analysis of the visual information contained in representations by asking the following question: given an encoding of an image, to which extent is it possible to reconstruct the image itself? To answer this question we contribute a general framework to invert representations. We show that this method can invert representations such as HOG more accurately than recent alternatives while being applicable to CNNs too. We then use this technique to study the inverse of recent state-of-the-art CNN image representations for the first time. Among our findings, we show that several layers in CNNs retain photographically accurate information about the image, with different degrees of geometric and photometric invariance.

Keywords :

"Image reconstruction","Image representation","Visualization","Standards","TV","Neural networks","Noise"

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2015.7299155

Filename :

7299155

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3672611