• DocumentCode
    3748770
  • Title

    Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks

  • Author

    Chunshui Cao;Xianming Liu;Yi Yang;Yinan Yu;Jiang Wang;Zilei Wang;Yongzhen Huang;Liang Wang;Chang Huang;Wei Xu;Deva Ramanan;Thomas S. Huang

  • Author_Institution
    Univ. of Sci. &
  • fYear
    2015
  • Firstpage
    2956
  • Lastpage
    2964
  • Abstract
    While feedforward deep convolutional neural networks (CNNs) have been a great success in computer vision, it is important to note that the human visual cortex generally contains more feedback than feedforward connections. In this paper, we will briefly introduce the background of feedbacks in the human visual cortex, which motivates us to develop a computational feedback mechanism in deep neural networks. In addition to the feedforward inference in traditional neural networks, a feedback loop is introduced to infer the activation status of hidden layer neurons according to the "goal" of the network, e.g., high-level semantic labels. We analogize this mechanism as "Look and Think Twice." The feedback networks help better visualize and understand how deep neural networks work, and capture visual attention on expected objects, even in images with cluttered background and multiple objects. Experiments on ImageNet dataset demonstrate its effectiveness in solving tasks such as image classification and object localization.
  • Keywords
    "Neurons","Visualization","Biological neural networks","Feedforward neural networks","Semantics","Feedback loop","Logic gates"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2015 IEEE International Conference on
  • Electronic_ISBN
    2380-7504
  • Type

    conf

  • DOI
    10.1109/ICCV.2015.338
  • Filename
    7410695