Title :
Closed-Loop Deep Vision
Author :
Carneiro, Gustavo ; Zhibin Liao ; Tat-Jun Chin
Author_Institution :
Australian Centre for Visual Technol., Univ. of Adelaide, Adelaide, SA, Australia
Abstract :
There has been a resurgence of interest in one of the most fundamental aspects of computer vision, which is related to the existence of a feedback mechanism in the inference of a visual classification process. Indeed, this mechanism was present in the first computer vision methodologies, but technical and theoretical issues imposed major roadblocks that forced researchers to seek alternative approaches based on pure feed-forward inference. These open loop approaches process the input image sequentially with increasingly more complex analysis steps, and any mistake made by intermediate steps impair all subsequent analysis tasks. On the other hand, closed-loop approaches involving feed- forward and feedback mechanisms can fix mistakes made during such intermediate stages. In this paper, we present a new closed- loop inference for computer vision problems based on an iterative analysis using deep belief networks (DBN). Specifically, an image is processed using a feed-forward mechanism that will produce a classification result, which is then used to sample an image from the current belief state of the DBN. Then the difference between the input image and the sampled image is fed back to the DBN for re- classification, and this process iterates until convergence. We show that our closed-loop vision inference improves the classification results compared to pure feed-forward mechanisms on the MNIST handwritten digit dataset and the Multiple Object Categories containing shapes of horses, dragonflies, llamas and rhinos.
Keywords :
belief networks; closed loop systems; computer vision; feedback; feedforward; image classification; inference mechanisms; iterative methods; DBN; MNIST handwritten digit dataset; closed-loop deep vision; closed-loop inference; closed-loop vision inference; complex analysis steps; computer vision methodology; deep belief networks; feed-forward inference; feed-forward mechanisms; feedback mechanisms; feedforward mechanism; iterative analysis; multiple object category; visual classification process; Analytical models; Computational modeling; Databases; Feature extraction; Training; Vectors; Visualization;
Conference_Titel :
Digital Image Computing: Techniques and Applications (DICTA), 2013 International Conference on
Conference_Location :
Hobart, TAS
DOI :
10.1109/DICTA.2013.6691492