Title :
Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories
Author :
Su, Hao ; Sun, Min ; Fei-Fei, Li ; Savarese, Silvio
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., Princeton, NJ, USA
fDate :
Sept. 29 2009-Oct. 2 2009
Abstract :
Recognizing object classes and their 3D viewpoints is an important problem in computer vision. Based on a part-based probabilistic representation [31], we propose a new 3D object class model that is capable of recognizing unseen views by pose estimation and synthesis. We achieve this by using a dense, multiview representation of the viewing sphere parameterized by a triangular mesh of viewpoints. Each triangle of viewpoints can be morphed to synthesize new viewpoints. By incorporating 3D geometrical constraints, our model establishes explicit correspondences among object parts across viewpoints. We propose an incremental learning algorithm to train the generative model. A cellphone video clip of an object is first used to initialize model learning. Then the model is updated by a set of unsorted training images without viewpoint labels. We demonstrate the robustness of our model on object detection, viewpoint classification and synthesis tasks. Our model performs superiorly to and on par with state-of-the-art algorithms on the Savarese et al. 2007 and PASCAL datasets in object detection. It outperforms all previous work in viewpoint classification and offers promising results in viewpoint synthesis.
Keywords :
learning (artificial intelligence); object detection; object recognition; pose estimation; probability; 3D geometrical constraints; 3D object class model; 3D viewpoint classification; cellphone video clip; computer vision; dense multiview representation Learning; incremental learning algorithm; object categories synthesis; object classes recognition; part based probabilistic representation; pose estimation; pose synthesis; viewpoint detection; viewpoint triangular mesh; Cellular phones; Computer science; Computer vision; Image recognition; Image segmentation; Layout; Mesh generation; Object detection; Robustness; Solid modeling;
Conference_Titel :
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-4420-5
Electronic_ISBN :
1550-5499
DOI :
10.1109/ICCV.2009.5459168