DocumentCode :
2287916
Title :
Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories
Author :
Su, Hao ; Sun, Min ; Fei-Fei, Li ; Savarese, Silvio
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., Princeton, NJ, USA
fYear :
2009
fDate :
Sept. 29 2009-Oct. 2 2009
Firstpage :
213
Lastpage :
220
Abstract :
Recognizing object classes and their 3D viewpoints is an important problem in computer vision. Based on a part-based probabilistic representation [31], we propose a new 3D object class model that is capable of recognizing unseen views by pose estimation and synthesis. We achieve this by using a dense, multiview representation of the viewing sphere parameterized by a triangular mesh of viewpoints. Each triangle of viewpoints can be morphed to synthesize new viewpoints. By incorporating 3D geometrical constraints, our model establishes explicit correspondences among object parts across viewpoints. We propose an incremental learning algorithm to train the generative model. A cellphone video clip of an object is first used to initialize model learning. Then the model is updated by a set of unsorted training images without viewpoint labels. We demonstrate the robustness of our model on object detection, viewpoint classification and synthesis tasks. Our model performs superiorly to and on par with state-of-the-art algorithms on the Savarese et al. 2007 and PASCAL datasets in object detection. It outperforms all previous work in viewpoint classification and offers promising results in viewpoint synthesis.
Keywords :
learning (artificial intelligence); object detection; object recognition; pose estimation; probability; 3D geometrical constraints; 3D object class model; 3D viewpoint classification; cellphone video clip; computer vision; dense multiview representation Learning; incremental learning algorithm; object categories synthesis; object classes recognition; part based probabilistic representation; pose estimation; pose synthesis; viewpoint detection; viewpoint triangular mesh; Cellular phones; Computer science; Computer vision; Image recognition; Image segmentation; Layout; Mesh generation; Object detection; Robustness; Solid modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location :
Kyoto
ISSN :
1550-5499
Print_ISBN :
978-1-4244-4420-5
Electronic_ISBN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2009.5459168
Filename :
5459168
Link To Document :
بازگشت