مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories

DocumentCode :

2287916

Title :

Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories

Author :

Su, Hao ; Sun, Min ; Fei-Fei, Li ; Savarese, Silvio

Author_Institution :

Dept. of Comput. Sci., Princeton Univ., Princeton, NJ, USA

fYear :

2009

fDate :

Sept. 29 2009-Oct. 2 2009

Firstpage :

213

Lastpage :

220

Abstract :

Recognizing object classes and their 3D viewpoints is an important problem in computer vision. Based on a part-based probabilistic representation [31], we propose a new 3D object class model that is capable of recognizing unseen views by pose estimation and synthesis. We achieve this by using a dense, multiview representation of the viewing sphere parameterized by a triangular mesh of viewpoints. Each triangle of viewpoints can be morphed to synthesize new viewpoints. By incorporating 3D geometrical constraints, our model establishes explicit correspondences among object parts across viewpoints. We propose an incremental learning algorithm to train the generative model. A cellphone video clip of an object is first used to initialize model learning. Then the model is updated by a set of unsorted training images without viewpoint labels. We demonstrate the robustness of our model on object detection, viewpoint classification and synthesis tasks. Our model performs superiorly to and on par with state-of-the-art algorithms on the Savarese et al. 2007 and PASCAL datasets in object detection. It outperforms all previous work in viewpoint classification and offers promising results in viewpoint synthesis.

Keywords :

learning (artificial intelligence); object detection; object recognition; pose estimation; probability; 3D geometrical constraints; 3D object class model; 3D viewpoint classification; cellphone video clip; computer vision; dense multiview representation Learning; incremental learning algorithm; object categories synthesis; object classes recognition; part based probabilistic representation; pose estimation; pose synthesis; viewpoint detection; viewpoint triangular mesh; Cellular phones; Computer science; Computer vision; Image recognition; Image segmentation; Layout; Mesh generation; Object detection; Robustness; Solid modeling;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision, 2009 IEEE 12th International Conference on

Conference_Location :

Kyoto

ISSN :

1550-5499

Print_ISBN :

978-1-4244-4420-5

Electronic_ISBN :

1550-5499

Type :

conf

DOI :

10.1109/ICCV.2009.5459168

Filename :

5459168

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2287916