مرکز منطقه ای اطلاع رساني علوم و فناوري - Discriminatively Trained And-Or Tree Models for Object Detection

DocumentCode :

3335392

Title :

Discriminatively Trained And-Or Tree Models for Object Detection

Author :

Xi Song ; Tianfu Wu ; Yunde Jia ; Song-Chun Zhu

Author_Institution :

Lab. of Intell. Inf. Technol., Beijing Inst. of Technol., Beijing, China

fYear :

2013

fDate :

23-28 June 2013

Firstpage :

3278

Lastpage :

3285

Abstract :

This paper presents a method of learning reconfigurable And-Or Tree (AOT) models discriminatively from weakly annotated data for object detection. To explore the appearance and geometry space of latent structures effectively, we first quantize the image lattice using an over complete set of shape primitives, and then organize them into a directed a cyclic And-Or Graph (AOG) by exploiting their compositional relations. We allow overlaps between child nodes when combining them into a parent node, which is equivalent to introducing an appearance Or-node implicitly for the overlapped portion. The learning of an AOT model consists of three components: (i) Unsupervised sub-category learning (i.e., branches of an object Or-node) with the latent structures in AOG being integrated out. (ii) Weakly supervised part configuration learning (i.e., seeking the globally optimal parse trees in AOG for each sub-category). To search the globally optimal parse tree in AOG efficiently, we propose a dynamic programming (DP) algorithm. (iii) Joint appearance and structural parameters training under latent structural SVM framework. In experiments, our method is tested on PASCAL VOC 2007 and 2010 detection benchmarks of 20 object classes and outperforms comparable state-of-the-art methods.

Keywords :

directed graphs; dynamic programming; learning (artificial intelligence); object detection; 2010 detection benchmarks; PASCAL VOC 2007; directed a cyclic and-or graph; discriminatively trained and-or tree models; dynamic programming algorithm; geometry space; globally optimal parse trees; image lattice; object detection; object or-node; reconfigurable and-or tree models; shape primitives; structural SVM framework; structural parameters training; unsupervised sub-category learning; weakly annotated data; Deformable models; Lattices; Object detection; Shape; Space exploration; Support vector machines; Training; And-Or Graph; Latent Structural SVM; Object Detection; Part-based Representation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location :

Portland, OR

ISSN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2013.421

Filename :

6619265

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3335392