مرکز منطقه ای اطلاع رساني علوم و فناوري - Online Object Tracking, Learning, and Parsing with And-Or Graphs

DocumentCode :

254328

Title :

Online Object Tracking, Learning, and Parsing with And-Or Graphs

Author :

Yang Lu ; Tianfu Wu ; Song-Chun Zhu

Author_Institution :

Dept. of Stat., Univ. of California, Los Angeles, Los Angeles, CA, USA

fYear :

2014

fDate :

23-28 June 2014

Firstpage :

3462

Lastpage :

3469

Abstract :

This paper presents a framework for simultaneously tracking, learning and parsing objects with a hierarchical and compositional and-or graph (AOG) representation. The AOG is discriminatively learned online to account for the appearance (e.g., lighting and partial occlusion) and structural (e.g., different poses and viewpoints) variations of the object itself, as well as the distractors (e.g., similar objects) in the scene background. In tracking, the state of the object (i.e., bounding box) is inferred by parsing with the current AOG using a spatial-temporal dynamic programming (DP) algorithm. When the AOG grows big for handling objects with large variations in long-term tracking, we propose a bottom-up/top-down scheduling scheme for efficient inference, which performs focused inference with the most stable and discriminative small sub-AOG. During online learning, the AOG is re-learned iteratively with two steps: (i) Identifying the false positives and false negatives of the current AOG in a new frame by exploiting the spatial and temporal constraints observed in the trajectory, (ii) Updating the structure of the AOG, and re-estimating the parameters based on the augmented training dataset. In experiments, the proposed method outperforms state-of-the-art tracking algorithms on a recent public tracking benchmark with 50 testing videos and 30 publicly available trackers evaluated [34].

Keywords :

dynamic programming; grammars; graph theory; image representation; iterative methods; learning (artificial intelligence); object tracking; scheduling; AOG; DP algorithm; augmented training dataset; bottom-up-top-down scheduling scheme; compositional and-or graph representation; false negatives; false positives; focused inference; hierarchical and-or graph representation; long-term tracking; online learning; online object tracking; parsing objects; public tracking benchmark; scene background; spatial constraints; spatial-temporal dynamic programming algorithm; structural variations; temporal constraints; Bismuth; Computational modeling; Heuristic algorithms; Hidden Markov models; Object tracking; Quantization (signal); Training; And-Or Graph; Bottom-up/Top-down Scheduling; Object Tracking; Spatial-Temporal Dynamic Programming;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

Conference_Location :

Columbus, OH

Type :

conf

DOI :

10.1109/CVPR.2014.443

Filename :

6909838

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=254328