DocumentCode
1758572
Title
Learning to Track Multiple Targets
Author
Xiao Liu ; Dacheng Tao ; Mingli Song ; Luming Zhang ; Jiajun Bu ; Chun Chen
Author_Institution
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
Volume
26
Issue
5
fYear
2015
fDate
42125
Firstpage
1060
Lastpage
1073
Abstract
Monocular multiple-object tracking is a fundamental yet under-addressed computer vision problem. In this paper, we propose a novel learning framework for tracking multiple objects by detection. First, instead of heuristically defining a tracking algorithm, we learn that a discriminative structure prediction model from labeled video data captures the interdependence of multiple influence factors. Given the joint targets state from the last time step and the observation at the current frame, the joint targets state at the current time step can then be inferred by maximizing the joint probability score. Second, our detection results benefit from tracking cues. The traditional detection algorithms need a nonmaximal suppression postprocessing to select a subset from the total detection responses as the final output and a large number of selection mistakes are induced, especially under a congested circumstance. Our method integrates both detection and tracking cues. This integration helps to decrease the postprocessing mistake risk and to improve performance in tracking. Finally, we formulate the entire model training into a convex optimization problem and estimate its parameters using the cutting plane optimization. Experiments show that our method performs effectively in a large variety of scenarios, including pedestrian tracking in crowd scenes and vehicle tracking in congested traffic.
Keywords
computer vision; convex programming; learning (artificial intelligence); object detection; probability; target tracking; video signal processing; computer vision problem; congested traffic; convex optimization problem; crowd scenes; cutting plane optimization; detection cues; discriminative structure prediction model; joint probability score maximization; joint target state; labeled video data; learning framework; model training; monocular multiple-object tracking; multiple influence factor interdependence; multiple target tracking; nonmaximal suppression postprocessing; object detection; parameter estimation; pedestrian tracking; performance improvement; postprocessing mistake risk; subset selection; time step; total detection responses; tracking cues; vehicle tracking; Cameras; Detectors; Joints; Prediction algorithms; Target tracking; Trajectory; Cutting plane; discriminative model; interdependence; learning to track; multiple-object tracking; structure prediction; tracking-by-detection; tracking-by-detection.;
fLanguage
English
Journal_Title
Neural Networks and Learning Systems, IEEE Transactions on
Publisher
ieee
ISSN
2162-237X
Type
jour
DOI
10.1109/TNNLS.2014.2333751
Filename
6855348
Link To Document