Title :
Online appearance learning by template prediction
Author :
Liu, Ming ; Han, Tony X. ; Huang, Thomas S.
Author_Institution :
Dept. of Electr. & Comput. Eng., Illinois Univ., Urbana, IL, USA
Abstract :
A new object tracking framework with online appearance learning ability is proposed in this paper. The object appearances are modeled as a set of probability mass functions (PMF), defined as "object-model-set". The averaged object appearance in the video is also modeled as a PMF, named as "universal model". Given an initial template of the target object, which is the only element in the initial object-model-set, the framework tries to track the object by looking into the whole input video sequence. The dynamic programming (DP) is applied to achieve a best spatial-scale matching between the observations and the current model set, across the whole input video. The object-model-set is iteratively updated if the prediction of the matched image patch using current object-model-set is less than its prior computed using universal model. The PMFs of such matched image patches are added into the object-model-set. Thus the object appearance, which is modeled as the set of PMFs of typical views, is learned online. The tracking results can be further refined given the updated object-model-set. This make the proposed tracking framework robust to the appearance variation caused by 3D motion, partial occlusion and illumination. Also, the learned typical views facilitate other vision tasks such as recognition or 3D reconstruction. Tracking results and the learned typical view on the challenging video sequence experimentally show the robustness and strong online learning ability of the proposed frame work.
Keywords :
computer vision; dynamic programming; image matching; image sequences; learning (artificial intelligence); probability; tracking; video signal processing; dynamic programming; matched image patch; object tracking framework; object-model-set; online appearance learning; probability mass functions; template prediction; video sequence; vision tasks; Content based retrieval; Dynamic programming; Image reconstruction; Lighting; Predictive models; Robustness; Target tracking; User interfaces; Video sequences; Video surveillance;
Conference_Titel :
Advanced Video and Signal Based Surveillance, 2005. AVSS 2005. IEEE Conference on
Print_ISBN :
0-7803-9385-6
DOI :
10.1109/AVSS.2005.1577273