DocumentCode :
985176
Title :
Video object segmentation and tracking using ψ-learning classification
Author :
Liu, Yi ; Zheng, Yuan F.
Author_Institution :
Dept. of Electr. & Comput. Eng., Ohio State Univ., Columbus, OH, USA
Volume :
15
Issue :
7
fYear :
2005
fDate :
7/1/2005 12:00:00 AM
Firstpage :
885
Lastpage :
899
Abstract :
As a requisite of the emerging content-based multimedia technologies, video object (VO) extraction is of great importance. This paper presents a novel semiautomatic segmentation and tracking method for single VO extraction. Unlike traditional approaches, the proposed method formulates the separation of the VO from the background as a classification problem. Each frame is divided into small blocks of uniform size, which are called object blocks if the centering pixels belong to the object, or background blocks otherwise. After a manual segmentation of the first frame, the blocks of this frame are used as the training samples for the object-background classifier. A newly developed learning tool called ψ-learning is employed to train the classifier which outperforms the conventional Support Vector Machines in linearly nonseparable cases. To deal with large and complex objects, a multilayer approach constructing a so-called hyperplane tree is proposed. Each node of the tree represents a hyperplane, responsible for classifying only a subset of the training samples. Multiple hyperplanes are thus needed to classify the entire set. Through the combination of the multilayer scheme and ψ-learning, one can avoid the complexity of nonlinear mapping as well as achieve high classification accuracy. During the tracking phase, the pixel in the center of every block in a successive frame is classified by a sequence of hyperplanes from the root to a leaf node of the hyperplane tree, and the class of the block is identified accordingly. All the object blocks thus form the object of interest, whose boundary unfortunately is stair-like due to the block effect. In order to obtain the pixel-wise boundary in a cost efficient way, a pyramid boundary refining algorithm is designed, which iteratively selects a few informative pixels for class label checking, and reduces uncertainty about the actual boundary of the object. The proposed method has been applied on video sequences with various spatial and temporal characteristics, and experimental results demonstrate it to be effective, efficient, and robust.
Keywords :
image classification; image resolution; image segmentation; image sequences; iterative methods; learning (artificial intelligence); multimedia communication; support vector machines; tracking; video signal processing; ψ-learning classification; background block; class label checking; content-based multimedia technology; hyperplane tree; iterative method; multilayer approach; object block; object-background classifier; pyramid boundary refining algorithm; semiautomatic segmentation; support vector machine; video object extraction; video object segmentation; video object tracking; video sequence; Algorithm design and analysis; Classification tree analysis; Costs; Iterative algorithms; Machine learning; Nonhomogeneous media; Object segmentation; Support vector machine classification; Support vector machines; Uncertainty; VO segmentation and tracking; support vector machines (SVM); video object (VO) extraction;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2005.848346
Filename :
1458830
Link To Document :
بازگشت