DocumentCode :
3428376
Title :
Structured Learning of Sum-of-Submodular Higher Order Energy Functions
Author :
Fix, Alexander ; Joachims, Thorsten ; Sung Min Park ; Zabih, Ramin
Author_Institution :
Comput. Sci. Dept., Cornell Univ., Ithaca, NY, USA
fYear :
2013
fDate :
1-8 Dec. 2013
Firstpage :
3104
Lastpage :
3111
Abstract :
Sub modular functions can be exactly minimized in polynomial time, and the special case that graph cuts solve with max flow [19] has had significant impact in computer vision [5, 21, 28]. In this paper we address the important class of sum-of-sub modular (SoS) functions [2, 18], which can be efficiently minimized via a variant of max flow called sub modular flow [6]. SoS functions can naturally express higher order priors involving, e.g., local image patches, however, it is difficult to fully exploit their expressive power because they have so many parameters. Rather than trying to formulate existing higher order priors as an SoS function, we take a discriminative learning approach, effectively searching the space of SoS functions for a higher order prior that performs well on our training set. We adopt a structural SVM approach [15, 34] and formulate the training problem in terms of quadratic programming, as a result we can efficiently search the space of SoS priors via an extended cutting-plane algorithm. We also show how the state-of-the-art max flow method for vision problems [11] can be modified to efficiently solve the sub modular flow problem. Experimental comparisons are made against the OpenCV implementation of the Grab Cut interactive segmentation technique [28], which uses hand-tuned parameters instead of machine learning. On a standard dataset [12] our method learns higher order priors with hundreds of parameter values, and produces significantly better segmentations. While our focus is on binary labeling problems, we show that our techniques can be naturally generalized to handle more than two labels.
Keywords :
computer vision; image segmentation; learning (artificial intelligence); quadratic programming; support vector machines; GrabCut interactive segmentation technique; OpenCV; SoS functions; binary labeling problems; computer vision; discriminative learning approach; extended cutting-plane algorithm; hand-tuned parameters; image segmentations; local image patches; machine learning; max flow method; polynomial time; quadratic programming; structural SVM approach; structured learning; submodular flow; sum-of-submodular higher order energy functions; Computational modeling; Computer vision; Minimization; Optimization; Support vector machines; Training; Vectors; Graph cuts; Max flow; Structured prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2013.385
Filename :
6751497
Link To Document :
بازگشت