• DocumentCode
    253526
  • Title

    Weakly Supervised Multiclass Video Segmentation

  • Author

    Xiao Liu ; Dacheng Tao ; Mingli Song ; Ying Ruan ; Chun Chen ; Jiajun Bu

  • Author_Institution
    Zhejiang Provincial Key Lab. of Service Robot, Zhejiang Univ., Hangzhou, China
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    57
  • Lastpage
    64
  • Abstract
    The desire of enabling computers to learn semantic concepts from large quantities of Internet videos has motivated increasing interests on semantic video understanding, while video segmentation is important yet challenging for understanding videos. The main difficulty of video segmentation arises from the burden of labeling training samples, making the problem largely unsolved. In this paper, we present a novel nearest neighbor-based label transfer scheme for weakly supervised video segmentation. Whereas previous weakly supervised video segmentation methods have been limited to the two-class case, our proposed scheme focuses on more challenging multiclass video segmentation, which finds a semantically meaningful label for every pixel in a video. Our scheme enjoys several favorable properties when compared with conventional methods. First, a weakly supervised hashing procedure is carried out to handle both metric and semantic similarity. Second, the proposed nearest neighbor-based label transfer algorithm effectively avoids overfitting caused by weakly supervised data. Third, a multi-video graph model is built to encourage smoothness between regions that are spatiotemporally adjacent and similar in appearance. We demonstrate the effectiveness of the proposed scheme by comparing it with several other state-of-the-art weakly supervised segmentation methods on one new Wild8 dataset and two other publicly available datasets.
  • Keywords
    image segmentation; learning (artificial intelligence); semantic Web; video signal processing; video streaming; Internet videos; Wild8 dataset; labeling training samples; metric similarity; multivideo graph model; neighbor-based label transfer scheme; semantic similarity; semantic video; video segmentation; video. pixel; weakly supervised multiclass video segmentation; Image segmentation; Labeling; Measurement; Semantics; Spatiotemporal phenomena; Streaming media; Training; label transfer; nearest neighbor; video segmentation; weakly supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.15
  • Filename
    6909409