• DocumentCode
    73935
  • Title

    Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition

  • Author

    Yahong Han ; Yi Yang ; Yan Yan ; Zhigang Ma ; Sebe, Nicu ; Xiaofang Zhou

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
  • Volume
    26
  • Issue
    2
  • fYear
    2015
  • fDate
    Feb. 2015
  • Firstpage
    252
  • Lastpage
    264
  • Abstract
    To improve both the efficiency and accuracy of video semantic recognition, we can perform feature selection on the extracted video features to select a subset of features from the high-dimensional feature set for a compact and accurate video data representation. Provided the number of labeled videos is small, supervised feature selection could fail to identify the relevant features that are discriminative to target classes. In many applications, abundant unlabeled videos are easily accessible. This motivates us to develop semisupervised feature selection algorithms to better identify the relevant video features, which are discriminative to target classes by effectively exploiting the information underlying the huge amount of unlabeled video data. In this paper, we propose a framework of video semantic recognition by semisupervised feature selection via spline regression (S2FS2R). Two scatter matrices are combined to capture both the discriminative information and the local geometry structure of labeled and unlabeled training videos: A within-class scatter matrix encoding discriminative information of labeled training videos and a spline scatter output from a local spline regression encoding data distribution. An ℓ2,1-norm is imposed as a regularization term on the transformation matrix to ensure it is sparse in rows, making it particularly suitable for feature selection. To efficiently solve S2FS2R, we develop an iterative algorithm and prove its convergency. In the experiments, three typical tasks of video semantic recognition, such as video concept detection, video classification, and human action recognition, are used to demonstrate that the proposed S2FS2R achieves better performance compared with the state-of-the-art methods.
  • Keywords
    feature extraction; feature selection; image representation; iterative methods; matrix algebra; object recognition; regression analysis; splines (mathematics); video signal processing; ℓ2,1-norm; S2FS2R; data distribution encoding; discriminative information; iterative algorithm; labeled training videos; local geometry structure; regularization term; semisupervised feature selection; spline regression; spline scatter output; transformation matrix; unlabeled training videos; video data representation; video feature extraction; video semantic recognition; within-class scatter matrix; Feature extraction; Geometry; Semantics; Sparse matrices; Splines (mathematics); Training; Vectors; $ell_{2,1}$ -norm; ℓ₂,₁; semisupervised feature selection; spline regression; video analysis; video analysis.;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2014.2314123
  • Filename
    6786497