• DocumentCode
    1369158
  • Title

    Parallel Lasso for Large-Scale Video Concept Detection

  • Author

    Geng, Bo ; Li, Yangxi ; Tao, Dacheng ; Wang, Meng ; Zha, Zheng-Jun ; Xu, Chao

  • Author_Institution
    Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
  • Volume
    14
  • Issue
    1
  • fYear
    2012
  • Firstpage
    55
  • Lastpage
    65
  • Abstract
    Existing video concept detectors are generally built upon the kernel based machine learning techniques, e.g., support vector machines, regularized least squares, and logistic regression, just to name a few. However, in order to build robust detectors, the learning process suffers from the scalability issues including the high-dimensional multi-modality visual features and the large-scale keyframe examples. In this paper, we propose parallel lasso (Plasso) by introducing the parallel distributed computation to significantly improve the scalability of lasso (the l1 regularized least squares). We apply the parallel incomplete Cholesky factorization to approximate the covariance statistics in the preprocess step, and the parallel primal-dual interior-point method with the Sherman-Morrison-Woodbury formula to optimize the model parameters. For a dataset with n samples in a d-dimensional space, compared with lasso, Plasso significantly reduces complexities from the original O(d3) for computational time and O(d2) for storage space to O(h2d/m) and O(hd/m) , respectively, if the system has m processors and the reduced dimension h is much smaller than the original dimension d . Furthermore, we develop the kernel extension of the proposed linear algorithm with the sample reweighting schema, and we can achieve similar time and space complexity improvements [time complexity from O(n3) to O(h2n/m) and the space complexity from O(n2) to O(hn/m), for a dataset with n training examples]. Experimental results on TRECVID video concept detection challenges suggest that the proposed method can obtain significant time and space savings for training effective detectors with limited communication overhead.
  • Keywords
    computational complexity; covariance analysis; feature extraction; learning (artificial intelligence); least squares approximations; matrix decomposition; parallel processing; video signal processing; Sherman-Morrison-Woodbury formula; complexity reduction; computational time; covariance statistics; high-dimensional multimodality visual features; kernel based machine learning technique; kernel extension; large-scale keyframe; large-scale video concept detection; learning process; linear algorithm; logistic regression; parallel distributed computation; parallel incomplete Cholesky factorization; parallel lasso; parallel primal-dual interior-point method; regularized least squares; sample reweighting schema; scalability issue; space complexity; storage space; support vector machine; time complexity; Algorithm design and analysis; Complexity theory; Educational institutions; Feature extraction; Kernel; Machine learning; Optimization; Incomplete cholosky factorization; lasso; parallel learning; video concept detection;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2011.2174781
  • Filename
    6069863