• DocumentCode
    1282916
  • Title

    Design and Generalization Analysis of Orthogonal Matching Pursuit Algorithms

  • Author

    Hussain, Zakria ; Shawe-Taylor, John ; Hardoon, David R. ; Dhanjal, Charanpal

  • Author_Institution
    Dept. of Comput. Sci ence, Univ. Coll. London, London, UK
  • Volume
    57
  • Issue
    8
  • fYear
    2011
  • Firstpage
    5326
  • Lastpage
    5341
  • Abstract
    We derive generalization error (loss) bounds for orthogonal matching pursuit algorithms, starting with kernel matching pursuit and sparse kernel principal components analysis. We propose (to the best of our knowledge) the first loss bound for kernel matching pursuit using a novel application of sample compression and Vapnik-Chervonenkis bounds. For sparse kernel principal components analysis, we find that it can be bounded using a standard sample compression analysis, as the subspace it constructs is a compression scheme. We demonstrate empirically that this bound is tighter than previous state-of-the-art bounds for principal components analysis, which use global and local Rademacher complexities. From this analysis we propose a novel sparse variant of kernel canonical correlation analysis and bound its generalization performance using the results developed in this paper. We conclude with a general technique for designing matching pursuit algorithms for other learning domains.
  • Keywords
    data compression; generalisation (artificial intelligence); learning (artificial intelligence); pattern matching; principal component analysis; Vapnik-Chervonenkis bounds; generalization analysis; global Rademacher complexity; kernel canonical correlation analysis; kernel matching pursuit; learning domain; local Rademacher complexity; orthogonal matching pursuit algorithm; sample compression analysis; sparse kernel principal components analysis; Algorithm design and analysis; Complexity theory; Kernel; Matching pursuit algorithms; Principal component analysis; Training; Vectors; Kernel methods; Nyström approximation; matching pursuit; principle components analysis; sample compression bounds; sparse kernel canonical correlation analysis; sparsity;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2011.2158880
  • Filename
    5961825