• DocumentCode
    655179
  • Title

    Iterative Row Sampling

  • Author

    Mu Li ; Miller, Gary L. ; Peng, Rongkun

  • Author_Institution
    Comput. Sci. Dept., CMU, Pittsburgh, PA, USA
  • fYear
    2013
  • fDate
    26-29 Oct. 2013
  • Firstpage
    127
  • Lastpage
    136
  • Abstract
    There has been significant interest and progress recently in algorithms that solve regression problems involving tall and thin matrices in input sparsity time. Given a n * d matrix where n ≥ d, these algorithms find an approximation with fewer rows, allowing one to solve a poly(d) sized problem instead. In practice, the best performances are often obtained by invoking these routines in an iterative fashion. We show these iterative methods can be adapted to give theoretical guarantees comparable to and better than the current state of the art. Our approaches are based on computing the importances of the rows, known as leverage scores, in an iterative manner. We show that alternating between computing a short matrix estimate and finding more accurate approximate leverage scores leads to a series of geometrically smaller instances. This gives an algorithm whose runtime is input sparsity plus an overhead comparable to the cost of solving a regression problem on the smaller approximation. Our results build upon the close connection between randomized matrix algorithms, iterative methods, and graph sparsification.
  • Keywords
    iterative methods; matrix algebra; regression analysis; sampling methods; graph sparsification; iterative row sampling method; leverage scores; poly(d) sized problem; randomized matrix algorithms; regression problems; short matrix estimate; tall matrices; thin matrices; Algorithm design and analysis; Approximation algorithms; Approximation methods; Probability; Runtime; Symmetric matrices; Vectors; Regression; Sampling; Well-conditioned Basis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science (FOCS), 2013 IEEE 54th Annual Symposium on
  • Conference_Location
    Berkeley, CA
  • ISSN
    0272-5428
  • Type

    conf

  • DOI
    10.1109/FOCS.2013.22
  • Filename
    6686148