• DocumentCode
    180830
  • Title

    Spectral Approaches to Nearest Neighbor Search

  • Author

    Abdullah, Ammar ; Andoni, Alexandr ; Kannan, Ravindran ; Krauthgamer, Robert

  • fYear
    2014
  • fDate
    18-21 Oct. 2014
  • Firstpage
    581
  • Lastpage
    590
  • Abstract
    We study spectral algorithms for the high-dimensional Nearest Neighbor Search problem (NNS). In particular, we consider a semi-random setting where a dataset is chosen arbitrarily from an unknown subspace of low dimension, and then perturbed by full-dimensional Gaussian noise. We design spectral NNS algorithms whose query time depends polynomially on the dimension and logarithmically on the size of the point set. These spectral algorithms use a repeated computation of the top PCA vector/subspace, and are effective even when the random-noise magnitude is much larger than the interpoint distances. Our motivation is that in practice, a number of spectral NNS algorithms outperform the random-projection methods that seem otherwise theoretically optimal on worst-case datasets. In this paper we aim to provide theoretical justification for this disparity. The full version of this extended abstract is available on arXiv.
  • Keywords
    Gaussian noise; data handling; principal component analysis; vectors; PCA vector; full-dimensional Gaussian noise; high-dimensional NNS problem; high-dimensional nearest neighbor search problem; random-noise magnitude; spectral NNS algorithms; worst-case datasets; Algorithm design and analysis; Data structures; Nearest neighbor searches; Noise; Partitioning algorithms; Principal component analysis; Vectors; Nearest neighbor search; spectral algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on
  • Conference_Location
    Philadelphia, PA
  • ISSN
    0272-5428
  • Type

    conf

  • DOI
    10.1109/FOCS.2014.68
  • Filename
    6979043