Title :
Spectral Approaches to Nearest Neighbor Search
Author :
Abdullah, Ammar ; Andoni, Alexandr ; Kannan, Ravindran ; Krauthgamer, Robert
Abstract :
We study spectral algorithms for the high-dimensional Nearest Neighbor Search problem (NNS). In particular, we consider a semi-random setting where a dataset is chosen arbitrarily from an unknown subspace of low dimension, and then perturbed by full-dimensional Gaussian noise. We design spectral NNS algorithms whose query time depends polynomially on the dimension and logarithmically on the size of the point set. These spectral algorithms use a repeated computation of the top PCA vector/subspace, and are effective even when the random-noise magnitude is much larger than the interpoint distances. Our motivation is that in practice, a number of spectral NNS algorithms outperform the random-projection methods that seem otherwise theoretically optimal on worst-case datasets. In this paper we aim to provide theoretical justification for this disparity. The full version of this extended abstract is available on arXiv.
Keywords :
Gaussian noise; data handling; principal component analysis; vectors; PCA vector; full-dimensional Gaussian noise; high-dimensional NNS problem; high-dimensional nearest neighbor search problem; random-noise magnitude; spectral NNS algorithms; worst-case datasets; Algorithm design and analysis; Data structures; Nearest neighbor searches; Noise; Partitioning algorithms; Principal component analysis; Vectors; Nearest neighbor search; spectral algorithms;
Conference_Titel :
Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on
Conference_Location :
Philadelphia, PA
DOI :
10.1109/FOCS.2014.68