مرکز منطقه ای اطلاع رساني علوم و فناوري - LSH vs Randomized Partition Trees: Which One to Use for Nearest Neighbor Search?

DocumentCode :

259562

Title :

LSH vs Randomized Partition Trees: Which One to Use for Nearest Neighbor Search?

Author :

Sinha, Kaushik

fYear :

2014

fDate :

3-6 Dec. 2014

Firstpage :

Lastpage :

Abstract :

Recently, randomized partition trees have been theoretically shown to be very effective in performing high dimensional nearest neighbor search. In this paper, we introduce a variant of randomized partition trees for high dimensional nearest neighbor search problem and provide theoretical justification for its choice. Experiments on various real-life datasets show that performance of this new variant is superior to the previous variant as well as to the locality sensitive hashing (LSH) method for nearest neighbor search. In addition, we establish the connection between various notions of difficulty in nearest neighbor search problem, that have recently been introduced, namely, potential function and relative contrast.

Keywords :

file organisation; pattern classification; query formulation; LSH method; high dimensional nearest neighbor search problem; locality sensitive hashing; randomized partition trees; Accuracy; Covariance matrices; Data structures; Nearest neighbor searches; Principal component analysis; Standards; Vectors; RP Trees; nearest neighbor search;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Applications (ICMLA), 2014 13th International Conference on

Conference_Location :

Detroit, MI

Type :

conf

DOI :

10.1109/ICMLA.2014.13

Filename :

7033089

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=259562