DocumentCode
3609781
Title
Effective and Efficient Algorithms for Flexible Aggregate Similarity Search in High Dimensional Spaces
Author
Houle, Michael E. ; Xiguo Ma ; Oria, Vincent
Author_Institution
Nat. Inst. of Inf., Tokyo, Japan
Volume
27
Issue
12
fYear
2015
Firstpage
3258
Lastpage
3273
Abstract
Numerous applications in different fields, such as spatial databases, multimedia databases, data mining, and recommender systems, may benefit from efficient and effective aggregate similarity search, also known as aggregate nearest neighbor (AggNN) search. Given a group of query objects Q, the goal of AggNN is to retrieve the k most similar objects from the database, where the underlying similarity measure is defined as an aggregation (usually sum or max) of the distances between the retrieved objects and every query object in Q. Recently, the problem was generalized so as to retrieve the k objects which are most similar to a fixed proportion of the elements of Q. This variant of aggregate similarity search is referred to as “flexible AggNN”, or FANN. In this work, we propose two approximation algorithms, one for the sum variant of FANN, and the other for the max variant. Extensive experiments are provided showing that, relative to state-of-the-art approaches (both exact and approximate), our algorithms produce query results with good accuracy, while at the same time being very efficient.
Keywords
approximation theory; query processing; AggNN search; aggregate nearest neighbor search; approximate approach; approximation algorithms; exact approach; fixed proportion; flexible AggNN; flexible-aggregate similarity search; high-dimensional spaces; k-most similar object retrieval; max variant FANN; query objects; similarity measure; sum variant FANN; Approximation algorithms; Approximation methods; Data mining; Multimedia databases; Recommender systems; Spatial databases; Similarity search; aggregation; dimensional test; intrinsic dimensionality;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2015.2475740
Filename
7317853
Link To Document