DocumentCode :
3335016
Title :
A general and efficient algorithm for “top” queries
Author :
Graefe, Goetz
Author_Institution :
Hewlett-Packard Labs., Palo Alto, CA
fYear :
2008
fDate :
7-12 April 2008
Firstpage :
548
Lastpage :
555
Abstract :
"Top K" queries reduce a query result to the most interesting or the most urgent items. In many cases, e.g., when the result size is unbounded due to duplicate key values, a "top" operation cannot be implemented using the standard algorithm based on an in-memory priority queue. The default alternative is a full sort. External merge sort admits multiple novel optimizations specific to "top" operations. These are simple to implement yet greatly reduce the data volume written to runs on temporary storage. Experiments demonstrate substantial performance improvements, in one case exceeding three orders of magnitude.
Keywords :
data structures; merging; query processing; sorting; duplicate key values; in-memory priority queue; merge sort; top K query optimizations; Computer aided software engineering; Data mining; Gold; Laboratories; Medals; Memory management; Query processing; Robustness; Scalability; Sorting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-2161-9
Electronic_ISBN :
978-1-4244-2162-6
Type :
conf
DOI :
10.1109/ICDEW.2008.4498379
Filename :
4498379
Link To Document :
بازگشت