Title :
I/O-Efficient Bundled Range Aggregation
Author :
Yufei Tao ; Cheng Sheng
Author_Institution :
Chinese Univ. of Hong Kong, Hong Kong, China
Abstract :
This paper studies bundled range aggregation, which is conceptually equivalent to running a range aggregate query separately on multiple datasets, returning the query result on each dataset. In particular, the queried datasets can be arbitrarily chosen from a large number (hundreds or even thousands) of candidate datasets. The challenge is to minimize the query cost no matter how many and which datasets are selected. We propose a fully-dynamic data structure called aggregate bundled B-tree (aBB-tree) to settle bundled range aggregation. Specifically, the aBB-tree requires linear space, answers any query in O(logB N) I/Os, and can be updated in O(logB N) I/Os (where N is the total size of all the candidate datasets, and B the disk page size), under the circumstances where the number of datasets is O(B). The practical efficiency of our technique is demonstrated with extensive experiments.
Keywords :
computational complexity; query processing; tree data structures; I/O-efficient bundled range aggregation; O(logB N) I/Os; aBB-tree; aggregate bundled B-tree; candidate datasets; fully-dynamic data structure; linear space; multiple datasets; query answering; query cost minimization; range aggregate query; Aggregates; Cities and towns; Computational modeling; Image color analysis; Indexes; Radiation detectors; Aggregation; Indexing methods; Query processing; index; range search;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2013.152