مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2405176

Title :

Data reduction by partial preaggregation

Author :

Larson, Per-Åke

fYear :

2002

fDate :

2002

Firstpage :

706

Lastpage :

715

Abstract :

Partial preaggregation is a simple data reduction operator that can be applied to aggregation queries. Whenever we group and aggregate on a column set G, we can preaggregate on any column set that functionally determines G. Preaggregation can be used, for example, to reduce the input size to a join. Regular aggregation reduces the input to one record per group. Partial preaggregation exploits the fact that preaggregation need not be complete-if multiple records happen to be output for a group, they will be combined into the same group by the final aggregation. This paper describes a straightforward hash-based algorithm for partial preaggregation, discusses where it can be applied, and derives a mathematical model for estimating the output size. The effectiveness of the technique and the accuracy of the model are shown on both artificial and real data. It is also shown how to reduce memory requirements by combining partial preaggregation with the input phase of a subsequent join or sort operator. Partial preaggregation has been implemented, in part, in Microsoft SQL Server

Keywords :

data reduction; file organisation; query processing; Microsoft SQL Server; aggregation queries; column set; data reduction operator; hash-based algorithm; input size; join operator; mathematical model; memory requirements; multiple records; partial preaggregation; sort operator; Aggregates; Costs; Data engineering; Databases; Marketing and sales; Mathematical model; Radio access networks; Sorting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Engineering, 2002. Proceedings. 18th International Conference on

Conference_Location :

San Jose, CA

ISSN :

1063-6382

Print_ISBN :

0-7695-1531-2

Type :

conf

DOI :

10.1109/ICDE.2002.994787

Filename :

994787

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2405176