DocumentCode :
2081163
Title :
Optimal load shedding with aggregates and mining queries
Author :
Mozafari, Barzan ; Zaniolo, Carlo
Author_Institution :
Comput. Sci. Dept., Univ. of California at Los Angeles, Los Angeles, CA, USA
fYear :
2010
fDate :
1-6 March 2010
Firstpage :
76
Lastpage :
88
Abstract :
To cope with bursty arrivals of high-volume data, a DSMS has to shed load while minimizing the degradation of Quality of Service (QoS). In this paper, we show that this problem can be formalized as a classical optimization task from operations research, in ways that accommodate different requirements for multiple users, different query sensitivities to load shedding, and different penalty functions. Standard nonlinear programming algorithms are adequate for non-critical situations, but for severe overloads, we propose a more efficient algorithm that runs in linear time, without compromising optimality. Our approach is applicable to a large class of queries including traditional SQL aggregates, statistical aggregates (e.g., quantiles), and data mining functions, such as k-means, naive Bayesian classifiers, decision trees, and frequent pattern discovery (where we can even specify a different error bound for each pattern). In fact, we show that these aggregate queries are special instances of a broader class of functions, that we call reciprocal-error aggregates, for which the proposed methods apply with full generality. Finally, we propose a novel architecture for supporting load shedding in an extensible system, where users can write arbitrary User Defined Aggregates (UDA), and thus confirm our analytical findings with several experiments executed on an actual DSMS.
Keywords :
Bayes methods; data mining; load shedding; nonlinear programming; query processing; DSMS; SQL aggregates; aggregate queries; data mining functions; decision trees; extensible system; frequent pattern discovery; high volume data arrivals; k-means; load shedding; mining queries; naive Bayesian classifiers; nonlinear programming algorithms; optimization; penalty functions; quality of service degradation; reciprocal error aggregates; statistical aggregates; user defined aggregates; Aggregates; Computer science; Dairy products; Data mining; Degradation; Intrusion detection; Linear programming; Monitoring; Operations research; Quality of service;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2010 IEEE 26th International Conference on
Conference_Location :
Long Beach, CA
Print_ISBN :
978-1-4244-5445-7
Electronic_ISBN :
978-1-4244-5444-0
Type :
conf
DOI :
10.1109/ICDE.2010.5447867
Filename :
5447867
Link To Document :
بازگشت