DocumentCode :
3123645
Title :
Scheduling Updates in a Real-Time Stream Warehouse
Author :
Golab, Lukasz ; Johnson, Theodore ; Shkapenyuk, Vladislav
Author_Institution :
AT&TLabs - Res., Florham Park, NJ
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
1207
Lastpage :
1210
Abstract :
This paper discusses updating a data warehouse that collects near-real-time data streams from a variety of external sources. The objective is to keep all the tables and materialized views up-to-date as new data arrive over time. We define the notion of data staleness, formalize the problem of scheduling updates in a way that minimizes average data staleness, and present scheduling algorithms designed to handle the complex environment of a real-time stream warehouse. A novel feature of our scheduling framework is that it considers the effect of an update on the staleness of the underlying tables rather than any property of the update job itself (such as deadline).
Keywords :
data handling; data warehouses; real-time systems; scheduling; data staleness; data warehouse; near-real-time data stream; scheduling algorithm; Algorithm design and analysis; Credit cards; Current measurement; Data engineering; Data warehouses; IP networks; Monitoring; Scheduling algorithm; Time measurement; USA Councils;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.202
Filename :
4812502
Link To Document :
بازگشت