DocumentCode
3086141
Title
Effective Summarization of Multi-Dimensional Data Streams for Historical Stream Mining
Author
Nassar, Samer ; Sander, Joerg
Author_Institution
Alberta Univ., Edmonton
fYear
2007
fDate
9-11 July 2007
Firstpage
30
Lastpage
30
Abstract
We consider the following problem: given a very large data stream, a limited space to encode the stream, and a compression technique to compress the stream, retain the most important information from the distant past of the stream while at the same time retain high quality of the compressed information that is in the recent part of the stream to perform temporal analysis of the summarized information. Simple schemes for accumulating micro-clustering summaries of stream windows that have been previously proposed are very ineffective for solving this challenging task. We overcome the limitations of these schemes by first identifying spatial summaries that compress "similar\´ regions in the data space, and reduce their space consumption using novel approximate spatio-temporal summaries. Second, we present policies for effectively utilizing the space budget and managing these novel approximate spatio-temporal summaries.
Keywords
data analysis; data compression; data mining; compressed information; historical stream mining; multidimensional data streams; summarization; temporal analysis; Data mining; Extraterrestrial measurements; Financial management; High performance computing; History; Information analysis; Multidimensional systems; Pattern analysis; Performance analysis; Quality management;
fLanguage
English
Publisher
ieee
Conference_Titel
Scientific and Statistical Database Management, 2007. SSBDM '07. 19th International Conference on
Conference_Location
Banff, Alta.
ISSN
1551-6393
Print_ISBN
0-7695-2868-6
Electronic_ISBN
1551-6393
Type
conf
DOI
10.1109/SSDBM.2007.32
Filename
4274975
Link To Document