• DocumentCode
    3086141
  • Title

    Effective Summarization of Multi-Dimensional Data Streams for Historical Stream Mining

  • Author

    Nassar, Samer ; Sander, Joerg

  • Author_Institution
    Alberta Univ., Edmonton
  • fYear
    2007
  • fDate
    9-11 July 2007
  • Firstpage
    30
  • Lastpage
    30
  • Abstract
    We consider the following problem: given a very large data stream, a limited space to encode the stream, and a compression technique to compress the stream, retain the most important information from the distant past of the stream while at the same time retain high quality of the compressed information that is in the recent part of the stream to perform temporal analysis of the summarized information. Simple schemes for accumulating micro-clustering summaries of stream windows that have been previously proposed are very ineffective for solving this challenging task. We overcome the limitations of these schemes by first identifying spatial summaries that compress "similar\´ regions in the data space, and reduce their space consumption using novel approximate spatio-temporal summaries. Second, we present policies for effectively utilizing the space budget and managing these novel approximate spatio-temporal summaries.
  • Keywords
    data analysis; data compression; data mining; compressed information; historical stream mining; multidimensional data streams; summarization; temporal analysis; Data mining; Extraterrestrial measurements; Financial management; High performance computing; History; Information analysis; Multidimensional systems; Pattern analysis; Performance analysis; Quality management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2007. SSBDM '07. 19th International Conference on
  • Conference_Location
    Banff, Alta.
  • ISSN
    1551-6393
  • Print_ISBN
    0-7695-2868-6
  • Electronic_ISBN
    1551-6393
  • Type

    conf

  • DOI
    10.1109/SSDBM.2007.32
  • Filename
    4274975