• DocumentCode
    3325637
  • Title

    SPOT: A System for Detecting Projected Outliers From High-dimensional Data Streams

  • Author

    Zhang, Ji ; Gao, Qigang ; Wang, Hai

  • Author_Institution
    Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    1628
  • Lastpage
    1631
  • Abstract
    In this paper, we present a new technique, called stream projected ouliter detector (SPOT), to deal with outlier detection problem in high-dimensional data streams. SPOT is unique in a number of aspects. First, SPOT employs a novel window-based time model and decaying cell summaries to capture statistics from the data stream. Second, sparse subspace template (SST), a set of top sparse subspaces obtained by unsupervised and/or supervised learning processes, is constructed in SPOT to detect projected outliers effectively. Multi-Objective genetic algorithm (MOGA) is employed as an effective search method in unsupervised learning for finding outlying subspaces from training data. Finally, SST is able to carry out online self- evolution to cope with dynamics of data streams. This paper provides details on the motivation and technical challenges of detecting outliers from high-dimensional data streams, present an overview of SPOT, and give the plans for system demonstration of SPOT.
  • Keywords
    data handling; data mining; genetic algorithms; learning (artificial intelligence); high-dimensional data stream; multiobjective genetic algorithm; sparse subspace template; stream projected ouliter detector; unsupervised learning; window-based time model; Computer science; Databases; Detection algorithms; Explosions; Extraterrestrial measurements; Lattices; Monitoring; Object detection; Sensor phenomena and characterization; Telecommunication traffic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-1836-7
  • Electronic_ISBN
    978-1-4244-1837-4
  • Type

    conf

  • DOI
    10.1109/ICDE.2008.4497638
  • Filename
    4497638