Title :
SPOT: A System for Detecting Projected Outliers From High-dimensional Data Streams
Author :
Zhang, Ji ; Gao, Qigang ; Wang, Hai
Author_Institution :
Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS
Abstract :
In this paper, we present a new technique, called stream projected ouliter detector (SPOT), to deal with outlier detection problem in high-dimensional data streams. SPOT is unique in a number of aspects. First, SPOT employs a novel window-based time model and decaying cell summaries to capture statistics from the data stream. Second, sparse subspace template (SST), a set of top sparse subspaces obtained by unsupervised and/or supervised learning processes, is constructed in SPOT to detect projected outliers effectively. Multi-Objective genetic algorithm (MOGA) is employed as an effective search method in unsupervised learning for finding outlying subspaces from training data. Finally, SST is able to carry out online self- evolution to cope with dynamics of data streams. This paper provides details on the motivation and technical challenges of detecting outliers from high-dimensional data streams, present an overview of SPOT, and give the plans for system demonstration of SPOT.
Keywords :
data handling; data mining; genetic algorithms; learning (artificial intelligence); high-dimensional data stream; multiobjective genetic algorithm; sparse subspace template; stream projected ouliter detector; unsupervised learning; window-based time model; Computer science; Databases; Detection algorithms; Explosions; Extraterrestrial measurements; Lattices; Monitoring; Object detection; Sensor phenomena and characterization; Telecommunication traffic;
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
DOI :
10.1109/ICDE.2008.4497638