DocumentCode
2725114
Title
Incremental Local Outlier Detection for Data Streams
Author
Pokrajac, Dragoljub ; Lazarevic, Aleksandar ; Latecki, Longin Jan
Author_Institution
Dept. of Comput. & Inf. Sci., Delaware State Univ., Dover, DE
fYear
2007
fDate
March 1 2007-April 5 2007
Firstpage
504
Lastpage
515
Abstract
Outlier detection has recently become an important problem in many industrial and financial applications. This problem is further complicated by the fact that in many cases, outliers have to be detected from data streams that arrive at an enormous pace. In this paper, an incremental LOF (local outlier factor) algorithm, appropriate for detecting outliers in data streams, is proposed. The proposed incremental LOF algorithm provides equivalent detection performance as the iterated static LOF algorithm (applied after insertion of each data record), while requiring significantly less computational time. In addition, the incremental LOF algorithm also dynamically updates the profiles of data points. This is a very important property, since data profiles may change over time. The paper provides theoretical evidence that insertion of a new data point as well as deletion of an old data point influence only limited number of their closest neighbors and thus the number of updates per such insertion/deletion does not depend on the total number of points TV in the data set. Our experiments performed on several simulated and real life data sets have demonstrated that the proposed incremental LOF algorithm is computationally efficient, while at the same time very successful in detecting outliers and changes of distributional behavior in various data stream applications
Keywords
data handling; data streams; incremental local outlier detection; local outlier factor algorithm; Change detection algorithms; Computational Intelligence Society; Computer networks; Data mining; Event detection; Intrusion detection; Telecommunication traffic; Traffic control; Unsupervised learning; Video surveillance;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location
Honolulu, HI
Print_ISBN
1-4244-0705-2
Type
conf
DOI
10.1109/CIDM.2007.368917
Filename
4221341
Link To Document