DocumentCode :
79679
Title :
Rough Sets, Kernel Set, and Spatiotemporal Outlier Detection
Author :
Albanese, Alessia ; Pal, Sankar K. ; Petrosino, Alfredo
Author_Institution :
Dept. of Appl. Sci., Univ. of Naples Parthenope, Naples, Italy
Volume :
26
Issue :
1
fYear :
2014
fDate :
Jan. 2014
Firstpage :
194
Lastpage :
207
Abstract :
Nowadays, the high availability of data gathered from wireless sensor networks and telecommunication systems has drawn the attention of researchers on the problem of extracting knowledge from spatiotemporal data. Detecting outliers which are grossly different from or inconsistent with the remaining spatiotemporal data set is a major challenge in real-world knowledge discovery and data mining applications. In this paper, we deal with the outlier detection problem in spatiotemporal data and describe a rough set approach that finds the top outliers in an unlabeled spatiotemporal data set. The proposed method, called Rough Outlier Set Extraction (ROSE), relies on a rough set theoretic representation of the outlier set using the rough set approximations, i.e., lower and upper approximations. We have also introduced a new set, named Kernel Set, that is a subset of the original data set, which is able to describe the original data set both in terms of data structure and of obtained results. Experimental results on real-world data sets demonstrate the superiority of ROSE, both in terms of some quantitative indices and outliers detected, over those obtained by various rough fuzzy clustering algorithms and by the state-of-the-art outlier detection methods. It is also demonstrated that the kernel set is able to detect the same outliers set but with less computational time.
Keywords :
approximation theory; data mining; fuzzy set theory; pattern clustering; rough set theory; statistical analysis; ROSE method; data availability; data mining; data structure; kernel set; knowledge discovery; knowledge extraction; lower approximation; quantitative index; rough fuzzy clustering algorithms; rough outlier set extraction; rough set approach; rough set approximation; spatiotemporal outlier detection; telecommunication systems; upper approximation; wireless sensor networks; Approximation methods; Data engineering; Data mining; Kernel; Knowledge engineering; Set theory; Uncertainty; Spatiotemporal data; outlier detection; rough set and granular computing; spatiotemporal uncertainty management;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2012.234
Filename :
6365186
Link To Document :
بازگشت