DocumentCode :
2185080
Title :
Distance-based k-nearest neighbors outlier detection method in large-scale traffic data
Author :
Dang, Taurus T. ; Ngan, Henry Y.T. ; Liu, Wei
Author_Institution :
Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong
fYear :
2015
fDate :
21-24 July 2015
Firstpage :
507
Lastpage :
510
Abstract :
This paper presents a k-nearest neighbors (kNN) method to detect outliers in large-scale traffic data collected daily in every modern city. Outliers include hardware and data errors as well as abnormal traffic behaviors. The proposed kNN method detects outliers by exploiting the relationship among neighborhoods in data points. The farther a data point is beyond its neighbors, the more possible the data is an outlier. Traffic data here was recorded in a video format, and converted to spatial-temporal (ST) traffic signals by statistics. The ST signals are then transformed to a two-dimensional (2D) (x, y) -coordinate plane by Principal Component Analysis (PCA) for dimension reduction. The distance-based kNN method is evaluated by unsupervised and semi-supervised approaches. The semi-supervised approach reaches 96.19% accuracy.
Keywords :
Accuracy; Fabrics; Histograms; Measurement; Principal component analysis; Training; Outlier detection; distance-based; kNN; large-scale; traffic data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing (DSP), 2015 IEEE International Conference on
Conference_Location :
Singapore, Singapore
Type :
conf
DOI :
10.1109/ICDSP.2015.7251924
Filename :
7251924
Link To Document :
بازگشت