Title :
Distance-based k-nearest neighbors outlier detection method in large-scale traffic data
Author :
Dang, Taurus T. ; Ngan, Henry Y.T. ; Liu, Wei
Author_Institution :
Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong
Abstract :
This paper presents a k-nearest neighbors (kNN) method to detect outliers in large-scale traffic data collected daily in every modern city. Outliers include hardware and data errors as well as abnormal traffic behaviors. The proposed kNN method detects outliers by exploiting the relationship among neighborhoods in data points. The farther a data point is beyond its neighbors, the more possible the data is an outlier. Traffic data here was recorded in a video format, and converted to spatial-temporal (ST) traffic signals by statistics. The ST signals are then transformed to a two-dimensional (2D) (x, y) -coordinate plane by Principal Component Analysis (PCA) for dimension reduction. The distance-based kNN method is evaluated by unsupervised and semi-supervised approaches. The semi-supervised approach reaches 96.19% accuracy.
Keywords :
Accuracy; Fabrics; Histograms; Measurement; Principal component analysis; Training; Outlier detection; distance-based; kNN; large-scale; traffic data;
Conference_Titel :
Digital Signal Processing (DSP), 2015 IEEE International Conference on
Conference_Location :
Singapore, Singapore
DOI :
10.1109/ICDSP.2015.7251924