DocumentCode
2185080
Title
Distance-based k-nearest neighbors outlier detection method in large-scale traffic data
Author
Dang, Taurus T. ; Ngan, Henry Y.T. ; Liu, Wei
Author_Institution
Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong
fYear
2015
fDate
21-24 July 2015
Firstpage
507
Lastpage
510
Abstract
This paper presents a k-nearest neighbors (kNN) method to detect outliers in large-scale traffic data collected daily in every modern city. Outliers include hardware and data errors as well as abnormal traffic behaviors. The proposed kNN method detects outliers by exploiting the relationship among neighborhoods in data points. The farther a data point is beyond its neighbors, the more possible the data is an outlier. Traffic data here was recorded in a video format, and converted to spatial-temporal (ST) traffic signals by statistics. The ST signals are then transformed to a two-dimensional (2D) (x, y) -coordinate plane by Principal Component Analysis (PCA) for dimension reduction. The distance-based kNN method is evaluated by unsupervised and semi-supervised approaches. The semi-supervised approach reaches 96.19% accuracy.
Keywords
Accuracy; Fabrics; Histograms; Measurement; Principal component analysis; Training; Outlier detection; distance-based; kNN; large-scale; traffic data;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing (DSP), 2015 IEEE International Conference on
Conference_Location
Singapore, Singapore
Type
conf
DOI
10.1109/ICDSP.2015.7251924
Filename
7251924
Link To Document