DocumentCode
2767574
Title
Prototype based outlier detection
Author
Kim, Seungtaek ; Cho, Sungzoon
Author_Institution
Seoul Nat. Univ., Seoul
fYear
0
fDate
0-0 0
Firstpage
820
Lastpage
826
Abstract
Outliers refer to "minority" data that are different from most other data. They usually disturb data mining process. But, sometimes they provide valuable information. Thus, it is important to identify outliers in a given data set. In this paper, we propose a novel approach which scores "outlierness" based on the distance from majority data. First, prototype data are identified. Second, those prototypes that are distant from others are eliminated. Finally, the outlierness of each data point is computed as the distance from the remaining prototypes. Experiments involving various data sets show that the proposed approach performs well in terms of accuracy, robustness and versatility.
Keywords
data mining; prototypes; data mining process; minority data; outlier detection; prototype; Cellular phones; Credit cards; Data mining; Equations; Intrusion detection; Parametric statistics; Probability; Prototypes; Robustness; Statistical distributions;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location
Vancouver, BC
Print_ISBN
0-7803-9490-9
Type
conf
DOI
10.1109/IJCNN.2006.246769
Filename
1716180
Link To Document