• DocumentCode
    2767574
  • Title

    Prototype based outlier detection

  • Author

    Kim, Seungtaek ; Cho, Sungzoon

  • Author_Institution
    Seoul Nat. Univ., Seoul
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    820
  • Lastpage
    826
  • Abstract
    Outliers refer to "minority" data that are different from most other data. They usually disturb data mining process. But, sometimes they provide valuable information. Thus, it is important to identify outliers in a given data set. In this paper, we propose a novel approach which scores "outlierness" based on the distance from majority data. First, prototype data are identified. Second, those prototypes that are distant from others are eliminated. Finally, the outlierness of each data point is computed as the distance from the remaining prototypes. Experiments involving various data sets show that the proposed approach performs well in terms of accuracy, robustness and versatility.
  • Keywords
    data mining; prototypes; data mining process; minority data; outlier detection; prototype; Cellular phones; Credit cards; Data mining; Equations; Intrusion detection; Parametric statistics; Probability; Prototypes; Robustness; Statistical distributions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2006. IJCNN '06. International Joint Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    0-7803-9490-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2006.246769
  • Filename
    1716180