• DocumentCode
    553149
  • Title

    An outlier mining algorithm based on local weighted k-density

  • Author

    Aiqin Liu ; Jifu Zhang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Taiyuan Univ. of Sci. & Technol., Taiyuan, China
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1504
  • Lastpage
    1508
  • Abstract
    Most local outlier mining algorithms are inefficient and depend on many pre-set parameters when applied to high-dimensional dataset. In the paper, an outlier mining algorithm based on local weighted k-density is presented. Firstly, the attribute abnormal degree of each object in neighborhood for each attribute is computed by using local attribute entropy. Secondly, the corresponding attribute weight vector is set automatically according to attribute abnormal degree, so that man-made factor is reduced. Thirdly, the time-consuming and unnecessary step of re-calculating neighborhood is removed for simplifying computation, then local weighted outlier factor in the same neighborhood as getting attribute weight is calculated and outliers are detected based on local weighted k-density. In the end, the experimental results validate that the algorithm is feasible and efficient for high-dimensional outlier mining by utilizing star spectrum data.
  • Keywords
    data mining; entropy; high-dimensional dataset; local attribute entropy; local outlier mining algorithm; local weighted k-density; star spectrum data; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Complexity theory; Data mining; Educational institutions; Entropy; abnormal degree; attribute weight vector; local attribute entropy; local weighted k-density; outlier;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-61284-180-9
  • Type

    conf

  • DOI
    10.1109/FSKD.2011.6019777
  • Filename
    6019777