• DocumentCode
    3189396
  • Title

    Mining Distance-Based Outliers from Categorical Data

  • Author

    Li, Shuxin ; Lee, Robert ; Lang, Sheau-Dong

  • fYear
    2007
  • fDate
    28-31 Oct. 2007
  • Firstpage
    225
  • Lastpage
    230
  • Abstract
    Distance-based outlier detection is an important data mining technique that finds abnormal data objects according to some distance function. However, when this technique is applied to high-dimensional categorical data, a traditional simple matching dissimilarity measure does not provide an adequate model. In this article, we employ a new common- neighbor-based distance function to measure the proximity between a pair of data points. Experiments show that better outlier mining results can be achieved when the new distance function is utilized rather than a conventional simple matching dissimilarity measure.
  • Keywords
    Algorithm design and analysis; Computational complexity; Computer science; Conferences; Data mining; Data security; Electronic commerce; Euclidean distance; Object detection; Risk management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
  • Conference_Location
    Omaha, NE
  • Print_ISBN
    978-0-7695-3019-2
  • Electronic_ISBN
    978-0-7695-3033-8
  • Type

    conf

  • DOI
    10.1109/ICDMW.2007.75
  • Filename
    4476672