• DocumentCode
    1797642
  • Title

    AORS: Affinity-based outlier ranking score

  • Author

    Shaohong Zhang ; Hau-San Wong ; Wen-Jun Shen ; Dongqing Xie

  • Author_Institution
    Dept. of Comput. Sci., Guangzhou Univ., Guangzhou, China
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    1020
  • Lastpage
    1027
  • Abstract
    Outlier ranking methods can provide a quantitative measure to evaluate the outlierness of data instances in data clustering and attract great interest in pattern recognition and data mining communities. However, it has been pointed out that the diverse scaling ranges of these scores bring difficulty to result interpretation. Moreover, popular outlier ranking scores based on simple distance measures might not accurately reflect the complex affinity among data points. In this paper, we propose a new outlier ranking method based on consensus affinity of a cluster ensemble. Two new outlier ranking scores generalized from well-known clustering evaluation measures, Rvv from the RAND measure and ARIvv from Adjusted Rand Index (ARI), are adopted for outlierness evaluation. Compared to other outlierness ranking measures, the two new measures have the desired bounds without additional transformations. Consistent with the improvement of Adjusted Rand Index (ARI) over RAND, we find that ARIvv also significantly outperforms Rvv. Benefiting from the consensus affinity of a cluster ensemble, our proposed method with the ARIvv score provides significant improvement beyond a number of competing algorithms on public UCI benchmark data sets. Studies with both theoretical analysis and experimental validation show the effectiveness of our proposed methods.
  • Keywords
    learning (artificial intelligence); pattern clustering; statistical analysis; AORS method; ARI; RAND measure; adjusted Rand index; affinity-based outlier ranking score; cluster ensemble; clustering evaluation measures; data clustering; data instance; data mining; data points; distance measure; outlier ranking methods; outlierness evaluation; pattern recognition; public UCI benchmark data sets; quantitative measure; Clustering algorithms; Correlation; Educational institutions; Indexes; Partitioning algorithms; Power measurement; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889551
  • Filename
    6889551