• DocumentCode
    547823
  • Title

    An approach to learn categorical distance based on attributes correlation

  • Author

    Khorshidpour, Z. ; Hashemi, S. ; Hamzeh, A.

  • fYear
    2011
  • fDate
    17-19 May 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Measuring similarity or distance plays a key role for data mining and knowledge discovery tasks. A lot of work has been performed on continuous attributes, but for nominal attributes the similarity computation is not relatively well-understood. In this paper, we propose a novel approach to learn a family of dissimilarity measures for categorical data. Based on these measures distance between two different values of an attribute can be determined by using the certain number of attributes rather than all attributes at once. We evaluate our methods in unsupervised environment, Experiments with real data show that our dissimilarity estimation method improves the accuracy of K-Modes clustering algorithm.
  • Keywords
    data mining; pattern clustering; attributes correlation; categorical distance learning; data mining; dissimilarity estimation method; k-modes clustering algorithm; knowledge discovery; Accuracy; Clustering algorithms; Computer aided instruction; Context; Data mining; Feature extraction; Probability distribution; Categorical data; Conditional probability distribution; Distance function learning; Kullback Leibler divergence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering (ICEE), 2011 19th Iranian Conference on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4577-0730-8
  • Electronic_ISBN
    978-964-463-428-4
  • Type

    conf

  • Filename
    5955712