• DocumentCode
    1175972
  • Title

    Interpretable hierarchical clustering by constructing an unsupervised decision tree

  • Author

    Basak, Jayanta ; Krishnapuram, Raghu

  • Author_Institution
    IBM India Res. Lab., Indian Inst. of Technol., New Delhi, India
  • Volume
    17
  • Issue
    1
  • fYear
    2005
  • Firstpage
    121
  • Lastpage
    132
  • Abstract
    We propose a method for hierarchical clustering based on the decision tree approach. As in the case of supervised decision tree, the unsupervised decision tree is interpretable in terms of rules, i.e., each leaf node represents a cluster, and the path from the root node to a leaf node represents a rule. The branching decision at each node of the tree is made based on the clustering tendency of the data available at the node. We present four different measures for selecting the most appropriate attribute to be used for splitting the data at every branching node (or decision node), and two different algorithms for splitting the data at each decision node. We provide a theoretical basis for the approach and demonstrate the capability of the unsupervised decision tree for segmenting various data sets. We also compare the performance of the unsupervised decision tree with that of the supervised one.
  • Keywords
    data mining; decision trees; pattern classification; pattern clustering; unsupervised learning; very large databases; data mining; data set segmentation; data splitting algorithm; interpretable hierarchical clustering; unsupervised decision tree; Classification tree analysis; Clustering algorithms; Clustering methods; Decision trees; Entropy; Minimization methods; Speech; Text categorization; Text mining; 65; Index Terms- Unsupervised decision tree; data set segmentation.; entropy;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2005.11
  • Filename
    1363769