• DocumentCode
    1602140
  • Title

    Discovering structure in labeled data

  • Author

    Graves, D. ; Pedrycz, W.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Alberta, Edmonton, AB
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    A novel approach based on supervised hierarchical clustering is developed with the purpose of discovering structure in data where labels are provided. Labels can come in the form of discrete-valued class labels or continuous-valued output variables to aid hierarchical clustering in discovering the structure and the number of clusters, in particular. In the proposed method, Clusters are linked together if their discrete-valued labels are the same or in the case of continuous output variables if their outputs are similar. Similarity within a cluster in the continuous case is expressed by a measure of internal cluster dispersion. Several experiments on synthetic data with discrete-valued class labels are conducted to demonstrate the algorithm´s ability to discover class or data structure.
  • Keywords
    data mining; learning (artificial intelligence); pattern clustering; continuous-valued output variables; data discovery; discrete-valued class labels; labeled data; supervised hierarchical clustering; Clustering algorithms; Couplings; Data engineering; Data structures; Dispersion; Euclidean distance; Frequency; Joining processes; Neural networks; Upper bound; Hierarchical clustering; discrete and continuous labels; number of clusters; supervised clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Information Processing Society, 2008. NAFIPS 2008. Annual Meeting of the North American
  • Conference_Location
    New York City, NY
  • Print_ISBN
    978-1-4244-2351-4
  • Electronic_ISBN
    978-1-4244-2352-1
  • Type

    conf

  • DOI
    10.1109/NAFIPS.2008.4531218
  • Filename
    4531218