Title :
Discovering structure in labeled data
Author :
Graves, D. ; Pedrycz, W.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Alberta, Edmonton, AB
Abstract :
A novel approach based on supervised hierarchical clustering is developed with the purpose of discovering structure in data where labels are provided. Labels can come in the form of discrete-valued class labels or continuous-valued output variables to aid hierarchical clustering in discovering the structure and the number of clusters, in particular. In the proposed method, Clusters are linked together if their discrete-valued labels are the same or in the case of continuous output variables if their outputs are similar. Similarity within a cluster in the continuous case is expressed by a measure of internal cluster dispersion. Several experiments on synthetic data with discrete-valued class labels are conducted to demonstrate the algorithm´s ability to discover class or data structure.
Keywords :
data mining; learning (artificial intelligence); pattern clustering; continuous-valued output variables; data discovery; discrete-valued class labels; labeled data; supervised hierarchical clustering; Clustering algorithms; Couplings; Data engineering; Data structures; Dispersion; Euclidean distance; Frequency; Joining processes; Neural networks; Upper bound; Hierarchical clustering; discrete and continuous labels; number of clusters; supervised clustering;
Conference_Titel :
Fuzzy Information Processing Society, 2008. NAFIPS 2008. Annual Meeting of the North American
Conference_Location :
New York City, NY
Print_ISBN :
978-1-4244-2351-4
Electronic_ISBN :
978-1-4244-2352-1
DOI :
10.1109/NAFIPS.2008.4531218