DocumentCode :
226700
Title :
Investigating distance metric learning in semi-supervised fuzzy c-means clustering
Author :
Lai, Daphne Teck Ching ; Garibaldi, Jonathan M. ; Reps, Jenna
Author_Institution :
Sch. of Comput. Sci., Univ. of Nottingham, Nottingham, UK
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
1817
Lastpage :
1824
Abstract :
The idea behind distance metric learning (DML) is to accentuate the distance relations found in the training data, maintaining whether the data patterns are similar or dissimilar. In this paper, we investigate in using DML (GDML, LMNN, MCML and NCA) in semi-supervised Fuzzy c-means clustering and apply them on a real, biomedical dataset and on UCI datasets. We used a cross validation setting with varying amount of labelled data to test our methodology. Out of eight datasets, statistical significant improvement was found on five datasets using ssFCM with DML. This shows that DML can improve ssFCM clustering for some datasets. Further analysis using 2D PCA projection and sum of squared distances before and after DML transformation of the original data are carried out. Interestingly, DML was found to worsen ssFCM clustering in the NTBC dataset with hierarchical clusters.
Keywords :
fuzzy set theory; learning (artificial intelligence); pattern classification; pattern clustering; statistical analysis; 2D PCA projection; DML; GDML; LMNN; MCML; NCA; NTBC dataset; UCI datasets; cross-validation; dissimilar data patterns; distance metric learning; distance relations; hierarchical clusters; labelled data; real biomedical dataset; semisupervised fuzzy c-means clustering; similar data patterns; ssFCM clustering; statistical analysis; sum-of-squared distances; training data; Accuracy; Cardiography; Clustering algorithms; Measurement; Principal component analysis; Training; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems (FUZZ-IEEE), 2014 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-2073-0
Type :
conf
DOI :
10.1109/FUZZ-IEEE.2014.6891673
Filename :
6891673
Link To Document :
بازگشت