DocumentCode
453841
Title
Feature Selection for Cluster Analysis: an Approach Based on the Simplified Silhouette Criterion
Author
Hruschka, Eduardo R. ; Covões, Thiago F.
Author_Institution
Catholic Univ. of Santos
Volume
1
fYear
2005
fDate
28-30 Nov. 2005
Firstpage
32
Lastpage
38
Abstract
This paper explores the problem of selecting relevant features for clustering, assuming that the number of clusters is not known a priori. The number of clusters and the subset of relevant features are usually inter-related. From this standpoint, we propose an exploratory data analysis method that considers the relationships between these two aspects. Empirical results in a number of synthetic and bioinformatics datasets show that the proposed approach can allow both reducing the number of features and providing good estimations of the number of clusters
Keywords
data analysis; feature extraction; pattern clustering; bioinformatics datasets; cluster analysis; exploratory data analysis method; feature selection; simplified silhouette criterion; Bioinformatics; Clustering algorithms; Clustering methods; Data analysis; Data visualization; Gene expression; Information filtering; Information filters; Supervised learning; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence for Modelling, Control and Automation, 2005 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
Conference_Location
Vienna
Print_ISBN
0-7695-2504-0
Type
conf
DOI
10.1109/CIMCA.2005.1631238
Filename
1631238
Link To Document