Title :
An Experimental Study on Unsupervised Clustering-Based Feature Selection Methods
Author :
Covoes, T.F. ; Hruschka, Eduardo R.
Author_Institution :
Comput. Sci. Dept., Univ. of Sao Paulo (USP) at Sao Carlos, Sao Carlos, Brazil
fDate :
Nov. 30 2009-Dec. 2 2009
Abstract :
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
Keywords :
data mining; pattern clustering; statistical testing; data mining; data understanding; feature selection; redundancy removal; statistical test; storage requirement; unsupervised clustering; Application software; Clustering algorithms; Clustering methods; Computer science; Data mining; Filters; Intelligent systems; Supervised learning; Testing; Text mining; clustering problems; feature clustering; unsupervised feature selection;
Conference_Titel :
Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-1-4244-4735-0
Electronic_ISBN :
978-0-7695-3872-3
DOI :
10.1109/ISDA.2009.79