DocumentCode
2835977
Title
An Experimental Study on Unsupervised Clustering-Based Feature Selection Methods
Author
Covoes, T.F. ; Hruschka, Eduardo R.
Author_Institution
Comput. Sci. Dept., Univ. of Sao Paulo (USP) at Sao Carlos, Sao Carlos, Brazil
fYear
2009
fDate
Nov. 30 2009-Dec. 2 2009
Firstpage
993
Lastpage
1000
Abstract
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
Keywords
data mining; pattern clustering; statistical testing; data mining; data understanding; feature selection; redundancy removal; statistical test; storage requirement; unsupervised clustering; Application software; Clustering algorithms; Clustering methods; Computer science; Data mining; Filters; Intelligent systems; Supervised learning; Testing; Text mining; clustering problems; feature clustering; unsupervised feature selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
Conference_Location
Pisa
Print_ISBN
978-1-4244-4735-0
Electronic_ISBN
978-0-7695-3872-3
Type
conf
DOI
10.1109/ISDA.2009.79
Filename
5364429
Link To Document