DocumentCode :
3579124
Title :
An enhanced feature selection method comprising rough set and clustering techniques
Author :
Murugan, A. ; Sridevi, T.
Author_Institution :
Department of Computer Science, Dr. Ambedkar Govt. Arts College, Chennai, Tamil Nadu, India
fYear :
2014
Firstpage :
1
Lastpage :
4
Abstract :
Feature selection or variable reduction is a fundamental problem in data mining, refers to the process of identifying the few most important features for application of a learning algorithm. The best subset contains the minimum number of dimensions retaining a suitably high accuracy on classifier in representing the original features. The objective of the proposed approach is to reduce the number of input features thus to identify the key features and eliminating irrelevant features with no predictive information using clustering technique, K-nearest neighbors (KNN) and rough set. This paper deals with two partition based clustering algorithm in data mining namely K-Means and Fuzzy C Means (FCM). These two algorithms are implemented for original data set without considering the class labels and further rough set theory implemented on the partitioned data set to generate feature subset after removing the outlier by using KNN. Wisconsin Breast Cancer datasets derived from UCI machine learning database are used for the purpose of testing the proposed hybrid method. The results show that the hybrid method is able to produce more accurate diagnosis and prognosis results than the full input model with respect to the classification accuracy.
Keywords :
Accuracy; Algorithm design and analysis; Breast cancer; Classification algorithms; Clustering algorithms; Data mining; Partitioning algorithms; classification; fcm; feature selection; k-means; machine learning; outlier detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on
Print_ISBN :
978-1-4799-3974-9
Type :
conf
DOI :
10.1109/ICCIC.2014.7238376
Filename :
7238376
Link To Document :
بازگشت