Title of article :
A framework for cost-based feature selection
Author/Authors :
Bolَn-Canedo، نويسنده , , V. and Porto-Dيaz، نويسنده , , I. and Sلnchez-Maroٌo، نويسنده , , N. and Alonso-Betanzos، نويسنده , , A.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Abstract :
Over the last few years, the dimensionality of datasets involved in data mining applications has increased dramatically. In this situation, feature selection becomes indispensable as it allows for dimensionality reduction and relevance detection. The research proposed in this paper broadens the scope of feature selection by taking into consideration not only the relevance of the features but also their associated costs. A new general framework is proposed, which consists of adding a new term to the evaluation function of a filter feature selection method so that the cost is taken into account. Although the proposed methodology could be applied to any feature selection filter, in this paper the approach is applied to two representative filter methods: Correlation-based Feature Selection (CFS) and Minimal-Redundancy-Maximal-Relevance (mRMR), as an example of use. The behavior of the proposed framework is tested on 17 heterogeneous classification datasets, employing a Support Vector Machine (SVM) as a classifier. The results of the experimental study show that the approach is sound and that it allows the user to reduce the cost without compromising the classification error.
Keywords :
Cost-based feature selection , Machine Learning , Filter methods
Journal title :
PATTERN RECOGNITION
Journal title :
PATTERN RECOGNITION