Title of article :
A selective sampling approach to active feature selection
Author/Authors :
Liu، Huan نويسنده , , Motoda، Hiroshi نويسنده , , Yu، Lei نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Abstract :
Feature selection, as a preprocessing step to machine learning, has been very effective in reducing dimensionality, removing irrelevant data, increasing learning accuracy, and improving result comprehensibility. Traditional feature selection methods resort to random sampling in dealing with data sets with a huge number of instances. In this paper, we introduce the concept of active feature selection, and investigate a selective sampling approach to active feature selection in a filter model setting. We present a formalism of selective sampling based on data variance, and apply it to a widely used feature selection algorithm Relief. Further, we show how it realizes active feature selection and reduces the required number of training instances to achieve time savings without performance deterioration. We design objective evaluation measures of performance, conduct extensive experiments using both synthetic and benchmark data sets, and observe consistent and significant improvement. We suggest some further work based on our study and experiments.
Keywords :
Feature selection and ranking , Dimensionality reduction , learning , sampling
Journal title :
ARTIFICIAL INTELLIGENCE (NON MEMBERS) (AI)
Journal title :
ARTIFICIAL INTELLIGENCE (NON MEMBERS) (AI)