Title of article :
A selective sampling approach to active feature selection Original Research Article
Author/Authors :
Huan Liu، نويسنده , , Hiroshi Motoda، نويسنده , , Lei Yu، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Abstract :
Feature selection, as a preprocessing step to machine learning, has been very effective in reducing dimensionality, removing irrelevant data, increasing learning accuracy, and improving result comprehensibility. Traditional feature selection methods resort to random sampling in dealing with data sets with a huge number of instances. In this paper, we introduce the concept of active feature selection, and investigate a selective sampling approach to active feature selection in a filter model setting. We present a formalism of selective sampling based on data variance, and apply it to a widely used feature selection algorithm Relief. Further, we show how it realizes active feature selection and reduces the required number of training instances to achieve time savings without performance deterioration. We design objective evaluation measures of performance, conduct extensive experiments using both synthetic and benchmark data sets, and observe consistent and significant improvement. We suggest some further work based on our study and experiments.
Keywords :
Dimensionality reduction , Feature selection and ranking , sampling , Learning
Journal title :
Artificial Intelligence
Journal title :
Artificial Intelligence