Title of article

Evaluation of k-Nearest Neighbor classifier performance for direct marketing

Author/Authors

Govindarajan، نويسنده , , M. and Chandrasekaran، نويسنده , , RM.، نويسنده ,

Issue Information

روزنامه با شماره پیاپی سال 2010

Pages

6

From page

253

To page

258

Abstract

Text data mining is a process of exploratory data analysis. Classification maps data into predefined groups or classes. It is often referred to as supervised learning because the classes are determined before examining the data. This paper describes the proposed k-Nearest Neighbor classifier that performs comparative cross-validation for the existing k-Nearest Neighbor classifier. The feasibility and the benefits of the proposed approach are demonstrated by means of data mining problem: direct marketing. Direct marketing has become an important application field of data mining. Comparative cross-validation involves estimation of accuracy by either stratified k-fold cross-validation or equivalent repeated random subsampling. While the proposed method may have a high bias; its performance (accuracy estimation in our case) may be poor due to a high variance. Thus the accuracy with the proposed k-Nearest Neighbor classifier was less than that with the existing k-Nearest Neighbor classifier, and the smaller the improvement in runtime the larger the improvement in precision and recall. In our proposed method we have determined the classification accuracy and prediction accuracy where the prediction accuracy is comparatively high.

Keywords

DATA MINING , cross-validation , K-nearest neighbor , Runtime , Accuracy

Journal title

Expert Systems with Applications

Serial Year

2010

Journal title

Expert Systems with Applications

Record number

Evaluation of k-Nearest Neighbor classifier performance for direct marketing

Govindarajan، نويسنده , , M. and Chandrasekaran، نويسنده , , RM.، نويسنده ,

2347088