Title of article :
Evaluation of k-Nearest Neighbor classifier performance for direct marketing
Author/Authors :
Govindarajan، نويسنده , , M. and Chandrasekaran، نويسنده , , RM.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2010
Pages :
6
From page :
253
To page :
258
Abstract :
Text data mining is a process of exploratory data analysis. Classification maps data into predefined groups or classes. It is often referred to as supervised learning because the classes are determined before examining the data. This paper describes the proposed k-Nearest Neighbor classifier that performs comparative cross-validation for the existing k-Nearest Neighbor classifier. The feasibility and the benefits of the proposed approach are demonstrated by means of data mining problem: direct marketing. Direct marketing has become an important application field of data mining. Comparative cross-validation involves estimation of accuracy by either stratified k-fold cross-validation or equivalent repeated random subsampling. While the proposed method may have a high bias; its performance (accuracy estimation in our case) may be poor due to a high variance. Thus the accuracy with the proposed k-Nearest Neighbor classifier was less than that with the existing k-Nearest Neighbor classifier, and the smaller the improvement in runtime the larger the improvement in precision and recall. In our proposed method we have determined the classification accuracy and prediction accuracy where the prediction accuracy is comparatively high.
Keywords :
DATA MINING , cross-validation , K-nearest neighbor , Runtime , Accuracy
Journal title :
Expert Systems with Applications
Serial Year :
2010
Journal title :
Expert Systems with Applications
Record number :
2347088
Link To Document :
بازگشت