Title of article
Data accuracyʹs impact on segmentation performance: Benchmarking RFM analysis, logistic regression, and decision trees
Author/Authors
Coussement، نويسنده , , Kristof and Van den Bossche، نويسنده , , Filip A.M. and De Bock، نويسنده , , Koen W.، نويسنده ,
Issue Information
ماهنامه با شماره پیاپی سال 2014
Pages
8
From page
2751
To page
2758
Abstract
Companies greatly benefit from knowing how problems with data quality influence the performance of segmentation techniques and which techniques are more robust to these problems than others. This study investigates the influence of problems with data accuracy – an important dimension of data quality – on three prominent segmentation techniques for direct marketing: RFM (recency, frequency, and monetary value) analysis, logistic regression, and decision trees. For two real-life direct marketing data sets analyzed, the results demonstrate that (1) under optimal data accuracy, decision trees are preferred over RFM analysis and logistic regression; (2) the introduction of data accuracy problems deteriorates the performance of all three segmentation techniques; and (3) as data becomes less accurate, decision trees retain superior to logistic regression and RFM analysis. Overall, this study recommends the use of decision trees in the context of customer segmentation for direct marketing, even under the suspicion of data accuracy problems.
Keywords
Customer segmentation , Data Quality , data accuracy , RFM , decision trees , Direct marketing
Journal title
Journal of Business Research
Serial Year
2014
Journal title
Journal of Business Research
Record number
1956217
Link To Document