• DocumentCode
    1771291
  • Title

    Handling class imbalance in customer behavior prediction

  • Author

    Nengbao Liu ; Wei Lee Woon ; Aung, Zeyar ; Afshari, Afshin

  • Author_Institution
    Inst. Center for Smart & Sustainable Syst., Masdar Inst. of Sci. & Technol., Abu Dhabi, United Arab Emirates
  • fYear
    2014
  • fDate
    19-23 May 2014
  • Firstpage
    100
  • Lastpage
    103
  • Abstract
    Class imbalance is a common problem in real world applications and it affects significantly the prediction accuracy. In this study, investigation on better handling class imbalance problem in customer behavior prediction is performed. Using a more appropriate evaluation metric (AUC), we investigated the increase of performance for under-sampling and two machine learning algorithms (weight Random Forests and RUSBoost) against a benchmark case of just using Random Forests. Results show that under-sampling is the most effective way to deal with class imbalance. RUSBoost, as a specific algorithm designed to deal with class imbalance problem, is also effective but not as good as under-sampling. Weighted Random Forests, as a cost-sensitive learner, only improves the performance of appetency classification problem out of three classification problems.
  • Keywords
    consumer behaviour; customer relationship management; data mining; learning (artificial intelligence); pattern classification; RUSBoost; appetency classification problem; appropriate evaluation metric; class imbalance handling; cost-sensitive learner; customer behavior prediction; machine learning algorithms; under-sampling; weight random forests; Accuracy; Benchmark testing; Companies; Data mining; Measurement; Training; Vegetation; Class imbalance; Customer behavior; Prediction; RUSBoost; Random forests; Under-sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Collaboration Technologies and Systems (CTS), 2014 International Conference on
  • Conference_Location
    Minneapolis, MN
  • Print_ISBN
    978-1-4799-5157-4
  • Type

    conf

  • DOI
    10.1109/CTS.2014.6867549
  • Filename
    6867549