• DocumentCode
    2169681
  • Title

    Research on statistics-based model for E-commerce user purchase prediction

  • Author

    Dong, Huailin ; Xie, Lingwei ; Zhang, Zhongnan

  • Author_Institution
    Software School, Xiamen University, Xiamen, China
  • fYear
    2015
  • fDate
    22-24 July 2015
  • Firstpage
    553
  • Lastpage
    557
  • Abstract
    This paper describes our work for ALIDATA DISCOVERY competition. Through analyzing massive real-world user action data provided by Tmall, one of the largest B2C online retail platforms in China, we try to predict future user purchases. The prediction results are judged by F1 Score that is consist of two parts, precision and recall rate. The provided data set contains more than 500 million action records from over 12 million distinct users. Such a massive data set drives us to finish the task in MapReduce fashion on the Open Data Processing Service (ODPS) platform. According to statistical results, we classify all users into different groups firstly. Then the rule model, timing model, statistics model are adopted for predicting future user purchases. By comparison, the statistics model obtains the best F1Score.
  • Keywords
    Big data; Computational modeling; Data mining; Data models; Data preprocessing; Predictive models; Timing; E-commerce user; MapReduce; behavior data; purchase prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science & Education (ICCSE), 2015 10th International Conference on
  • Conference_Location
    Cambridge, United Kingdom
  • Print_ISBN
    978-1-4799-6598-4
  • Type

    conf

  • DOI
    10.1109/ICCSE.2015.7250308
  • Filename
    7250308