DocumentCode
2169681
Title
Research on statistics-based model for E-commerce user purchase prediction
Author
Dong, Huailin ; Xie, Lingwei ; Zhang, Zhongnan
Author_Institution
Software School, Xiamen University, Xiamen, China
fYear
2015
fDate
22-24 July 2015
Firstpage
553
Lastpage
557
Abstract
This paper describes our work for ALIDATA DISCOVERY competition. Through analyzing massive real-world user action data provided by Tmall, one of the largest B2C online retail platforms in China, we try to predict future user purchases. The prediction results are judged by F1 Score that is consist of two parts, precision and recall rate. The provided data set contains more than 500 million action records from over 12 million distinct users. Such a massive data set drives us to finish the task in MapReduce fashion on the Open Data Processing Service (ODPS) platform. According to statistical results, we classify all users into different groups firstly. Then the rule model, timing model, statistics model are adopted for predicting future user purchases. By comparison, the statistics model obtains the best F1 Score.
Keywords
Big data; Computational modeling; Data mining; Data models; Data preprocessing; Predictive models; Timing; E-commerce user; MapReduce; behavior data; purchase prediction;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science & Education (ICCSE), 2015 10th International Conference on
Conference_Location
Cambridge, United Kingdom
Print_ISBN
978-1-4799-6598-4
Type
conf
DOI
10.1109/ICCSE.2015.7250308
Filename
7250308
Link To Document