Title :
Study on the application of data mining for customer groups based on the modified ID3 algorithm in the e-commerce
Author :
Yang, Feng ; Jin, Hemin ; Qi, Huirnin
Author_Institution :
Inst. of Inf. Technol., Henan Univ. of TCM, Zhengzhou, China
Abstract :
Customer information-mining in e-commerce is very important. ID3 algorithm is a mining one based on decision tree, which selects property value with the highest gains as the test attribute of its sample sets, establishes decision-making node, and divides them in turn. ID3 algorithm involves repeated logarithm operation, and it will affect the efficiency of generating decision tree when there are a large number of data, so one must change the selection criteria of data set attributes, using the Taylor formula to transform the algorithm to reduce the amount of data calculation and the generation time of decision trees and thus improve the efficiency of the decision tree classifier. It is shown that the use of improved ID3 algorithm to deal with the customer base data samples can reduce the computational cost, and improve the efficiency of the decision tree generation.
Keywords :
data mining; decision trees; electronic commerce; pattern classification; Taylor formula; customer groups; customer information-mining; data calculation; data mining; decision tree classifier; decision tree generation; decision-making node; e-commerce; modified ID3 algorithm; property value; repeated logarithm operation; Microwave integrated circuits; Data Mining; Decision Tree; ID3 Algorithm; Information Entropy; Taylor Formula;
Conference_Titel :
Computer Science and Information Processing (CSIP), 2012 International Conference on
Conference_Location :
Xi´an, Shaanxi
Print_ISBN :
978-1-4673-1410-7
DOI :
10.1109/CSIP.2012.6308929