DocumentCode
1808443
Title
Decision Tree Algorithm based on Sampling
Author
Xudong, Song ; Xiaolan, Cheng
Author_Institution
Dalian Jiaotong Univ., Dalian
fYear
2007
fDate
18-21 Sept. 2007
Firstpage
689
Lastpage
694
Abstract
As the size of the database increases, data mining algorithm faces more demanding requirements for efficiency and accuracy. Data mining for large data sets require large amounts of time and physical resources. Sampling is introduced as an effective method. Facing large data sets, a new decision tree algorithm based on sampling is put forward. It can select small initial samples with similar distribution to the original data sets to study, and stop sampling according to the time complexity requirements and convergence criteria. Comparing with the existing flexible decision tree algorithm, the algorithm can reduce the computation time and I/O complexity, while maintaining the accuracy of the tree.
Keywords
data mining; decision trees; convergence criteria; data mining algorithm; decision tree algorithm; time complexity; Classification tree analysis; Computer networks; Concurrent computing; Convergence; Data mining; Databases; Decision trees; Parallel processing; Partitioning algorithms; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Network and Parallel Computing Workshops, 2007. NPC Workshops. IFIP International Conference on
Conference_Location
Liaoning
Print_ISBN
978-0-7695-2943-1
Type
conf
DOI
10.1109/NPC.2007.133
Filename
4351564
Link To Document