DocumentCode :
2549625
Title :
An Efficient Classification System Based on Binary Search Trees for Data Streams Mining
Author :
Wang, Tao ; Li, Zhoujun ; Yan, Yuejin ; Chen, Huowang ; Yu, Jinshan
Author_Institution :
Nat. Univ. of Defense Technol., Changsha
fYear :
2007
fDate :
22-28 April 2007
Firstpage :
15
Lastpage :
15
Abstract :
Decision tree construction is a well-studied problem in data mining. Recently, there has been much interest in mining data streams. Domingos and Hulten have presented a one-pass algorithm for decision tree constructions. Their system using Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. In this paper, we revisit this problem and propose a decision tree classifier system that uses binary search trees to handle numerical attributes. The proposed system is based on the most successful VFDT, and it achieves excellent performance. The most relevant property of our system is an average large reduction in processing time, while keeps the same tree size and accuracy.
Keywords :
data analysis; data mining; decision trees; pattern classification; probability; Hoeffding inequality; binary search tree; data stream mining; decision tree classifier system; probability; Binary search trees; Classification tree analysis; Computer science; Data engineering; Data mining; Data processing; Databases; Decision trees; Input variables; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, 2007. ICONS '07. Second International Conference on
Conference_Location :
Martinique
Print_ISBN :
0-7695-2807-4
Electronic_ISBN :
0-7695-2807-4
Type :
conf
DOI :
10.1109/ICONS.2007.12
Filename :
4196317
Link To Document :
بازگشت