DocumentCode
3341887
Title
Efficient building algorithms of decision tree for uniformly distributed uncertain data
Author
Chenggang Li ; Liping Huang ; Ling Tian
Author_Institution
Dept. of Precision Instrum. & Machanology, Univ. of Tsinghua, Beijing, China
Volume
1
fYear
2011
fDate
26-28 July 2011
Firstpage
105
Lastpage
108
Abstract
Developing algorithms for uncertain data is one of the most active themes in data mining community. A number of different decision tree classifiers have been studied in order to deal with uncertain data. This paper extends these works. In this paper, we develop a tree-pruning algorithm using sum of the tuples fractions based on probability theory. By pruning, we find that the accuracy of the classifier is improved and the efficiency of building the decision tree is also improved. Besides, we find that under the context of uniformly distribution, increasing the sampling density of the uncertain attribute value can make little contribution to improve the accuracy, but is computationally more costly. So we propose a new method of sampling. Using this sampling method, the execution time of building the decision tree is greatly decreased.
Keywords
data mining; decision trees; pattern classification; probability; building algorithm; data mining community; decision tree classifier; probability theory; sampling density; sampling method; tree-pruning algorithm; tuples fraction; uncertain attribute value; uniformly distributed uncertain data; Accuracy; Algorithm design and analysis; Classification algorithms; Decision trees; Distributed databases; Satellites; Training; data mining; decision tree; prunning; uncertain data;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2011 Seventh International Conference on
Conference_Location
Shanghai
ISSN
2157-9555
Print_ISBN
978-1-4244-9950-2
Type
conf
DOI
10.1109/ICNC.2011.6022055
Filename
6022055
Link To Document