• DocumentCode
    3341887
  • Title

    Efficient building algorithms of decision tree for uniformly distributed uncertain data

  • Author

    Chenggang Li ; Liping Huang ; Ling Tian

  • Author_Institution
    Dept. of Precision Instrum. & Machanology, Univ. of Tsinghua, Beijing, China
  • Volume
    1
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    105
  • Lastpage
    108
  • Abstract
    Developing algorithms for uncertain data is one of the most active themes in data mining community. A number of different decision tree classifiers have been studied in order to deal with uncertain data. This paper extends these works. In this paper, we develop a tree-pruning algorithm using sum of the tuples fractions based on probability theory. By pruning, we find that the accuracy of the classifier is improved and the efficiency of building the decision tree is also improved. Besides, we find that under the context of uniformly distribution, increasing the sampling density of the uncertain attribute value can make little contribution to improve the accuracy, but is computationally more costly. So we propose a new method of sampling. Using this sampling method, the execution time of building the decision tree is greatly decreased.
  • Keywords
    data mining; decision trees; pattern classification; probability; building algorithm; data mining community; decision tree classifier; probability theory; sampling density; sampling method; tree-pruning algorithm; tuples fraction; uncertain attribute value; uniformly distributed uncertain data; Accuracy; Algorithm design and analysis; Classification algorithms; Decision trees; Distributed databases; Satellites; Training; data mining; decision tree; prunning; uncertain data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2011 Seventh International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    2157-9555
  • Print_ISBN
    978-1-4244-9950-2
  • Type

    conf

  • DOI
    10.1109/ICNC.2011.6022055
  • Filename
    6022055