• DocumentCode
    2892746
  • Title

    Inducing Uncertain Decision Tree via Cloud Model

  • Author

    Wu, Tao ; Qin, Kun

  • Author_Institution
    State Key Lab. of Software Eng., Wuhan Univ., Wuhan, China
  • fYear
    2009
  • fDate
    12-14 Oct. 2009
  • Firstpage
    85
  • Lastpage
    91
  • Abstract
    This paper addresses the decision trees induction with uncertain data. In other words, it presents a novel method, called uncertain decision trees (UDT) to handle the uncertainty during the process of inducing decision trees. Here, uncertainty is depicted via cloud model theory, a quantitative-qualitative transforming model with uncertainty, which can well integrate the fuzziness and randomness of concepts in a unified way. In the learning stage, dataset is pre-processed by cloud transformation algorithm, which climbs data into class labels via histograms or frequency distribution, and the labels are expressed by cloud concepts. In this paper, some basic definitions are proposed, including cloud distance, cloud dissimilarity matrix, cloud index, and UDT, where cloud index is a novel splitting criterion of selecting attributes for handling uncertainty. Take data from UCI for example, this paper provided an algorithm inducing UDT, and checked its validity or appropriateness. In contrast to the classical approaches, both in the learning stage and classifying stage, the proposed method develops existing methods, and it is more consistent with the human cognition, which can support uncertainty, build UDT via cloud concepts, and classify the uncertain data. Moreover, experiments and results are compared with the current methods to illustrate the feasibility, accuracy and effectiveness of the cloud based algorithm.
  • Keywords
    data mining; decision trees; uncertainty handling; cloud model; human cognition; learning stage; uncertain data; uncertain decision tree; Classification tree analysis; Clouds; Decision trees; Fuzzy sets; Induction generators; Information science; Machine learning algorithms; Measurement uncertainty; Software engineering; Uncertain systems; classifier; cloud model; data mining; decision tree; knowledge discovery; uncertain systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantics, Knowledge and Grid, 2009. SKG 2009. Fifth International Conference on
  • Conference_Location
    Zhuhai
  • Print_ISBN
    978-0-7695-3810-5
  • Type

    conf

  • DOI
    10.1109/SKG.2009.17
  • Filename
    5368030