• DocumentCode
    2577543
  • Title

    An improved sample selection algorithm in fuzzy decision tree induction

  • Author

    Dong, Ling-Cai ; Wang, Dan ; Wang, Xi-Zhao

  • Author_Institution
    Coll. of Math. & Comput. Sci., Hebei Univ., Baoding, China
  • fYear
    2009
  • fDate
    11-14 Oct. 2009
  • Firstpage
    629
  • Lastpage
    634
  • Abstract
    This paper improves a method of sample selection based on maximum entropy. Compared with the original method, the improved one takes the probability distribution of unlabeled instances into consideration. It selects the instances which can reduce the uncertainty of the whole unlabeled set to a great extent. The uncertainty reduction of the whole unlabeled set caused by an instance is measured by the instance´s uncertainty and its influence index on the whole unlabeled set. To calculate the influence index conveniently, we introduces the similar matrix, the elements of which are the similarities measured by the distances between instances. The new method avoids the drawbacks that some abnormal or isolated samples may be selected by original method. Thus it can select the instances with more representation and more capability to resist noises. Our experimental results show that the performance of the classifier built from samples selected by the new algorithm is better than those selected by original method in the same time complexity.
  • Keywords
    decision trees; fuzzy set theory; pattern classification; statistical distributions; classification ambiguity; fuzzy decision tree induction; probability distribution; sample selection algorithm; uncertainty reduction; Alzheimer´s disease; Databases; Decision trees; Entropy; Hidden Markov models; Machine learning; Probability distribution; Proteins; Support vector machines; Uncertainty; Sample selection; classification ambiguity; fuzzy decision tree; probability distribution; similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2793-2
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2009.5346654
  • Filename
    5346654