DocumentCode
2577543
Title
An improved sample selection algorithm in fuzzy decision tree induction
Author
Dong, Ling-Cai ; Wang, Dan ; Wang, Xi-Zhao
Author_Institution
Coll. of Math. & Comput. Sci., Hebei Univ., Baoding, China
fYear
2009
fDate
11-14 Oct. 2009
Firstpage
629
Lastpage
634
Abstract
This paper improves a method of sample selection based on maximum entropy. Compared with the original method, the improved one takes the probability distribution of unlabeled instances into consideration. It selects the instances which can reduce the uncertainty of the whole unlabeled set to a great extent. The uncertainty reduction of the whole unlabeled set caused by an instance is measured by the instance´s uncertainty and its influence index on the whole unlabeled set. To calculate the influence index conveniently, we introduces the similar matrix, the elements of which are the similarities measured by the distances between instances. The new method avoids the drawbacks that some abnormal or isolated samples may be selected by original method. Thus it can select the instances with more representation and more capability to resist noises. Our experimental results show that the performance of the classifier built from samples selected by the new algorithm is better than those selected by original method in the same time complexity.
Keywords
decision trees; fuzzy set theory; pattern classification; statistical distributions; classification ambiguity; fuzzy decision tree induction; probability distribution; sample selection algorithm; uncertainty reduction; Alzheimer´s disease; Databases; Decision trees; Entropy; Hidden Markov models; Machine learning; Probability distribution; Proteins; Support vector machines; Uncertainty; Sample selection; classification ambiguity; fuzzy decision tree; probability distribution; similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location
San Antonio, TX
ISSN
1062-922X
Print_ISBN
978-1-4244-2793-2
Electronic_ISBN
1062-922X
Type
conf
DOI
10.1109/ICSMC.2009.5346654
Filename
5346654
Link To Document