DocumentCode :
1121642
Title :
Analysis and Design of a Decision Tree Based on Entropy Reduction and Its Application to Large Character Set Recognition
Author :
Wang, Qing Ren ; Suen, Ching Y.
Author_Institution :
Department of Computer Science, Concordia University, Montreal, P. Q., Canada H3G 1M8.
Issue :
4
fYear :
1984
fDate :
7/1/1984 12:00:00 AM
Firstpage :
406
Lastpage :
417
Abstract :
Based on a recursive process of reducing the entropy, the general decision tree classifier with overlap has been analyzed. Several theorems have been proposed and proved. When the number of pattern classes is very large, the theorems can reveal both the advantages of a tree classifier and the main difficulties in its implementation. Suppose H is Shannon´s entropy measure of the given problem. The theoretical results indicate that the tree searching time can be minimized to the order O(H), but the error rate is also in the same order O(H) due to error accumulation. However, the memory requirement is in the order 0(H exp(H)) which poses serious problems in the implementation of a tree classifier for a large number of classes. To solve these problems, several theorems related to the bounds on the search time, error rate, memory requirement and overlap factor in the design of a decision tree have been proposed and some principles have been established to analyze the behaviors of the decision tree. When applied to classify sets of 64, 450, and 3200 Chinese characters, respectively, the experimental results support the theoretical predictions. For 3200 classes, a very high recognition rate of 99.88 percent was achieved at a high speed of 873 samples/s when the experiment was conducted on a Cyber 172 computer using a high-level language.
Keywords :
Biomedical computing; Biomedical measurements; Character recognition; Classification tree analysis; Computer errors; Decision trees; Entropy; Error analysis; High level languages; Pattern recognition; Character recognition; ISOETRP; decision tree; entropy reduction; error accumulation; fuzzy logic search; gain; heuristic search; overlap; tree classifier;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.1984.4767546
Filename :
4767546
Link To Document :
بازگشت