DocumentCode
2370597
Title
Ensembles of cascading trees
Author
Li, Jinyan ; Liu, Huiqing
Author_Institution
Inst. for Infocomm Res., Singapore
fYear
2003
fDate
19-22 Nov. 2003
Firstpage
585
Lastpage
588
Abstract
We introduce a new method, called CS4, to construct committees of decision trees for classification. The method considers different top-ranked features as the root nodes of member trees. This idea is particularly suitable for dealing with high-dimensional bio-medical data as top-ranked features in this type of data usually possess similar merits for classification. To make a decision, the committee combines the power of individual trees in a weighted manner. Unlike Bagging or Boosting which uses bootstrapped training data, our method builds all the member trees of a committee using exactly the same set of training data. We have tested these ideas on UCI data sets as well as recent bio-medical data sets of gene expression or proteomic profiles that are usually described by more than 10,000 features. All the experimental results show that our method is efficient and that the classification performance are superior to C4.5 family algorithms.
Keywords
decision trees; learning (artificial intelligence); medical information systems; CS4; UCI data sets; biomedical data; cascading trees; classification; decision tree; gene expression; Bagging; Bioinformatics; Boosting; Classification tree analysis; Decision trees; Gain measurement; Gene expression; Proteomics; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN
0-7695-1978-4
Type
conf
DOI
10.1109/ICDM.2003.1250983
Filename
1250983
Link To Document