Title :
Random Forest Based Feature Induction
Author :
Vens, Celine ; Costa, Fabrizio
Author_Institution :
Dept. of Comput. Sci., Katholieke Univ. Leuven, Leuven, Belgium
Abstract :
We propose a simple yet effective strategy to induce a task dependent feature representation using ensembles of random decision trees. The new feature mapping is efficient in space and time, and provides a metric transformation that is non parametric and not implicit in nature (i.e. not expressed via a kernel matrix), nor limited to the transductive setup. The main advantage of the proposed mapping lies in its flexibility to adapt to several types of learning tasks ranging from regression to multi-label classification, and to deal in a natural way with missing values. Finally, we provide an extensive empirical study of the properties of the learned feature representation over real and artificial datasets.
Keywords :
data mining; decision trees; feature extraction; matrix algebra; pattern classification; random processes; regression analysis; feature mapping; feature representation; metric transformation; multilabel classification; random decision trees; random forest based feature induction; regression analysis; task dependent feature representation; Complexity theory; Decision trees; Encoding; Kernel; Training; Vectors; Vegetation; feature induction; random forests;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.121