Title :
Using information gain to build meaningful decision forests for multilabel classification
Author :
Gold, Kevin ; Petrosino, Allison
Author_Institution :
Dept. of Comput. Sci., Wellesley Coll., Wellesley, MA, USA
Abstract :
“Gain-Based Separation” is a novel heuristic that modifies the standard multiclass decision tree learning algorithm to produce forests that can describe an example or object with multiple classifications. When the information gain at a node would be higher if all examples of a particular classification were removed, those examples are reserved for another tree. In this way, the algorithm performs some automated separation of classes into categories; classes are mutually exclusive within trees but not across trees. The algorithm was tested on naive subjects´ descriptions of objects to a robot, using YUV color space and basic size and distance features. The new method outperforms the common strategy of separating multilabel problems into L binary outcome decision trees, and also outperforms RAkEL, a recent method for producing random multilabel forests.
Keywords :
classification; decision trees; learning (artificial intelligence); natural language processing; pattern classification; robots; L binary outcome decision tree; YUV color space; decision tree learning algorithm; gain based separation; multilabel classification; random multilabel forest; robot; Conferences; Decision trees; Heuristic algorithms; Image color analysis; Robot sensing systems; Training;
Conference_Titel :
Development and Learning (ICDL), 2010 IEEE 9th International Conference on
Conference_Location :
Ann Arbor, MI
Print_ISBN :
978-1-4244-6900-0
DOI :
10.1109/DEVLRN.2010.5578864