DocumentCode
617854
Title
A grammatical evolution algorithm for generation of Hierarchical Multi-Label Classification rules
Author
Cerri, R. ; Barros, Rodrigo C. ; de Carvalho, Andre C. P. L. F. ; Freitas, Alex A.
Author_Institution
Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, Sao Carlos, Brazil
fYear
2013
fDate
20-23 June 2013
Firstpage
454
Lastpage
461
Abstract
Hierarchical Multi-Label Classification (HMC) is a challenging task in data mining and machine learning. Each instance in HMC can be classified into two or more classes simultaneously. These classes are structured in a hierarchy, in the form of either a tree or a directed acyclic graph. Therefore, an instance can be assigned to two or more paths from the hierarchical structure, resulting in a complex classification problem with hundreds or thousands of classes. Several methods have been proposed to deal with such problems, including several algorithms based on well-known bio-inspired techniques, such as neural networks, ant colony optimization, and genetic algorithms. In this work, we propose a novel global method called GEHM, which makes use of grammatical evolution for generating HMC rules. In this approach, the grammatical evolution algorithm evolves the antecedents of classification rules, in order to assign instances from a HMC dataset to a probabilistic class vector. Our method is compared to bio-inspired HMC algorithms in protein function prediction datasets. The empirical analysis conducted in this work shows that GEHM outperforms the bio-inspired algorithms with statistical significance, which suggests that grammatical evolution is a promising alternative to deal with hierarchical multi-label classification of biological data.
Keywords
data mining; directed graphs; genetic algorithms; grammars; learning (artificial intelligence); neural nets; pattern classification; trees (mathematics); GEHM; HMC; ant colony optimization; bioinspired techniques; data mining; directed acyclic graph; genetic algorithms; grammatical evolution algorithm; hierarchical multilabel classification rule generation; machine learning; neural networks; probabilistic class vector; tree; Evolution (biology); Genetic algorithms; Grammar; Prediction algorithms; Proteins; Training; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2013 IEEE Congress on
Conference_Location
Cancun
Print_ISBN
978-1-4799-0453-2
Electronic_ISBN
978-1-4799-0452-5
Type
conf
DOI
10.1109/CEC.2013.6557604
Filename
6557604
Link To Document