• DocumentCode
    617854
  • Title

    A grammatical evolution algorithm for generation of Hierarchical Multi-Label Classification rules

  • Author

    Cerri, R. ; Barros, Rodrigo C. ; de Carvalho, Andre C. P. L. F. ; Freitas, Alex A.

  • Author_Institution
    Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, Sao Carlos, Brazil
  • fYear
    2013
  • fDate
    20-23 June 2013
  • Firstpage
    454
  • Lastpage
    461
  • Abstract
    Hierarchical Multi-Label Classification (HMC) is a challenging task in data mining and machine learning. Each instance in HMC can be classified into two or more classes simultaneously. These classes are structured in a hierarchy, in the form of either a tree or a directed acyclic graph. Therefore, an instance can be assigned to two or more paths from the hierarchical structure, resulting in a complex classification problem with hundreds or thousands of classes. Several methods have been proposed to deal with such problems, including several algorithms based on well-known bio-inspired techniques, such as neural networks, ant colony optimization, and genetic algorithms. In this work, we propose a novel global method called GEHM, which makes use of grammatical evolution for generating HMC rules. In this approach, the grammatical evolution algorithm evolves the antecedents of classification rules, in order to assign instances from a HMC dataset to a probabilistic class vector. Our method is compared to bio-inspired HMC algorithms in protein function prediction datasets. The empirical analysis conducted in this work shows that GEHM outperforms the bio-inspired algorithms with statistical significance, which suggests that grammatical evolution is a promising alternative to deal with hierarchical multi-label classification of biological data.
  • Keywords
    data mining; directed graphs; genetic algorithms; grammars; learning (artificial intelligence); neural nets; pattern classification; trees (mathematics); GEHM; HMC; ant colony optimization; bioinspired techniques; data mining; directed acyclic graph; genetic algorithms; grammatical evolution algorithm; hierarchical multilabel classification rule generation; machine learning; neural networks; probabilistic class vector; tree; Evolution (biology); Genetic algorithms; Grammar; Prediction algorithms; Proteins; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2013 IEEE Congress on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4799-0453-2
  • Electronic_ISBN
    978-1-4799-0452-5
  • Type

    conf

  • DOI
    10.1109/CEC.2013.6557604
  • Filename
    6557604