• DocumentCode
    1796692
  • Title

    Dependency network methods for Hierarchical Multi-label Classification of gene functions

  • Author

    Fabris, Fabio ; Freitas, Alex A.

  • Author_Institution
    Sch. of Comput., Univ. of Kent., Canterbury, UK
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    241
  • Lastpage
    248
  • Abstract
    Hierarchical Multi-label Classification (HMC) is a challenging real-world problem that naturally emerges in several areas. This work proposes two new algorithms using a Probabilistic Graphical Model based on Dependency Networks (DN) to solve the HMC problem of classifying gene functions into pre-established class hierarchies. DNs are especially attractive for their capability of using traditional, “out-of-the-shelf”, classification algorithms to model the relationship among classes and for their ability to cope with cyclic dependencies, resulting in greater flexibility with respect to Bayesian Networks. We tested our two algorithms: the first is a stand-alone Hierarchical Dependency Network (HDN) algorithm, and the second is a hybrid between the HDN and the Predictive Clustering Tree (PCT) algorithm, a well-known classifier for HMC. Based on our experiments, the hybrid classifier, using SVMs as base classifiers, obtained higher predictive accuracy than both the standard PCT algorithm and the HDN algorithm, considering 22 bioinformatics datasets and two out of three predictive accuracy measures specific for hierarchical classification (AU(PRC) and AUPRCw).
  • Keywords
    Bayes methods; bioinformatics; genetics; network theory (graphs); pattern classification; support vector machines; Bayesian networks; DN; HMC problem; PCT algorithm; SVM; base classifiers; bioinformatics datasets; class hierarchies; cyclic dependencies; gene function classification; hierarchical multilabel classification; hybrid algorithm; hybrid classifier; out-of-the-shelf classification algorithms; predictive accuracy measures; predictive clustering tree algorithm; probabilistic graphical model; real-world problem; stand-alone HDN algorithm; stand-alone hierarchical dependency network algorithm; standard PCT algorithm; Accuracy; Classification algorithms; Clustering algorithms; Inference algorithms; Markov processes; Prediction algorithms; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining (CIDM), 2014 IEEE Symposium on
  • Conference_Location
    Orlando, FL
  • Type

    conf

  • DOI
    10.1109/CIDM.2014.7008674
  • Filename
    7008674