Title :
Dependency network methods for Hierarchical Multi-label Classification of gene functions
Author :
Fabris, Fabio ; Freitas, Alex A.
Author_Institution :
Sch. of Comput., Univ. of Kent., Canterbury, UK
Abstract :
Hierarchical Multi-label Classification (HMC) is a challenging real-world problem that naturally emerges in several areas. This work proposes two new algorithms using a Probabilistic Graphical Model based on Dependency Networks (DN) to solve the HMC problem of classifying gene functions into pre-established class hierarchies. DNs are especially attractive for their capability of using traditional, “out-of-the-shelf”, classification algorithms to model the relationship among classes and for their ability to cope with cyclic dependencies, resulting in greater flexibility with respect to Bayesian Networks. We tested our two algorithms: the first is a stand-alone Hierarchical Dependency Network (HDN) algorithm, and the second is a hybrid between the HDN and the Predictive Clustering Tree (PCT) algorithm, a well-known classifier for HMC. Based on our experiments, the hybrid classifier, using SVMs as base classifiers, obtained higher predictive accuracy than both the standard PCT algorithm and the HDN algorithm, considering 22 bioinformatics datasets and two out of three predictive accuracy measures specific for hierarchical classification (AU(PRC) and AUPRCw).
Keywords :
Bayes methods; bioinformatics; genetics; network theory (graphs); pattern classification; support vector machines; Bayesian networks; DN; HMC problem; PCT algorithm; SVM; base classifiers; bioinformatics datasets; class hierarchies; cyclic dependencies; gene function classification; hierarchical multilabel classification; hybrid algorithm; hybrid classifier; out-of-the-shelf classification algorithms; predictive accuracy measures; predictive clustering tree algorithm; probabilistic graphical model; real-world problem; stand-alone HDN algorithm; stand-alone hierarchical dependency network algorithm; standard PCT algorithm; Accuracy; Classification algorithms; Clustering algorithms; Inference algorithms; Markov processes; Prediction algorithms; Vectors;
Conference_Titel :
Computational Intelligence and Data Mining (CIDM), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/CIDM.2014.7008674