DocumentCode
1796692
Title
Dependency network methods for Hierarchical Multi-label Classification of gene functions
Author
Fabris, Fabio ; Freitas, Alex A.
Author_Institution
Sch. of Comput., Univ. of Kent., Canterbury, UK
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
241
Lastpage
248
Abstract
Hierarchical Multi-label Classification (HMC) is a challenging real-world problem that naturally emerges in several areas. This work proposes two new algorithms using a Probabilistic Graphical Model based on Dependency Networks (DN) to solve the HMC problem of classifying gene functions into pre-established class hierarchies. DNs are especially attractive for their capability of using traditional, “out-of-the-shelf”, classification algorithms to model the relationship among classes and for their ability to cope with cyclic dependencies, resulting in greater flexibility with respect to Bayesian Networks. We tested our two algorithms: the first is a stand-alone Hierarchical Dependency Network (HDN) algorithm, and the second is a hybrid between the HDN and the Predictive Clustering Tree (PCT) algorithm, a well-known classifier for HMC. Based on our experiments, the hybrid classifier, using SVMs as base classifiers, obtained higher predictive accuracy than both the standard PCT algorithm and the HDN algorithm, considering 22 bioinformatics datasets and two out of three predictive accuracy measures specific for hierarchical classification (AU(PRC) and AUPRCw).
Keywords
Bayes methods; bioinformatics; genetics; network theory (graphs); pattern classification; support vector machines; Bayesian networks; DN; HMC problem; PCT algorithm; SVM; base classifiers; bioinformatics datasets; class hierarchies; cyclic dependencies; gene function classification; hierarchical multilabel classification; hybrid algorithm; hybrid classifier; out-of-the-shelf classification algorithms; predictive accuracy measures; predictive clustering tree algorithm; probabilistic graphical model; real-world problem; stand-alone HDN algorithm; stand-alone hierarchical dependency network algorithm; standard PCT algorithm; Accuracy; Classification algorithms; Clustering algorithms; Inference algorithms; Markov processes; Prediction algorithms; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Data Mining (CIDM), 2014 IEEE Symposium on
Conference_Location
Orlando, FL
Type
conf
DOI
10.1109/CIDM.2014.7008674
Filename
7008674
Link To Document