DocumentCode
2913663
Title
Hierarchical multi-label classification for protein function prediction: A local approach based on neural networks
Author
Cerri, Ricardo ; Barros, Rodrigo C. ; De Carvalho, André C P L F
Author_Institution
Dept. of Comput. Sci., Univ. of Sao Paulo (USP), São Carlos, Brazil
fYear
2011
fDate
22-24 Nov. 2011
Firstpage
337
Lastpage
343
Abstract
In Hierarchical Multi-Label Classification problems, each instance can be classified into two or more classes simultaneously, differently from conventional classification. Additionally, the classes are structured in a hierarchy, in the form of either a tree or a directed acyclic graph. Hence, an instance can be assigned to two or more paths from the hierarchical structure, resulting in a complex classification problem with possibly hundreds of classes. Many methods have been proposed to deal with such problems, some of them employing a single classifier to deal with all classes simultaneously (global methods), and others employing many classifiers to decompose the original problem into a set of subproblems (local methods). In this work, we propose a novel local method named HMC-LMLP, which uses one Multi-Layer Perceptron per hierarchical level. The predictions in one level are used as inputs to the network responsible for the predictions in the next level. We make use of two distinct Multi-Layer Perceptron algorithms: Back-propagation and Resilient Back-propagation. In addition, we make use of an error measure specially tailored to multi-label problems for training the networks. Our method is compared to state-of-the-art hierarchical multi-label classification algorithms, in protein function prediction datasets. The experimental results show that our approach presents competitive predictive accuracy, suggesting that artificial neural networks constitute a promising alternative to deal with hierarchical multi-label classification of biological data.
Keywords
backpropagation; biology computing; classification; directed graphs; multilayer perceptrons; neural nets; proteins; HMC-LMLP; artificial neural networks; biological data; directed acyclic graph; hierarchical multi-label classification; multilayer perceptron; protein function prediction; resilient backpropagation; Decision trees; Measurement uncertainty; Neural networks; Neurons; Prediction algorithms; Proteins; Training; Machine learning; hierarchical multi-label classification; neural networks; protein function prediction;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location
Cordoba
ISSN
2164-7143
Print_ISBN
978-1-4577-1676-8
Type
conf
DOI
10.1109/ISDA.2011.6121678
Filename
6121678
Link To Document