DocumentCode :
2913663
Title :
Hierarchical multi-label classification for protein function prediction: A local approach based on neural networks
Author :
Cerri, Ricardo ; Barros, Rodrigo C. ; De Carvalho, André C P L F
Author_Institution :
Dept. of Comput. Sci., Univ. of Sao Paulo (USP), São Carlos, Brazil
fYear :
2011
fDate :
22-24 Nov. 2011
Firstpage :
337
Lastpage :
343
Abstract :
In Hierarchical Multi-Label Classification problems, each instance can be classified into two or more classes simultaneously, differently from conventional classification. Additionally, the classes are structured in a hierarchy, in the form of either a tree or a directed acyclic graph. Hence, an instance can be assigned to two or more paths from the hierarchical structure, resulting in a complex classification problem with possibly hundreds of classes. Many methods have been proposed to deal with such problems, some of them employing a single classifier to deal with all classes simultaneously (global methods), and others employing many classifiers to decompose the original problem into a set of subproblems (local methods). In this work, we propose a novel local method named HMC-LMLP, which uses one Multi-Layer Perceptron per hierarchical level. The predictions in one level are used as inputs to the network responsible for the predictions in the next level. We make use of two distinct Multi-Layer Perceptron algorithms: Back-propagation and Resilient Back-propagation. In addition, we make use of an error measure specially tailored to multi-label problems for training the networks. Our method is compared to state-of-the-art hierarchical multi-label classification algorithms, in protein function prediction datasets. The experimental results show that our approach presents competitive predictive accuracy, suggesting that artificial neural networks constitute a promising alternative to deal with hierarchical multi-label classification of biological data.
Keywords :
backpropagation; biology computing; classification; directed graphs; multilayer perceptrons; neural nets; proteins; HMC-LMLP; artificial neural networks; biological data; directed acyclic graph; hierarchical multi-label classification; multilayer perceptron; protein function prediction; resilient backpropagation; Decision trees; Measurement uncertainty; Neural networks; Neurons; Prediction algorithms; Proteins; Training; Machine learning; hierarchical multi-label classification; neural networks; protein function prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location :
Cordoba
ISSN :
2164-7143
Print_ISBN :
978-1-4577-1676-8
Type :
conf
DOI :
10.1109/ISDA.2011.6121678
Filename :
6121678
Link To Document :
بازگشت