DocumentCode
1283078
Title
Multilabel Learning for Protein Subcellular Location Prediction
Author
Guo-Zheng Li ; Xiao Wang ; Xiaohua Hu ; Jia-Ming Liu ; Rui-Wei Zhao
Author_Institution
Dept. of Control Sci. & Eng., Tongji Univ., Shanghai, China
Volume
11
Issue
3
fYear
2012
Firstpage
237
Lastpage
243
Abstract
Protein subcellular localization aims at predicting the location of a protein within a cell using computational methods. Knowledge of subcellular localization of proteins indicates protein functions and helps in identifying drug targets. Prediction of protein subcellular localization is an important but challenging problem, particularly when proteins may simultaneously exist at, or move between, two or more different subcellular location sites. Most of the existing protein subcellular localization methods are only used to deal with the single-location proteins. To better reflect the characteristics of multiplex proteins, we formulate prediction of subcellular localization of multiplex proteins as a multilabel learning problem. We present and compare two multilabel learning approaches, which exploit correlations between labels and leverage label-specific features, respectively, to induce a high quality prediction model. Experimental results on six protein data sets under various organisms show that our described methods achieve significantly higher performance than any of the existing methods. Among the different multilabel learning methods, we find that methods exploiting label correlations performs better than those leveraging label-specific features.
Keywords
biology computing; cellular biophysics; drugs; learning (artificial intelligence); molecular biophysics; proteins; computational methods; drug targets; high quality prediction model; leverage label-specific features; multilabel learning; multiplex proteins; protein subcellular localization; protein subcellular location prediction; single-location proteins; Accuracy; Correlation; Learning systems; Measurement; Multiplexing; Proteins; Training; Binary relevance; classifier chain; multilabel learning; protein subcellular localization; Algorithms; Artificial Intelligence; Computational Biology; Databases, Protein; Humans; Intracellular Space; Models, Biological; Models, Statistical; Proteins;
fLanguage
English
Journal_Title
NanoBioscience, IEEE Transactions on
Publisher
ieee
ISSN
1536-1241
Type
jour
DOI
10.1109/TNB.2012.2212249
Filename
6298041
Link To Document