• DocumentCode
    1283078
  • Title

    Multilabel Learning for Protein Subcellular Location Prediction

  • Author

    Guo-Zheng Li ; Xiao Wang ; Xiaohua Hu ; Jia-Ming Liu ; Rui-Wei Zhao

  • Author_Institution
    Dept. of Control Sci. & Eng., Tongji Univ., Shanghai, China
  • Volume
    11
  • Issue
    3
  • fYear
    2012
  • Firstpage
    237
  • Lastpage
    243
  • Abstract
    Protein subcellular localization aims at predicting the location of a protein within a cell using computational methods. Knowledge of subcellular localization of proteins indicates protein functions and helps in identifying drug targets. Prediction of protein subcellular localization is an important but challenging problem, particularly when proteins may simultaneously exist at, or move between, two or more different subcellular location sites. Most of the existing protein subcellular localization methods are only used to deal with the single-location proteins. To better reflect the characteristics of multiplex proteins, we formulate prediction of subcellular localization of multiplex proteins as a multilabel learning problem. We present and compare two multilabel learning approaches, which exploit correlations between labels and leverage label-specific features, respectively, to induce a high quality prediction model. Experimental results on six protein data sets under various organisms show that our described methods achieve significantly higher performance than any of the existing methods. Among the different multilabel learning methods, we find that methods exploiting label correlations performs better than those leveraging label-specific features.
  • Keywords
    biology computing; cellular biophysics; drugs; learning (artificial intelligence); molecular biophysics; proteins; computational methods; drug targets; high quality prediction model; leverage label-specific features; multilabel learning; multiplex proteins; protein subcellular localization; protein subcellular location prediction; single-location proteins; Accuracy; Correlation; Learning systems; Measurement; Multiplexing; Proteins; Training; Binary relevance; classifier chain; multilabel learning; protein subcellular localization; Algorithms; Artificial Intelligence; Computational Biology; Databases, Protein; Humans; Intracellular Space; Models, Biological; Models, Statistical; Proteins;
  • fLanguage
    English
  • Journal_Title
    NanoBioscience, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1536-1241
  • Type

    jour

  • DOI
    10.1109/TNB.2012.2212249
  • Filename
    6298041