• DocumentCode
    3703451
  • Title

    Phosphorylation sites prediction using Random Forest

  • Author

    Hamid D. Ismail;Ahoi Jones;Jung H. Kim;Robert H. Newman;KC Dukka B.

  • Author_Institution
    Department of Computational Science and Engineering, North Carolina Agricultural and Technical State University, Greensboro, 27411, United States
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Protein phosphorylation is one of the most widespread regulatory mechanisms in eukaryotes. Over the past decade, phosphorylation site prediction has emerged as an important problem in the field of bioinformatics. Here, we report a new method, termed Random Forest-based Phosphosite predictor 1.0 (RF-Phos 1.0), to predict phosphorylation sites given only the primary amino acid sequence of a protein as input. RF-Phos 1.0, which uses random forest classifiers to integrate various sequence and structural features, is able to identify putative sites of phosphorylation across many protein families. In side-by-side comparisons based on 10-fold cross validation and an independent dataset, RF-Phos 1.0 compares favorably to other existing phosphosite prediction methods, such as PhosphoSVM, GPS2.1 and Musite.
  • Keywords
    "Amino acids","Feature extraction","Entropy","Radio frequency","Protein sequence","Training"
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2015 IEEE 5th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICCABS.2015.7344726
  • Filename
    7344726