DocumentCode
3703451
Title
Phosphorylation sites prediction using Random Forest
Author
Hamid D. Ismail;Ahoi Jones;Jung H. Kim;Robert H. Newman;KC Dukka B.
Author_Institution
Department of Computational Science and Engineering, North Carolina Agricultural and Technical State University, Greensboro, 27411, United States
fYear
2015
Firstpage
1
Lastpage
6
Abstract
Protein phosphorylation is one of the most widespread regulatory mechanisms in eukaryotes. Over the past decade, phosphorylation site prediction has emerged as an important problem in the field of bioinformatics. Here, we report a new method, termed Random Forest-based Phosphosite predictor 1.0 (RF-Phos 1.0), to predict phosphorylation sites given only the primary amino acid sequence of a protein as input. RF-Phos 1.0, which uses random forest classifiers to integrate various sequence and structural features, is able to identify putative sites of phosphorylation across many protein families. In side-by-side comparisons based on 10-fold cross validation and an independent dataset, RF-Phos 1.0 compares favorably to other existing phosphosite prediction methods, such as PhosphoSVM, GPS2.1 and Musite.
Keywords
"Amino acids","Feature extraction","Entropy","Radio frequency","Protein sequence","Training"
Publisher
ieee
Conference_Titel
Computational Advances in Bio and Medical Sciences (ICCABS), 2015 IEEE 5th International Conference on
Type
conf
DOI
10.1109/ICCABS.2015.7344726
Filename
7344726
Link To Document