Title of article
Using Bayesian multinomial classifier to predict whether a given protein sequence is intrinsically disordered
Author/Authors
Bulashevska، نويسنده , , Alla and Eils، نويسنده , , Roland، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2008
Pages
5
From page
799
To page
803
Abstract
Intrinsically disordered proteins (IDPs) lack a well-defined three-dimensional structure under physiological conditions. Intrinsic disorder is a common phenomenon, particularly in multicellular eukaryotes, and is responsible for important protein functions including regulation and signaling. Many disease-related proteins are likely to be intrinsically disordered or to have disordered regions. In this paper, a new predictor model based on the Bayesian classification methodology is introduced to predict for a given protein or protein region if it is intrinsically disordered or ordered using only its primary sequence. The method allows to incorporate length-dependent amino acid compositional differences of disordered regions by including separate statistical representations for short, middle and long disordered regions. The predictor was trained on the constructed data set of protein regions with known structural properties. In a Jack-knife test, the predictor achieved the sensitivity of 89.2% for disordered and 81.4% for ordered regions. Our method outperformed several reported predictors when evaluated on the previously published data set of Prilusky et al. [2005. FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded. Bioinformatics 21 (16), 3435–3438]. Further strength of our approach is the ease of implementation.
Keywords
Disorder prediction , Model-based classification , Multinomial model , unfolded proteins
Journal title
Journal of Theoretical Biology
Serial Year
2008
Journal title
Journal of Theoretical Biology
Record number
1539465
Link To Document