DocumentCode :
2889611
Title :
Intrinsic Disorder in Putative Protein Sequences
Author :
Midic, Uros ; Obradovic, Zoran
Author_Institution :
Center for Data Analytics & Biomed. Inf., Temple Univ., Philadelphia, PA, USA
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
43
Lastpage :
48
Abstract :
Intrinsically disordered proteins perform a variety of crucial biological functions despite lacking stable tertiary structure under physiological conditions in vitro. State-of-the-art sequence-based predictors of intrinsic disorder are achieving per-residue accuracies over 80%. In a genome-wide study we observed big difference in predicted disorder content between confirmed and putative human proteins, and suspected that this is due to large errors introduced by gene-finding algorithms for putative sequence annotation. To test this hypothesis we trained a predictor to discriminate sequences of real proteins from synthetic sequences that mimic errors of gene finding algorithms. Its application to putative human protein sequences shows that they contain a substantial fraction of incorrectly assigned regions. These regions are predicted to have higher levels of disorder content than correctly assigned regions. Our finding provides first evidence that current practice of predicting disorder content in putative sequences should be reconsidered, as such estimates are biased.
Keywords :
bioinformatics; genetics; genomics; molecular biophysics; molecular configurations; prediction theory; proteins; biological functions; gene finding algorithms; genome; intrinsic disorder; putative protein sequences; putative sequence annotation; sequence-based predictors; stable tertiary structure; Accuracy; Amino acids; Genomics; Humans; Prediction algorithms; Protein engineering; Proteins; disorder prediction; gene finding; protein intrinsic disorder;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1799-4
Type :
conf
DOI :
10.1109/BIBM.2011.32
Filename :
6120406
Link To Document :
بازگشت