DocumentCode :
1604053
Title :
Update on the Pfam5000 Strategy for Selection of Structural Genomics Targets
Author :
Chandonia, J.-M. ; Brenner, S.E.
Author_Institution :
Div. of Phys. Biosci., Lawrence Berkeley Nat. Lab., CA
fYear :
2005
fDate :
6/27/1905 12:00:00 AM
Firstpage :
751
Lastpage :
755
Abstract :
Structural genomics is an international effort to determine the three-dimensional shapes of all important biological macromolecules, with a primary focus on proteins. Target proteins should be selected according to a strategy that is medically and biologically relevant, of good financial value, and tractable. In 2003, we presented the "Pfam5000" strategy, which involves selecting the 5,000 most important families from the Pfam database as sources for targets. In this update, we show that although both the Pfam database and the number of sequenced genomes have increased in size, the expected benefits of the Pfam5000 strategy have not changed substantially. Solving the structures of proteins from the 5,000 largest Pfam families would allow accurate fold assignment for approximately 65% of all prokaryotic proteins (covering 54% of residues) and 63% of eukaryotic proteins (42% of residues). Fewer than 2,300 of the largest families on this list remain to be solved, making the project feasible in the next five years given the expected throughput to be achieved in the production phase of the Protein Structure Initiative
Keywords :
biology computing; genetics; molecular biophysics; proteins; Pfam database; Pfam5000 strategy; Protein Structure Initiative; biological macromolecules; eukaryotic proteins; prokaryotic proteins; protein structures; sequenced genomes; structural genomics targets; Bioinformatics; Databases; Genomics; Molecular biophysics; Production; Proteins; RNA; Sequences; Shape; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 2005. IEEE-EMBS 2005. 27th Annual International Conference of the
Conference_Location :
Shanghai
Print_ISBN :
0-7803-8741-4
Type :
conf
DOI :
10.1109/IEMBS.2005.1616523
Filename :
1616523
Link To Document :
بازگشت