Title :
Building a Structured PDB: The RS-PDB Database
Author :
Szabadka, Zoltan ; Grolmusz, Vince
Author_Institution :
Dept. of Comput. Sci., Eotvos Univ., Budapest
fDate :
Aug. 30 2006-Sept. 3 2006
Abstract :
A method for automatically analyzing structures deposited in the Protein Data Bank is presented. The method is capable to detect missing atoms, bond length deviations, atom bumps and to correctly identify protein-ligand complexes. The results are organized into a database, called the Rich Structure PDB (RS-PDB in short) from which one can easily select PDB entries satisfying diverse sets of requirements. The newer and richer mmCIF format of both the PDB and its chemical component dictionary (formerly the HET Group Dictionary) were used in the construction, and the International Chemical Identifier (InChI) of IUPAC played a main role in correctly identifying distinct ligands
Keywords :
biochemistry; biology computing; database management systems; molecular biophysics; proteins; IUPAC; International Chemical Identifier; atom bumps; bond length deviations; chemical component dictionary; ligand identification; missing atoms detection; mmCIF format; protein-ligand complexes; structured Protein Data Bank database; Atomic layer deposition; Bonding; Buildings; Chemicals; Cities and towns; Databases; Dictionaries; NIST; Proteins; USA Councils;
Conference_Titel :
Engineering in Medicine and Biology Society, 2006. EMBS '06. 28th Annual International Conference of the IEEE
Conference_Location :
New York, NY
Print_ISBN :
1-4244-0032-5
Electronic_ISBN :
1557-170X
DOI :
10.1109/IEMBS.2006.259331