Title of article
Data mining the protein data bank: automatic detection and assignment of carbohydrate structures Original Research Article
Author/Authors
Thomas Lütteke، نويسنده , , Martin Frank and Lluis Fontbote ، نويسنده , , Claus-W von der Lieth، نويسنده ,
Issue Information
دوهفته نامه با شماره پیاپی سال 2004
Pages
6
From page
1015
To page
1020
Abstract
Knowledge of the 3D structure of glycans is a prerequisite for a complete understanding of the biological processes glycoproteins are involved in. However, due to a lack of standardised nomenclature, carbohydrate compounds are difficult to locate within the Protein Data Bank (PDB). Using an algorithm that detects carbohydrate structures only requiring element types and atom coordinates, we were able to detect 1663 entries containing a total of 5647 carbohydrate chains. The majority of chains are found to be N-glycosidically bound. Noncovalently bound ligands are also frequent, while O-glycans form a minority. About 30% of all carbohydrate containing PDB entries comprise one or several errors. The automatic assignment of carbohydrate structures in PDB entries will improve the cross-linking of glycobiology resources with genomic and proteomic data collections, which will be an important issue of the upcoming glycomics projects. By aiding in detection of erroneous annotations and structures, the algorithm might also help to increase database quality.
Keywords
Data analysis , 3D structure database , Glycosylation , Bioinformatics , Algorithm
Journal title
Carbohydrate Research
Serial Year
2004
Journal title
Carbohydrate Research
Record number
964052
Link To Document