• Title of article

    Data mining the protein data bank: automatic detection and assignment of carbohydrate structures Original Research Article

  • Author/Authors

    Thomas Lütteke، نويسنده , , Martin Frank and Lluis Fontbote ، نويسنده , , Claus-W von der Lieth، نويسنده ,

  • Issue Information
    دوهفته نامه با شماره پیاپی سال 2004
  • Pages
    6
  • From page
    1015
  • To page
    1020
  • Abstract
    Knowledge of the 3D structure of glycans is a prerequisite for a complete understanding of the biological processes glycoproteins are involved in. However, due to a lack of standardised nomenclature, carbohydrate compounds are difficult to locate within the Protein Data Bank (PDB). Using an algorithm that detects carbohydrate structures only requiring element types and atom coordinates, we were able to detect 1663 entries containing a total of 5647 carbohydrate chains. The majority of chains are found to be N-glycosidically bound. Noncovalently bound ligands are also frequent, while O-glycans form a minority. About 30% of all carbohydrate containing PDB entries comprise one or several errors. The automatic assignment of carbohydrate structures in PDB entries will improve the cross-linking of glycobiology resources with genomic and proteomic data collections, which will be an important issue of the upcoming glycomics projects. By aiding in detection of erroneous annotations and structures, the algorithm might also help to increase database quality.
  • Keywords
    Data analysis , 3D structure database , Glycosylation , Bioinformatics , Algorithm
  • Journal title
    Carbohydrate Research
  • Serial Year
    2004
  • Journal title
    Carbohydrate Research
  • Record number

    964052