首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Data mining the protein data bank: automatic detection and assignment of carbohydrate structures
Authors:Lütteke Thomas  Frank Martin  von der Lieth Claus-W
Institution:Central Spectroscopic Department, German Cancer Research Center, INF 280, D-69120 Heidelberg, Germany. t.luetteke@dkfz.de
Abstract:Knowledge of the 3D structure of glycans is a prerequisite for a complete understanding of the biological processes glycoproteins are involved in. However, due to a lack of standardised nomenclature, carbohydrate compounds are difficult to locate within the Protein Data Bank (PDB). Using an algorithm that detects carbohydrate structures only requiring element types and atom coordinates, we were able to detect 1663 entries containing a total of 5647 carbohydrate chains. The majority of chains are found to be N-glycosidically bound. Noncovalently bound ligands are also frequent, while O-glycans form a minority. About 30% of all carbohydrate containing PDB entries comprise one or several errors. The automatic assignment of carbohydrate structures in PDB entries will improve the cross-linking of glycobiology resources with genomic and proteomic data collections, which will be an important issue of the upcoming glycomics projects. By aiding in detection of erroneous annotations and structures, the algorithm might also help to increase database quality.
Keywords:Data analysis  3D structure database  Glycosylation  Bioinformatics  Algorithm
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号