A complete small molecule dataset from the protein data bank |
| |
Authors: | Feldman Howard J Snyder Kevin A Ticoll Amy Pintilie Greg Hogue Christopher W V |
| |
Institution: | The Blueprint Initiative, Suite 101, 200 Elm Street, Toronto, Ont., Canada M5T 1K4. |
| |
Abstract: | A complete set of 6300 small molecule ligands was extracted from the protein data bank, and deposited online in PubChem as data source 'SMID'. This set's major improvement over prior methods is the inclusion of cyclic polypeptides and branched polysaccharides, including an unambiguous nomenclature, in addition to normal monomeric ligands. Only the best available example of each ligand structure is retained, and an additional dataset is maintained containing co-ordinates for all examples of each structure. Attempts are made to correct ambiguous atomic elements and other common errors, and a perception algorithm was used to determine bond order and aromaticity when no other information was available. |
| |
Keywords: | PDB protein data bank MMDB molecular modelling database SMID small molecule interaction database MWM maximum weighted matching |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|