Statistical analysis of interface similarity in crystals of homologous proteins |
| |
Authors: | Xu Qifang Canutescu Adrian A Wang Guoli Shapovalov Maxim Obradovic Zoran Dunbrack Roland L |
| |
Affiliation: | 1 Institute for Cancer Research, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA 19111, USA 2 Center for Information Science and Technology, Temple University, 1805 North Broad Street, Philadelphia, PA 19122, USA |
| |
Abstract: | Many proteins function as homo-oligomers and are regulated via their oligomeric state. For some proteins, the stoichiometry of homo-oligomeric states under various conditions has been studied using gel filtration or analytical ultracentrifugation experiments. The interfaces involved in these assemblies may be identified using cross-linking and mass spectrometry, solution-state NMR, and other experiments. However, for most proteins, the actual interfaces that are involved in oligomerization are inferred from X-ray crystallographic structures using assumptions about interface surface areas and physical properties. Examination of interfaces across different Protein Data Bank (PDB) entries in a protein family reveals several important features. First, similarities in space group, asymmetric unit size, and cell dimensions and angles (within 1%) do not guarantee that two crystals are actually the same crystal form, containing similar relative orientations and interactions within the crystal. Conversely, two crystals in different space groups may be quite similar in terms of all the interfaces within each crystal. Second, NMR structures and an existing benchmark of PDB crystallographic entries consisting of 126 dimers as well as larger structures and 132 monomers were used to determine whether the existence or lack of common interfaces across multiple crystal forms can be used to predict whether a protein is an oligomer or not. Monomeric proteins tend to have common interfaces across only a minority of crystal forms, whereas higher-order structures exhibit common interfaces across a majority of available crystal forms. The data can be used to estimate the probability that an interface is biological if two or more crystal forms are available. Finally, the Protein Interfaces, Surfaces, and Assemblies (PISA) database available from the European Bioinformatics Institute is more consistent in identifying interfaces observed in many crystal forms compared with the PDB and the European Bioinformatics Institute's Protein Quaternary Server (PQS). The PDB, in particular, is missing highly likely biological interfaces in its biological unit files for about 10% of PDB entries. |
| |
Keywords: | PDB, Protein Data Bank SOD, superoxide dismutase PISA, Protein Interfaces, Surfaces, and Assemblies PQS, Protein Quaternary Server PiQSi, Protein Quaternary Structure Investigation SCOP, Structural Classification of Proteins ASA, accessible surface area CFG, crystal form group |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|