首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The ability to determine the structure of a protein in solution is a critical tool for structural biology, as proteins in their native state are found in aqueous environments. Using a physical chemistry based prediction protocol, we demonstrate the ability to reproduce protein loop geometries in experimentally derived solution structures. Predictions were run on loops drawn from (1)NMR entries in the Protein Databank (PDB), and from (2) the RECOORD database in which NMR entries from the PDB have been standardized and re-refined in explicit solvent. The predicted structures are validated by comparison with experimental distance restraints, a test of structural quality as defined by the WHAT IF structure validation program, root mean square deviation (RMSD) of the predicted loops to the original structural models, and comparison of precision of the original and predicted ensembles. Results show that for the RECOORD ensembles, the predicted loops are consistent with an average of 95%, 91%, and 87% of experimental restraints for the short, medium and long loops respectively. Prediction accuracy is strongly affected by the quality of the original models, with increases in the percentage of experimental restraints violated of 2% for the short loops, and 9% for both the medium and long loops in the PDB derived ensembles. We anticipate the application of our protocol to theoretical modeling of protein structures, such as fold recognition methods; as well as to experimental determination of protein structures, or segments, for which only sparse NMR restraint data is available.  相似文献   

2.
One of the major goals of structural genomics projects is to determine the three-dimensional structure of representative members of as many different fold families as possible. Comparative modeling is expected to fill the remaining gaps by providing structural models of homologs of the experimentally determined proteins. However, for such an approach to be successful it is essential that the quality of the experimentally determined structures is adequate. In an attempt to build a homology model for the protein dynein light chain 2A (DLC2A) we found two potential templates, both experimentally determined nuclear magnetic resonance (NMR) structures originating from structural genomics efforts. Despite their high sequence identity (96%), the folds of the two structures are markedly different. This urged us to perform in-depth analyses of both structure ensembles and the deposited experimental data, the results of which clearly identify one of the two models as largely incorrect. Next, we analyzed the quality of a large set of recent NMR-derived structure ensembles originating from both structural genomics projects and individual structure determination groups. Unfortunately, a visual inspection of structures exhibiting lower quality scores than DLC2A reveals that the seriously flawed DLC2A structure is not an isolated incident. Overall, our results illustrate that the quality of NMR structures cannot be reliably evaluated using only traditional experimental input data and overall quality indicators as a reference and clearly demonstrate the urgent need for a tight integration of more sophisticated structure validation tools in NMR structure determination projects. In contrast to common methodologies where structures are typically evaluated as a whole, such tools should preferentially operate on a per-residue basis.  相似文献   

3.
The precision of NMR structure ensembles revisited   总被引:4,自引:4,他引:0  
  相似文献   

4.
Membrane proteins are challenging to study and restraints for structure determination are typically sparse or of low resolution because the membrane environment that surrounds them leads to a variety of experimental challenges. When membrane protein structures are determined by different techniques in different environments, a natural question is “which structure is most biologically relevant?” Towards answering this question, we compiled a dataset of membrane proteins with known structures determined by both solution NMR and X‐ray crystallography. By investigating differences between the structures, we found that RMSDs between crystal and NMR structures are below 5 Å in the membrane region, NMR ensembles have a higher convergence in the membrane region, crystal structures typically have a straighter transmembrane region, have higher stereo‐chemical correctness, and are more tightly packed. After quantifying these differences, we used high‐resolution refinement of the NMR structures to mitigate them, which paves the way for identifying and improving the structural quality of membrane proteins.  相似文献   

5.
Bolstad ES  Anderson AC 《Proteins》2008,73(3):566-580
Accurate ranking during in silico lead optimization is critical to drive the generation of new ligands with higher affinity, yet it is especially difficult because of the subtle changes between analogs. In order to assess the role of the structure of the receptor in delivering accurate lead ranking results, we docked a set of forty related inhibitors to structures of one species of dihydrofolate reductase (DHFR) derived from crystallographic, NMR solution data, and homology models. In this study, the crystal structures yielded the superior results: the compounds were placed in the active site in the conserved orientation and the docking scores for 80% percent of the compounds clustered into the same bins as the measured affinity. Single receptor structures derived from NMR data or homology models did not serve as accurate docking receptors. To our knowledge, these are the first experiments that assess ranking of homologous lead compounds using a variety of receptor structures. We then extended the study to investigate whether ensembles, either computationally or experimentally derived, of all of the single starting structures aid, hinder or have no effect on the performance of the starting template. Impressively, when ensembles of receptor structures derived from NMR data or homology models were employed, docking accuracy improved to a level equal to that of the high resolution crystal structures. The same experiments using a second species of DHFR and set of ligands confirm the results. A comparison of the structures of the individual ensemble members to the starting structures shows that the effect of the ensembles can be ascribed to protein flexibility in addition to absorption of computational error.  相似文献   

6.
Because of their large conformational heterogeneity, structural characterization of intrinsically disordered proteins (IDPs) is very challenging using classical experimental methods alone. In this study, we use NMR and small-angle x-ray scattering (SAXS) data with multiple molecular dynamics (MD) simulations to describe the conformational ensemble of the fully disordered verprolin homology domain of the neural Aldrich syndrome protein involved in the regulation of actin polymerization. First, we studied several back-calculation software of SAXS scattering intensity and optimized the adjustable parameters to accurately calculate the SAXS intensity from an atomic structure. We also identified the most appropriate force fields for MD simulations of this IDP. Then, we analyzed four conformational ensembles of neural Aldrich syndrome protein verprolin homology domain, two generated with the program flexible-meccano with or without NMR-derived information as input and two others generated by MD simulations with two different force fields. These four conformational ensembles were compared to available NMR and SAXS data for validation. We found that MD simulations with the AMBER-03w force field and the TIP4P/2005s water model are able to correctly describe the conformational ensemble of this 67-residue IDP at both local and global level.  相似文献   

7.
A method is introduced to represent an ensemble of conformers of a protein by a single structure in torsion angle space that lies closest to the averaged Cartesian coordinates while maintaining perfect covalent geometry and on average equal steric quality and an equally good fit to the experimental (e.g. NMR) data as the individual conformers of the ensemble. The single representative ‘regmean structure’ is obtained by simulated annealing in torsion angle space with the program CYANA using as input data the experimental restraints, restraints for the atom positions relative to the average Cartesian coordinates, and restraints for the torsion angles relative to the corresponding principal cluster average values of the ensemble. The method was applied to 11 proteins for which NMR structure ensembles are available, and compared to alternative, commonly used simple approaches for selecting a single representative structure, e.g. the structure from the ensemble that best fulfills the experimental and steric restraints, or the structure from the ensemble that has the lowest RMSD value to the average Cartesian coordinates. In all cases our method found a structure in torsion angle space that is significantly closer to the mean coordinates than the alternatives while maintaining the same quality as individual conformers. The method is thus suitable to generate representative single structure representations of protein structure ensembles in torsion angle space. Since in the case of NMR structure calculations with CYANA the single structure is calculated in the same way as the individual conformers except that weak positional and torsion angle restraints are added, we propose to represent new NMR structures by a ‘regmean bundle’ consisting of the single representative structure as the first conformer and all but one original individual conformers (the original conformer with the highest target function value is discarded in order to keep the number of conformers in the bundle constant). In this way, analyses that require a single structure can be carried out in the most meaningful way using the first model, while at the same time the additional information contained in the ensemble remains available.  相似文献   

8.
NMR chemical shifts provide important local structural information for proteins. Consistent structure generation from NMR chemical shift data has recently become feasible for proteins with sizes of up to 130 residues, and such structures are of a quality comparable to those obtained with the standard NMR protocol. This study investigates the influence of the completeness of chemical shift assignments on structures generated from chemical shifts. The Chemical-Shift-Rosetta (CS-Rosetta) protocol was used for de novo protein structure generation with various degrees of completeness of the chemical shift assignment, simulated by omission of entries in the experimental chemical shift data previously used for the initial demonstration of the CS-Rosetta approach. In addition, a new CS-Rosetta protocol is described that improves robustness of the method for proteins with missing or erroneous NMR chemical shift input data. This strategy, which uses traditional Rosetta for pre-filtering of the fragment selection process, is demonstrated for two paramagnetic proteins and also for two proteins with solid-state NMR chemical shift assignments. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

9.
Biology is advanced by producing structural models of biological systems, such as protein complexes. Some systems are recalcitrant to traditional structure determination methods. In such cases, it may still be possible to produce useful models by integrative structure determination that depends on simultaneous use of multiple types of data. An ensemble of models that are sufficiently consistent with the data is produced by a structural sampling method guided by a data‐dependent scoring function. The variation in the ensemble of models quantified the uncertainty of the structure, generally resulting from the uncertainty in the input information and actual structural heterogeneity in the samples used to produce the data. Here, we describe how to generate, assess, and interpret ensembles of integrative structural models using our open source Integrative Modeling Platform program ( https://integrativemodeling.org ).  相似文献   

10.
Conformational changes in proteins are extremely important for their biochemical functions. Correlation between inherent conformational variations in a protein and conformational differences in its homologues of known structure is still unclear. In this study, we have used a structural alphabet called Protein Blocks (PBs). PBs are used to perform abstraction of protein 3-D structures into a 1-D strings of 16 alphabets (ap) based on dihedral angles of overlapping pentapeptides. We have analyzed the variations in local conformations in terms of PBs represented in the ensembles of 801 protein structures determined using NMR spectroscopy. In the analysis of concatenated data over all the residues in all the NMR ensembles, we observe that the overall nature of inherent local structural variations in NMR ensembles is similar to the nature of local structural differences in homologous proteins with a high correlation coefficient of .94. High correlation at the alignment positions corresponding to helical and β-sheet regions is only expected. However, the correlation coefficient by considering only the loop regions is also quite high (.91). Surprisingly, segregated position-wise analysis shows that this high correlation does not hold true to loop regions at the structurally equivalent positions in NMR ensembles and their homologues of known structure. This suggests that the general nature of local structural changes is unique; however most of the local structural variations in loop regions of NMR ensembles do not correlate to their local structural differences at structurally equivalent positions in homologues.  相似文献   

11.
The experimental determination of scalar three-bond coupling constants represents a powerful method to probe both the structure and dynamics of proteins. The detailed structural interpretation of such coupling constants is usually based on Karplus relationships, which allow the measured couplings to be related to the torsion angles of the molecules. As the measured couplings are sensitive to thermal fluctuations, the parameters in the Karplus relationships are better derived from ensembles representing the distributions of dihedral angles present in solution, rather than from single conformations. We present a method to derive such parameters that uses ensembles of conformations determined through dynamic-ensemble refinement – a method that provides structural ensembles that simultaneously represent both the structure and the associated dynamics of a protein.  相似文献   

12.
The roles of unfolded states of proteins in normal folding and in diseases involving aggregation, as well as the prevalence and regulatory functions of intrinsically disordered proteins, have become increasingly recognized. The structural representation of these disordered states as ensembles of interconverting conformers can therefore provide critical insights. Experimental methods can be used to probe ensemble-averaged structural properties of disordered states and computational approaches generate representative ensembles of conformers using experimental restraints. In particular, NMR and small-angle X-ray scattering provide quantitative data that can readily be incorporated into calculations. These techniques have gleaned structural information about denatured, unfolded and intrinsically disordered proteins. The use of experimental data in different computational approaches, including ensemble molecular dynamics simulations and algorithms that assign populations to pregenerated conformers, has highlighted the presence of both local and long-range structure, and the occurrence of native-like and non-native interactions in unfolded and denatured states. Analysis of the resulting ensembles has suggested important implications of this fluctuating structure for folding, aggregation and binding.  相似文献   

13.
14.
Determination of the accurate three-dimensional structure of large proteins by NMR remains challenging due to a loss in the density of experimental restraints resulting from the often prerequisite perdeuteration. Solution small-angle scattering, which carries long-range translational information, presents an opportunity to enhance the structural accuracy of derived models when used in combination with global orientational NMR restraints such as residual dipolar couplings (RDCs) and residual chemical shift anisotropies (RCSAs). We have quantified the improvements in accuracy that can be obtained using this strategy for the 82 kDa enzyme Malate Synthase G (MSG), currently the largest single chain protein solved by solution NMR. Joint refinement against NMR and scattering data leads to an improvement in structural accuracy as evidenced by a decrease from approximately 4.5 to approximately 3.3 A of the backbone rmsd between the derived model and the high-resolution X-ray structure, PDB code 1D8C. This improvement results primarily from medium-angle scattering data, which encode the overall molecular shape, rather than the lowest angle data that principally determine the radius of gyration and the maximum particle dimension. The effect of the higher angle data, which are dominated by internal density fluctuations, while beneficial, is also found to be relatively small. Our results demonstrate that joint NMR/SAXS refinement can yield significantly improved accuracy in solution structure determination and will be especially well suited for the study of systems with limited NMR restraints such as large proteins, oligonucleotides, or their complexes.  相似文献   

15.
16.
Laughton CA  Orozco M  Vranken W 《Proteins》2009,75(1):206-216
NMR structures are typically deposited in databases such as the PDB in the form of an ensemble of structures. Generally, each of the models in such an ensemble satisfies the experimental data and is equally valid. No unique solution can be calculated because the experimental NMR data is insufficient, in part because it reflects the conformational variability and dynamical behavior of the molecule in solution. Even for relatively rigid molecules, the limited number of structures that are typically deposited cannot completely encompass the structural diversity allowed by the observed NMR data, but they can be chosen to try and maximize its representation. We describe here the adaptation and application of techniques more commonly used to examine large ensembles from molecular dynamics simulations, to the analysis of NMR ensembles. The approach, which is based on principal component analysis, we call COCO ("Complementary Coordinates"). The COCO approach analyses the distribution of an NMR ensemble in conformational space, and generates a new ensemble that fills "gaps" in the distribution. The method is very rapid, and analysis of a 25-member ensemble and generation of a new 25 member ensemble typically takes 1-2 min on a conventional workstation. Applied to the 545 structures in the RECOORD database, we find that COCO generates new ensembles that are as structurally diverse-both from each other and from the original ensemble-as are the structures within the original ensemble. The COCO approach does not explicitly take into account the NMR restraint data, yet in tests on selected structures from the RECOORD database, the COCO ensembles are frequently good matches to this data, and certainly are structures that can be rapidly refined against the restraints to yield high-quality, novel solutions. COCO should therefore be a useful aid in NMR structure refinement and in other situations where a richer representation of conformational variability is desired-for example in docking studies. COCO is freely accessible via the website www.ccpb.ac.uk/COCO.  相似文献   

17.
Various experimental studies of hen egg white lysozyme (HEWL) in water and TFE/water clearly indicate structural differences between the native state and TFE state of HEWL, e.g. the helical content of the protein in the TFE state is much higher than in the native state. However, the available detailed NMR studies were not sufficient to determine fully a structure of HEWL in the TFE state. Different molecular dynamics (MD) simulations, i.e. at room temperature, at increased temperature and using proton–proton distance restraints derived from NMR NOE data, have been used to generate configurational ensembles corresponding to the TFE state of HEWL. The configurational ensemble obtained at room temperature using atom-atom distance restraints measured for HEWL in TFE/water solution satisfies the experimental data and has the lowest protein energy. In this ensemble residues 50–58, which are part of the β-sheet in native HEWL, adopt fluctuating α-helical secondary structure.  相似文献   

18.
Many methods of protein structure generation such as NMR-based solution structure determination and template-based modeling do not produce a single model, but an ensemble of models consistent with the available information. Current strategies for comparing ensembles lose information because they use only a single representative structure. Here, we describe the ENSEMBLATOR and its novel strategy to directly compare two ensembles containing the same atoms to identify significant global and local backbone differences between them on per-atom and per-residue levels, respectively. The ENSEMBLATOR has four components: eePREP (ee for ensemble-ensemble), which selects atoms common to all models; eeCORE, which identifies atoms belonging to a cutoff-distance dependent common core; eeGLOBAL, which globally superimposes all models using the defined core atoms and calculates for each atom the two intraensemble variations, the interensemble variation, and the closest approach of members of the two ensembles; and eeLOCAL, which performs a local overlay of each dipeptide and, using a novel measure of local backbone similarity, reports the same four variations as eeGLOBAL. The combination of eeGLOBAL and eeLOCAL analyses identifies the most significant differences between ensembles. We illustrate the ENSEMBLATOR''s capabilities by showing how using it to analyze NMR ensembles and to compare NMR ensembles with crystal structures provides novel insights compared to published studies. One of these studies leads us to suggest that a “consistency check” of NMR-derived ensembles may be a useful analysis step for NMR-based structure determinations in general. The ENSEMBLATOR 1.0 is available as a first generation tool to carry out ensemble-ensemble comparisons.  相似文献   

19.
Structural genomics projects are providing large quantities of new 3D structural data for proteins. To monitor the quality of these data, we have developed the protein structure validation software suite (PSVS), for assessment of protein structures generated by NMR or X-ray crystallographic methods. PSVS is broadly applicable for structure quality assessment in structural biology projects. The software integrates under a single interface analyses from several widely-used structure quality evaluation tools, including PROCHECK (Laskowski et al., J Appl Crystallog 1993;26:283-291), MolProbity (Lovell et al., Proteins 2003;50:437-450), Verify3D (Luthy et al., Nature 1992;356:83-85), ProsaII (Sippl, Proteins 1993;17: 355-362), the PDB validation software, and various structure-validation tools developed in our own laboratory. PSVS provides standard constraint analyses, statistics on goodness-of-fit between structures and experimental data, and knowledge-based structure quality scores in standardized format suitable for database integration. The analysis provides both global and site-specific measures of protein structure quality. Global quality measures are reported as Z scores, based on calibration with a set of high-resolution X-ray crystal structures. PSVS is particularly useful in assessing protein structures determined by NMR methods, but is also valuable for assessing X-ray crystal structures or homology models. Using these tools, we assessed protein structures generated by the Northeast Structural Genomics Consortium and other international structural genomics projects, over a 5-year period. Protein structures produced from structural genomics projects exhibit quality score distributions similar to those of structures produced in traditional structural biology projects during the same time period. However, while some NMR structures have structure quality scores similar to those seen in higher-resolution X-ray crystal structures, the majority of NMR structures have lower scores. Potential reasons for this "structure quality score gap" between NMR and X-ray crystal structures are discussed.  相似文献   

20.
The three-dimensional solution structure of apo rabbit lung calcyclin has been refined to high resolution through the use of heteronuclear NMR spectroscopy and 13C,15N- enriched protein. Upon completing the assignment of virtually all of the 15N, 13C and 1H NMR resonances, the solution structure was determined from a combination of 2814 NOE- derived distance constraints, and 272 torsion angle constraints derived from scalar couplings. A large number of critical inter- subunit NOEs (386) were identified from 13C- select,13C-filtered NOESY experiments, providing a highly accurate dimer interface. The combination of distance geometry and restrained molecular dynamics calculations yielded structures with excellent agreement with the experimental data and high precision (rmsd from the mean for the backbone atoms in the eight helices: 0.33 Å). Calcyclin exhibits a symmetric dimeric fold of two identical 90 amino acid subunits, characteristic of the S100 subfamily of EF-hand Ca2+-binding proteins. The structure reveals a readily identified pair of putative sites for binding of Zn2+. In order to accurately determine the structural features that differentiate the various S100 proteins, distance difference matrices and contact maps were calculated for the NMR structural ensembles of apo calcyclin and rat and bovine S100B. These data show that the most significant variations among the structures are in the positioning of helix III and in loops, the regions with least sequence similarity. Inter-helical angles and distance differences for the proteins show that the positioning of helix III of calcyclin is most similar to that of bovine S100B, but that the helix interfaces are more closely packed in calcyclin than in either S100B structure. Surprisingly large differences were found in the positioning of helix III in the two S100B structures, despite there being only four non-identical residues, suggesting that one or both of the S100B structures requires further refinement.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号