共查询到20条相似文献,搜索用时 0 毫秒
1.
Marian Novotny Marvin Seibert Gerard J. Kleywegt 《Acta Crystallographica. Section D, Structural Biology》2007,63(2):270-274
The fact that protein structures are dynamic by nature and that structure models determined by X‐ray crystallography, electron microscopy (EM) and nuclear magnetic resonance (NMR) spectroscopy have limited accuracy limits the precision with which derived properties can be reported. Here, the issue of the precision of calculated solvent‐accessible surface areas (ASAs) is addressed. A number of protein structures of different sizes were selected and the effect of random shifts applied to the atomic coordinates on ASA values was investigated. Standard deviations of the ASA calculations were found to range from ∼10 to ∼80 Å2. Similar values are obtained for a handful of cases in the Protein Data Bank (PDB) where `ensembles' of crystal structures were refined against the same data set. The ASA values for 69 hen egg‐white lysozyme structures were calculated and a standard deviation of the ASA of 81 Å2 was obtained (the average ASA value was 6571 Å2). The calculated ASA values do not show any correlation with factors such as resolution or overall temperature factors. A molecular‐dynamics (MD) trajectory of lysozyme was also analysed. The ASA values during the simulation covered a range of more than 800 Å2. If different programs are used, the ASA values obtained for one small protein show a spread of almost 600 Å2. These results suggest that in most cases reporting ASA values with a precision better than 10 Å2 is probably not realistic and a precision of 50–100 Å2 would seem prudent. The precision of buried surface‐area calculations for complexes is also discussed. 相似文献
2.
An integrated system of neural networks, called SPINE, is established and optimized for predicting structural properties of proteins. SPINE is applied to three-state secondary-structure and residue-solvent-accessibility (RSA) prediction in this paper. The integrated neural networks are carefully trained with a large dataset of 2640 chains, sequence profiles generated from multiple sequence alignment, representative amino acid properties, a slow learning rate, overfitting protection, and an optimized sliding-widow size. More than 200,000 weights in SPINE are optimized by maximizing the accuracy measured by Q(3) (the percentage of correctly classified residues). SPINE yields a 10-fold cross-validated accuracy of 79.5% (80.0% for chains of length between 50 and 300) in secondary-structure prediction after one-month (CPU time) training on 22 processors. An accuracy of 87.5% is achieved for exposed residues (RSA >95%). The latter approaches the theoretical upper limit of 88-90% accuracy in assigning secondary structures. An accuracy of 73% for three-state solvent-accessibility prediction (25%/75% cutoff) and 79.3% for two-state prediction (25% cutoff) is also obtained. 相似文献
3.
4.
Mihaela Drenscko 《Molecular simulation》2017,43(3):234-241
We characterise the hydrophobic collapse of single polystyrene chains in water using molecular dynamics simulations. Specifically, we calculate the potential of mean force for the collapse of a single polystyrene chain in water using metadynamics, comparing the results between all atomistic with coarse-grained (CG) molecular simulation. We next explore the scaling behaviour of the collapsed globular shape at the minimum energy configuration, characterised by the radius of gyration, as a function of chain length. The exponent is close to one third, consistent with that predicted for a polymer chain in bad solvent. We also explore the scaling behaviour of the solvent accessible surface area (SASA) as a function of chain length, finding a similar exponent for both all atomistic and CG simulations. Furthermore, calculation of the local water density as a function of chain length near the minimum energy configuration suggests that intermediate chain lengths are more likely to form dewetted states, as compared to shorter or longer chain lengths. 相似文献
5.
Structure–dynamics interrelationships are important in understanding protein function. We have explored the empirical relationship between rotational correlation times (c and the solvent accessible surface areas (SASA) of 75 proteins with known structures. The theoretical correlation between SASA and c through the equation SASA = Krc
(2/3) is also considered. SASA was determined from the structure, c
calc was determined from diffusion tensor calculations, and c
expt was determined from NMR backbone13 C or 15N relaxation rate measurements. The theoretical and experimental values of c correlate with SASA with regression analyses values of Kr as 1696 and 1896 m2s-(2/3), respectively, and with corresponding correlation coefficients of 0.92 and 0.70. 相似文献
6.
Isolated β-hairpins in water have a temperature dependence of their conformational stability qualitatively resembling that of globular proteins, showing both cold and hot unfolding transitions. It is shown that a molecular-level rationalization of this cold unfolding can be provided extending the approach devised for globular proteins (Graziano G. Phys Chem Chem Phys 2010; 12:14245-14252). The decrease in the solvent-excluded volume upon folding, measured by the decrease in the solvent accessible surface area, produces a gain in configurational/translational entropy of water molecules that is the main stabilizing contribution of the folded conformation. This always stabilizing Gibbs energy contribution has a parabolic-like temperature dependence in water and is exactly counterbalanced at two temperatures (i.e., the cold and hot unfolding temperatures) by the always destabilizing Gibbs energy contribution due to the loss in conformational degrees of freedom of the peptide chain. 相似文献
7.
Characterizing the interactions between amino acid residues and lipid molecules is important for understanding the assembly of transmembrane helices and for studying membrane protein folding. In this study we develop TMLIP (TransMembrane helix-LIPid), an empirically derived propensity of individual residue types to face lipid membrane based on statistical analysis of high-resolution structures of membrane proteins. Lipid accessibilities of amino acid residues within the transmembrane (TM) region of 29 structures of helical membrane proteins are studied with a spherical probe of radius of 1.9 A. Our results show that there are characteristic preferences for residues to face the headgroup region and the hydrocarbon core region of lipid membrane. Amino acid residues Lys, Arg, Trp, Phe, and Leu are often found exposed at the headgroup regions of the membrane, where they have high propensity to face phospholipid headgroups and glycerol backbones. In the hydrocarbon core region, the strongest preference for interacting with lipids is observed for Ile, Leu, Phe and Val. Small and polar amino acid residues are usually buried inside helical bundles and are strongly lipophobic. There is a strong correlation between various hydrophobicity scales and the propensity of a given residue to face the lipids in the hydrocarbon region of the bilayer. Our data suggest a possibly significant contribution of the lipophobic effect to the folding of membrane proteins. This study shows that membrane proteins have exceedingly apolar exteriors rather than highly polar interiors. Prediction of lipid-facing surfaces of boundary helices using TMLIP1 results in a 54% accuracy, which is significantly better than random (25% accuracy). We also compare performance of TMLIP with another lipid propensity scale, kPROT, and with several hydrophobicity scales using hydrophobic moment analysis. 相似文献
8.
A solvation term based on the solvent accessible surface area (SASA) is combined with the CHARMM polar hydrogen force field for the efficient simulation of peptides and small proteins in aqueous solution. Only two atomic solvation parameters are used: one is negative for favoring the direct solvation of polar groups and the other positive for taking into account the hydrophobic effect on apolar groups. To approximate the water screening effects on the intrasolute electrostatic interactions, a distance-dependent dielectric function is used and ionic side chains are neutralized. The use of an analytical approximation of the SASA renders the model extremely efficient (i.e., only about 50% slower than in vacuo simulations). The limitations and range of applicability of the SASA model are assessed by simulations of proteins and structured peptides. For the latter, the present study and results reported elsewhere show that with the SASA model it is possible to sample a significant amount of folding/unfolding transitions, which permit the study of the thermodynamics and kinetics of folding at an atomic level of detail. 相似文献
9.
Chaotropic agents are cosolutes that can disrupt the hydrogen bonding network between water molecules and reduce the stability of the native state of proteins by weakening the hydrophobic effect. In this work, we represent the chaotropic agent as a factor that reduces the amount of order in the structures formed by water molecules, both in the bulk and the hydration shells around hydrophobic amino acids. In this framework we show that low chaotrope concentrations lead to a destabilization of the native state of proteins, and that high concentrations induce complete denaturation. We also find that the reduction of the number of bulk ordered states of water molecules can give origin to an effective interaction between chaotropic molecules and proteins. 相似文献
10.
Emmanuel D. Levy 《Journal of molecular biology》2010,403(4):660-670
Analysis of proteins commonly requires the partition of their structure into regions such as the surface, interior, or interface. Despite the frequent use of such categorization, no consensus definition seems to exist. This study thus aims at providing a definition that is general, is simple to implement, and yields new biological insights. This analysis relies on 397, 196, and 701 protein structures from Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens, respectively, and the conclusions are consistent across all three species. A threshold of 25% relative accessible surface area best segregates amino acids at the interior and at the surface. This value is further used to extend the core-rim model of protein-protein interfaces and to introduce a third region called support. Interface core, rim, and support regions contain similar numbers of residues on average, but core residues contribute over two-thirds of the contact surface. The amino acid composition of each region remains similar across different organisms and interface types. The interface core composition is intermediate between the surface and the interior, but the compositions of the support and the rim are virtually identical with those of the interior and the surface, respectively. The support and rim could thus “preexist” in proteins, and evolving a new interaction could require mutations to form an interface core only. Using the interface regions defined, it is shown through simulations that only two substitutions are necessary to shift the average composition of a 1000-Å2 surface patch involving ∼ 28 residues to that of an equivalent interface. This analysis and conclusions will help understand the notion of promiscuity in protein-protein interaction networks. 相似文献
11.
The mechanisms of deposition and dissociation are implicated in the assembly of amyloid fibrils. To investigate the kinetics of unbinding of Abeta(16-22) monomers from preformed fibrils, we use molecular dynamics (MD) simulations and the structures for Abeta(16-22) amyloid fibrils. Consistent with experimental studies, the dissociation of Abeta(16-22) peptides involves two main stages, locked and docked, after which peptides unbind. The lifetime of the locked state, in which a peptide retains fibril-like structure and interactions, extends up to 0.5 micros under normal physiological conditions. Upon cooperative rupture of all fibril-like hydrogen bonds (HBs) with the fibril, a peptide enters a docked state. This state is populated by disordered random coil conformations and its lifetime ranges from approximately 10 to 200 ns. The docked state is stabilized by hydrophobic side chain interactions, while the contribution from HBs is small. Our simulations also suggest that the peptides located on fibril edges may form stable beta-strand conformations distinct from the fibril "bulk". We propose that such edge peptides can act as fibril caps, which impede fibril elongation. Our results indicate that the interactions between unbinding peptides constitute the molecular basis for cooperativity of peptide dissociation. The kinetics of fibril growth is reconstructed from unbinding assuming the reversibility of deposition/dissociation pathways. The relation of in silica dissociation kinetics to experimental observations is discussed. 相似文献
12.
13.
In order to rationalize the physicochemical properties of human serum-transferrin (STf) and the STf-receptor (TfR) recognition process, we have tried to predict the 3D structures of apo- and iron-loaded STf using a homology modeling technique to study the changes in the structural characteristics that take place upon the uptake of iron by STf in solution. The crystal structures of both forms for ovotransferrin were used as templates for the STf modeling. The modeled structure of STf gave a satisfactory interpretation for the typical physicochemical properties such that (1) STf has a negative electrophoretic mobility and its value increases with iron uptake, and (2) the radius of gyration Rg of Tf decreases with iron uptake. It was found that upon iron binding, interdomain closures take place with large movements of the NII and CII subdomains comprising the N- and C-lobes in STf through a hinge-bending motion, accompanied by the opening of the bridge region with a displacement of more than 15 Å. Moreover, in view of the findings from our capillary electrophoresis experiments that the electrostatic interactions significantly contribute to a specific binding of Fe2-STf with TfR, it is inferred that the connecting (bridge) and its neighboring region associated with a surface exposure of negative charge play an important role in the STf-receptor recognition process. 相似文献
14.
提出了一种用于生成分子光滑表面的新算法.该算法从分布在一个包含整个分子表面的椭球上的三角网络开始,逐步收缩网络直到所有的三角形最佳贴近分子表面.所使用的收缩包络椭球的技术只要稍加修改就可用于蛋白质空腔的表示. 相似文献
15.
This article attempts to increase the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins through improved learning. Most methods developed for improving the backpropagation algorithm of artificial neural networks are limited to small neural networks. Here, we introduce a guided-learning method suitable for networks of any size. The method employs a part of the weights for guiding and the other part for training and optimization. We demonstrate this technique by predicting residue solvent accessibility and real-value backbone torsion angles of proteins. In this application, the guiding factor is designed to satisfy the intuitive condition that for most residues, the contribution of a residue to the structural properties of another residue is smaller for greater separation in the protein-sequence distance between the two residues. We show that the guided-learning method makes a 2-4% reduction in 10-fold cross-validated mean absolute errors (MAE) for predicting residue solvent accessibility and backbone torsion angles, regardless of the size of database, the number of hidden layers and the size of input windows. This together with introduction of two-layer neural network with a bipolar activation function leads to a new method that has a MAE of 0.11 for residue solvent accessibility, 36 degrees for psi, and 22 degrees for phi. The method is available as a Real-SPINE 3.0 server in http://sparks.informatics.iupui.edu. 相似文献
16.
One strategy for ab initio protein structure prediction is to generate a large number of possible structures (decoys) and select the most fitting ones based on a scoring or free energy function. The conformational space of a protein is huge, and chances are rare that any heuristically generated structure will directly fall in the neighborhood of the native structure. It is desirable that, instead of being thrown away, the unfitting decoy structures can provide insights into native structures so prediction can be made progressively. First, we demonstrate that a recently parameterized physics-based effective free energy function based on the GROMOS96 force field and a generalized Born/surface area solvent model is, as several other physics-based and knowledge-based models, capable of distinguishing native structures from decoy structures for a number of widely used decoy databases. Second, we observe a substantial increase in correlations of the effective free energies with the degree of similarity between the decoys and the native structure, if the similarity is measured by the content of native inter-residue contacts in a decoy structure rather than its root-mean-square deviation from the native structure. Finally, we investigate the possibility of predicting native contacts based on the frequency of occurrence of contacts in decoy structures. For most proteins contained in the decoy databases, a meaningful amount of native contacts can be predicted based on plain frequencies of occurrence at a relatively high level of accuracy. Relative to using plain frequencies, overwhelming improvements in sensitivity of the predictions are observed for the 4_state_reduced decoy sets by applying energy-dependent weighting of decoy structures in determining the frequency. There, approximately 80% native contacts can be predicted at an accuracy of approximately 80% using energy-weighted frequencies. The sensitivity of the plain frequency approach is much lower (20% to 40%). Such improvements are, however, not observed for the other decoy databases. The rationalization and implications of the results are discussed. 相似文献
17.
Membrane proteins that regulate solute movement are often built from multiple copies of an identical polypeptide chain. These complexes represent striking examples of self-assembling systems that recruit monomers only until a prescribed level for function is reached. Here we report that three modes of assembly - distinguished by sequence and stoichiometry - describe all helical membrane protein complexes currently solved to high resolution. Using the 13 presently available non-redundant homo-oligomeric structures, we show that two of these types segregate with protein function: one produces energy-dependent transporters, while the other builds channels for passive diffusion. Given such limited routes to functional complexes, membrane proteins that self-assemble exist on the edge of aggregation, susceptible to mutations that may underlie human diseases. 相似文献
18.
We describe an algorithm to compute native structures of proteins from their primary sequences. The novel aspects of this method are: 1) The hydrophobic potential was set to be proportional to the nonpolar solvent accessible surface. To make computation feasible, we developed a new algorithm to compute the solvent accessible surface areas rapidly. 2) The supersecondary structures of each protein were predicted and used as restraints during the conformation searching processes. This algorithm was applied to five proteins. The overall fold of these proteins can be computed from their sequences, with deviations from crystal structures of 1.48–4.48 Å for Cα atoms. Proteins 31:247–257, 1998. © 1998 Wiley-Liss, Inc. 相似文献
19.
Proteins can move freely in three-dimensional space. As a result, their structural properties, such as solvent accessible surface area, backbone dihedral angles, and atomic distances, are continuous variables. However, these properties are often arbitrarily divided into a few classes to facilitate prediction by statistical learning techniques. In this work, we establish an integrated system of neural networks (called Real-SPINE) for real-value prediction and apply the method to predict residue-solvent accessibility and backbone psi dihedral angles of proteins based on information derived from sequences only. Real-SPINE is trained with a large data set of 2640 protein chains, sequence profiles generated from multiple sequence alignment, representative amino-acid properties, a slow learning rate, overfitting protection, and predicted secondary structures. The method optimizes more than 200,000 weights and yields a 10-fold cross-validated Pearson's correlation coefficient (PCC) of 0.74 between predicted and actual solvent accessible surface areas and 0.62 between predicted and actual psi angles. In particular, 90% of 2640 proteins have a PCC value greater than 0.6 between predicted and actual solvent-accessible surface areas. The results of Real-SPINE can be compared with the best reported correlation coefficients of 0.64-0.67 for solvent-accessible surface areas and 0.47 for psi angles. The real-SPINE server, executable programs, and datasets are freely available on http://sparks.informatics.iupui.edu. 相似文献
20.
Patcharapong Thangsunan Sakunna Wongsaipun Sila Kittiwachana 《Journal of biomolecular structure & dynamics》2020,38(2):460-473
AbstractDevelopment of a highly accurate prediction model for protein–ligand inhibition has been a major challenge in drug discovery. Herein, we describe a novel predictive model for the inhibition of HIV-1 integrase (IN)-LEDGF/p75 protein-protein interaction. The model was constructed using energy parameters approximated from molecular dynamics (MD) simulations and molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) calculations. Chemometric analysis using partial least squares (PLS) regression revealed that solvent accessible surface area energy (ΔGSASA) is the major determinant parameter contributing greatly to the prediction accuracy. PLS prediction model on the ΔGSASA values collected from 41 complexes yielded a strong correlation between the predicted and the actual inhibitory activities (R2 = 0.9666, RMSEC of pIC50 values = 0.0890). Additionally, for the test set of 14 complexes, the model performed satisfactorily with very low pIC50 errors (Q2 = 0.5168, RMSEP = 0.3325). A strong correlation between the buried surface areas on the IN protein, when bound with IN-LEDGF/p75 inhibitors, and the respective ΔGSASA values was also obtained. Furthermore, the current method could identify ‘hot spots’of amino acid residues highly influential to the inhibitory activity prediction. This could present fruitful implications in binding site determination and future inhibitor developments targeting protein-protein interactions.Communicated by Ramaswamy H. Sarma 相似文献