共查询到20条相似文献,搜索用时 15 毫秒
1.
Statistical potentials are frequently engaged in the protein structural prediction and protein folding for conformational evaluation. Theoretically, to describe the many‐body effect, pairwise interaction between two atom groups should be corrected by their relative geometric orientation. The potential functions developed by this means are called orientation‐dependent statistical potentials and have exhibited substantially improved performance. However, none of the currently available orientation‐dependent statistical potentials use any reference state, which has been proven to greatly enhance the power of distance‐dependent statistical potentials in numerous previous studies. In this work, we designed a reasonable reference state for the orientation‐dependent statistical potentials: using the average geometric relationship between atom pairs in known structures by neglecting their residue identities. The statistical potential developed using this reference state (called ORDER_AVE) prevails most available rival potentials in a series of tests on the decoy sets, although the information of side chain atoms (except the β‐carbon) is absent in its construction. Proteins 2014; 82:2383–2393. © 2014 Wiley Periodicals, Inc. 相似文献
2.
Extracting knowledge-based statistical potential from known structures of proteins is proved to be a simple, effective method to obtain an approximate free-energy function. However, the different compositions of amino acid residues at the core, the surface, and the binding interface of proteins prohibited the establishment of a unified statistical potential for folding and binding despite the fact that the physical basis of the interaction (water-mediated interaction between amino acids) is the same. Recently, a physical state of ideal gas, rather than a statistically averaged state, has been used as the reference state for extracting the net interaction energy between amino acid residues of monomeric proteins. Here, we find that this monomer-based potential is more accurate than an existing all-atom knowledge-based potential trained with interfacial structures of dimers in distinguishing native complex structures from docking decoys (100% success rate vs. 52% in 21 dimer/trimer decoy sets). It is also more accurate than a recently developed semiphysical empirical free-energy functional enhanced by an orientation-dependent hydrogen-bonding potential in distinguishing native state from Rosetta docking decoys (94% success rate vs. 74% in 31 antibody-antigen and other complexes based on Z score). In addition, the monomer potential achieved a 93% success rate in distinguishing true dimeric interfaces from artificial crystal interfaces. More importantly, without additional parameters, the potential provides an accurate prediction of binding free energy of protein-peptide and protein-protein complexes (a correlation coefficient of 0.87 and a root-mean-square deviation of 1.76 kcal/mol with 69 experimental data points). This work marks a significant step toward a unified knowledge-based potential that quantitatively captures the common physical principle underlying folding and binding. A Web server for academic users, established for the prediction of binding free energy and the energy evaluation of the protein-protein complexes, may be found at http://theory.med.buffalo.edu. 相似文献
3.
Maryam Murtaza Sarjinder Singh Zawar Hussain 《Biometrical journal. Biometrische Zeitschrift》2021,63(1):134-147
In this paper, we develop a new methodology that indicates that the use of correlated scrambling variables in the randomized response technique may play an important role in increasing the efficiency of an estimator of the population mean of a sensitive variable. Although it is clear analytically that the proposed estimator is more efficient than its existing competitors, we have investigated the magnitude of the gain in efficiency through simulation studies that involve both real secondary data from the health sciences, as well as artificial data. We also derive an estimator of the variance of the proposed estimator of mean and we study the coverage of 95% confidence intervals based on this variance estimator. An application using real primary data on smoking by university students is also included. 相似文献
4.
Zhang C Liu S Zhou H Zhou Y 《Protein science : a publication of the Protein Society》2004,13(2):400-411
Structure prediction on a genomic scale requires a simplified energy function that can efficiently sample the conformational space of polypeptide chains. A good energy function at minimum should discriminate native structures against decoys. Here, we show that a recently developed, residue-specific, all-atom knowledge-based potential (167 atomic types) based on distance-scaled, finite ideal-gas reference state (DFIRE-all-atom) can be substantially simplified to 20 residue types located at side-chain center of mass (DFIRE-SCM) without a significant change in its capability of structure discrimination. Using 96 standard multiple decoy sets, we show that there is only a small reduction (from 80% to 78%) in success rate of ranking native structures as the top 1. The success rate is higher than two previously developed, all-atom distance-dependent statistical pair potentials. Applied to structure selections of 21 docking decoys without modification, the DFIRE-SCM potential is 29% more successful in recognizing native complex structures than an all-atom statistical potential trained by a database of dimeric interfaces. The potential also achieves 92% accuracy in distinguishing true dimeric interfaces from artificial crystal interfaces. In addition, the DFIRE potential with the C(alpha) positions as the interaction centers recognizes 123 native structures out of a comprehensive 125-protein TOUCHSTONE decoy set in which each protein has 24,000 decoys with only C(alpha) positions. Furthermore, the performance by DFIRE-SCM on newly established 25 monomeric and 31 docking Rosetta-decoy sets is comparable to (or better than in the case of monomeric decoy sets) that of a recently developed, all-atom Rosetta energy function enhanced with an orientation-dependent hydrogen bonding potential. 相似文献
5.
The distance-dependent structure-derived potentials developed so far all employed a reference state that can be characterized as a residue (atom)-averaged state. Here, we establish a new reference state called the distance-scaled, finite ideal-gas reference (DFIRE) state. The reference state is used to construct a residue-specific all-atom potential of mean force from a database of 1011 nonhomologous (less than 30% homology) protein structures with resolution less than 2 A. The new all-atom potential recognizes more native proteins from 32 multiple decoy sets, and raises an average Z-score by 1.4 units more than two previously developed, residue-specific, all-atom knowledge-based potentials. When only backbone and C(beta) atoms are used in scoring, the performance of the DFIRE-based potential, although is worse than that of the all-atom version, is comparable to those of the previously developed potentials on the all-atom level. In addition, the DFIRE-based all-atom potential provides the most accurate prediction of the stabilities of 895 mutants among three knowledge-based all-atom potentials. Comparison with several physical-based potentials is made. 相似文献
6.
Yu Su Ao Zhou Xuefeng Xia Wen Li Zhirong Sun 《Protein science : a publication of the Protein Society》2009,18(12):2550-2558
Quantitative prediction of protein–protein binding affinity is essential for understanding protein–protein interactions. In this article, an atomic level potential of mean force (PMF) considering volume correction is presented for the prediction of protein–protein binding affinity. The potential is obtained by statistically analyzing X‐ray structures of protein–protein complexes in the Protein Data Bank. This approach circumvents the complicated steps of the volume correction process and is very easy to implement in practice. It can obtain more reasonable pair potential compared with traditional PMF and shows a classic picture of nonbonded atom pair interaction as Lennard‐Jones potential. To evaluate the prediction ability for protein–protein binding affinity, six test sets are examined. Sets 1–5 were used as test set in five published studies, respectively, and set 6 was the union set of sets 1–5, with a total of 86 protein–protein complexes. The correlation coefficient (R) and standard deviation (SD) of fitting predicted affinity to experimental data were calculated to compare the performance of ours with that in literature. Our predictions on sets 1–5 were as good as the best prediction reported in the published studies, and for union set 6, R = 0.76, SD = 2.24 kcal/mol. Furthermore, we found that the volume correction can significantly improve the prediction ability. This approach can also promote the research on docking and protein structure prediction. 相似文献
7.
Knowledge‐based methods for analyzing protein structures, such as statistical potentials, primarily consider the distances between pairs of bodies (atoms or groups of atoms). Considerations of several bodies simultaneously are generally used to characterize bonded structural elements or those in close contact with each other, but historically do not consider atoms that are not in direct contact with each other. In this report, we introduce an information‐theoretic method for detecting and quantifying distance‐dependent through‐space multibody relationships between the sidechains of three residues. The technique introduced is capable of producing convergent and consistent results when applied to a sufficiently large database of randomly chosen, experimentally solved protein structures. The results of our study can be shown to reproduce established physico‐chemical properties of residues as well as more recently discovered properties and interactions. These results offer insight into the numerous roles that residues play in protein structure, as well as relationships between residue function, protein structure, and evolution. The techniques and insights presented in this work should be useful in the future development of novel knowledge‐based tools for the evaluation of protein structure. Proteins 2014; 82:3450–3465. © 2014 Wiley Periodicals, Inc. 相似文献
8.
The conformations of loops are determined by the water-mediated interactions between amino acid residues. Energy functions that describe the interactions can be derived either from physical principles (physical-based energy function) or statistical analysis of known protein structures (knowledge-based statistical potentials). It is commonly believed that statistical potentials are appropriate for coarse-grained representation of proteins but are not as accurate as physical-based potentials when atomic resolution is required. Several recent applications of physical-based energy functions to loop selections appear to support this view. In this article, we apply a recently developed DFIRE-based statistical potential to three different loop decoy sets (RAPPER, Jacobson, and Forrest-Woolf sets). Together with a rotamer library for side-chain optimization, the performance of DFIRE-based potential in the RAPPER decoy set (385 loop targets) is comparable to that of AMBER/GBSA for short loops (two to eight residues). The DFIRE is more accurate for longer loops (9 to 12 residues). Similar trend is observed when comparing DFIRE with another physical-based OPLS/SGB-NP energy function in the large Jacobson decoy set (788 loop targets). In the Forrest-Woolf decoy set for the loops of membrane proteins, the DFIRE potential performs substantially better than the combination of the CHARMM force field with several solvation models. The results suggest that a single-term DFIRE-statistical energy function can provide an accurate loop prediction at a fraction of computing cost required for more complicate physical-based energy functions. A Web server for academic users is established for loop selection at the softwares/services section of the Web site http://theory.med.buffalo.edu/. 相似文献
9.
Sjors H. W. Scheres Piet Gros 《Acta Crystallographica. Section D, Structural Biology》2003,59(3):438-446
Conditional optimization allows the incorporation of extensive geometrical information in protein structure refinement, without the requirement of an explicit chemical assignment of the individual atoms. Here, a mean‐force potential for the conditional optimization of protein structures is presented that expresses knowledge of common protein conformations in terms of interatomic distances, torsion angles and numbers of neighbouring atoms. Information is included for protein fragments up to several residues long in α‐helical, β‐strand and loop conformations, comprising the main chain and side chains up to the γ position in three distinct rotamers. Using this parameter set, conditional optimization of three small protein structures against 2.0 Å observed diffraction data shows a large radius of convergence, validating the presented force field and illustrating the feasibility of the approach. The generally applicable force field allows the development of novel phase‐improvement procedures using the conditional optimization technique. 相似文献
10.
Michael E. Compton 《Plant Cell, Tissue and Organ Culture》1994,37(3):217-242
Statistical analyses are an essential part of biological research. Statistical methods are available to biological researchers that range from very simple to extremely complex. Therefore, caution should be used when selecting a statistical method. When possible it is best to avoid complicated statistical procedures that are difficult to interpret and may hinder the researcher's ability to make treatment comparisons. Instead a method should be chosen that compliments a logical and practical treatment design. Statistics should be used as a tool to compare treatments of interest and should not dictate the treatments. Experimental designs should take into account the eventual analysis, otherwise one could conceive of a design that could not be analyzed or, when analyzed, would not answer the desired questions. Therefore, time should be spent before conducting an experiment to plan an experimental design and analysis that best compliments the treatment scheme and questions to be answered. The purpose of this paper is to present examples of experimental designs, means separation procedures, data transformations and presentation methods suitable for plant cell and tissue culture data.Abbreviations ANOVA analysis of variance - BA benzyladenine - CV coefficient of variation - DF degrees of freedom - IAA indole-3-acetic acid - IBA indole-3-butyric acid - LOF lack-of-fit - MSE mean square error - P-ITB phenyl indole-3-thiolobutyrate - S standard deviation - SE standard error of the mean - TDZ thidiazuron 相似文献
11.
To gain insight into the free energy changes accompanying protein hydrophobic core formation, we have used computer simulations to study the formation of small clusters of nonpolar solutes in water. A barrier to association is observed at the largest solute separation that does not allow substantial solvent penetration. The barrier reflects an effective increase in the size of the cavity occupied by the expanded but water-excluding cluster relative to both the close-packed cluster and the fully solvated separated solutes; a similar effect may contribute to the barrier to protein folding/unfolding. Importantly for the simulation of protein folding without explicit solvent, we find that the interactions between nonpolar solutes of varying size and number can be approximated by a linear function of the molecular surface, but not the solvent-accessible surface of the solutes. Comparison of the free energy of cluster formation to that of dimer formation suggests that the assumption of pair additivity implicit in current protein database derived potentials may be in error. 相似文献
12.
Brent L. Lee 《Molecular simulation》2018,44(13-14):1147-1157
AbstractComputer simulations of passive membrane permeation provide important microscopic insights into the molecular mechanism of this important biological process that are complementary to experimental data. Our review focuses on the main approaches for calculating the free energy, or potential of mean force, for permeation of small molecules through lipid bilayers. The theoretical background for most currently used methods for potential of mean force calculation is described, including particle insertion, thermodynamic integration, umbrella sampling, metadynamics, adaptive biasing force and milestoning. A brief comparison of strengths and weaknesses of the competing approaches is presented. This is followed by a survey of results obtained by the different methods, with special attention to describing the mechanistic insights generated by modelling and illustrating capabilities of the different techniques. We conclude with a discussion of recent advances and future directions in modelling membrane permeation, including latest methodological enhancements, consideration of multiple slow variables and memory effects. 相似文献
13.
《Molecular simulation》2012,38(8-9):751-761
Polymer nanocomposites consist of nanoscale additives embedded in a polymer matrix and are widely used in the automobiles, optics and microelectronics industries. Since the composition and the morphology of the polymer nanocomposite impact its macroscopic properties, significant efforts have been made to understand how parameters, such as polymer and nanoparticle chemistries, molecular weight of the matrix polymers and nanoparticle size, help tune the morphology. Theory and simulations have proven to be useful tools in this field due to their ability to link molecular level interactions, the morphology and the macroscopic properties. Due to the computational intensity of molecular simulations of a dense polymer matrix, there has been a strong effort on the theoretical front to develop methodologies that map out equilibrium structure and phase behaviour of polymer nanocomposites over a large parameter space. In this paper, we review the details of the self-consistent polymer reference interaction site model (PRISM)–Monte Carlo (MC) simulation method which integrates theory and simulation to study phase behaviour in polymer nanocomposites. We discuss two specific cases of polymer nanocomposites containing polymer-grafted nanoparticles with chemical and physical heterogeneity in grafts in which this self-consistent PRISM–MC approach has been used to study effective inter-filler interactions and phase behaviour. 相似文献
14.
Based on integrative computational hybrid approaches that combined statistical coupling analysis (SCA), molecular dynamics (MD), and normal mode analysis (NMA), evolutionarily coupled residues involved in functionally relevant motion in the adenylate kinase protein family were identified. The hybrids identified four top-ranking site pairs that belong to a conserved hydrogen bond network that is involved in the enzyme's flexibility. A second group of top-ranking site pairs was identified in critical regions for functional dynamics, such as those related to enzymatic turnover. The high consistency of the results obtained by SCA with NMA (SCA.NMA) and by SCA.MD hybrid analyses suggests that suitable replacement of the matrix of cross-correlation analysis of atomic fluctuations (derived by using NMA) with those based on MD contributes to the identification of such sites by means of a fast computational calculation. The analysis presented here strongly supports the hypothesis that evolutionary forces, such as coevolution at the sequence level, have promoted functional dynamic properties of the adenylate kinase protein family. Finally, these hybrid approaches can be used to identify, at the residue level, protein motion coordination patterns not previously observed, such as in hinge regions. 相似文献
15.
In his review of neural binding problems, Feldman (Cogn Neurodyn 7:1–11, 2013) addressed two types of models as solutions of (novel) variable binding. The one type uses labels such as phase synchrony of activation. The other (‘connectivity based’) type uses dedicated connections structures to achieve novel variable binding. Feldman argued that label (synchrony) based models are the only possible candidates to handle novel variable binding, whereas connectivity based models lack the flexibility required for that. We argue and illustrate that Feldman’s analysis is incorrect. Contrary to his conclusion, connectivity based models are the only viable candidates for models of novel variable binding because they are the only type of models that can produce behavior. We will show that the label (synchrony) based models analyzed by Feldman are in fact examples of connectivity based models. Feldman’s analysis that novel variable binding can be achieved without existing connection structures seems to result from analyzing the binding problem in a wrong frame of reference, in particular in an outside instead of the required inside frame of reference. Connectivity based models can be models of novel variable binding when they possess a connection structure that resembles a small-world network, as found in the brain. We will illustrate binding with this type of model with episode binding and the binding of words, including novel words, in sentence structures. 相似文献
16.
Near-native selections from docking decoys have proved challenging especially when unbound proteins are used in the molecular docking. One reason is that significant atomic clashes in docking decoys lead to poor predictions of binding affinities of near native decoys. Atomic clashes can be removed by structural refinement through energy minimization. Such an energy minimization, however, will lead to an unrealistic bias toward docked structures with large interfaces. Here, we extend an empirical energy function developed for protein design to protein-protein docking selection by introducing a simple reference state that removes the unrealistic dependence of binding affinity of docking decoys on the buried solvent accessible surface area of interface. The energy function called EMPIRE (EMpirical Protein-InteRaction Energy), when coupled with a refinement strategy, is found to provide a significantly improved success rate in near native selections when applied to RosettaDock and refined ZDOCK docking decoys. Our work underlines the importance of removing nonspecific interactions from specific ones in near native selections from docking decoys. 相似文献
17.
We describe the derivation and testing of a knowledge-based atomic environment potential for the modeling of protein structural energetics. An analysis of the probabilities of atomic interactions in a dataset of high-resolution protein structures shows that the probabilities of non-bonded inter-atomic contacts are not statistically independent events, and that the multi-body contact frequencies are poorly predicted from pairwise contact potentials. A pseudo-energy function is defined that measures the preferences for protein atoms to be in a given microenvironment defined by the number of contacting atoms in the environment and its atomic composition. This functional form is tested for its ability to recognize native protein structures amongst an ensemble of decoy structures and a detailed relative performance comparison is made with a number of common functions used in protein structure prediction. 相似文献
18.
石山苣苔属(Petrocodon Hance)是著名的观赏花卉之一,但气候动荡和人类活动的强烈干扰,使其绝大部分被评估为极危(CR),至少也是易危(VU)以上。为重建末次间冰期以来石山苣苔属潜在适生区的时空变化,探讨适生区对环境变化的响应关系,为石山苣苔属的起源、地理分化研究和中国特有种质资源保护、园林开发利用提供理论指导,该研究结合120个分布记录和17个环境变量,应用优化的MaxEnt模型和地理信息技术(ArcGIS)对石山苣苔属在中国及中南半岛的适生区及其分布格局进行模拟,并基于逐步多元线性回归分析、冗余分析和蒙特卡洛检验评估影响石山苣苔属当前地理分布的主导变量。结果表明:(1)优化的MaxEnt模型的预测精度高,AUC值大于0.96;石山苣苔属当前适生区从中国西南部连续分布至越南北部,零散分布于中国中部和南部、块状分布于缅甸北部,其中以中国云贵高原南部为最佳适生区。(2)制约石山苣苔属当前地理分布的主导环境变量为最干月降雨量(bio14)、最热季度平均降雨量(bio18)、最湿季降雨量(bio16)、温度变化方差(bio4)、最冷月最低温度(bio6)、海拔(alt)。(3)在... 相似文献
19.
We have developed a method to both predict the geometry and the relative stability of point mutants that may be used for arbitrary mutations. The geometry optimization procedure was first tested on a new benchmark of 2141 ordered pairs of X-ray crystal structures of proteins that differ by a single point mutation, the largest data set to date. An empirical energy function, which includes terms representing the energy contributions of the folded and denatured proteins and uses the predicted mutant side chain conformation, was fit to a training set consisting of half of a diverse set of 1816 experimental stability values for single point mutations in 81 different proteins. The data included a substantial number of small to large residue mutations not considered by previous prediction studies. After removing 22 (approximately 2%) outliers, the stability calculation gave a standard deviation of 1.08 kcal/mol with a correlation coefficient of 0.82. The prediction method was then tested on the remaining half of the experimental data, giving a standard deviation of 1.10 kcal/mol and covariance of 0.66 for 97% of the test set. A regression fit of the energy function to a subset of 137 mutants, for which both native and mutant structures were available, gave a prediction error comparable to that for the complete training set with predicted side chain conformations. We found that about half of the variation is due to conformation-independent residue contributions. Finally, a fit to the experimental stability data using these residue parameters exclusively suggests guidelines for improving protein stability in the absence of detailed structure information. 相似文献
20.
The enzyme biotin carboxylase (BC) uses adenosine triphosphate (ATP) to carboxylate biotin and is involved in fatty acid synthesis. Structural evidence suggests that the B domain of BC undergoes a large hinge motion of ~45° when binding and releasing substrates. Escherichia coli BC can function as a natural homodimer and as a mutant monomer. Using molecular dynamics simulations, we evaluate the free energy profile along a closure angle of the B domain of E. coli BC for three cases: a monomer without bound Mg(2)ATP, a monomer with bound Mg(2)ATP, and a homodimer with bound Mg(2)ATP in one subunit. The simulation results show that a closed state is the most probable for the monomer with or without bound Mg(2)ATP. For the dimer with Mg(2)ATP in one of its subunits, communication between the two subunits was observed. Specifically, in the dimer, the opening of the subunit without Mg(2)ATP caused the other subunit to open, and hysteresis was observed upon reclosing it. The most stable state of the dimer is one in which the B domain of both subunits is closed; however, the open state for the B domain without Mg(2)ATP is only approximately 2k(B)T higher in free energy than the closed state. A simple diffusion model indicates that the mean times for opening and closing of the B domain in the monomer with and without Mg(2)ATP are much smaller than the overall reaction time, which is on the order of seconds. 相似文献