共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Ji Guo Su Xiao Ming Han Xiao Zhang Yan Xue Hou Jian Zhuo Zhu Yi Dong Wu 《Journal of biomolecular structure & dynamics》2016,34(3):560-571
Protein collective motions play a critical role in many biochemical processes. How to predict the functional motions and the related key residue interactions in proteins is important for our understanding in the mechanism of the biochemical processes. Normal mode analysis (NMA) of the elastic network model (ENM) is one of the effective approaches to investigate the structure-encoded motions in proteins. However, the motion modes revealed by the conventional NMA approach do not necessarily correspond to a specific function of protein. In the present work, a new analysis method was proposed to identify the motion modes responsible for a specific function of proteins and then predict the key residue interactions involved in the functional motions by using a perturbation approach. In our method, an internal coordinate that accounts for the specific function was introduced, and the Cartesian coordinate space was transformed into the internal/Cartesian space by using linear approximation, where the introduced internal coordinate serves as one of the axes of the coordinate space. NMA of ENM in this internal/Cartesian space was performed and the function-relevant motion modes were identified according to their contributions to the specific function of proteins. Then the key residue interactions important for the functional motions of the protein were predicted as the interactions whose perturbation largely influences the fluctuation along the internal coordinate. Using our proposed methods, the maltose transporter (MalFGK2) from E. Coli was studied. The functional motions and the key residue interactions that are related to the channel-gating function of this protein were successfully identified. 相似文献
3.
S. Saraswathi J. L. Fernández-Martínez A. Koliński R. L. Jernigan A. Kloczkowski 《Journal of molecular modeling》2013,19(10):4337-4348
Exponential growth in the number of available protein sequences is unmatched by the slower growth in the number of structures. As a result, the development of efficient and fast protein secondary structure prediction methods is essential for the broad comprehension of protein structures. Computational methods that can efficiently determine secondary structure can in turn facilitate protein tertiary structure prediction, since most methods rely initially on secondary structure predictions. Recently, we have developed a fast learning optimized prediction methodology (FLOPRED) for predicting protein secondary structure (Saraswathi et al. in JMM 18:4275, 2012). Data are generated by using knowledge-based potentials combined with structure information from the CATH database. A neural network-based extreme learning machine (ELM) and advanced particle swarm optimization (PSO) are used with this data to obtain better and faster convergence to more accurate secondary structure predicted results. A five-fold cross-validated testing accuracy of 83.8 % and a segment overlap (SOV) score of 78.3 % are obtained in this study. Secondary structure predictions and their accuracy are usually presented for three secondary structure elements: α-helix, β-strand and coil but rarely have the results been analyzed with respect to their constituent amino acids. In this paper, we use the results obtained with FLOPRED to provide detailed behaviors for different amino acid types in the secondary structure prediction. We investigate the influence of the composition, physico-chemical properties and position specific occurrence preferences of amino acids within secondary structure elements. In addition, we identify the correlation between these properties and prediction accuracy. The present detailed results suggest several important ways that secondary structure predictions can be improved in the future that might lead to improved protein design and engineering. 相似文献
4.
Cunliang Geng Anna Vangone Gert E. Folkers Li C. Xue Alexandre M. J. J. Bonvin 《Proteins》2019,87(2):110-119
Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning-based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution, and Energy-based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal/mol on a diverse training dataset consisting of 1102 mutations in 57 protein-protein complexes. It competes with existing state-of-the-art methods on two blind test datasets. Predictions for a new dataset of 487 mutations in 56 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC < 0.42), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2–p53 complex. 相似文献
5.
Vibrational entropy differences between mesophile and thermophile proteins and their use in protein engineering 下载免费PDF全文
Vincent Frappier Rafael Najmanovich 《Protein science : a publication of the Protein Society》2015,24(4):474-483
We recently introduced ENCoM, an elastic network atomic contact model, as the first coarse-grained normal mode analysis method that accounts for the nature of amino acids and can predict the effect of mutations on thermostability based on changes vibrational entropy. In this proof-of-concept article, we use pairs of mesophile and thermophile homolog proteins with identical structures to determine if a measure of vibrational entropy based on normal mode analysis can discriminate thermophile from mesophile proteins. We observe that in around 60% of cases, thermophile proteins are more rigid at equivalent temperatures than their mesophile counterpart and this difference can guide the design of proteins to increase their thermostability through series of mutations. We observe that mutations separating thermophile proteins from their mesophile orthologs contribute independently to a decrease in vibrational entropy and discuss the application and implications of this methodology to protein engineering. 相似文献
6.
Yuting Chen Haoyu Lu Ning Zhang Zefeng Zhu Shuqin Wang Minghui Li 《PLoS computational biology》2020,16(12)
Computational methods that predict protein stability changes induced by missense mutations have made a lot of progress over the past decades. Most of the available methods however have very limited accuracy in predicting stabilizing mutations because existing experimental sets are dominated by mutations reducing protein stability. Moreover, few approaches could consistently perform well across different test cases. To address these issues, we developed a new computational method PremPS to more accurately evaluate the effects of missense mutations on protein stability. The PremPS method is composed of only ten evolutionary- and structure-based features and parameterized on a balanced dataset with an equal number of stabilizing and destabilizing mutations. A comprehensive comparison of the predictive performance of PremPS with other available methods on nine benchmark datasets confirms that our approach consistently outperforms other methods and shows considerable improvement in estimating the impacts of stabilizing mutations. A protein could have multiple structures available, and if another structure of the same protein is used, the predicted change in stability for structure-based methods might be different. Thus, we further estimated the impact of using different structures on prediction accuracy, and demonstrate that our method performs well across different types of structures except for low-resolution structures and models built based on templates with low sequence identity. PremPS can be used for finding functionally important variants, revealing the molecular mechanisms of functional influences and protein design. PremPS is freely available at https://lilab.jysw.suda.edu.cn/research/PremPS/, which allows to do large-scale mutational scanning and takes about four minutes to perform calculations for a single mutation per protein with ~ 300 residues and requires ~ 0.4 seconds for each additional mutation. 相似文献
7.
Sampling bias and the use of ecological niche modeling in conservation planning: a field evaluation in a biodiversity hotspot 总被引:2,自引:0,他引:2
Gabriel C. Costa Cristiano Nogueira Ricardo B. Machado Guarino R. Colli 《Biodiversity and Conservation》2010,19(3):883-899
Ecological niche modeling (ENM) has become an important tool in conservation biology. Despite its recent success, several
basic issues related to algorithm performance are still being debated. We assess the ability of two of the most popular algorithms,
GARP and Maxent, to predict distributions when sampling is geographically biased. We use an extensive data set collected in
the Brazilian Cerrado, a biodiversity hotspot in South America. We found that both algorithms give richness predictions that
are very similar to other traditionally used richness estimators. Also, both algorithms correctly predicted the presence of
most species collected during fieldwork, and failed to predict species collected only in very few cases (usually species with
very few known localities, i.e., <5). We also found that Maxent tends to be more sensitive to sampling bias than GARP. However,
Maxent performs better when sampling is poor (e.g., low number of data points). Our results indicates that ENM, even when
provided with limited and geographically biased localities, is a very useful technique to estimate richness and composition
of unsampled areas. We conclude that data generated by ENM maximize the utility of existing biodiversity data, providing a
very useful first evaluation. However, for reliable conservation decisions ENM data must be followed by well-designed field
inventories, especially for the detection of restricted range, rare species. 相似文献
8.
Vijaya Parthiban M Michael Gromiha Madenhalli Abhinandan Dietmar Schomburg 《BMC structural biology》2007,7(1):54
Background
Understanding and predicting protein stability upon point mutations has wide-spread importance in molecular biology. Several prediction models have been developed in the past with various algorithms. Statistical potentials are one of the widely used algorithms for the prediction of changes in stability upon point mutations. Although the methods provide flexibility and the capability to develop an accurate and reliable prediction model, it can be achieved only by the right selection of the structural factors and optimization of their parameters for the statistical potentials. In this work, we have selected five atom classification systems and compared their efficiency for the development of amino acid atom potentials. Additionally, torsion angle potentials have been optimized to include the orientation of amino acids in such a way that altered backbone conformation in different secondary structural regions can be included for the prediction model. This study also elaborates the importance of classifying the mutations according to their solvent accessibility and secondary structure specificity. The prediction efficiency has been calculated individually for the mutations in different secondary structural regions and compared. 相似文献9.
Predicting secondary structures from a protein sequence is an important step for characterizing the structural properties of a protein. Existing methods for protein secondary structure prediction can be broadly classified into template based or sequence profile based methods. We propose a novel framework that bridges the gap between the two fundamentally different approaches. Our framework integrates the information from the fuzzy k-nearest neighbor algorithm and position-specific scoring matrices using a neural network. It combines the strengths of the two methods and has a better potential to use the information in both the sequence and structure databases than existing methods. We implemented the framework into a software system MUPRED. MUPRED has achieved three-state prediction accuracy (Q3) ranging from 79.2 to 80.14%, depending on which benchmark dataset is used. A higher Q3 can be achieved if a query protein has a significant sequence identity (>25%) to a template in PDB. MUPRED also estimates the prediction accuracy at the individual residue level more quantitatively than existing methods. The MUPRED web server and executables are freely available at http://digbio.missouri.edu/mupred. 相似文献
10.
《Journal of structural biology》2023,215(1):107939
Ramachandran plots, which describe protein structures by plotting the dihedral angle pairs of the backbone on a two-dimensional plane, have played an important role in structural biology over the past few decades. However, despite continued discovery of new protein structures to date, the Ramachandran plot is still constructed by only a small number of data points, and further it cannot reflect the steric information of proteins. Here, we investigated the secondary structure of proteins in terms of static and dynamic characteristics. As for static feature, the Ramachandran plot was revisited for the dataset consisting of 9,148 non-redundant high-resolution protein structures released in the protein data bank until April 1, 2022. By calculating amino acid propensities, it was found that the proportion of secondary structures with respect to residue depth is directly related to their hydrophobicity. As for dynamic feature, normal mode analysis (NMA) based on an elastic network model (ENM) was carried out for the dataset using our KOSMOS web server (http://bioengineering.skku.ac.kr/kosmos/). All ENM-based NMA results were stored in the KOSMOS database, allowing researchers to use them in various ways. In this process, it was commonly found that high B-factors appeared at the edge of the alpha helix region, which was elucidated by introducing residue depth. In addition, by investigating the change in dihedral angle, it was possible to quantitatively survey the contribution of structural change of protein on the Ramachandran plot. In conclusion, our statistical analysis of protein characteristics will provide insight into a range of protein structural studies. 相似文献
11.
Evolutionary potentials: structure specific knowledge-based potentials exploiting the evolutionary record of sequence homologs 下载免费PDF全文
We introduce a new type of knowledge-based potentials for protein structure prediction, called 'evolutionary potentials', which are derived using a single experimental protein structure and all three-dimensional models of its homologous sequences. The new potentials have been benchmarked against other knowledge-based potentials, resulting in a significant increase in accuracy for model assessment. In contrast to standard knowledge-based potentials, we propose that evolutionary potentials capture key determinants of thermodynamic stability and specific sequence constraints required for fast folding. 相似文献
12.
In silico platform for predicting and initiating β‐turns in a protein at desired locations 下载免费PDF全文
Gajendra P. S. Raghava 《Proteins》2015,83(5):910-921
Numerous studies have been performed for analysis and prediction of β‐turns in a protein. This study focuses on analyzing, predicting, and designing of β‐turns to understand the preference of amino acids in β‐turn formation. We analyzed around 20,000 PDB chains to understand the preference of residues or pair of residues at different positions in β‐turns. Based on the results, a propensity‐based method has been developed for predicting β‐turns with an accuracy of 82%. We introduced a new approach entitled “Turn level prediction method,” which predicts the complete β‐turn rather than focusing on the residues in a β‐turn. Finally, we developed BetaTPred3, a Random forest based method for predicting β‐turns by utilizing various features of four residues present in β‐turns. The BetaTPred3 achieved an accuracy of 79% with 0.51 MCC that is comparable or better than existing methods on BT426 dataset. Additionally, models were developed to predict β‐turn types with better performance than other methods available in the literature. In order to improve the quality of prediction of turns, we developed prediction models on a large and latest dataset of 6376 nonredundant protein chains. Based on this study, a web server has been developed for prediction of β‐turns and their types in proteins. This web server also predicts minimum number of mutations required to initiate or break a β‐turn in a protein at specified location of a protein. Proteins 2015; 83:910–921. © 2015 Wiley Periodicals, Inc. 相似文献
13.
14.
Normal mode analysis (NMA) has been a powerful tool for studying protein dynamics. Elastic network models (ENM), through their simplicity, have made normal mode computations accessible to a much broader research community and for many more biomolecular systems. The drawback of ENMs, however, is that they are less accurate than NMA. In this work, through steps of simplification that starts with NMA and ends with ENMs we build a tight connection between NMA and ENMs. In the process of bridging between the two, we have also discovered several high‐quality simplified models. Our best simplified model has a mean correlation with the original NMA that is as high as 0.88. In addition, the model is force‐field independent and does not require energy minimization, and thus can be applied directly to experimental structures. Another benefit of drawing the connection is a clearer understanding why ENMs work well and how it can be further improved. We discovered that can be greatly enhanced by including an additional torsional term and a geometry term. Proteins 2014; 82:2157–2168. © 2014 Wiley Periodicals, Inc. 相似文献
15.
Correlated mutations have been repeatedly exploited for intramolecular contact map prediction. Over the last decade these efforts yielded several methods for measuring correlated mutations. Nevertheless, the application of correlated mutations for the prediction of intermolecular interactions has not yet been explored. This gap is due to several obstacles, such as 3D complexes availability, paralog discrimination, and the availability of sequence pairs that are required for inter- but not intramolecular analyses. Here we selected for analysis fusion protein families that bypass some of these obstacles. We find that several correlated mutation measurements yield reasonable accuracy for intramolecular contact map prediction on the fusion dataset. However, the accuracy level drops sharply in intermolecular contacts prediction. This drop in accuracy does not occur always. In the Cohesin-Dockerin family, reasonable accuracy is achieved in the prediction of both intra- and intermolecular contacts. The Cohesin-Dockerin family is well suited for correlated mutation analysis. Because, however, this family constitutes a special case (it has radical mutations, has domain repeats, within each species each Dockerin domain interacts with each Cohesin domain, see below), the successful prediction in this family does not point to a general potential in using correlated mutations for predicting intermolecular contacts. Overall, the results of our study indicate that current methodologies of correlated mutations analysis are not suitable for large-scale intermolecular contact prediction, and thus cannot assist in docking. With current measurements, sequence availability, sequence annotations, and underdeveloped sequence pairing methods, correlated mutations can yield reasonable accuracy only for a handful of families. 相似文献
16.
Pairwise contact energy statistical potentials can help to find probability of point mutations 下载免费PDF全文
To adopt a particular fold, a protein requires several interactions between its amino acid residues. The energetic contribution of these residue–residue interactions can be approximated by extracting statistical potentials from known high resolution structures. Several methods based on statistical potentials extracted from unrelated proteins are found to make a better prediction of probability of point mutations. We postulate that the statistical potentials extracted from known structures of similar folds with varying sequence identity can be a powerful tool to examine probability of point mutation. By keeping this in mind, we have derived pairwise residue and atomic contact energy potentials for the different functional families that adopt the (α/β)8 TIM‐Barrel fold. We carried out computational point mutations at various conserved residue positions in yeast Triose phosphate isomerase enzyme for which experimental results are already reported. We have also performed molecular dynamics simulations on a subset of point mutants to make a comparative study. The difference in pairwise residue and atomic contact energy of wildtype and various point mutations reveals probability of mutations at a particular position. Interestingly, we found that our computational prediction agrees with the experimental studies of Silverman et al. (Proc Natl Acad Sci 2001;98:3092–3097) and perform better prediction than iMutant and Cologne University Protein Stability Analysis Tool. The present work thus suggests deriving pairwise contact energy potentials and molecular dynamics simulations of functionally important folds could help us to predict probability of point mutations which may ultimately reduce the time and cost of mutation experiments. Proteins 2016; 85:54–64. © 2016 Wiley Periodicals, Inc. 相似文献
17.
Nobuko Hamasaki-Katagiri Raheleh Salari Andrew Wu Yini Qi Tal Schiller Amanda C. Filiberto Enrique F. Schisterman Anton A. Komar Teresa M. Przytycka Chava Kimchi-Sarfaty 《Journal of molecular biology》2013
A fundamental goal of medical genetics is the accurate prediction of genotype–phenotype correlations. As an approach to develop more accurate in silico tools for prediction of disease-causing mutations of structural proteins, we present a gene- and disease-specific prediction tool based on a large systematic analysis of missense mutations from hemophilia A (HA) patients. Our HA-specific prediction tool, HApredictor, showed disease prediction accuracy comparable to other publicly available prediction software. In contrast to those methods, its performance is not limited to non-synonymous mutations. Given the role of synonymous mutations in disease and drug codon optimization, we propose that utilizing a gene- and disease-specific method can be highly useful to make functional predictions possible even for synonymous mutations. Incorporating computational metrics at both nucleotide and amino acid levels along with multiple protein sequence/structure alignment significantly improved the predictive performance of our tool. HApredictor is freely available for download at http://www.ncbi.nlm.nih.gov/CBBresearch/Przytycka/HA_Predict/index.htm. 相似文献
18.
BACE1 is a key protease involved in the proteolysis of amyloid precursor protein (APP) that generates a toxic peptide amyloid beta (Aβ), a pathological feature of Alzheimer's disease (AD). The enzyme is believed to possess an open and a closed conformation that corresponds to its free and inhibitor-bound form respectively. Here, we study the dynamic transition of BACE1 employing normal mode analysis (NMA) using a simplified elastic network model (ENM). Estimation of the catalytic cavity volume on the structures of BACE1 encoded by the lowest frequency normal mode reveals the dynamical transition of the enzyme from the open to the closed conformer. Detailed analysis reveals that concerted movement of different loop segments in the active site of the protein, namely flap regions, 10s loop, A loop and F loop, squeeze the catalytic cavity between the N-terminal and C-terminal lobe of the substrate binding domain of BACE1. We also propose that the NMA encoded multiple receptor conformations (MRC) of BACE1 elucidate the pharmacophoric feature necessary to inhibit the enzyme by a polyphenol, myricetin. van der Waals interaction is found to be the main driving force that guides the ligand induced conformational switching to the closed conformer. We suggest that NMA derived MRC of BACE1 is an efficient way to treat the receptor flexibility in docking and thus can be further applied in virtual screening and structure based drug design. 相似文献
19.
The energetics of protein‐DNA interactions are often modeled using so‐called statistical potentials, that is, energy models derived from the atomic structures of protein‐DNA complexes. Many statistical protein‐DNA potentials based on differing theoretical assumptions have been investigated, but little attention has been paid to the types of data and the parameter estimation process used in deriving the statistical potentials. We describe three enhancements to statistical potential inference that significantly improve the accuracy of predicted protein‐DNA interactions: (i) incorporation of binding energy data of protein‐DNA complexes, in conjunction with their X‐ray crystal structures, (ii) use of spatially‐aware parameter fitting, and (iii) use of ensemble‐based parameter fitting. We apply these enhancements to three widely‐used statistical potentials and use the resulting enhanced potentials in a structure‐based prediction of the DNA binding sites of proteins. These enhancements are directly applicable to all statistical potentials used in protein‐DNA modeling, and we show that they can improve the accuracy of predicted DNA binding sites by up to 21%. Proteins 2013. © 2012 Wiley Periodicals, Inc. 相似文献
20.
Carlos H.M. Rodrigues Douglas E.V. Pires David B. Ascher 《Protein science : a publication of the Protein Society》2021,30(1):60-69
Predicting the effect of missense variations on protein stability and dynamics is important for understanding their role in diseases, and the link between protein structure and function. Approaches to estimate these changes have been proposed, but most only consider single‐point missense variants and a static state of the protein, with those that incorporate dynamics are computationally expensive. Here we present DynaMut2, a web server that combines Normal Mode Analysis (NMA) methods to capture protein motion and our graph‐based signatures to represent the wildtype environment to investigate the effects of single and multiple point mutations on protein stability and dynamics. DynaMut2 was able to accurately predict the effects of missense mutations on protein stability, achieving Pearson's correlation of up to 0.72 (RMSE: 1.02 kcal/mol) on a single point and 0.64 (RMSE: 1.80 kcal/mol) on multiple‐point missense mutations across 10‐fold cross‐validation and independent blind tests. For single‐point mutations, DynaMut2 achieved comparable performance with other methods when predicting variations in Gibbs Free Energy (ΔΔG) and in melting temperature (ΔTm). We anticipate our tool to be a valuable suite for the study of protein flexibility analysis and the study of the role of variants in disease. DynaMut2 is freely available as a web server and API at http://biosig.unimelb.edu.au/dynamut2 . 相似文献