共查询到20条相似文献,搜索用时 15 毫秒
1.
The aggregation of alpha-helix-rich proteins into beta-sheet-rich amyloid fibrils is associated with fatal diseases, such as Alzheimer's disease and prion disease. During an aggregation process, protein secondary structure elements-alpha-helices-undergo conformational changes to beta-sheets. The fact that proteins with different sequences and structures undergo a similar transition on aggregation suggests that the sequence nonspecific hydrogen bond interaction among protein backbones is an important factor. We perform molecular dynamics simulations of a polyalanine model, which is an alpha-helix in its native state and observe a metastable beta-hairpin intermediate. Although a beta-hairpin has larger potential energy than an alpha-helix, the entropy of a beta-hairpin is larger because of fewer constraints imposed by the hydrogen bonds. In the vicinity of the transition temperature, we observe the interconversion of the alpha-helix and beta-sheet states via a random coil state. We also study the effect of the environment by varying the relative strength of side-chain interactions for a designed peptide-an alpha-helix in its native state. For a certain range of side-chain interaction strengths, we find that the intermediate beta-hairpin state is destabilized and even disappears, suggesting an important role of the environment in the aggregation propensity of a peptide. 相似文献
2.
Accuracy of predicting protein secondary structure and solvent accessibility from sequence information has been improved significantly by using information contained in multiple sequence alignments as input to a neural 'network system. For the Asilomar meeting, predictions for 13 proteins were generated automatically using the publicly available prediction method PHD. The results confirm the estimate of 72% three-state prediction accuracy. The fairly accurate predictions of secondary structure segments made the tool useful as a starting point for modeling of higher dimensional aspects of protein structure. © 1995 Wiley-Liss, Inc. 相似文献
3.
Michael Schantz Klausen Martin Closter Jespersen Henrik Nielsen Kamilla Kjærgaard Jensen Vanessa Isabell Jurtz Casper Kaae Sønderby Morten Otto Alexander Sommer Ole Winther Morten Nielsen Bent Petersen Paolo Marcatili 《Proteins》2019,87(6):520-527
The ability to predict local structural features of a protein from the primary sequence is of paramount importance for unraveling its function in absence of experimental structural information. Two main factors affect the utility of potential prediction tools: their accuracy must enable extraction of reliable structural information on the proteins of interest, and their runtime must be low to keep pace with sequencing data being generated at a constantly increasing speed. Here, we present NetSurfP-2.0, a novel tool that can predict the most important local structural features with unprecedented accuracy and runtime. NetSurfP-2.0 is sequence-based and uses an architecture composed of convolutional and long short-term memory neural networks trained on solved protein structures. Using a single integrated model, NetSurfP-2.0 predicts solvent accessibility, secondary structure, structural disorder, and backbone dihedral angles for each residue of the input sequences. We assessed the accuracy of NetSurfP-2.0 on several independent test datasets and found it to consistently produce state-of-the-art predictions for each of its output features. We observe a correlation of 80% between predictions and experimental data for solvent accessibility, and a precision of 85% on secondary structure 3-class predictions. In addition to improved accuracy, the processing time has been optimized to allow predicting more than 1000 proteins in less than 2 hours, and complete proteomes in less than 1 day. 相似文献
4.
The inclusion of explicit solvent water in molecular dynamics refinement of NMR structures ought to provide the most physically meaningful accounting for the effects of solvent on structure, but is computationally expensive. In order to evaluate the validity of commonly used vacuum refinements and of recently developed continuum solvent model methods, we have used three different methods to refine a set of NMR solution structures of a medium sized protein, Escherichia coliglutaredoxin 2, from starting structures calculated using the program DYANA. The three different refinement protocols used molecular dynamics simulated annealing with the program AMBER in vacuum (VAC), including a generalized Born (GB) solvent model, and a full calculation including explicit solvent water (WAT). The structures obtained using the three methods of refinements were very similar, a reflection of their generally well-determined nature. However, the structures refined with the generalized Born model were more similar to those from explicit water refinement than those refined in vacuum. Significant improvement was seen in the percentage of backbone dihedral angles in the most favored regions of , space and in hydrogen bond pattern for structures refined with the GB and WAT models, compared with the structures refined in vacuum. The explicit water calculation took an average of 200 h of CPU time per structure on an SGI cluster, compared to 15–90 h for the GB calculation (depending on the parameters used) and 2 h for the vacuum calculation. The generalized Born solvent model proved to be an excellent compromise between the vacuum and explicit water refinements, giving results comparable to those of the explicit water calculation. Some improvement for and angle distribution and hydrogen bond pattern can also be achieved by energy minimizing the vacuum structures with the GB model, which takes a much shorter time than MD simulations with the GB model. 相似文献
5.
Qi W Zhang A Patel D Lee S Harrington JL Zhao L Schaefer D Good TA Fernandez EJ 《Biotechnology and bioengineering》2008,100(6):1214-1227
Increasing evidence indicates that soluble aggregates of amyloid beta protein (Abeta) are neurotoxic. However, difficulty in isolating these unstable, dynamic species impedes studies of Abeta and other aggregating peptides and proteins. In this study, hydrogen-deuterium exchange (HX) detected by mass spectrometry (MS) was used to measure Abeta(1-40) aggregate distributions without purification or modification that might alter the aggregate structure or distribution. Different peaks in the mass spectra were assigned to monomer, low molecular weight oligomer, intermediate, and fibril based on HX labeling behavior and complementary assays. After 1 h labeling, the intermediates incorporated approximately ten more deuterons relative to fibrils, indicating a more solvent exposed structure of such intermediates. HX-MS also showed that the intermediate species dissociated much more slowly to monomer than did the very low molecular weight oligomers that were formed at very early times in Abeta aggregation. Atomic force microscopy (AFM) measurements revealed the intermediates were roughly spherical with relatively homogenous diameters of 30-50 nm. Quantitative analysis of the HX mass spectra showed that the amount of intermediate species was correlated with Abeta toxicity patterns reported in a previous study under the same conditions. This study also demonstrates the potential of the HX-MS approach to characterizing complex, multi-component oligomer distributions of aggregating peptides and proteins. 相似文献
6.
The prediction of 1D structural properties of proteins is an important step toward the prediction of protein structure and function, not only in the ab initio case but also when homology information to known structures is available. Despite this the vast majority of 1D predictors do not incorporate homology information into the prediction process. We develop a novel structural alignment method, SAMD, which we use to build alignments of putative remote homologues that we compress into templates of structural frequency profiles. We use these templates as additional input to ensembles of recursive neural networks, which we specialise for the prediction of query sequences that show only remote homology to any Protein Data Bank structure. We predict four 1D structural properties – secondary structure, relative solvent accessibility, backbone structural motifs, and contact density. Secondary structure prediction accuracy, tested by five‐fold cross‐validation on a large set of proteins allowing less than 25% sequence identity between training and test set and query sequences and templates, exceeds 82%, outperforming its ab initio counterpart, other state‐of‐the‐art secondary structure predictors (Jpred 3 and PSIPRED) and two other systems based on PSI‐BLAST and COMPASS templates. We show that structural information from homologues improves prediction accuracy well beyond the Twilight Zone of sequence similarity, even below 5% sequence identity, for all four structural properties. Significant improvement over the extraction of structural information directly from PDB templates suggests that the combination of sequence and template information is more informative than templates alone. Proteins 2009. © 2009 Wiley‐Liss, Inc. 相似文献
7.
8.
In template-based modeling of protein structures, the generation of the alignment between the target and the template is a critical step that significantly affects the accuracy of the final model. This paper proposes an alignment algorithm SSALN that learns substitution matrices and position-specific gap penalties from a database of structurally aligned protein pairs. In addition to the amino acid sequence information, secondary structure and solvent accessibility information of a position are used to derive substitution scores and position-specific gap penalties. In a test set of CASP5 targets, SSALN outperforms sequence alignment methods such as a Smith-Waterman algorithm with BLOSUM50 and PSI_BLAST. SSALN also generates better alignments than PSI_BLAST in the CASP6 test set. LOOPP server prediction based on an SSALN alignment is ranked the best for target T0280_1 in CASP6. SSALN is also compared with several threading methods and sequence alignment methods on the ProSup benchmark. SSALN has the highest alignment accuracy among the methods compared. On the Fischer's benchmark, SSALN performs better than CLUSTALW and GenTHREADER, and generates more alignments with accuracy >50%, >60% or >70% than FUGUE, but fewer alignments with accuracy >80% than FUGUE. All the supplemental materials can be found at http://www.cs.cornell.edu/ approximately jianq/research.htm. 相似文献
9.
Dmitry A. Karasev Darya A. Veselova Alexander V. Veselovsky Boris N. Sobolev Victor G. Zgoda Alexander I. Archakov 《Proteins》2018,86(1):13-20
Protein phosphorylation is widely used in biological regulatory processes. The study of spatial features related to phosphorylation sites is necessary to increase the efficacy of recognition of phosphorylation patterns in protein sequences. Using the data on phosphosites found in amino acid sequences, we mapped these sites onto 3D structures and studied the structural variability of the same sites in different PDB entries related to the same proteins. Solvent accessibility was calculated for the residues known to be phosphorylated. A significant change in accessibility was shown for many sites, but several ones were determined as buried in all the structures considered. Most phosphosites were found in coil regions. However, a significant portion was located in the structurally stable ordered regions. Comparison of structures with the same sites in modified and unmodified states showed that the region surrounding a site could be significantly shifted due to phosphorylation. Comparison between non‐modified structures (as well as between the modified ones) suggested that phosphorylation stabilizes one of the possible conformations. The local structure around the site could be changed due to phosphorylation, but often the initial conformation of the site surrounding is not altered within bounds of a rather large substructure. In this case, we can observe an extensive displacement within a protein domain. Phosphorylation without structural alteration seems to provide the interface for domain‐domain or protein‐protein interactions. Accounting for structural features is important for revealing more specific patterns of phosphorylation. It is also necessary for explaining structural changes as a basis for regulatory processes. 相似文献
10.
Owing to the use of evolutionary information and advanced machine learning protocols, secondary structures of amino acid residues in proteins can be predicted from the primary sequence with more than 75% per-residue accuracy for the 3-state (i.e., helix, beta-strand, and coil) classification problem. In this work we investigate whether further progress may be achieved by incorporating the relative solvent accessibility (RSA) of an amino acid residue as a fingerprint of the overall topology of the protein. Toward that goal, we developed a novel method for secondary structure prediction that uses predicted RSA in addition to attributes derived from evolutionary profiles. Our general approach follows the 2-stage protocol of Rost and Sander, with a number of Elman-type recurrent neural networks (NNs) combined into a consensus predictor. The RSA is predicted using our recently developed regression-based method that provides real-valued RSA, with the overall correlation coefficients between the actual and predicted RSA of about 0.66 in rigorous tests on independent control sets. Using the predicted RSA, we were able to improve the performance of our secondary structure prediction by up to 1.4% and achieved the overall per-residue accuracy between 77.0% and 78.4% for the 3-state classification problem on different control sets comprising, together, 603 proteins without homology to proteins included in the training. The effects of including solvent accessibility depend on the quality of RSA prediction. In the limit of perfect prediction (i.e., when using the actual RSA values derived from known protein structures), the accuracy of secondary structure prediction increases by up to 4%. We also observed that projecting real-valued RSA into 2 discrete classes with the commonly used threshold of 25% RSA decreases the classification accuracy for secondary structure prediction. While the level of improvement of secondary structure prediction may be different for prediction protocols that implicitly account for RSA in other ways, we conclude that an increase in the 3-state classification accuracy may be achieved when combining RSA with a state-of-the-art protocol utilizing evolutionary profiles. The new method is available through a Web server at http://sable.cchmc.org. 相似文献
11.
The present study is an attempt to develop a neural network-based method for predicting the real value of solvent accessibility from the sequence using evolutionary information in the form of multiple sequence alignment. In this method, two feed-forward networks with a single hidden layer have been trained with standard back-propagation as a learning algorithm. The Pearson's correlation coefficient increases from 0.53 to 0.63, and mean absolute error decreases from 18.2 to 16% when multiple-sequence alignment obtained from PSI-BLAST is used as input instead of a single sequence. The performance of the method further improves from a correlation coefficient of 0.63 to 0.67 when secondary structure information predicted by PSIPRED is incorporated in the prediction. The final network yields a mean absolute error value of 15.2% between the experimental and predicted values, when tested on two different nonhomologous and nonredundant datasets of varying sizes. The method consists of two steps: (1) in the first step, a sequence-to-structure network is trained with the multiple alignment profiles in the form of PSI-BLAST-generated position-specific scoring matrices, and (2) in the second step, the output obtained from the first network and PSIPRED-predicted secondary structure information is used as an input to the second structure-to-structure network. Based on the present study, a server SARpred (http://www.imtech.res.in/raghava/sarpred/) has been developed that predicts the real value of solvent accessibility of residues for a given protein sequence. We have also evaluated the performance of SARpred on 47 proteins used in CASP6 and achieved a correlation coefficient of 0.68 and a MAE of 15.9% between predicted and observed values. 相似文献
12.
We developed dictionaries of two-, three-, and five-residue patterns in proteins and computed the average solvent accessibility of the central residues in their native proteins. These dictionaries serve as a look-up table for making subsequent predictions of solvent accessibility of amino acid residues. We find that predictions made in this way are very close to those made using more sophisticated methods of solvent accessibility prediction. We also analyzed the effect of immediate neighbors on the solvent accessibility of residues. This helps us in understanding how the same residue type may have different accessible surface areas in different proteins and in different positions of the same protein. We observe that certain residues have a tendency to increase or decrease the solvent accessibility of their neighboring residues in C- or N-terminal positions. Interestingly, the C-terminal and N-terminal neighbor residues are found to have asymmetric roles in modifying solvent accessibility of residues. As expected, similar neighbors enhance the hydrophobic or hydrophilic character of residues. Detailed look-up tables are provided on the web at www.netasa.org/look-up/. 相似文献
13.
Sulfur atoms have been known to participate in hydrogen bonds (H‐bonds) and these sulfur‐containing H‐bonds (SCHBs) are suggested to play important roles in certain biological processes. This study aims to comprehensively characterize all the SCHBs in 500 high‐resolution protein structures (≤1.8 Å). We categorized SCHBs into six types according to donor/acceptor behaviors and used explicit hydrogen approach to distinguish SCHBs from those of nonhydrogen bonding interactions. It is revealed that sulfur atom is a very poor H‐bond acceptor, but a moderately good H‐bond donor. In α‐helix, considerable SCHBs were found between the sulphydryl group of cysteine residue i and the carbonyl oxygen of residue i‐4, and these SCHBs exert effects in stabilizing helices. Although for other SCHBs, they possess no specific secondary structural preference, their geometric characteristics in proteins and in free small compounds are significantly distinct, indicating the protein SCHBs are geometrically distorted. Interestingly, sulfur atom in the disulfide bond tends to form bifurcated H‐bond whereas in cysteine‐cysteine pairs prefer to form dual H‐bond. These special H‐bonds remarkably boost the interaction between H‐bond donor and acceptor. By oxidation/reduction manner, the mutual transformation between the dual H‐bonds and disulfide bonds for cysteine‐cysteine pairs can accurately adjust the structural stability and biological function of proteins in different environments. Furthermore, few loose H‐bonds were observed to form between the sulphydryl groups and aromatic rings, and in these cases the donor H is almost over against the rim rather than the center of the aromatic ring. Proteins 2009. © 2008 Wiley‐Liss, Inc. 相似文献
14.
Information on relative solvent accessibility (RSA) of amino acid residues in proteins provides valuable clues to the prediction of protein structure and function. A two-stage approach with support vector machines (SVMs) is proposed, where an SVM predictor is introduced to the output of the single-stage SVM approach to take into account the contextual relationships among solvent accessibilities for the prediction. By using the position-specific scoring matrices (PSSMs) generated by PSI-BLAST, the two-stage SVM approach achieves accuracies up to 90.4% and 90.2% on the Manesh data set of 215 protein structures and the RS126 data set of 126 nonhomologous globular proteins, respectively, which are better than the highest published scores on both data sets to date. A Web server for protein RSA prediction using a two-stage SVM method has been developed and is available (http://birc.ntu.edu.sg/~pas0186457/rsa.html). 相似文献
15.
It is well known that nonpolar residues are largely buried in the interior of proteins, whereas polar and ionizable residues tend to be more localized on the protein surface where they are solvent exposed. Such a distribution of residues between surface and interior is well understood from a thermodynamic point: nonpolar side chains are excluded from the contact with the solvent water, whereas polar and ionizable groups have favorable interactions with the water and thus are preferred at the protein surface. However, there is an increasing amount of information suggesting that polar and ionizable residues do occur in the protein core, including at positions that have no known functional importance. This is inconsistent with the observations that dehydration of polar and in particular ionizable groups is very energetically unfavorable. To resolve this, we performed a detailed analysis of the distribution of fractional burial of polar and ionizable residues using a large set of ?2600 nonhomologous protein structures. We show that when ionizable residues are fully buried, the vast majority of them form hydrogen bonds and/or salt bridges with other polar/ionizable groups. This observation resolves an apparent contradiction: the energetic penalty of dehydration of polar/ionizable groups is paid off by favorable energy of hydrogen bonding and/or salt bridge formation in the protein interior. Our conclusion agrees well with the previous findings based on the continuum models for electrostatic interactions in proteins. Proteins 2011; © 2011 Wiley‐Liss, Inc. 相似文献
16.
We have improved the multiple linear regression (MLR) algorithm for protein secondary structure prediction by combining it with the evolutionary information provided by multiple sequence alignment of PSI-BLAST. On the CB513 dataset, the three states average overall per-residue accuracy, Q(3), reached 76.4%, while segment overlap accuracy, SOV99, reached 73.2%, using a rigorous jackknife procedure and the strictest reduction of eight states DSSP definition to three states. This represents an improvement of approximately 5% on overall per-residue accuracy compared with previous work. The relative solvent accessibility prediction also benefited from this combination of methods. The system achieved 77.7% average jackknifed accuracy for two states prediction based on a 25% relative solvent accessibility mode, with a Mathews' correlation coefficient of 0.548. The improved MLR secondary structure and relative solvent accessibility prediction server is available at http://spg.biosci.tsinghua.edu.cn/. 相似文献
17.
Elmira I. Yakupova Ivan M. Vikhlyantsev Liya G. Bobyleva Nikita V. Penkov Alexander A. Timchenko Maria A. Timchenko 《Journal of biomolecular structure & dynamics》2018,36(9):2237-2248
A comparative study of amyloid properties of the aggregates of smooth muscle titin (SMT) from chicken gizzard was carried out. These aggregates were formed in two solutions: 0.15 M glycine-KOH, pH 7.2–7.4 (SMT(Gly)) and 0.2 M KCl, 10 mM imidazole, pH 7.0 (SMT(KCl)). Electron microscopy data showed that SMT aggregates has an amorphous structure in both cases. The results of atomic-force microscopy demonstrated slight differences in morphology in two types of aggregates. The SMT(Gly) aggregates were represented as branching chains, composed of spherical aggregates approximately 300–500 nm in diameter and up to 35 nm in height. The SMT(KCl) aggregates formed sponge-like structures with strands of 8–10 nm in height. Structural analysis of SMT aggregates by X-ray diffraction revealed the presence of cross-β-sheet structure in the samples under study. In the presence of SMT(Gly) aggregates, thioflavine T fluorescence intensity was higher (~3-fold times) compared with that in the presence of SMT(KCl) aggregates. Congo red-stained SMT(Gly) aggregates had yellow to apple-green birefringence under polarized light, which was not observed for SMT(KCl) aggregates. Dynamic light scattering data showed the similar rate of aggregation for both types of aggregates, though SMT(KCl) aggregates were able to partially disaggregate under increased ionic strength of the solution. The ability of SMT to aggregation followed by disaggregation may be functionally significant in the cell. 相似文献
18.
A multiple linear regression method was applied to predict real values of solvent accessibility from the sequence and evolutionary information. This method allowed us to obtain coefficients of regression and correlation between the occurrence of an amino-acid residue at a specific target and its sequence neighbor positions on the one hand, and the solvent accessibility of that residue on the other. Our linear regression model based on sequence information and evolutionary models was found to predict residue accessibility with 18.9% and 16.2% mean absolute error respectively, which is better than or comparable to the best available methods. A correlation matrix for several neighbor positions to examine the role of evolutionary information at these positions has been developed and analyzed. As expected, the effective frequency of hydrophobic residues at target positions shows a strong negative correlation with solvent accessibility, whereas the reverse is true for charged and polar residues. The correlation of solvent accessibility with effective frequencies at neighboring positions falls abruptly with distance from target residues. Longer protein chains have been found to be more accurately predicted than their smaller counterparts. 相似文献
19.
AbstractAtomistic molecular dynamics simulations have been performed on the peptide amphiphiles (PAs) with four amyloid beta peptide fragments as head groups. The stable structures were monitored by the root mean square deviation with respect to the energy minimised initial structures. Random coil and β-sheet structures with hydrogen bonds along and perpendicular to the long axis of the nanofibre were obtained due to the different nature of the head groups. Influences of pH and capping ends on the nanofibre structures were investigated through variation of the protonation states of the ionic amino acids in the peptides. The peptides with opposite charges on both sides were found to have the fewest β-sheet structures, and the charges on the outer terminal tended to destruct the β-sheets while those at the inner side did not. The isolated charge in the centre of peptides was found to be able to promote the formation of regular β-sheets, while multiple charged residues could not support ordered β-sheet structures. When charge neutralisation occurred between adjacent residues, regular β-sheet laminates might also occur for systems with charges at the outer terminal. With the increase of β-sheet structures formed, the original twisted structures found for random coil structures of the PAs could be diminished by the hydrogen bonds. 相似文献
20.
Qi-Shi Du Dong Chen Neng-Zhong Xie Kuo-Chen Chou 《Journal of biomolecular structure & dynamics》2013,31(9):1957-1972
Although not being classified as the most fundamental protein structural elements like α-helices and β-strands, the loop segment may play considerable roles for protein stability, flexibility, and dynamic activity. Meanwhile, the protein loop is also quite elusive; i.e. its interactions with the other parts of protein as well as its own shape-maintaining forces have still remained as a puzzle or at least not quite clear yet. Here, we report a molecular force, the so-called polar hydrogen–π interaction (Hp–π), which may play an important role in supporting the backbones of protein loops. By conducting the potential energy surface scanning calculations on the quasi π-plane of peptide bond unit, we have observed the following intriguing phenomena: (1) when the polar hydrogen atom of a peptide unit is perpendicularly pointing to the π-plane of other peptide bond units, a remarkable Hp–π interaction occurs; (2) the interaction is distance and orientation dependent, acting in a broad space, and belonging to the ‘point-to-plane’ one. The molecular force reported here may provide useful interaction concepts and insights into better understanding the loop’s unique stability and flexibility feature, as well as the driving force of the protein global folding. 相似文献