首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The DSSP program assigns protein secondary structure to one of eight states. This discrete assignment cannot describe the continuum of thermal fluctuations. Hence, a continuous assignment is proposed. Technically, the continuum results from averaging over ten discrete DSSP assignments with different hydrogen bond thresholds. The final continuous assignment for a single NMR model successfully reflected the structural variations observed between all NMR models in the ensemble. The structural variations between NMR models were verified to correlate with thermal motion; these variations were captured by the continuous assignments. Because the continuous assignment reproduces the structural variation between many NMR models from one single model, functionally important variation can be extracted from a single X-ray structure. Thus, continuous assignments of secondary structure may affect future protein structure analysis, comparison, and prediction.  相似文献   

2.
We have developed an automatic algorithm STRIDE for protein secondary structure assignment from atomic coordinates based on the combined use of hydrogen bond energy and statistically derived backbone torsional angle information. Parameters of the pattern recognition procedure were optimized using designations provided by the crystallographers as a standard-of-truth. Comparison to the currently most widely used technique DSSP by Kabsch and Sander (Biopolymers 22:2577-2637, 1983) shows that STRIDE and DSSP assign secondary structural states in 58 and 31% of 226 protein chains in our data sample, respectively, in greater agreement with the specific residue-by-residue definitions provided by the discoverers of the structures while in 11% of the chains, the assignments are the same. STRIDE delineates every 11th helix and every 32nd strand more in accord with published assignments. © 1995 Wiley-Liss, Inc.  相似文献   

3.
Lee J 《Proteins》2006,65(2):453-462
Many of the recent secondary structure prediction methods incorporate the idea of fuzzy set theory, where instead of assigning a definite secondary structure to a query residue, probability for the residue being in each of the conformational states is estimated. Moreover, continuous assignment of conformational states to the experimentally observed protein structures can be performed in order to reflect inherent flexibility. Although various measures have been developed for evaluating performances of secondary structure prediction methods, they depend only on the most probable secondary structures. They do not assess the accuracy of the probabilities produced by fuzzy prediction methods, and they cannot incorporate information contained in continuous assignments of conformational states to observed structures. Three important measures for evaluating performance of a secondary structure prediction algorithm, Q score, Segment OVerlap (SOV) measure, and the k-state correlation coefficient (Corr), are deformed into fuzzy measures F score, Fuzzy OVerlap (FOV) measure, and the fuzzy correlation coefficient (Forr), so that the new measures not only assess probabilistic outputs of fuzzy prediction methods, but also incorporate information from continuous assignments of secondary structure. As an example of application, prediction results of four fuzzy secondary structure prediction methods, PSIPRED, PROFking, SABLE, and PREDICT, are assessed using the new fuzzy measures.  相似文献   

4.
Zhang W  Dunker AK  Zhou Y 《Proteins》2008,71(1):61-67
How to make an objective assignment of secondary structures based on a protein structure is an unsolved problem. Defining the boundaries between helix, sheet, and coil structures is arbitrary, and commonly accepted standard assignments do not exist. Here, we propose a criterion that assesses secondary structure assignment based on the similarity of the secondary structures assigned to pairwise sequence-alignment benchmarks, where these benchmarks are determined by prior structural alignments of the protein pairs. This criterion is used to rank six secondary structure assignment methods: STRIDE, DSSP, SECSTR, KAKSI, P-SEA, and SEGNO with three established sequence-alignment benchmarks (PREFAB, SABmark, and SALIGN). STRIDE and KAKSI achieve comparable success rates in assigning the same secondary structure elements to structurally aligned residues in the three benchmarks. Their success rates are between 1-4% higher than those of the other four methods. The consensus of STRIDE, KAKSI, SECSTR, and P-SEA, called SKSP, improves assignments over the best single method in each benchmark by an additional 1%. These results support the usefulness of the sequence-alignment benchmarks as a means to evaluate secondary structure assignment. The SKSP server and the benchmarks can be accessed at http://sparks.informatics.iupui.edu  相似文献   

5.

Background

Secondary structures are elements of great importance in structural biology, biochemistry and bioinformatics. They are broadly composed of two repetitive structures namely α-helices and β-sheets, apart from turns, and the rest is associated to coil. These repetitive secondary structures have specific and conserved biophysical and geometric properties. PolyProline II (PPII) helix is yet another interesting repetitive structure which is less frequent and not usually associated with stabilizing interactions. Recent studies have shown that PPII frequency is higher than expected, and they could have an important role in protein – protein interactions.

Methodology/Principal Findings

A major factor that limits the study of PPII is that its assignment cannot be carried out with the most commonly used secondary structure assignment methods (SSAMs). The purpose of this work is to propose a PPII assignment methodology that can be defined in the frame of DSSP secondary structure assignment. Considering the ambiguity in PPII assignments by different methods, a consensus assignment strategy was utilized. To define the most consensual rule of PPII assignment, three SSAMs that can assign PPII, were compared and analyzed. The assignment rule was defined to have a maximum coverage of all assignments made by these SSAMs. Not many constraints were added to the assignment and only PPII helices of at least 2 residues length are defined.

Conclusions/Significance

The simple rules designed in this study for characterizing PPII conformation, lead to the assignment of 5% of all amino as PPII. Sequence – structure relationships associated with PPII, defined by the different SSAMs, underline few striking differences. A specific study of amino acid preferences in their N and C-cap regions was carried out as their solvent accessibility and contact patterns. Thus the assignment of PPII can be coupled with DSSP and thus opens a simple way for further analysis in this field.  相似文献   

6.
Park SY  Yoo MJ  Shin J  Cho KH 《BMB reports》2011,44(2):118-122
Most widely used secondary structure assignment methods such as DSSP identify structural elements based on N-H and C=O hydrogen bonding patterns from X-ray or NMR-determined coordinates. Secondary structure assignment algorithms using limited Cα information have been under development as well, but their accuracy is only ~80% compared to DSSP. We have hereby developed SABA (Secondary Structure Assignment Program Based on only Alpha Carbons) with~90% accuracy. SABA defines a novel geometrical parameter, termed a pseudo center, which is the midpoint of two continuous Cαs. SABA is capable of identifying α-helices, 3(10)-helices, and β-strands with high accuracy by using cut-off criteria on distances and dihedral angles between two or more pseudo centers. In addition to assigning secondary structures to Cα-only structures, algorithms using limited Cα information with high accuracy have the potential to enhance the speed of calculations for high capacity structure comparison.  相似文献   

7.
The significant biological role of RNA has further highlighted the need for improving the accuracy, efficiency and the reach of methods for investigating RNA structure and function. Nuclear magnetic resonance (NMR) spectroscopy is vital to furthering the goals of RNA structural biology because of its distinctive capabilities. However, the dispersion pattern in the NMR spectra of RNA makes automated resonance assignment, a key step in NMR investigation of biomolecules, remarkably challenging. Herein we present RNA Probabilistic Assignment of Imino Resonance Shifts (RNA-PAIRS), a method for the automated assignment of RNA imino resonances with synchronized verification and correction of predicted secondary structure. RNA-PAIRS represents an advance in modeling the assignment paradigm because it seeds the probabilistic network for assignment with experimental NMR data, and predicted RNA secondary structure, simultaneously and from the start. Subsequently, RNA-PAIRS sets in motion a dynamic network that reverberates between predictions and experimental evidence in order to reconcile and rectify resonance assignments and secondary structure information. The procedure is halted when assignments and base-parings are deemed to be most consistent with observed crosspeaks. The current implementation of RNA-PAIRS uses an initial peak list derived from proton-nitrogen heteronuclear multiple quantum correlation (1H–15N 2D HMQC) and proton–proton nuclear Overhauser enhancement spectroscopy (1H–1H 2D NOESY) experiments. We have evaluated the performance of RNA-PAIRS by using it to analyze NMR datasets from 26 previously studied RNAs, including a 111-nucleotide complex. For moderately sized RNA molecules, and over a range of comparatively complex structural motifs, the average assignment accuracy exceeds 90%, while the average base pair prediction accuracy exceeded 93%. RNA-PAIRS yielded accurate assignments and base pairings consistent with imino resonances for a majority of the NMR resonances, even when the initial predictions are only modestly accurate. RNA-PAIRS is available as a public web-server at .  相似文献   

8.
J M Moore  W J Chazin  R Powls  P E Wright 《Biochemistry》1988,27(20):7806-7816
Two-dimensional 1H NMR methods have been used to make sequence-specific resonance assignments for the 97 amino acid residues of the plastocyanin from the green alga Scenedesmus obliquus. Assignments were obtained for all backbone protons and the majority of the side-chain protons. Spin system identification relied heavily on the observation of relayed connectivities to the backbone amide proton. Sequence-specific assignments were made by using the sequential assignment procedure. During this process, an extra valine residue was identified that had not been detected in the original amino acid sequence. Elements of regular secondary structure were identified from characteristic NOE connectivities between backbone protons, 3JHN alpha coupling constant values, and the observation of slowly exchanging amide protons. The protein in solution contains eight beta-strands, one short segment of helix, five reverse turns, and five loops. The beta-strands may be arranged into two beta-sheets on the basis of extensive cross-strand NOE connectivities. The chain-folding topology determined from the NMR experiments is that of a Greek key beta-barrel and is similar to that observed for French bean plastocyanin in solution and poplar plastocyanin in the crystalline state. While the overall structures are similar, several differences in local structure between the S. obliquus and higher plant plastocyanins have been identified.  相似文献   

9.
A suite of programs called CAMRA (Computer Aided Magnetic Resonance Assignment) has been developed for computer assisted residue-specific assignments of proteins. CAMRA consists of three units: ORB, CAPTURE and PROCESS. ORB predicts NMR chemical shifts for unassigned proteins using a chemical shift database of previously assigned homologous proteins supplemented by a statistically derived chemical shift database in which the shifts are categorized according to their residue, atom and secondary structure type. CAPTURE generates a list of valid peaks from NMR spectra by filtering out noise peaks and other artifacts and then separating the derived peak list into distinct spin systems. PROCESS combines the chemical shift predictions from ORB with the spin systems identified by CAPTURE to obtain residue specific assignments. PROCESS ranks the top choices for an assignment along with scores and confidence values. In contrast to other auto-assignment programs, CAMRA does not use any connectivity information but instead is based solely on matching predicted shifts with observed spin systems. As such, CAMRA represents a new and unique approach for the assignment of protein NMR spectra. CAMRA will be particularly useful in conjunction with other assignment methods and under special circumstances, such as the assignment of flexible regions in proteins where sufficient NOE information is generally not available. CAMRA was tested on two medium-sized proteins belonging to the chemokine family. It was found to be effective in predicting the assignment providing a database of previously assigned proteins with at least 30% sequence identity is available. CAMRA is versatile and can be used to include and evaluate heteronuclear and three-dimensional experiments.  相似文献   

10.
We present two NMR experiments, (3,2)D HNHA and (3,2)D HNHB, for rapid and accurate measurement of 3J(H N-H alpha) and 3J(N-H beta) coupling constants in polypeptides based on the principle of G-matrix Fourier transform NMR spectroscopy and quantitative J-correlation. These experiments, which facilitate fast acquisition of three-dimensional data with high spectral/digital resolution and chemical shift dispersion, will provide renewed opportunities to utilize them for sequence specific resonance assignments, estimation/characterization of secondary structure with/without prior knowledge of resonance assignments, stereospecific assignment of prochiral groups and 3D structure determination, refinement and validation. Taken together, these experiments have a wide range of applications from structural genomics projects to studying structure and folding in polypeptides.  相似文献   

11.

Background  

The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models.  相似文献   

12.
Protein structure determination by NMR can in principle be speeded up both by reducing the measurement time on the NMR spectrometer and by a more efficient analysis of the spectra. Here we study the reliability of protein structure determination based on a single type of spectra, namely nuclear Overhauser effect spectroscopy (NOESY), using a fully automated procedure for the sequence-specific resonance assignment with the recently introduced FLYA algorithm, followed by combined automated NOE distance restraint assignment and structure calculation with CYANA. This NOESY-FLYA method was applied to eight proteins with 63–160 residues for which resonance assignments and solution structures had previously been determined by the Northeast Structural Genomics Consortium (NESG), and unrefined and refined NOESY data sets have been made available for the Critical Assessment of Automated Structure Determination of Proteins by NMR project. Using only peak lists from three-dimensional 13C- or 15N-resolved NOESY spectra as input, the FLYA algorithm yielded for the eight proteins 91–98 % correct backbone and side-chain assignments if manually refined peak lists are used, and 64–96 % correct assignments based on raw peak lists. Subsequent structure calculations with CYANA then produced structures with root-mean-square deviation (RMSD) values to the manually determined reference structures of 0.8–2.0 Å if refined peak lists are used. With raw peak lists, calculations for 4 proteins converged resulting in RMSDs to the reference structure of 0.8–2.8 Å, whereas no convergence was obtained for the four other proteins (two of which did already not converge with the correct manual resonance assignments given as input). These results show that, given high-quality experimental NOESY peak lists, the chemical shift assignments can be uncovered, without any recourse to traditional through-bond type assignment experiments, to an extent that is sufficient for calculating accurate three-dimensional structures.  相似文献   

13.
Combined automated NOE assignment and structure determination module (CANDID) is a new software for efficient NMR structure determination of proteins by automated assignment of the NOESY spectra. CANDID uses an iterative approach with multiple cycles of NOE cross-peak assignment and protein structure calculation using the fast DYANA torsion angle dynamics algorithm, so that the result from each CANDID cycle consists of exhaustive, possibly ambiguous NOE cross-peak assignments in all available spectra and a three-dimensional protein structure represented by a bundle of conformers. The input for the first CANDID cycle consists of the amino acid sequence, the chemical shift list from the sequence-specific resonance assignment, and listings of the cross-peak positions and volumes in one or several two, three or four-dimensional NOESY spectra. The input for the second and subsequent CANDID cycles contains the three-dimensional protein structure from the previous cycle, in addition to the complete input used for the first cycle. CANDID includes two new elements that make it robust with respect to the presence of artifacts in the input data, i.e. network-anchoring and constraint-combination, which have a key role in de novo protein structure determinations for the successful generation of the correct polypeptide fold by the first CANDID cycle. Network-anchoring makes use of the fact that any network of correct NOE cross-peak assignments forms a self-consistent set; the initial, chemical shift-based assignments for each individual NOE cross-peak are therefore weighted by the extent to which they can be embedded into the network formed by all other NOE cross-peak assignments. Constraint-combination reduces the deleterious impact of artifact NOE upper distance constraints in the input for a protein structure calculation by combining the assignments for two or several peaks into a single upper limit distance constraint, which lowers the probability that the presence of an artifact peak will influence the outcome of the structure calculation. CANDID test calculations were performed with NMR data sets of four proteins for which high-quality structures had previously been solved by interactive protocols, and they yielded comparable results to these reference structure determinations with regard to both the residual constraint violations, and the precision and accuracy of the atomic coordinates. The CANDID approach has further been validated by de novo NMR structure determinations of four additional proteins. The experience gained in these calculations shows that once nearly complete sequence-specific resonance assignments are available, the automated CANDID approach results in greatly enhanced efficiency of the NOESY spectral analysis. The fact that the correct fold is obtained in cycle 1 of a de novo structure calculation is the single most important advance achieved with CANDID, when compared with previously proposed automated NOESY assignment methods that do not use network-anchoring and constraint-combination.  相似文献   

14.
Several static structural models exist for γδ resolvase, a self-coded DNA recombinase of the γδ transposon. While these reports are invaluable to formulation of a mechanistic hypothesis for DNA strand exchange, several questions remain. Foremost among them concerns the protomer structural dynamics within the protein/DNA synaptosome. Solution NMR chemical shift assignments have been made for truncated variants of the natural wild-type dimer, which is inactive without the full synaptosome structure, and a mutationally activated tetramer. Of the 134 residues, backbone 1H, 15N, and 13Cα assignments are made for 121–124 residues in the dimer, but only 76–80 residues of the tetramer. These assignment differences are interpreted by comparison to X-ray diffraction models of the recombinase dimer and tetramer. Inspection of intramolecular and intermolecular structural variation between these models suggests a correspondence between sequence regions at subunit interfaces unique to tetramer, and the regions that can be sequentially assigned in the dimer but not the tetramer. The loss of sequential context for assignment is suggestive of stochastic fluctuation between structural states involving protomer–protomer interactions exclusive to the activated tetrameric state, and may be indicative of dynamics which pertain to the recombinase mechanism.  相似文献   

15.
Estimation of secondary structure in polypeptides is important for studying their structure, folding and dynamics. In NMR spectroscopy, such information is generally obtained after sequence specific resonance assignments are completed. We present here a new methodology for assignment of secondary structure type to spin systems in proteins directly from NMR spectra, without prior knowledge of resonance assignments. The methodology, named Combination of Shifts for Secondary Structure Identification in Proteins (CSSI-PRO), involves detection of specific linear combination of backbone 1Hα and 13C′ chemical shifts in a two-dimensional (2D) NMR experiment based on G-matrix Fourier transform (GFT) NMR spectroscopy. Such linear combinations of shifts facilitate editing of residues belonging to α-helical/β-strand regions into distinct spectral regions nearly independent of the amino acid type, thereby allowing the estimation of overall secondary structure content of the protein. Comparison of the predicted secondary structure content with those estimated based on their respective 3D structures and/or the method of Chemical Shift Index for 237 proteins gives a correlation of more than 90% and an overall rmsd of 7.0%, which is comparable to other biophysical techniques used for structural characterization of proteins. Taken together, this methodology has a wide range of applications in NMR spectroscopy such as rapid protein structure determination, monitoring conformational changes in protein-folding/ligand-binding studies and automated resonance assignment. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

16.
NMR resonance assignment is one of the key steps in solving an NMR protein structure. The assignment process links resonance peaks to individual residues of the target protein sequence, providing the prerequisite for establishing intra- and inter-residue spatial relationships between atoms. The assignment process is tedious and time-consuming, which could take many weeks. Though there exist a number of computer programs to assist the assignment process, many NMR labs are still doing the assignments manually to ensure quality. This paper presents a new computational method based on the combination of a suite of algorithms for automating the assignment process, particularly the process of backbone resonance peak assignment. We formulate the assignment problem as a constrained weighted bipartite matching problem. While the problem, in the most general situation, is NP-hard, we present an efficient solution based on a branch-and-bound algorithm with effective bounding techniques using two recently introduced approximation algorithms. We also devise a greedy filtering algorithm for reducing the search space. Our experimental results on 70 instances of (pseudo) real NMR data derived from 14 proteins demonstrate that the new solution runs much faster than a recently introduced (exhaustive) two-layer algorithm and recovers more correct peak assignments than the two-layer algorithm. Our result demonstrates that integrating different algorithms can achieve a good tradeoff between backbone assignment accuracy and computation time.  相似文献   

17.
ADAPT-NMR (Assignment-directed Data collection Algorithm utilizing a Probabilistic Toolkit in NMR) represents a groundbreaking prototype for automated protein structure determination by nuclear magnetic resonance (NMR) spectroscopy. With a [(13)C,(15)N]-labeled protein sample loaded into the NMR spectrometer, ADAPT-NMR delivers complete backbone resonance assignments and secondary structure in an optimal fashion without human intervention. ADAPT-NMR achieves this by implementing a strategy in which the goal of optimal assignment in each step determines the subsequent step by analyzing the current sum of available data. ADAPT-NMR is the first iterative and fully automated approach designed specifically for the optimal assignment of proteins with fast data collection as a byproduct of this goal. ADAPT-NMR evaluates the current spectral information, and uses a goal-directed objective function to select the optimal next data collection step(s) and then directs the NMR spectrometer to collect the selected data set. ADAPT-NMR extracts peak positions from the newly collected data and uses this information in updating the analysis resonance assignments and secondary structure. The goal-directed objective function then defines the next data collection step. The procedure continues until the collected data support comprehensive peak identification, resonance assignments at the desired level of completeness, and protein secondary structure. We present test cases in which ADAPT-NMR achieved results in two days or less that would have taken two months or more by manual approaches.  相似文献   

18.
Proton resonance assignments of horse ferrocytochrome c   总被引:4,自引:0,他引:4  
Two-dimensional nuclear magnetic resonance (NMR) spectroscopy was used to assign the proton resonances of horse ferrocytochrome c. Assignments were based on the main chain directed (MCD) and sequential assignment procedures. The fundamental units of the MCD approach, the main-chain NH-C alpha H-C beta H J-coupled subspin systems of each amino acid residue (NAB sets), were defined by analysis of direct and relayed coherence transfer spectra. Recognition of main-chain NOE connectivity patterns specified in the MCD algorithm then allowed NAB sets to be aligned in their proper juxtaposition within secondary structural units. The units of secondary structure were placed within the polypeptide sequence of identification of a small number of side-chain J-coupled spin systems, found by direct recognition in 2D spectra of some J-coupled spin systems and by pairwise comparisons of the J-correlated spectra of six homologous cytochromes c having a small number of known amino acid differences. The placement of a given segment in this way defines the amino acid identity of all its NAB sets. This foreknowledge allowed the vast majority of the side-chain resonances to be discerned in J-correlated spectra. Extensive confirmation of the assignments derives internally from multiple main-chain NOE connectivities and their consistency following temperature-induced changes of the chemical shifts of NOE-correlated protons. The observed patterns of main-chain NOEs provide some structural information and suggest small but potentially significant differences between the solution structure observed by NMR and that defined earlier in crystallographic studies at 2.8-A resolution.  相似文献   

19.
W A Lim  D C Farruggio  R T Sauer 《Biochemistry》1992,31(17):4324-4333
We have characterized the properties of a set of variants of the N-terminal domain of lambda repressor bearing disruptive mutations in the hydrophobic core. These mutations include some that dramatically alter the total core residue volume (by up to six methylene groups) and some that place a single polar residue into the otherwise hydrophobic core. The structural properties of the purified proteins have been studied by CD spectroscopy, biological activity, recognition by conformation-specific monoclonal antibodies, and 1H NMR spectroscopy. The stabilities of the proteins have been measured by thermal and guanidine hydrochloride denaturation. Proteins with disruptive core mutations are found to display a continuum of increasingly nonnative properties. Large internal volume changes cause both significant conformational rearrangements and destabilization by up to 5 kcal/mol. Variants with polar substitutions at core positions no longer behave like well-folded proteins but rather display characteristics of molten globules. However, even proteins bearing some of the most disruptive mutations retain many of the crude secondary and tertiary structural features of the wild-type protein. These results indicate that primitive elements of native structure can form in the absence of normal core packing.  相似文献   

20.
We describe a simple approach to classify amino acid residue types in NMR spectra of proteins for supporting the backbone resonance assignments. It makes use of the differences in biosynthetic pathways of the 20 amino acids in Escherichia coli. Therefore, it is distinct from the parameters routinely exploited in the backbone resonance assignment such as chemical shifts and spin topology information. The combination of biosynthetically directed fractional 13C-labeling and uniform 15N-labeling enables us to obtain both residue-type specific information and sequential connectivities from a single protein sample. The residue-type classification exploiting biosynthetic pathways can be used for accelerating the conventional backbone assignment procedure.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号