首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Protein structure determination by NMR can in principle be speeded up both by reducing the measurement time on the NMR spectrometer and by a more efficient analysis of the spectra. Here we study the reliability of protein structure determination based on a single type of spectra, namely nuclear Overhauser effect spectroscopy (NOESY), using a fully automated procedure for the sequence-specific resonance assignment with the recently introduced FLYA algorithm, followed by combined automated NOE distance restraint assignment and structure calculation with CYANA. This NOESY-FLYA method was applied to eight proteins with 63–160 residues for which resonance assignments and solution structures had previously been determined by the Northeast Structural Genomics Consortium (NESG), and unrefined and refined NOESY data sets have been made available for the Critical Assessment of Automated Structure Determination of Proteins by NMR project. Using only peak lists from three-dimensional 13C- or 15N-resolved NOESY spectra as input, the FLYA algorithm yielded for the eight proteins 91–98 % correct backbone and side-chain assignments if manually refined peak lists are used, and 64–96 % correct assignments based on raw peak lists. Subsequent structure calculations with CYANA then produced structures with root-mean-square deviation (RMSD) values to the manually determined reference structures of 0.8–2.0 Å if refined peak lists are used. With raw peak lists, calculations for 4 proteins converged resulting in RMSDs to the reference structure of 0.8–2.8 Å, whereas no convergence was obtained for the four other proteins (two of which did already not converge with the correct manual resonance assignments given as input). These results show that, given high-quality experimental NOESY peak lists, the chemical shift assignments can be uncovered, without any recourse to traditional through-bond type assignment experiments, to an extent that is sufficient for calculating accurate three-dimensional structures.  相似文献   

2.
Reliable automated NOE assignment and structure calculation on the basis of a largely complete, assigned input chemical shift list and a list of unassigned NOESY cross peaks has recently become feasible for routine NMR protein structure calculation and has been shown to yield results that are equivalent to those of the conventional, manual approach. However, these algorithms rely on the availability of a virtually complete list of the chemical shifts. This paper investigates the influence of incomplete chemical shift assignments on the reliability of NMR structures obtained with automated NOESY cross peak assignment. The program CYANA was used for combined automated NOESY assignment with the CANDID algorithm and structure calculations with torsion angle dynamics at various degrees of completeness of the chemical shift assignment which was simulated by random omission of entries in the experimental 1H chemical shift lists that had been used for the earlier, conventional structure determinations of two proteins. Sets of structure calculations were performed choosing the omitted chemical shifts randomly among all assigned hydrogen atoms, or among aromatic hydrogen atoms. For comparison, automated NOESY assignment and structure calculations were performed with the complete experimental chemical shift but under random omission of NOESY cross peaks. When heteronuclear-resolved three-dimensional NOESY spectra are available the current CANDID algorithm yields in the absence of up to about 10% of the experimental 1H chemical shifts reliable NOE assignments and three-dimensional structures that deviate by less than 2 Å from the reference structure obtained using all experimental chemical shift assignments. In contrast, the algorithm can accommodate the omission of up to 50% of the cross peaks in heteronuclear- resolved NOESY spectra without producing structures with a RMSD of more than 2 Å to the reference structure. When only homonuclear NOESY spectra are available, the algorithm is slightly more susceptible to missing data and can tolerate the absence of up to about 7% of the experimental 1H chemical shifts or of up to 30% of the NOESY peaks.Abbreviations: BmPBPA – Bombyx mori pheromone binding protein form A; CYANA – combined assignment and dynamics algorithm for NMR applications; NMR – nuclear magnetic resonance; NOE – nuclear Overhauser effect; NOESY – NOE spectroscopy; RMSD – root-mean-square deviation; WmKT – Williopsis mrakii killer toxin  相似文献   

3.
The amount of experimental restraints e.g., NOEs is often too small for calculating high quality three-dimensional structures by restrained molecular dynamics. Considering this as a typical missing value problem we propose here a model based data imputation technique that should lead to an improved estimation of the correct structure. The novel automated method implemented in AUREMOL makes a more efficient use of the experimental information to obtain NMR structures with higher accuracy. It creates a large set of substitute restraints that are used either alone or together with the experimental restraints. The new approach was successfully tested on three examples: firstly, the Ras-binding domain of Byr2 from Schizosaccharomyces pombe, the mutant HPr (H15A) from Staphylococcus aureus, and a X-ray structure of human ubiquitin. In all three examples, the quality of the resulting final bundles was improved considerably by the use of additional substitute restraints, as assessed quantitatively by the calculation of RMSD values to the “true” structure and NMR R-factors directly calculated from the original NOESY spectra or the published diffraction data.  相似文献   

4.
The labeling of proteins with stable isotopes enhances the NMR method for the determination of 3D protein structures in solution. Stereo-array isotope labeling (SAIL) provides an optimal stereospecific and regiospecific pattern of stable isotopes that yields sharpened lines, spectral simplification without loss of information, and the ability to collect rapidly and evaluate fully automatically the structural restraints required to solve a high-quality solution structure for proteins up to twice as large as those that can be analyzed using conventional methods. Here, we describe a protocol for the preparation of SAIL proteins by cell-free methods, including the preparation of S30 extract and their automated structure analysis using the FLYA algorithm and the program CYANA. Once efficient cell-free expression of the unlabeled or uniformly labeled target protein has been achieved, the NMR sample preparation of a SAIL protein can be accomplished in 3 d. A fully automated FLYA structure calculation can be completed in 1 d on a powerful computer system.  相似文献   

5.
Automated structure determination from NMR spectra   总被引:2,自引:0,他引:2  
Automated methods for protein structure determination by NMR have increasingly gained acceptance and are now widely used for the automated assignment of distance restraints and the calculation of three-dimensional structures. This review gives an overview of the techniques for automated protein structure analysis by NMR, including both NOE-based approaches and methods relying on other experimental data such as residual dipolar couplings and chemical shifts, and presents the FLYA algorithm for the fully automated NMR structure determination of proteins that is suitable to substitute all manual spectra analysis and thus overcomes a major efficiency limitation of the NMR method for protein structure determination.  相似文献   

6.
Novel algorithms are presented for automated NOESY peak picking and NOE signal identification in homonuclear 2D and heteronuclear-resolved 3D [1H,1H]-NOESY spectra during de novoprotein structure determination by NMR, which have been implemented in the new software ATNOS (automated NOESY peak picking). The input for ATNOS consists of the amino acid sequence of the protein, chemical shift lists from the sequence-specific resonance assignment, and one or several 2D or 3D NOESY spectra. In the present implementation, ATNOS performs multiple cycles of NOE peak identification in concert with automated NOE assignment with the software CANDID and protein structure calculation with the program DYANA. In the second and subsequent cycles, the intermediate protein structures are used as an additional guide for the interpretation of the NOESY spectra. By incorporating the analysis of the raw NMR data into the process of automated de novoprotein NMR structure determination, ATNOS enables direct feedback between the protein structure, the NOE assignments and the experimental NOESY spectra. The main elements of the algorithms for NOESY spectral analysis are techniques for local baseline correction and evaluation of local noise level amplitudes, automated determination of spectrum-specific threshold parameters, the use of symmetry relations, and the inclusion of the chemical shift information and the intermediate protein structures in the process of distinguishing between NOE peaks and artifacts. The ATNOS procedure has been validated with experimental NMR data sets of three proteins, for which high-quality NMR structures had previously been obtained by interactive interpretation of the NOESY spectra. The ATNOS-based structures coincide closely with those obtained with interactive peak picking. Overall, we present the algorithms used in this paper as a further important step towards objective and efficient de novoprotein structure determination by NMR.  相似文献   

7.
The quality of protein structures determined by nuclear magnetic resonance (NMR) spectroscopy is contingent on the number and quality of experimentally-derived resonance assignments, distance and angular restraints. Two key features of protein NMR data have posed challenges for the routine and automated structure determination of small to medium sized proteins; (1) spectral resolution – especially of crowded nuclear Overhauser effect spectroscopy (NOESY) spectra, and (2) the reliance on a continuous network of weak scalar couplings as part of most common assignment protocols. In order to facilitate NMR structure determination, we developed a semi-automated strategy that utilizes non-uniform sampling (NUS) and multidimensional decomposition (MDD) for optimal data collection and processing of selected, high resolution multidimensional NMR experiments, combined it with an ABACUS protocol for sequential and side chain resonance assignments, and streamlined this procedure to execute structure and refinement calculations in CYANA and CNS, respectively. Two graphical user interfaces (GUIs) were developed to facilitate efficient analysis and compilation of the data and to guide automated structure determination. This integrated method was implemented and refined on over 30 high quality structures of proteins ranging from 5.5 to 16.5 kDa in size.  相似文献   

8.
The C-terminal trigger sequence is essential in the coiled-coil formation of GCN4-p1; its conformational properties are thus of importance for understanding this process at the atomic level. A solution NMR model structure of a peptide, GCN4p16–31, encompassing the GCN4-p1 trigger sequence was proposed a few years ago. Derived using a standard single-structure refinement protocol based on 172 nuclear Overhauser effect (NOE) distance restraints, 14 hydrogen-bond and 11 ϕ torsional-angle restraints, the resulting set of 20 NMR model structures exhibits regular α-helical structure. However, the set slightly violates some measured NOE bounds and does not reproduce all 15 measured 3J(HN-H)-coupling constants, indicating that different conformers of GCN4p16–31 might be present in solution. With the aim to resolve structures compatible with all NOE upper distance bounds and 3J-coupling constants, we executed several structure refinement protocols employing unrestrained and restrained molecular dynamics (MD) simulations with two force fields. We find that only configurational ensembles obtained by applying simultaneously time-averaged NOE distance and 3J-coupling constant restraining with either force field reproduce all the experimental data. Additionally, analyses of the simulated ensembles show that the conformational variability of GCN4p16–31 in solution admitted by the available set of 187 measured NMR data is larger than represented by the set of the NMR model structures. The conformations of GCN4p16–31 in solution differ in the orientation not only of the side-chains but also of the backbone. The inconsistencies between the NMR model structures and the measured NMR data are due to the neglect of averaging effects and the inclusion of hydrogen-bond and torsional-angle restraints that have little basis in the primary, i.e. measured NMR data.  相似文献   

9.
A procedure for automated protein structure determination is presented that is based on an iterative procedure during which the NOESY peak list assignment and the structure calculation are performed simultaneously. The input consists of a list of NOESY peak positions and a list of chemical shifts as obtained from sequence-specific resonance assignment. For the present applications of this approach the previously introduced NOAH routine was implemented in the distance geometry program DIANA. As an illustration, experimental 2D and 3D NOESY cross-peak lists of six proteins have been analyzed, for which complete sequence-specific 1H assignments are available for the polypeptide backbone and the amino acid side chains. The automated method assigned 70–90% of all NOESY cross peaks, which is on average 10% less than with the interactive approach, and only between 0.8% and 2.4% of the automatically assigned peaks had a different assignment than in the corresponding manually assigned peak lists. The structures obtained with NOAH/DIANA are in close agreement with those from manually assigned peak lists, and with both approaches the residual constraint violations correspond to high-quality NMR structure determinations. Systematic comparisons of the bundles of conformers that represent corresponding automatically and interactively determined structures document the absence of significant bias in either approach, indicating that an important step has been made towards automation of structure determination from NMR spectra.  相似文献   

10.
CD2 associated protein (CD2AP) is an adaptor protein that plays an important role in cell to cell union needed for the kidney function. CD2AP interacts, as an adaptor protein, with different natural targets, such as CD2, nefrin, c-Cbl and podocin. These proteins are believed to interact to one of the three SH3 domains that are positioned in the N-terminal region of CD2AP. To understand the network of interactions between the natural targets and the three SH3 domains (SH3-A, B and C), we have started to determine the structures of the individual SH3 domains. Here we present the high-resolution structure of the SH3-C domain derived from NMR data. Full backbone and side-chain assignments were obtained from triple-resonance spectra. The structure was determined from distance restraints derived from high-resolution 600 and 800 MHz NOESY spectra, together with phi and psi torsion angle restraints based on the analysis of 1HN, 15N, 1Hα, 13Cα, 13CO and 13Cβ chemical shifts. Structures were calculated using CYANA and refined in water using RECOORD. The three-dimensional structure of CD2AP SH3-C contains all the features that are typically found in other SH3 domains, including the general binding site for the recognition of polyproline sequences.  相似文献   

11.
Combined automated NOE assignment and structure determination module (CANDID) is a new software for efficient NMR structure determination of proteins by automated assignment of the NOESY spectra. CANDID uses an iterative approach with multiple cycles of NOE cross-peak assignment and protein structure calculation using the fast DYANA torsion angle dynamics algorithm, so that the result from each CANDID cycle consists of exhaustive, possibly ambiguous NOE cross-peak assignments in all available spectra and a three-dimensional protein structure represented by a bundle of conformers. The input for the first CANDID cycle consists of the amino acid sequence, the chemical shift list from the sequence-specific resonance assignment, and listings of the cross-peak positions and volumes in one or several two, three or four-dimensional NOESY spectra. The input for the second and subsequent CANDID cycles contains the three-dimensional protein structure from the previous cycle, in addition to the complete input used for the first cycle. CANDID includes two new elements that make it robust with respect to the presence of artifacts in the input data, i.e. network-anchoring and constraint-combination, which have a key role in de novo protein structure determinations for the successful generation of the correct polypeptide fold by the first CANDID cycle. Network-anchoring makes use of the fact that any network of correct NOE cross-peak assignments forms a self-consistent set; the initial, chemical shift-based assignments for each individual NOE cross-peak are therefore weighted by the extent to which they can be embedded into the network formed by all other NOE cross-peak assignments. Constraint-combination reduces the deleterious impact of artifact NOE upper distance constraints in the input for a protein structure calculation by combining the assignments for two or several peaks into a single upper limit distance constraint, which lowers the probability that the presence of an artifact peak will influence the outcome of the structure calculation. CANDID test calculations were performed with NMR data sets of four proteins for which high-quality structures had previously been solved by interactive protocols, and they yielded comparable results to these reference structure determinations with regard to both the residual constraint violations, and the precision and accuracy of the atomic coordinates. The CANDID approach has further been validated by de novo NMR structure determinations of four additional proteins. The experience gained in these calculations shows that once nearly complete sequence-specific resonance assignments are available, the automated CANDID approach results in greatly enhanced efficiency of the NOESY spectral analysis. The fact that the correct fold is obtained in cycle 1 of a de novo structure calculation is the single most important advance achieved with CANDID, when compared with previously proposed automated NOESY assignment methods that do not use network-anchoring and constraint-combination.  相似文献   

12.
The identification of proton contacts from NOE spectra remains the major bottleneck in NMR protein structure calculations. We describe an automated assignment-free system for deriving proton contact probabilities from NOESY peak lists that can be viewed as a quantitative extension of manual assignment techniques. Rather than assigning contacts to NOESY crosspeaks, a rigorous Bayesian methodology is used to transform initial proton contact probabilities derived from a set of 2992 protein structures into posterior probabilities using the observed crosspeaks as evidence. Given a target protein, the Bayesian approach is used to derive probabilities for all possible proton contacts. We evaluated the accuracy of this approach at predicting proton contacts on 60 15N separated NOESY and 13C separated NOESY datasets simulated from experimentally determined NMR structures and compared it to CYANA, an established method for proton constraint assignment. On average, at the highest confidence level, our method accurately identifies 3.16/3.17 long range contacts per residue and 12.11/12.18 interresidue proton contacts per residue. These accuracies represent a significant increase over the performance of CYANA on the same data set. On a difficult real dataset that is publicly available, the coverage is lower but our method retains its advantage in accuracy over CANDID/CYANA. The algorithm is publicly available via the Protinfo NMR webserver .  相似文献   

13.
The three-dimensional structure determination of RNAs by NMR spectroscopy relies on chemical shift assignment, which still constitutes a bottleneck. In order to develop more efficient assignment strategies, we analysed relationships between sequence and 1H and 13C chemical shifts. Statistics of resonances from regularly Watson–Crick base-paired RNA revealed highly characteristic chemical shift clusters. We developed two approaches using these statistics for chemical shift assignment of double-stranded RNA (dsRNA): a manual approach that yields starting points for resonance assignment and simplifies decision trees and an automated approach based on the recently introduced automated resonance assignment algorithm FLYA. Both strategies require only unlabeled RNAs and three 2D spectra for assigning the H2/C2, H5/C5, H6/C6, H8/C8 and H1′/C1′ chemical shifts. The manual approach proved to be efficient and robust when applied to the experimental data of RNAs with a size between 20 nt and 42 nt. The more advanced automated assignment approach was successfully applied to four stem-loop RNAs and a 42 nt siRNA, assigning 92–100% of the resonances from dsRNA regions correctly. This is the first automated approach for chemical shift assignment of non-exchangeable protons of RNA and their corresponding 13C resonances, which provides an important step toward automated structure determination of RNAs.  相似文献   

14.
Summary A strategy is presented for the semiautomated assignment and 3D structure determination of proteins from heteronuclear multidimensional Nuclear Magnetic Resonance (NMR) data. This approach involves the computer-based assignment of the NMR signals, identification of distance restraints from nuclear Overhauser effects, and generation of 3D structures by using the NMR-derived restraints. The protocol is described in detail and illustrated on a resonance assignment and structure determination of the FK506 binding protein (FKBP, 107 amino acids) complexed to the immunosuppressant, ascomycin. The 3D structures produced from this automated protocol attained backbone and heavy atom rmsd of 1.17 and 1.69 Å, respectively. Although more highly resolved structures of the complex have been obtained by standard interpretation of NMR data (Meadows et al. (1993) Biochemistry, 32, 754–765), the structures generated with this automated protocol required minimal manual intervention during the spectral assignment and 3D structure calculations stages. Thus, the protocol may yield an approximate order of magnitude reduction in the time required for the generation of 3D structures of proteins from NMR data.  相似文献   

15.
The NOAH/DIAMOD suite uses feedback filtering and self-correcting distance geometry to generate 3D structures from unassigned NOESY spectra. In this study we determined the minimum set of experiments needed to generate a high quality structure bundle. Different combinations of 3D 15N-edited, 13C-edited HSQC-NOESY and 2D homonuclear 1H-1H NOESY spectra of the 77 amino acid protein, myeloid progenitor inhibitory factor-1 (MPIF-1) were used as input for NOAH/DIAMOD calculations. The quality of the assignments of NOESY cross peaks and the accuracy of the automatically generated 3D structures were compared to those obtained with a conventional manual procedure. Combining data from two types of experiments synergistically increased the number of peaks assigned unambiguously in both individual spectra. As a general trend for the accuracy of the structures we observed structural variations in the backbone fold of the final structures of about 2 Å for single spectral data, of 1 Å to 1.5 Å for double spectral data, and of 0.6 Å for triple spectral data sets. The quality of the assignments and 3D structures from the optimal data using all three spectra were similar to those obtained from traditional assignment methods with structural variations within the bundle of 0.6 Å and 1.3 Å for backbone and heavy atoms, respectively. Almost all constraints (97%) of the automatic NOESY cross peak assignments were cross compatible with the structures from the conventional manual assignment procedure, and an even larger proportion (99%) of the manually derived constraints were compatible with the automatically determined 3D structures. The two mean structures determined by both methods differed only by 1.3 Å rmsd for the backbone atoms in the well-defined regions of the protein. Thus NOAD/DIAMOD analysis of spectra from labeled proteins provides a reliable method for high throughput analysis of genomic targets.  相似文献   

16.
Ribonucleic acid structure determination by NMR spectroscopy relies primarily on local structural restraints provided by 1H 1H NOEs and J-couplings. When employed loosely, these restraints are broadly compatible with A- and B-like helical geometries and give rise to calculated structures that are highly sensitive to the force fields employed during refinement. A survey of recently reported NMR structures reveals significant variations in helical parameters, particularly the major groove width. Although helical parameters observed in high-resolution X-ray crystal structures of isolated A-form RNA helices are sensitive to crystal packing effects, variations among the published X-ray structures are significantly smaller than those observed in NMR structures. Here we show that restraints derived from aromatic 1H 13C residual dipolar couplings (RDCs) and residual chemical shift anisotropies (RCSAs) can overcome NMR restraint and force field deficiencies and afford structures with helical properties similar to those observed in high-resolution X-ray structures.  相似文献   

17.
Enkephalin represents one of the bioactive peptide molecules most extensively investigated both in solution and in the crystal state. Depending upon the environment chosen for such studies, three main conformational states were identified for this flexible, linear pentapeptide—i.e., an extended conformation, a single-bend, and a double-bend structure. Since CD and Fourier transform ir (FTIR) spectra of Leu-enkephalin solubilized in negatively charged reverse micelles of bis (2-ethylhexyl)sulfosuccinate sodium salt/isooctane/water were supportive of a restricted conformational space of the aromatic side chains and of a bended type fold, we have analyzed by nmr the conformational preferences of Leu-enkephalin in reverse micelles using a synthetic [13C, 15N]-backbone-labeled sample. The overall conformation derived from nuclear Overhauser effect spectroscopy (NOESY) and 15N-filtered rotating frame NOESY (ROESY) spectra and by distance geometry calculations is a double-bend fold of the backbone that is comparable to one of the known x-ray structures. Thereby the tyrosine side chain is inserted into the hydrophobic core of the reverse micelles in a restrained conformational space as well evidenced by NOEs between the aromatic ring protons and the surfactant. The proximity of the aromatic rings of tyrosine and phenylalanine indicate a preferred structure consistent with the postulated conformation of the opioid peptide in the δ-receptor-bound state. These results confirm the interesting and promising properties of reverse micelles as membrane mimetica. © 1997 John Wiley & Sons, Inc. Biopoly 41: 591–606, 1997  相似文献   

18.
Strong solvent signals lead to a disappearance of weak protein signals close to the solvent resonance frequency and to base plane variations all over the spectrum. AUREMOL-SSA provides an automated approach for solvent artifact removal from multidimensional NMR protein spectra. Its core algorithm is based on singular spectrum analysis (SSA) in the time domain and is combined with an automated base plane correction in the frequency domain. The performance of the method has been tested on synthetic and experimental spectra including two-dimensional NOESY and TOCSY spectra and a three-dimensional 1H,13C-HCCH-TOCSY spectrum. It can also be applied to frequency domain spectra since an optional inverse Fourier transformation is included in the algorithm.  相似文献   

19.
An automated procedure for NOE assignment and three-dimensional structure refinement is presented. The input to the procedure consists of (1) an ensemble of preliminary protein NMR structures, (2) partial sequence-specific assignments for the protein and (3) the positions and volumes of unassigned NOESY cross peaks. Chemical shifts for unassigned side chain protons are predicted from the preliminary structures. The chemical shifts and unassigned NOESY cross peaks are input to an automated procedure for NOE assignment and structure calculation (ARIA) [Nilges et al. (1997) J. Mol. Biol., 269, 408–422]. ARIA is optimized for the task of structure refinement of larger proteins. Errors are filtered to ensure that sequence-specific assignments are reliable. The procedure is applied to the 27.8 kDa single-chain T cell receptor (scTCR). Preliminary NMR structures, nearly complete backbone assignments, partial assignments of side chain protons and more than 1300 unassigned NOESY cross peaks are input. Using the procedure, the resonant frequencies of more than 40 additional side chain protons are assigned. Over 400 new NOE cross peaks are assigned unambiguously. Distances derived from the automatically assigned NOEs improve the precision and quality of calculated scTCR structures. In the refined structures, a hydrophobic cluster of side chains on the scTCR surface that binds major histocompatibility complex (MHC)/antigen is revealed. It is composed of the side chains of residues from three loops and stabilizes the conformation of residues that interact with MHC.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号