首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Protein structure determination by NMR can in principle be speeded up both by reducing the measurement time on the NMR spectrometer and by a more efficient analysis of the spectra. Here we study the reliability of protein structure determination based on a single type of spectra, namely nuclear Overhauser effect spectroscopy (NOESY), using a fully automated procedure for the sequence-specific resonance assignment with the recently introduced FLYA algorithm, followed by combined automated NOE distance restraint assignment and structure calculation with CYANA. This NOESY-FLYA method was applied to eight proteins with 63–160 residues for which resonance assignments and solution structures had previously been determined by the Northeast Structural Genomics Consortium (NESG), and unrefined and refined NOESY data sets have been made available for the Critical Assessment of Automated Structure Determination of Proteins by NMR project. Using only peak lists from three-dimensional 13C- or 15N-resolved NOESY spectra as input, the FLYA algorithm yielded for the eight proteins 91–98 % correct backbone and side-chain assignments if manually refined peak lists are used, and 64–96 % correct assignments based on raw peak lists. Subsequent structure calculations with CYANA then produced structures with root-mean-square deviation (RMSD) values to the manually determined reference structures of 0.8–2.0 Å if refined peak lists are used. With raw peak lists, calculations for 4 proteins converged resulting in RMSDs to the reference structure of 0.8–2.8 Å, whereas no convergence was obtained for the four other proteins (two of which did already not converge with the correct manual resonance assignments given as input). These results show that, given high-quality experimental NOESY peak lists, the chemical shift assignments can be uncovered, without any recourse to traditional through-bond type assignment experiments, to an extent that is sufficient for calculating accurate three-dimensional structures.  相似文献   

2.
ADAPT-NMR (Assignment-directed Data collection Algorithm utilizing a Probabilistic Toolkit in NMR) represents a groundbreaking prototype for automated protein structure determination by nuclear magnetic resonance (NMR) spectroscopy. With a [(13)C,(15)N]-labeled protein sample loaded into the NMR spectrometer, ADAPT-NMR delivers complete backbone resonance assignments and secondary structure in an optimal fashion without human intervention. ADAPT-NMR achieves this by implementing a strategy in which the goal of optimal assignment in each step determines the subsequent step by analyzing the current sum of available data. ADAPT-NMR is the first iterative and fully automated approach designed specifically for the optimal assignment of proteins with fast data collection as a byproduct of this goal. ADAPT-NMR evaluates the current spectral information, and uses a goal-directed objective function to select the optimal next data collection step(s) and then directs the NMR spectrometer to collect the selected data set. ADAPT-NMR extracts peak positions from the newly collected data and uses this information in updating the analysis resonance assignments and secondary structure. The goal-directed objective function then defines the next data collection step. The procedure continues until the collected data support comprehensive peak identification, resonance assignments at the desired level of completeness, and protein secondary structure. We present test cases in which ADAPT-NMR achieved results in two days or less that would have taken two months or more by manual approaches.  相似文献   

3.
Determination of the high resolution solution structure of a protein using nuclear magnetic resonance (NMR) spectroscopy requires that resonances observed in the NMR spectra be unequivocally assigned to individual nuclei of the protein. With the advent of modern, two-dimensional NMR techniques arose methodologies for assigning the1H resonances based on 2D, homonuclear1H NMR experiments. These include the sequential assignment strategy and the main chain directed strategy. These basic strategies have been extended to include newer 3D homonuclear experiments and 2D and 3D heteronuclear resolved and edited methods. Most recently a novel, conceptually new approach to the problem has been introduced that relies on heteronuclear, multidimensional so-called triple resonance experiments for both backbone and sidechain resonance assignments in proteins. This article reviews the evolution of strategies for the assignment of resonances of proteins.  相似文献   

4.
Summary A strategy is presented for the semiautomated assignment and 3D structure determination of proteins from heteronuclear multidimensional Nuclear Magnetic Resonance (NMR) data. This approach involves the computer-based assignment of the NMR signals, identification of distance restraints from nuclear Overhauser effects, and generation of 3D structures by using the NMR-derived restraints. The protocol is described in detail and illustrated on a resonance assignment and structure determination of the FK506 binding protein (FKBP, 107 amino acids) complexed to the immunosuppressant, ascomycin. The 3D structures produced from this automated protocol attained backbone and heavy atom rmsd of 1.17 and 1.69 Å, respectively. Although more highly resolved structures of the complex have been obtained by standard interpretation of NMR data (Meadows et al. (1993) Biochemistry, 32, 754–765), the structures generated with this automated protocol required minimal manual intervention during the spectral assignment and 3D structure calculations stages. Thus, the protocol may yield an approximate order of magnitude reduction in the time required for the generation of 3D structures of proteins from NMR data.  相似文献   

5.
The protocols currently used for protein structure determination by nuclear magnetic resonance (NMR) depend on the determination of a large number of upper distance limits for proton-proton pairs. Typically, this task is performed manually by an experienced researcher rather than automatically by using a specific computer program. To assess whether it is indeed possible to generate in a fully automated manner NMR structures adequate for deposition in the Protein Data Bank, we gathered 10 experimental data sets with unassigned nuclear Overhauser effect spectroscopy (NOESY) peak lists for various proteins of unknown structure, computed structures for each of them using different, fully automatic programs, and compared the results to each other and to the manually solved reference structures that were not available at the time the data were provided. This constitutes a stringent "blind" assessment similar to the CASP and CAPRI initiatives. This study demonstrates the feasibility of routine, fully automated protein structure determination by NMR.  相似文献   

6.
The automation of protein structure determination using NMR is coming of age. The tedious processes of resonance assignment, followed by assignment of NOE (nuclear Overhauser enhancement) interactions (now intertwined with structure calculation), assembly of input files for structure calculation, intermediate analyses of incorrect assignments and bad input data, and finally structure validation are all being automated with sophisticated software tools. The robustness of the different approaches continues to deal with problems of completeness and uniqueness; nevertheless, the future is very bright for automation of NMR structure generation to approach the levels found in X-ray crystallography. Currently, near completely automated structure determination is possible for small proteins, and the prospect for medium-sized and large proteins is good.  相似文献   

7.
Estimation of secondary structure in polypeptides is important for studying their structure, folding and dynamics. In NMR spectroscopy, such information is generally obtained after sequence specific resonance assignments are completed. We present here a new methodology for assignment of secondary structure type to spin systems in proteins directly from NMR spectra, without prior knowledge of resonance assignments. The methodology, named Combination of Shifts for Secondary Structure Identification in Proteins (CSSI-PRO), involves detection of specific linear combination of backbone 1Hα and 13C′ chemical shifts in a two-dimensional (2D) NMR experiment based on G-matrix Fourier transform (GFT) NMR spectroscopy. Such linear combinations of shifts facilitate editing of residues belonging to α-helical/β-strand regions into distinct spectral regions nearly independent of the amino acid type, thereby allowing the estimation of overall secondary structure content of the protein. Comparison of the predicted secondary structure content with those estimated based on their respective 3D structures and/or the method of Chemical Shift Index for 237 proteins gives a correlation of more than 90% and an overall rmsd of 7.0%, which is comparable to other biophysical techniques used for structural characterization of proteins. Taken together, this methodology has a wide range of applications in NMR spectroscopy such as rapid protein structure determination, monitoring conformational changes in protein-folding/ligand-binding studies and automated resonance assignment. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

8.
One bottleneck in NMR structure determination lies in the laborious and time-consuming process of side-chain resonance and NOE assignments. Compared to the well-studied backbone resonance assignment problem, automated side-chain resonance and NOE assignments are relatively less explored. Most NOE assignment algorithms require nearly complete side-chain resonance assignments from a series of through-bond experiments such as HCCH-TOCSY or HCCCONH. Unfortunately, these TOCSY experiments perform poorly on large proteins. To overcome this deficiency, we present a novel algorithm, called Nasca (NOE Assignment and Side-Chain Assignment), to automate both side-chain resonance and NOE assignments and to perform high-resolution protein structure determination in the absence of any explicit through-bond experiment to facilitate side-chain resonance assignment, such as HCCH-TOCSY. After casting the assignment problem into a Markov Random Field (MRF), Nasca extends and applies combinatorial protein design algorithms to compute optimal assignments that best interpret the NMR data. The MRF captures the contact map information of the protein derived from NOESY spectra, exploits the backbone structural information determined by RDCs, and considers all possible side-chain rotamers. The complexity of the combinatorial search is reduced by using a dead-end elimination (DEE) algorithm, which prunes side-chain resonance assignments that are provably not part of the optimal solution. Then an A* search algorithm is employed to find a set of optimal side-chain resonance assignments that best fit the NMR data. These side-chain resonance assignments are then used to resolve the NOE assignment ambiguity and compute high-resolution protein structures. Tests on five proteins show that Nasca assigns resonances for more than 90% of side-chain protons, and achieves about 80% correct assignments. The final structures computed using the NOE distance restraints assigned by Nasca have backbone RMSD 0.8–1.5 Å from the reference structures determined by traditional NMR approaches.  相似文献   

9.
Determination of precise and accurate protein structures by NMR generally requires weeks or even months to acquire and interpret all the necessary NMR data. However, even medium-accuracy fold information can often provide key clues about protein evolution and biochemical function(s). In this article we describe a largely automatic strategy for rapid determination of medium-accuracy protein backbone structures. Our strategy derives from ideas originally introduced by other groups for determining medium-accuracy NMR structures of large proteins using deuterated, (13)C-, (15)N-enriched protein samples with selective protonation of side-chain methyl groups ((13)CH(3)). Data collection includes acquiring NMR spectra for automatically determining assignments of backbone and side-chain (15)N, H(N) resonances, and side-chain (13)CH(3) methyl resonances. These assignments are determined automatically by the program AutoAssign using backbone triple resonance NMR data, together with Spin System Type Assignment Constraints (STACs) derived from side-chain triple-resonance experiments. The program AutoStructure then derives conformational constraints using these chemical shifts, amide (1)H/(2)H exchange, nuclear Overhauser effect spectroscopy (NOESY), and residual dipolar coupling data. The total time required for collecting such NMR data can potentially be as short as a few days. Here we demonstrate an integrated set of NMR software which can process these NMR spectra, carry out resonance assignments, interpret NOESY data, and generate medium-accuracy structures within a few days. The feasibility of this combined data collection and analysis strategy starting from raw NMR time domain data was illustrated by automatic analysis of a medium accuracy structure of the Z domain of Staphylococcal protein A.  相似文献   

10.
Determination of high-quality small protein structures by nuclear magnetic resonance (NMR) methods generally requires acquisition and analysis of an extensive set of structural constraints. The process generally demands extensive backbone and sidechain resonance assignments, and weeks or even months of data collection and interpretation. Here we demonstrate rapid and high-quality protein NMR structure generation using CS-Rosetta with a perdeuterated protein sample made at a significantly reduced cost using new bacterial culture condensation methods. Our strategy provides the basis for a high-throughput approach for routine, rapid, high-quality structure determination of small proteins. As an example, we demonstrate the determination of a high-quality 3D structure of a small 8 kDa protein, E. coli cold shock protein A (CspA), using <4 days of data collection and fully automated data analysis methods together with CS-Rosetta. The resulting CspA structure is highly converged and in excellent agreement with the published crystal structure, with a backbone RMSD value of 0.5 Å, an all atom RMSD value of 1.2 Å to the crystal structure for well-defined regions, and RMSD value of 1.1 Å to crystal structure for core, non-solvent exposed sidechain atoms. Cross validation of the structure with 15N- and 13C-edited NOESY data obtained with a perdeuterated 15N, 13C-enriched 13CH3 methyl protonated CspA sample confirms that essentially all of these independently-interpreted NOE-based constraints are already satisfied in each of the 10 CS-Rosetta structures. By these criteria, the CS-Rosetta structure generated by fully automated analysis of data for a perdeuterated sample provides an accurate structure of CspA. This represents a general approach for rapid, automated structure determination of small proteins by NMR.  相似文献   

11.
Eukaryotic proteins with important biological function can be partially unstructured, conformational flexible, or heterogenic. Crystallization trials often fail for such proteins. In NMR spectroscopy, parts of the polypeptide chain undergoing dynamics in unfavorable time regimes cannot be observed. De novo NMR structure determination is seriously hampered when missing signals lead to an incomplete chemical shift assignment resulting in an information content of the NOE data insufficient to determine the structure ab initio. We developed a new protein structure determination strategy for such cases based on a novel NOE assignment strategy utilizing a number of model structures but no explicit reference structure as it is used for bootstrapping like algorithms. The software distinguishes in detail between consistent and mutually exclusive pairs of possible NOE assignments on the basis of different precision levels of measured chemical shifts searching for a set of maximum number of consistent NOE assignments in agreement with 3D space. Validation of the method using the structure of the low molecular‐weight‐protein tyrosine phosphatase A (MptpA) showed robust results utilizing protein structures with 30–45% sequence identity and 70% of the chemical shift assignments. About 60% of the resonance assignments are sufficient to identify those structural models with highest conformational similarity to the real structure. The software was benchmarked by de novo solution structures of fibroblast growth factor 21 (FGF21) and the extracellular fibroblast growth factor receptor domain FGFR4 D2, which both failed in crystallization trials and in classical NMR structure determination. Proteins 2013; 81:2007–2022. © 2013 Wiley Periodicals, Inc.  相似文献   

12.
The J-UNIO (JCSG protocol using the software UNIO) procedure for automated protein structure determination by NMR in solution is introduced. In the present implementation, J-UNIO makes use of APSY-NMR spectroscopy, 3D heteronuclear-resolved [(1)H,(1)H]-NOESY experiments, and the software UNIO. Applications with proteins from the JCSG target list with sizes up to 150 residues showed that the procedure is highly robust and efficient. In all instances the correct polypeptide fold was obtained in the first round of automated data analysis and structure calculation. After interactive validation of the data obtained from the automated routine, the quality of the final structures was comparable to results from interactive structure determination. Special advantages are that the NMR data have been recorded with 6-10 days of instrument time per protein, that there is only a single step of chemical shift adjustments to relate the backbone signals in the APSY-NMR spectra with the corresponding backbone signals in the NOESY spectra, and that the NOE-based amino acid side chain chemical shift assignments are automatically focused on those residues that are heavily weighted in the structure calculation. The individual working steps of J-UNIO are illustrated with the structure determination of the protein YP_926445.1 from Shewanella amazonensis, and the results obtained with 17 JCSG targets are critically evaluated.  相似文献   

13.
Protein structure determination in solution by NMR spectroscopy   总被引:1,自引:0,他引:1  
The introduction of nuclear magnetic resonance (NMR) spectroscopy as a second method for protein structure determination at atomic resolution, in addition to x-ray diffraction in single crystals, has already led to a significant increase in the number of known protein structures. The NMR method provides data that are in many ways complementary to those obtained from x-ray crystallography and thus promises to widen our view of protein molecules, giving a clearer insight into the relation between structure and function.  相似文献   

14.
We present two NMR experiments, (3,2)D HNHA and (3,2)D HNHB, for rapid and accurate measurement of 3J(H N-H alpha) and 3J(N-H beta) coupling constants in polypeptides based on the principle of G-matrix Fourier transform NMR spectroscopy and quantitative J-correlation. These experiments, which facilitate fast acquisition of three-dimensional data with high spectral/digital resolution and chemical shift dispersion, will provide renewed opportunities to utilize them for sequence specific resonance assignments, estimation/characterization of secondary structure with/without prior knowledge of resonance assignments, stereospecific assignment of prochiral groups and 3D structure determination, refinement and validation. Taken together, these experiments have a wide range of applications from structural genomics projects to studying structure and folding in polypeptides.  相似文献   

15.
Novel algorithms are presented for automated NOESY peak picking and NOE signal identification in homonuclear 2D and heteronuclear-resolved 3D [1H,1H]-NOESY spectra during de novoprotein structure determination by NMR, which have been implemented in the new software ATNOS (automated NOESY peak picking). The input for ATNOS consists of the amino acid sequence of the protein, chemical shift lists from the sequence-specific resonance assignment, and one or several 2D or 3D NOESY spectra. In the present implementation, ATNOS performs multiple cycles of NOE peak identification in concert with automated NOE assignment with the software CANDID and protein structure calculation with the program DYANA. In the second and subsequent cycles, the intermediate protein structures are used as an additional guide for the interpretation of the NOESY spectra. By incorporating the analysis of the raw NMR data into the process of automated de novoprotein NMR structure determination, ATNOS enables direct feedback between the protein structure, the NOE assignments and the experimental NOESY spectra. The main elements of the algorithms for NOESY spectral analysis are techniques for local baseline correction and evaluation of local noise level amplitudes, automated determination of spectrum-specific threshold parameters, the use of symmetry relations, and the inclusion of the chemical shift information and the intermediate protein structures in the process of distinguishing between NOE peaks and artifacts. The ATNOS procedure has been validated with experimental NMR data sets of three proteins, for which high-quality NMR structures had previously been obtained by interactive interpretation of the NOESY spectra. The ATNOS-based structures coincide closely with those obtained with interactive peak picking. Overall, we present the algorithms used in this paper as a further important step towards objective and efficient de novoprotein structure determination by NMR.  相似文献   

16.
The study of multidomain or large proteins in solution by NMR spectroscopy has been made possible in recent years by the development of new spectroscopic methods. However, resonance overlap found in large proteins remains a limiting factor, making resonance assignments and structure determination of large proteins very difficult. In this study, we present an expressed protein ligation protocol that can be used for the segmental isotopic labeling of virtually any multidomain or high molecular mass protein, independent of both the folding state and the solubility of the protein fragments, as well as independent of whether the fragments are interacting. The protocol was applied successfully to two different multidomain proteins containing RNA recognition motifs (RRMs), heterogeneous nuclear ribonucleoprotein L and Npl3p. High yields of segmentally labeled proteins could be obtained, allowing characterization of the interdomain interactions with NMR spectroscopy. We found that the RRMs of heterogeneous nuclear ribonucleoprotein L interact, whereas those of Npl3p are independent. Subsequently, the structures of the two RRMs of Npl3p were determined on the basis of samples in which each RRM was expressed individually. The two Npl3p RRMs adopt the expected βαββαβ fold.  相似文献   

17.
The recent expansion of structural genomics has increased the demands for quick and accurate protein structure determination by NMR spectroscopy. The conventional strategy without an automated protocol can no longer satisfy the needs of high-throughput application to a large number of proteins, with each data set including many NMR spectra, chemical shifts, NOE assignments, and calculated structures. We have developed the new software KUJIRA, a package of integrated modules for the systematic and interactive analysis of NMR data, which is designed to reduce the tediousness of organizing and manipulating a large number of NMR data sets. In combination with CYANA, the program for automated NOE assignment and structure determination, we have established a robust and highly optimized strategy for comprehensive protein structure analysis. An application of KUJIRA in accordance with our new strategy was carried out by a non-expert in NMR structure analysis, demonstrating that the accurate assignment of the chemical shifts and a high-quality structure of a small protein can be completed in a few weeks. The high completeness of the chemical shift assignment and the NOE assignment achieved by the systematic analysis using KUJIRA and CYANA led, in practice, to increased reliability of the determined structure.  相似文献   

18.
Combined automated NOE assignment and structure determination module (CANDID) is a new software for efficient NMR structure determination of proteins by automated assignment of the NOESY spectra. CANDID uses an iterative approach with multiple cycles of NOE cross-peak assignment and protein structure calculation using the fast DYANA torsion angle dynamics algorithm, so that the result from each CANDID cycle consists of exhaustive, possibly ambiguous NOE cross-peak assignments in all available spectra and a three-dimensional protein structure represented by a bundle of conformers. The input for the first CANDID cycle consists of the amino acid sequence, the chemical shift list from the sequence-specific resonance assignment, and listings of the cross-peak positions and volumes in one or several two, three or four-dimensional NOESY spectra. The input for the second and subsequent CANDID cycles contains the three-dimensional protein structure from the previous cycle, in addition to the complete input used for the first cycle. CANDID includes two new elements that make it robust with respect to the presence of artifacts in the input data, i.e. network-anchoring and constraint-combination, which have a key role in de novo protein structure determinations for the successful generation of the correct polypeptide fold by the first CANDID cycle. Network-anchoring makes use of the fact that any network of correct NOE cross-peak assignments forms a self-consistent set; the initial, chemical shift-based assignments for each individual NOE cross-peak are therefore weighted by the extent to which they can be embedded into the network formed by all other NOE cross-peak assignments. Constraint-combination reduces the deleterious impact of artifact NOE upper distance constraints in the input for a protein structure calculation by combining the assignments for two or several peaks into a single upper limit distance constraint, which lowers the probability that the presence of an artifact peak will influence the outcome of the structure calculation. CANDID test calculations were performed with NMR data sets of four proteins for which high-quality structures had previously been solved by interactive protocols, and they yielded comparable results to these reference structure determinations with regard to both the residual constraint violations, and the precision and accuracy of the atomic coordinates. The CANDID approach has further been validated by de novo NMR structure determinations of four additional proteins. The experience gained in these calculations shows that once nearly complete sequence-specific resonance assignments are available, the automated CANDID approach results in greatly enhanced efficiency of the NOESY spectral analysis. The fact that the correct fold is obtained in cycle 1 of a de novo structure calculation is the single most important advance achieved with CANDID, when compared with previously proposed automated NOESY assignment methods that do not use network-anchoring and constraint-combination.  相似文献   

19.
Xu Y  Zheng Y  Fan JS  Yang D 《Nature methods》2006,3(11):931-937
So far high-resolution structure determination by nuclear magnetic resonance (NMR) spectroscopy has been limited to proteins <30 kDa, although global fold determination is possible for substantially larger proteins. Here we present a strategy for assigning backbone and side-chain resonances of large proteins without deuteration, with which one can obtain high-resolution structures from (1)H-(1)H distance restraints. The strategy uses information from through-bond correlation experiments to filter intraresidue and sequential correlations from through-space correlation experiments, and then matches the filtered correlations to obtain sequential assignment. We demonstrate this strategy on three proteins ranging from 24 to 65 kDa for resonance assignment and on maltose binding protein (42 kDa) and hemoglobin (65 kDa) for high-resolution structure determination. The strategy extends the size limit for structure determination by NMR spectroscopy to 42 kDa for monomeric proteins and to 65 kDa for differentially labeled multimeric proteins without the need for deuteration or selective labeling.  相似文献   

20.
Eisenreich W  Bacher A 《Phytochemistry》2007,68(22-24):2799-2815
Rapid progress in instrumentation and software made nuclear magnetic resonance spectroscopy (NMR) one of the most powerful analytical methods in biological sciences. Whereas the development of multidimensional NMR pulse sequences is an ongoing process, a small subset of two-dimensional NMR experiments is typically sufficient for the rapid structure determination of small metabolites. The use of sophisticated three- and four-dimensional NMR experiments enables the determination of the three-dimensional structures of proteins with a molecular weight up to 100 kDa, and solution structures of more than 100 plant proteins have been established by NMR spectroscopy. NMR has also been introduced to the emerging field of metabolomics where it can provide unbiased information about metabolite profiles of plant extracts. In recent times, high-resolution NMR has become a key technology for the elucidation of biosynthetic pathways and metabolite flux via quantitative assessment of multiple isotopologues. This review summarizes some of the recent advances of high-resolution NMR spectroscopy in the field of plant sciences.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号