首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The quality of protein structures determined by nuclear magnetic resonance (NMR) spectroscopy is contingent on the number and quality of experimentally-derived resonance assignments, distance and angular restraints. Two key features of protein NMR data have posed challenges for the routine and automated structure determination of small to medium sized proteins; (1) spectral resolution – especially of crowded nuclear Overhauser effect spectroscopy (NOESY) spectra, and (2) the reliance on a continuous network of weak scalar couplings as part of most common assignment protocols. In order to facilitate NMR structure determination, we developed a semi-automated strategy that utilizes non-uniform sampling (NUS) and multidimensional decomposition (MDD) for optimal data collection and processing of selected, high resolution multidimensional NMR experiments, combined it with an ABACUS protocol for sequential and side chain resonance assignments, and streamlined this procedure to execute structure and refinement calculations in CYANA and CNS, respectively. Two graphical user interfaces (GUIs) were developed to facilitate efficient analysis and compilation of the data and to guide automated structure determination. This integrated method was implemented and refined on over 30 high quality structures of proteins ranging from 5.5 to 16.5 kDa in size.  相似文献   

2.
3.
Direct methods in NMR based structure determination start from an unassigned ensemble of unconnected gaseous hydrogen atoms. Under favorable conditions they can produce low resolution structures of proteins. Usually a prohibitively large number of NOEs is required, to solve a protein structure ab-initio, but even with a much smaller set of distance restraints low resolution models can be obtained which resemble a protein fold. One problem is that at such low resolution and in the absence of a force field it is impossible to distinguish the correct protein fold from its mirror image. In a hybrid approach these ambiguous models have the potential to aid in the process of sequential backbone chemical shift assignment when 13Cβ and 13C′ shifts are not available for sensitivity reasons. Regardless of the overall fold they enhance the information content of the NOE spectra. These, combined with residue specific labeling and minimal triple-resonance data using 13Cα connectivity can provide almost complete sequential assignment. Strategies for residue type specific labeling with customized isotope labeling patterns are of great advantage in this context. Furthermore, this approach is to some extent error-tolerant with respect to data incompleteness, limited precision of the peak picking, and structural errors caused by misassignment of NOEs. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

4.
Nuclear magnetic resonance (NMR) spectroscopy allows scientists to study protein structure, dynamics and interactions in solution. A necessary first step for such applications is determining the resonance assignment, mapping spectral data to atoms and residues in the primary sequence. Automated resonance assignment algorithms rely on information regarding connectivity (e.g., through-bond atomic interactions) and amino acid type, typically using the former to determine strings of connected residues and the latter to map those strings to positions in the primary sequence. Significant ambiguity exists in both connectivity and amino acid type information. This paper focuses on the information content available in connectivity alone and develops a novel random-graph theoretic framework and algorithm for connectivity-driven NMR sequential assignment. Our random graph model captures the structure of chemical shift degeneracy, a key source of connectivity ambiguity. We then give a simple and natural randomized algorithm for finding optimal assignments as sets of connected fragments in NMR graphs. The algorithm naturally and efficiently reuses substrings while exploring connectivity choices; it overcomes local ambiguity by enforcing global consistency of all choices. By analyzing our algorithm under our random graph model, we show that it can provably tolerate relatively large ambiguity while still giving expected optimal performance in polynomial time. We present results from practical applications of the algorithm to experimental datasets from a variety of proteins and experimental set-ups. We demonstrate that our approach is able to overcome significant noise and local ambiguity in identifying significant fragments of sequential assignments.  相似文献   

5.
Several techniques for spectral editing of 2D 13C?C13C correlation NMR of proteins are introduced. They greatly reduce the spectral overlap for five common amino acid types, thus simplifying spectral assignment and conformational analysis. The carboxyl (COO) signals of glutamate and aspartate are selected by suppressing the overlapping amide N?CCO peaks through 13C?C15N dipolar dephasing. The sidechain methine (CH) signals of valine, lecuine, and isoleucine are separated from the overlapping methylene (CH2) signals of long-chain amino acids using a multiple-quantum dipolar transfer technique. Both the COO and CH selection methods take advantage of improved dipolar dephasing by asymmetric rotational-echo double resonance (REDOR), where every other ??-pulse is shifted from the center of a rotor period tr by about 0.15 tr. This asymmetry produces a deeper minimum in the REDOR dephasing curve and enables complete suppression of the undesired signals of immobile segments. Residual signals of mobile sidechains are positively identified by dynamics editing using recoupled 13C?C1H dipolar dephasing. In all three experiments, the signals of carbons within a three-bond distance from the selected carbons are detected in the second spectral dimension via 13C spin exchange. The efficiencies of these spectral editing techniques range from 60?% for the COO and dynamic selection experiments to 25?% for the CH selection experiment, and are demonstrated on well-characterized model proteins GB1 and ubiquitin.  相似文献   

6.
7.
In NMR protein structure determination, after the resonance peaks have been identified and chemical shifts from peaks across multiple spectra have been grouped into spin systems, associating these spin systems to their host residues is the key toward the success of structural information extraction and thus the key to the success of the structure calculation. To achieve accurate enough structure calculation, a near complete and accurate assignment is a prerequisite. There are two pieces of information that can be used into the assignment, one of which is the adjacency information among the spin systems and the other is the signature information of the spin systems. The signature information reflects the fact that, generally speaking, for one type of amino acid residing in a specific local structural environment, the chemical shifts for the atoms inside the amino acid fall into some very narrow distinct ranges. In most of the existing work, normal distributions are assumed with means and standard deviations statistically collected from the available data. In this paper, we followed a simple yet effective histogram-based way to estimate for every spin system the probability that its host is a certain type of amino acid residing in a certain type of secondary structure. We used two combinations of chemical shifts to demonstrate the effectiveness of this type of histogram-based scoring schemes.  相似文献   

8.
Lin HN  Wu KP  Chang JM  Sung TY  Hsu WL 《Nucleic acids research》2005,33(14):4593-4601
NMR data from different experiments often contain errors; thus, automated backbone resonance assignment is a very challenging issue. In this paper, we present a method called GANA that uses a genetic algorithm to automatically perform backbone resonance assignment with a high degree of precision and recall. Precision is the number of correctly assigned residues divided by the number of assigned residues, and recall is the number of correctly assigned residues divided by the number of residues with known human curated answers. GANA takes spin systems as input data and uses two data structures, candidate lists and adjacency lists, to assign the spin systems to each amino acid of a target protein. Using GANA, almost all spin systems can be mapped correctly onto a target protein, even if the data are noisy. We use the BioMagResBank (BMRB) dataset (901 proteins) to test the performance of GANA. To evaluate the robustness of GANA, we generate four additional datasets from the BMRB dataset to simulate data errors of false positives, false negatives and linking errors. We also use a combination of these three error types to examine the fault tolerance of our method. The average precision rates of GANA on BMRB and the four simulated test cases are 99.61, 99.55, 99.34, 99.35 and 98.60%, respectively. The average recall rates of GANA on BMRB and the four simulated test cases are 99.26, 99.19, 98.85, 98.87 and 97.78%, respectively. We also test GANA on two real wet-lab datasets, hbSBD and hbLBD. The precision and recall rates of GANA on hbSBD are 95.12 and 92.86%, respectively, and those of hbLBD are 100 and 97.40%, respectively.  相似文献   

9.
Three-dimensional (3D) heteronuclear NMR techniques have been used to make sequential 1H and 15N resonance assignments for most of the residues of Lactobacillus casei dihydrofolate reductase (DHFR), a monomeric protein of molecular mass 18,300 Da. A uniformly 15N-labeled sample of the protein was prepared and its complex with methotrexate (MTX) studied by 3D 15N/1H nuclear Overhauser-heteronuclear multiple quantum coherence (NOESY-HMQC), Hartmann-Hahn-heteronuclear multiple quantum coherence (HOHAHA-HMQC), and HMQC-NOESY-HMQC experiments. These experiments overcame most of the spectral overlap problems caused by chemical shift degeneracies in 2D spectra and allowed the 1H-1H through-space and through-bond connectivities to be identified unambiguously, leading to the resonance assignments. The novel HMQC-NOESY-HMQC experiment allows NOE cross peaks to be detected between NH protons even when their 1H chemical shifts are degenerate as long as the amide 15N chemical shifts are nondegenerate. The 3D experiments, in combination with conventional 2D NOESY, COSY, and HOHAHA experiments on unlabelled and selectively deuterated DHFR, provide backbone assignments for 146 of the 162 residues and side-chain assignments for 104 residues of the protein. Data from the NOE-based experiments and identification of the slowly exchanging amide protons provide detailed information about the secondary structure of the binary complex of the protein with methotrexate. Sequential NHi-NHi+1 NOEs define four regions with helical structure. Two of these regions, residues 44-49 and 79-89, correspond to within one amino acid to helices C and E in the crystal structure of the DHFR.methotrexate.NADPH complex [Bolin et al. (1982) J. Biol. Chem. 257, 13650-13662], while the NMR-determined helix formed by residues 26-35 is about one turn shorter at the N-terminus than helix B in the crystal structure, which spans residues 23-34. Similarly, the NMR-determined helical region comprising residues 102-110 is somewhat offset from the crystal structure's helix F, which encompasses residues 97-107. Regions of beta-sheet structure were characterized in the binary complex by strong alpha CHi-NHi+1 NOEs and by slowly exchanging amide protons. In addition, several long-range NOEs were identified linking together these stretches to form a beta-sheet. These elements align perfectly with corresponding elements in the crystal structure of the DHFR.methotrexate.NADPH complex, which contains an eight-stranded beta-sheet, indicating that the main body of the beta-sheet is preserved in the binary complex in solution.  相似文献   

10.
Q X Hua  M A Weiss 《Biochemistry》1991,30(22):5505-5515
The solution structure and dynamics of human insulin are investigated by 2D 1H NMR spectroscopy in reference to a previously analyzed analogue, des-pentapeptide(B26-B30) insulin (DPI; Hua, Q.X., & Weiss, M.A. (1990) Biochemistry 29, 10545-10555). This spectroscopic comparison is of interest since (i) the structure of the C-terminal region of the B-chain has not been determined in the monomeric state and (ii) the role of this region in binding to the insulin receptor has been the subject of long-standing speculation. The present NMR studies are conducted in the presence of an organic cosolvent (20% acetic acid), under which conditions both proteins are monomeric and stably folded. Complete sequential assignment of human insulin is obtained and leads to the following conclusions. (1) The secondary structure of the insulin monomer (three alpha-helices and B-chain beta-turn) is similar to that observed in the 2-Zn crystal state. (2) The folding of DPI is essentially the same as the corresponding portion of intact insulin, in accord with the similarities between their respective crystal structures. However, differences between insulin and DPI are observed in the extent of conformational broadening of amide resonances, indicating that the presence or absence of residues B26-B30 influences the overall dynamics of the protein on the millisecond time scale. (3) Residues B24-B28 adopt an extended configuration in the monomer and pack against the hydrophobic core as in crystallographic dimers; residues B29 and B30 are largely disordered. This configuration differs from that described in a more organic milieu (35% acetonitrile; Kline, A.D., & Justice, R.M., Jr. (1990) Biochemistry 29, 2906-2913), suggesting that the conformation of insulin in the latter study may have been influenced by solvent composition. (4) The insulin fold is shown to provide a model for collective motions in a protein with implications for the mechanism of protein-protein recognition. To our knowledge, this paper describes the first detailed analysis of a protein NMR spectrum under conditions of extensive conformational broadening. Such an analysis is made possible in the present case by comparative study of an analogue (DPI) with more tractable spectroscopic properties.  相似文献   

11.
Solid-state NMR is an emerging structure determination technique for crystalline and non-crystalline protein assemblies, e.g., amyloids. Resonance assignment constitutes the first and often very time-consuming step to a structure. We present ssFLYA, a generally applicable algorithm for automatic assignment of protein solid-state NMR spectra. Application to microcrystals of ubiquitin and the Ure2 prion C-terminal domain, as well as amyloids of HET-s(218–289) and α-synuclein yielded 88–97 % correctness for the backbone and side-chain assignments that are classified as self-consistent by the algorithm, and 77–90 % correctness if also assignments classified as tentative by the algorithm are included.  相似文献   

12.
Error tolerant backbone resonance assignment is the cornerstone of the NMR structure determination process. Although a variety of assignment approaches have been developed, none works sufficiently well on noisy fully automatically picked peaks to enable the subsequent automatic structure determination steps. We have designed an integer linear programming (ILP) based assignment system (IPASS) that has enabled fully automatic protein structure determination for four test proteins. IPASS employs probabilistic spin system typing based on chemical shifts and secondary structure predictions. Furthermore, IPASS extracts connectivity information from the inter-residue information and the (automatically picked) (15)N-edited NOESY peaks which are then used to fix reliable fragments. When applied to automatically picked peaks for real proteins, IPASS achieves an average precision and recall of 82% and 63%, respectively. In contrast, the next best method, MARS, achieves an average precision and recall of 77% and 36%, respectively. The assignments generated by IPASS are then fed into our protein structure calculation system, FALCON-NMR, to determine the 3D structures without human intervention. The final models have backbone RMSDs of 1.25?, 0.88?, 1.49?, and 0.67? to the reference native structures for proteins TM1112, CASKIN, VRAR, and HACS1, respectively. The web server is publicly available at http://monod.uwaterloo.ca/nmr/ipass.  相似文献   

13.
We report a new residual dipolar couplings (RDCs) based NMR procedure for rapidly determining RNA tertiary structure demonstrated on a uniformly (15)N/(13)C-labeled 27 nt variant of the trans-activation response element (TAR) RNA from HIV-I. In this procedure, the time-consuming nuclear Overhauser enhancement (NOE)-based sequential assignment step is replaced by a fully automated RDC-based assignment strategy. This approach involves examination of all allowed sequence-specific resonance assignment permutations for best-fit agreement between measured RDCs and coordinates for sub-structures in a target RNA. Using idealized A-form geometries to model Watson-Crick helices and coordinates from a previous X-ray structure to model a hairpin loop in TAR, the best-fit RDC assignment solutions are determined very rapidly (相似文献   

14.
15.
DnaE intein from Nostoc punctiforme (Npu) is one of naturally occurring split inteins, which has robust protein splicing activity. Highly efficient trans-splicing activity of NpuDnaE intein could widen various biotechnological applications. However, structural basis of the efficient protein splicing activity is poorly understood. As a first step toward better understanding of protein trans-splicing mechanism, we present the backbone and side-chain resonance assignments of a single chain variant NpuDnaE intein as determined by triple resonance experiments with [13C,15N]-labeled protein.  相似文献   

16.
All non-proline residues except for the N-terminal dipeptide have been assigned in the 108-residue protein Escherichia coli thioredoxin. Central to these experiments has been the use of protein samples in which all carbon-bound hydrogen positions are substituted to 75% with deuterium by bacterial growth on partially deuteriated carbon sources and media. The dilution of the local proton density gives rise to narrower line widths with little loss in sensitivity. In addition, passive or secondary coupling to protons not directly involved in the coherence transfer process of correlation experiments is largely suppressed, thus significantly improving the resolution for side-chain couplings. Simultaneous multiresidue-type assignments have been obtained by incorporation of several amino acids with differing selective alpha- and/or beta-deuteriation into a fractionally deuteriated background. Combined with several single residue type labeling experiments, these selective labelings have yielded direct residue type assignments for two-thirds of the protein. In addition to improved resolution, the amide to carbon-bound proton NOESY spectra offered equivalent sensitivity while the amide to amide NOESY spectra offered superior sensitivity to that observed for natural abundance samples. The resultant sequential assignment has an average number of nearest-neighbor NOE connectivities of 2.35 out of the possible 3 alpha-amide, beta-amide, and amide-amide connectivities.  相似文献   

17.
Complete and accurate NMR spectral assignment is a prerequisite for high-throughput automated structure determination of biological macromolecules. However, completely automated assignment procedures generally encounter difficulties for all but the most ideal data sets. Sources of these problems include difficulty in resolving correlations in crowded spectral regions, as well as complications arising from dynamics, such as weak or missing peaks, or atoms exhibiting more than one peak due to exchange phenomena. Smartnotebook is a semi-automated assignment software package designed to combine the best features of the automated and manual approaches. The software finds and displays potential connections between residues, while the spectroscopist makes decisions on which connection is correct, allowing rapid and robust assignment. In addition, smartnotebook helps the user fit chains of connected residues to the primary sequence of the protein by comparing the experimentally determined chemical shifts with expected shifts derived from a chemical shift database, while providing bookkeeping throughout the assignment procedure.  相似文献   

18.
The 1H nuclear magnetic resonance (NMR) spectrum of Ca2+-saturated porcine calbindin D9k (78 amino acids, Mr 8800) has been assigned. Greater than 98% of the 1H resonances, including spin systems for each amino acid residue, have been identified by using an approach that integrates data from a wide range of two-dimensional scalar correlated NMR experiments [Chazin, Rance, & Wright (1988) J. Mol. Biol. 202, 603-626]. Due to the limited quantity of sample and conformational heterogeneity of the protein, two-dimensional nuclear Overhauser effect (NOE) experiments also played an essential role in the identification of spin systems. On the basis of the pattern of scalar connectivities, 43 of the 78 spin systems could be directly assigned to the appropriate residue type. This provided an ample basis for obtaining the sequence-specific resonance assignments. The elements of secondary structure are identified from sequential and medium-range NOEs, values of 3JNH alpha, and the location of slowly exchanging backbone amide protons. Four well-defined helices and a mini beta-sheet between the two calcium binding loops are present in solution. These elements of secondary structure and a few key long-range NOEs provided sufficient information to define the global fold of the protein in solution. Generally good agreement is found between the crystal structure of the minor A form of bovine calbindin D9k and the solution structure of intact porcine calbindin D9k. The only significant difference is a short one-turn helix in the loop between helices II and III in the bovine crystal structure, which is clearly absent in the porcine solution structure.  相似文献   

19.
20.
A fully automated method is presented for determining NMR solution structures of proteins using exclusively NOESY spectra as input, obviating the need to measure any spectra only for obtaining resonance assignments but devoid of structural information. Applied to two small proteins, the approach yielded structures that coincided closely with conventionally determined structures.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号