首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The construction of a consistent protein chemical shift database is an important step toward making more extensive use of this data in structural studies. Unfortunately, progress in this direction has been hampered by the quality of the available data, particularly with respect to chemical shift referencing, which is often either inaccurate or inconsistently annotated. Preprocessing of the data is therefore required to detect and correct referencing errors. In an earlier study we developed CheckShift, a program for performing this task automatically. Now we spent substantial effort in improving the running time of the CheckShift algorithm, which resulted in an running time decrease of 90%, thereby achieving equivalent quality to the former version of CheckShift. The reason for the running time decrease is twofold. Firstly we improved the search for the optimal re-referencing offset considerably. Secondly, as CheckShift is based on a secondary structure prediction from the amino acid sequence (formally PsiPred was used), we evaluated a wide range of available secondary structure prediction programs focusing on the special needs of the CheckShift algorithm. The results of this evaluation prove empirically that we can use faster secondary structure prediction programs than PsiPred without sacrificing CheckShift’s accuracy. Very recently Wang and Markley (2009) gave a small list of extreme outliers of the former version of the CheckShift web-server. Those were due to the empirical reduction of the search space implemented in the old version. The new version of CheckShift now gives very similar results to RefDB and LACS for all outliers mentioned in Table 1 of Wang and Markley (2009).  相似文献   

2.
Inconsistent 13C and 15N chemical shift referencing is a continuing problem associated with protein chemical shift assignments deposited in BioMagResBank (BMRB). Here we describe a simple and robust approach that can quantitatively determine the 13C and 15N referencing offsets solely from chemical shift assignment data and independently of 3D coordinate data. This novel structure-independent approach permitted the assessment and determination of 13C and 15N reference offsets for all protein entries deposited in the BMRB. Tests on 452 proteins with known 3D structures show that this structure-independent approach yields 13C and 15N referencing offsets that exhibit excellent agreement with those calculated on the basis of 3D structures. Furthermore, this protocol appears to improve the accuracy of chemical shift-derived secondary structural identification, and has been formally incorporated into a computer program called PSSI (http//www.pronmr.com).Supplementary material to this paper is available in electronic form at http://dx.doi.org/10.1007/s10858-004-7441-3  相似文献   

3.
The linear analysis of chemical shifts (LACS) has provided a robust method for identifying and correcting 13C chemical shift referencing problems in data from protein NMR spectroscopy. Unlike other approaches, LACS does not require prior knowledge of the three-dimensional structure or inference of the secondary structure of the protein. It also does not require extensive assignment of the NMR data. We report here a way of extending the LACS approach to 15N NMR data from proteins, so as to enable the detection and correction of inconsistencies in chemical shift referencing for this nucleus. The approach is based on our finding that the secondary 15N chemical shift of the backbone nitrogen atom of residue i is strongly correlated with the secondary chemical shift difference (experimental minus random coil) between the alpha and beta carbons of residue i − 1. Thus once alpha and beta 13C chemical shifts are available (their difference is referencing error-free), the 15N referencing can be validated, and an appropriate offset correction can be derived. This approach can be implemented prior to a structure determination and can be used to analyze potential referencing problems in database data not associated with three-dimensional structure. Application of the LACS algorithm to the current BMRB protein chemical shift database, revealed that nearly 35% of the BMRB entries have δ 15N values mis-referenced by over 0.7 ppm and over 25% of them have δ 1HN values mis-referenced by over 0.12 ppm. One implication of the findings reported here is that a backbone 15N chemical shift provides a better indicator of the conformation of the preceding residue than of the residue itself. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

4.
RefDB: a database of uniformly referenced protein chemical shifts   总被引:8,自引:8,他引:0  
RefDB is a secondary database of reference-corrected protein chemical shifts derived from the BioMagResBank (BMRB). The database was assembled by using a recently developed program (SHIFTX) to predict protein 1H, 13C and 15N chemical shifts from X-ray or NMR coordinate data of previously assigned proteins. The predicted shifts were then compared with the corresponding observed shifts and a variety of statistical evaluations performed. In this way, potential mis-assignments, typographical errors and chemical referencing errors could be identified and, in many cases, corrected. This approach allows for an unbiased, instrument-independent solution to the problem of retrospectively re-referencing published protein chemical shifts. Results from this study indicate that nearly 25% of BMRB entries with 13C protein assignments and 27% of BMRB entries with 15N protein assignments required significant chemical shift reference readjustments. Additionally, nearly 40% of protein entries deposited in the BioMagResBank appear to have at least one assignment error. From this study it evident that protein NMR spectroscopists are increasingly adhering to recommended IUPAC 13C and 15N chemical shift referencing conventions, however, approximately 20% of newly deposited protein entries in the BMRB are still being incorrectly referenced. This is cause for some concern. However, the utilization of RefDB and its companion programs may help mitigate this ongoing problem. RefDB is updated weekly and the database, along with its associated software, is freely available at http://redpoll.pharmacy.ualberta.ca and the BMRB website.  相似文献   

5.
It has been estimated that more than 20% of the proteins in the BMRB are improperly referenced and that about 1% of all chemical shift assignments are mis-assigned. These statistics also reflect the likelihood that any newly assigned protein will have shift assignment or shift referencing errors. The relatively high frequency of these errors continues to be a concern for the biomolecular NMR community. While several programs do exist to detect and/or correct chemical shift mis-referencing or chemical shift mis-assignments, most can only do one, or the other. The one program (SHIFTCOR) that is capable of handling both chemical shift mis-referencing and mis-assignments, requires the 3D structure coordinates of the target protein. Given that chemical shift mis-assignments and chemical shift re-referencing issues should ideally be addressed prior to 3D structure determination, there is a clear need to develop a structure-independent approach. Here, we present a new structure-independent protocol, which is based on using residue-specific and secondary structure-specific chemical shift distributions calculated over small (3–6 residue) fragments to identify mis-assigned resonances. The method is also able to identify and re-reference mis-referenced chemical shift assignments. Comparisons against existing re-referencing or mis-assignment detection programs show that the method is as good or superior to existing approaches. The protocol described here has been implemented into a freely available Java program called “Probabilistic Approach for protein Nmr Assignment Validation (PANAV)” and as a web server () which can be used to validate and/or correct as well as re-reference assigned protein chemical shifts.  相似文献   

6.
Peak lists are commonly used in NMR as input data for various software tools such as automatic assignment and structure calculation programs. Inconsistencies of chemical shift referencing among different peak lists or between peak and chemical shift lists can cause severe problems during peak assignment. Here we present a simple and robust tool to achieve self-consistency of the chemical shift referencing among a set of peak lists. The Peakmatch algorithm matches a set of peak lists to a specified reference peak list, neither of which have to be assigned. The chemical shift referencing offset between two peak lists is determined by optimizing an assignment-free match score function using either a complete grid search or downhill simplex optimization. It is shown that peak lists from many different types of spectra can be matched reliably as long as they contain at least two corresponding dimensions. Using a simulated peak list, the Peakmatch algorithm can also be used to obtain the optimal agreement between a chemical shift list and experimental peak lists. Combining these features makes Peakmatch a useful tool that can be applied routinely before automatic assignment or structure calculation in order to obtain an optimized input data set.  相似文献   

7.
Poor chemical shift referencing, especially for 13C in protein Nuclear Magnetic Resonance (NMR) experiments, fundamentally limits and even prevents effective study of biomacromolecules via NMR, including protein structure determination and analysis of protein dynamics. To solve this problem, we constructed a Bayesian probabilistic framework that circumvents the limitations of previous reference correction methods that required protein resonance assignment and/or three-dimensional protein structure. Our algorithm named Bayesian Model Optimized Reference Correction (BaMORC) can detect and correct 13C chemical shift referencing errors before the protein resonance assignment step of analysis and without three-dimensional structure. By combining the BaMORC methodology with a new intra-peaklist grouping algorithm, we created a combined method called Unassigned BaMORC that utilizes only unassigned experimental peak lists and the amino acid sequence. Unassigned BaMORC kept all experimental three-dimensional HN(CO)CACB-type peak lists tested within ±?0.4 ppm of the correct 13C reference value. On a much larger unassigned chemical shift test set, the base method kept 13C chemical shift referencing errors to within ±?0.45 ppm at a 90% confidence interval. With chemical shift assignments, Assigned BaMORC can detect and correct 13C chemical shift referencing errors to within ±?0.22 at a 90% confidence interval. Therefore, Unassigned BaMORC can correct 13C chemical shift referencing errors when it will have the most impact, right before protein resonance assignment and other downstream analyses are started. After assignment, chemical shift reference correction can be further refined with Assigned BaMORC. These new methods will allow non-NMR experts to detect and correct 13C referencing error at critical early data analysis steps, lowering the bar of NMR expertise required for effective protein NMR analysis.  相似文献   

8.
We present SimShiftDB, a new program to extract conformational data from protein chemical shifts using structural alignments. The alignments are obtained in searches of a large database containing 13,000 structures and corresponding back-calculated chemical shifts. SimShiftDB makes use of chemical shift data to provide accurate results even in the case of low sequence similarity, and with even coverage of the conformational search space. We compare SimShiftDB to HHSearch, a state-of-the-art sequence-based search tool, and to TALOS, the current standard tool for the task. We show that for a significant fraction of the predicted similarities, SimShiftDB outperforms the other two methods. Particularly, the high coverage afforded by the larger database often allows predictions to be made for residues not involved in canonical secondary structure, where TALOS predictions are both less frequent and more error prone. Thus SimShiftDB can be seen as a complement to currently available methods.  相似文献   

9.
Females of many species obtain benefits by mating polyandrously, and often prefer novel males over previous mates. However, how do females recognise previous mates, particularly in the face of cognitive constraints? Female crickets appear to have evolved a simple but effective solution: females imbue males with their own cuticular hydrocarbons (CHCs) at mating and utilise chemosensory self‐referencing to recognise recent mates. Female CHC profiles exhibited significant additive genetic variation, demonstrating that genetically unique chemical cues are available to support chemosensory self‐referencing. CHC profiles of males became more similar to those of females after mating, indicating physical transfer of CHCs between individuals during copulation. Experimental perfuming of males with female CHCs resulted in a female aversion to males bearing chemical cues similar to their own. Chemosensory self‐referencing, therefore, could be a widespread mechanism by which females increase the diversity of their mating partners.  相似文献   

10.
Many processes in the regulation of gene expression and signaling involve the formation of protein complexes involving multi-domain proteins. Individual domains that mediate protein-protein and protein-nucleic acid interactions are typically connected by flexible linkers, which contribute to conformational dynamics and enable the formation of complexes with distinct binding partners. Solution techniques are therefore required for structural analysis and to characterize potential conformational dynamics. Nuclear magnetic resonance spectroscopy (NMR) provides such information but often only sparse data are obtained with increasing molecular weight of the complexes. It is therefore beneficial to combine NMR data with additional structural restraints from complementary solution techniques. Small angle X-ray/neutron scattering (SAXS/SANS) data can be efficiently combined with NMR-derived information, either for validation or by providing additional restraints for structural analysis. Here, we show that the combination of SAXS and SANS data can help to refine structural models obtained from data-driven docking using HADDOCK based on sparse NMR data. The approach is demonstrated with the ternary protein-protein-RNA complex involving two RNA recognition motif (RRM) domains of Sex-lethal, the N-terminal cold shock domain of Upstream-to-N-Ras, and msl-2 mRNA. Based on chemical shift perturbations we have mapped protein-protein and protein-RNA interfaces and complemented this NMR-derived information with SAXS data, as well as SANS measurements on subunit-selectively deuterated samples of the ternary complex. Our results show that, while the use of SAXS data is beneficial, the additional combination with contrast variation in SANS data resolves remaining ambiguities and improves the docking based on chemical shift perturbations of the ternary protein-RNA complex.  相似文献   

11.
NMR chemical shifts provide important local structural information for proteins. Consistent structure generation from NMR chemical shift data has recently become feasible for proteins with sizes of up to 130 residues, and such structures are of a quality comparable to those obtained with the standard NMR protocol. This study investigates the influence of the completeness of chemical shift assignments on structures generated from chemical shifts. The Chemical-Shift-Rosetta (CS-Rosetta) protocol was used for de novo protein structure generation with various degrees of completeness of the chemical shift assignment, simulated by omission of entries in the experimental chemical shift data previously used for the initial demonstration of the CS-Rosetta approach. In addition, a new CS-Rosetta protocol is described that improves robustness of the method for proteins with missing or erroneous NMR chemical shift input data. This strategy, which uses traditional Rosetta for pre-filtering of the fragment selection process, is demonstrated for two paramagnetic proteins and also for two proteins with solid-state NMR chemical shift assignments. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

12.
An increasing number of protein solution structures, calculated on the basis of nuclear Overhauser enhancement cross-peak intensities observed in two- or higher dimensional NOESY experiments, are becoming available. Among these structures regions of uncertainty are frequently observed particularly with respect to loops and surface side chains. These are commonly ascribed to either a lack of NOE constraints or to some intrinsic mobility within the protein. A powerful method of structural analysis which may resolve this problem is based on the information content of the chemical shift. The value of such an analysis is illustrated here with cytochromes b5 and c, proteins for which high-quality crystallographic and NMR data are available. Comparison of these using a pseudocontact shift-based analysis indicates that NOE data should be combined with the chemical shift data in order to uncover fully the ensemble of protein states and their dynamics in solution.  相似文献   

13.
Chemical shift prediction has an unappreciated power to guide backbone resonance assignment in cases where protein structure is known. Here we describe Resonance Assignment by chemical Shift Prediction (RASP), a method that exploits this power to derive protein backbone resonance assignments from chemical shift predictions. Robust assignments can be obtained from a minimal set of only the most sensitive triple-resonance experiments, even for spectroscopically challenging proteins. Over a test set of 154 proteins RASP assigns 88 % of residues with an accuracy of 99.7 %, using only information available from HNCO and HNCA spectra. Applied to experimental data from a challenging 34 kDa protein, RASP assigns 90 % of manually assigned residues using only 40 % of the experimental data required for the manual assignment. RASP has the potential to significantly accelerate the backbone assignment process for a wide range of proteins for which structural information is available, including those for which conventional assignment strategies are not feasible.  相似文献   

14.
Protein structure determination using nuclear magnetic resonance (NMR) spectroscopy can be both time-consuming and labor intensive. Here we demonstrate how chemical shift threading can permit rapid, robust, and accurate protein structure determination using only chemical shift data. Threading is a relatively old bioinformatics technique that uses a combination of sequence information and predicted (or experimentally acquired) low-resolution structural data to generate high-resolution 3D protein structures. The key motivations behind using NMR chemical shifts for protein threading lie in the fact that they are easy to measure, they are available prior to 3D structure determination, and they contain vital structural information. The method we have developed uses not only sequence and chemical shift similarity but also chemical shift-derived secondary structure, shift-derived super-secondary structure, and shift-derived accessible surface area to generate a high quality protein structure regardless of the sequence similarity (or lack thereof) to a known structure already in the PDB. The method (called E-Thrifty) was found to be very fast (often?<?10 min/structure) and to significantly outperform other shift-based or threading-based structure determination methods (in terms of top template model accuracy)—with an average TM-score performance of 0.68 (vs. 0.50–0.62 for other methods). Coupled with recent developments in chemical shift refinement, these results suggest that protein structure determination, using only NMR chemical shifts, is becoming increasingly practical and reliable. E-Thrifty is available as a web server at http://ethrifty.ca.  相似文献   

15.
The public archives containing protein information in the form of NMR chemical shift data at the BioMagResBank (BMRB) and of 3D structure coordinates at the Protein Data Bank are continuously expanding. The quality of the data contained in these archives, however, varies. The main issue for chemical shift values is that they are determined relative to a reference frequency. When this reference frequency is set incorrectly, all related chemical shift values are systematically offset. Such wrongly referenced chemical shift values, as well as other problems such as chemical shift values that are assigned to the wrong atom, are not easily distinguished from correct values and effectively reduce the usefulness of the archive. We describe a new method to correct and validate protein chemical shift values in relation to their 3D structure coordinates. This method classifies atoms using two parameters: the per‐atom solvent accessible surface area (as calculated from the coordinates) and the secondary structure of the parent amino acid. Through the use of Gaussian statistics based on a large database of 3220 BMRB entries, we obtain per‐entry chemical shift corrections as well as Z scores for the individual chemical shift values. In addition, information on the error of the correction value itself is available, and the method can retain only dependable correction values. We provide an online resource with chemical shift, atom exposure, and secondary structure information for all relevant BMRB entries ( http://www.ebi.ac.uk/pdbe/nmr/vasco ) and hope this data will aid the development of new chemical shift‐based methods in NMR. Proteins 2010. © 2010 Wiley‐Liss, Inc.  相似文献   

16.
We present a protocol for predicting protein flexibility from NMR chemical shifts. The protocol consists of (i) ensuring that the chemical shift assignments are correctly referenced or, if not, performing a reference correction using information derived from the chemical shift index, (ii) calculating the random coil index (RCI), and (iii) predicting the expected root mean square fluctuations (RMSFs) and order parameters (S2) of the protein from the RCI. The key advantages of this protocol over existing methods for studying protein dynamics are that (i) it does not require prior knowledge of a protein's tertiary structure, (ii) it is not sensitive to the protein's overall tumbling and (iii) it does not require additional NMR measurements beyond the standard experiments for backbone assignments. When chemical shift assignments are available, protein flexibility parameters, such as S2 and RMSF, can be calculated within 1-2 h using a spreadsheet program.  相似文献   

17.
18.
Carbonyl 13C′ relaxation is dominated by the contribution from the 13C′ chemical shift anisotropy (CSA). The relaxation rates provide useful and non-redundant structural information in addition to dynamic parameters. It is straightforward to acquire, and offers complimentary structural information to the 15N relaxation data. Furthermore, the non-axial nature of the 13C′ CSA tensor results in a T1/T2 value that depends on an additional angular variable even when the diffusion tensor of the protein molecule is axially symmetric. This dependence on an extra degree of freedom provides new geometrical information that is not available from the NH dipolar relaxation. A protocol that incorporates such structural restraints into NMR structure calculation was developed within the program Xplor-NIH. Its application was illustrated with the yeast Fis1 NMR structure. Refinement against the 13C′ T1/T2 improved the overall quality of the structure, as evaluated by cross-validation against the residual dipolar coupling as well as the 15N relaxation data. In addition, possible variations of the CSA tensor were addressed. Electronic Supplementary Material The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

19.
Chemical shifts reflect the structural environment of a certain nucleus and can be used to extract structural and dynamic information. Proper calibration is indispensable to extract such information from chemical shifts. Whereas a variety of procedures exist to verify the chemical shift calibration for proteins, no such procedure is available for RNAs to date. We present here a procedure to analyze and correct the calibration of 13C NMR data of RNAs. Our procedure uses five 13C chemical shifts as a reference, each of them found in a narrow shift range in most datasets deposited in the Biological Magnetic Resonance Bank. In 49 datasets we could evaluate the 13C calibration and detect errors or inconsistencies in RNA 13C chemical shifts based on these chemical shift reference values. More than half of the datasets (27 out of those 49) were found to be improperly referenced or contained inconsistencies. This large inconsistency rate possibly explains that no clear structure–13C chemical shift relationship has emerged for RNA so far. We were able to recalibrate or correct 17 datasets resulting in 39 usable 13C datasets. 6 new datasets from our lab were used to verify our method increasing the database to 45 usable datasets. We can now search for structure–chemical shift relationships with this improved list of 13C chemical shift data. This is demonstrated by a clear relationship between ribose 13C shifts and the sugar pucker, which can be used to predict a C2′- or C3′-endo conformation of the ribose with high accuracy. The improved quality of the chemical shift data allows statistical analysis with the potential to facilitate assignment procedures, and the extraction of restraints for structure calculations of RNA.  相似文献   

20.
Random coil chemical shifts are commonly used to detect protein secondary structural elements in chemical shift index (CSI) calculations. Though this technique is widely used and seems reliable for folded proteins, the choice of reference random coil chemical shift values can significantly alter the outcome of secondary structure estimation. In order to evaluate these effects, we present a comparison of secondary structure content calculated using CSI, based on five different reference random coil chemical shift value sets, to that derived from three-dimensional structures.Our results show that none of the reference random coil data sets chosen for evaluation fully reproduces the actual secondary structures. Among the reference values generally available to date, most tend to be good estimators only of helices. Based on our evaluation, we recommend the experimental values measured by Schwarzinger et al.(2000), and statistical values obtained by Lukin et al. (1997), as good estimators of both helical and sheet content.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号