首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The construction of a consistent protein chemical shift database is an important step toward making more extensive use of this data in structural studies. Unfortunately, progress in this direction has been hampered by the quality of the available data, particularly with respect to chemical shift referencing, which is often either inaccurate or inconsistently annotated. Preprocessing of the data is therefore required to detect and correct referencing errors. We have developed a program for performing this task, based on the comparison of reported and expected chemical shift distributions. This program, named CheckShift, does not require additional data and is therefore applicable to data sets where structures are not available. Therefore CheckShift provides the possibility to re-reference chemical shifts prior to their use as structural constraints.  相似文献   

2.
Peak lists are commonly used in NMR as input data for various software tools such as automatic assignment and structure calculation programs. Inconsistencies of chemical shift referencing among different peak lists or between peak and chemical shift lists can cause severe problems during peak assignment. Here we present a simple and robust tool to achieve self-consistency of the chemical shift referencing among a set of peak lists. The Peakmatch algorithm matches a set of peak lists to a specified reference peak list, neither of which have to be assigned. The chemical shift referencing offset between two peak lists is determined by optimizing an assignment-free match score function using either a complete grid search or downhill simplex optimization. It is shown that peak lists from many different types of spectra can be matched reliably as long as they contain at least two corresponding dimensions. Using a simulated peak list, the Peakmatch algorithm can also be used to obtain the optimal agreement between a chemical shift list and experimental peak lists. Combining these features makes Peakmatch a useful tool that can be applied routinely before automatic assignment or structure calculation in order to obtain an optimized input data set.  相似文献   

3.
The linear analysis of chemical shifts (LACS) has provided a robust method for identifying and correcting 13C chemical shift referencing problems in data from protein NMR spectroscopy. Unlike other approaches, LACS does not require prior knowledge of the three-dimensional structure or inference of the secondary structure of the protein. It also does not require extensive assignment of the NMR data. We report here a way of extending the LACS approach to 15N NMR data from proteins, so as to enable the detection and correction of inconsistencies in chemical shift referencing for this nucleus. The approach is based on our finding that the secondary 15N chemical shift of the backbone nitrogen atom of residue i is strongly correlated with the secondary chemical shift difference (experimental minus random coil) between the alpha and beta carbons of residue i − 1. Thus once alpha and beta 13C chemical shifts are available (their difference is referencing error-free), the 15N referencing can be validated, and an appropriate offset correction can be derived. This approach can be implemented prior to a structure determination and can be used to analyze potential referencing problems in database data not associated with three-dimensional structure. Application of the LACS algorithm to the current BMRB protein chemical shift database, revealed that nearly 35% of the BMRB entries have δ 15N values mis-referenced by over 0.7 ppm and over 25% of them have δ 1HN values mis-referenced by over 0.12 ppm. One implication of the findings reported here is that a backbone 15N chemical shift provides a better indicator of the conformation of the preceding residue than of the residue itself. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

4.
Yang M  Wyckoff GJ 《Genetica》2011,139(5):639-648
The neutral theory of molecular evolution (Kimura 1985) is the basis for most current statistical tests for detecting selection, mainly using polymorphism data within species, divergence data between species, and/or genomic structures like linkage disequilibrium (Wang et al. 2006). In most cases informative tests can only be constructed with ample variations within these parameters and many common tests are difficult to formulate when identity-by-descent is not clear, for example in gene families or repetitive elements. With the current progress being made toward whole-genome sequencing and re-sequencing efforts, as well as protein sequencing via tandem mass spectrometry where genomic sequencing is lacking, we felt it was necessary to re-visit possible methods for rapid screening and detection of evolutionary outliers. These outliers might be of interest for other research, such as candidate gene association studies or genome annotations, drug- and disease-target searches, and functional studies. We focused on methods that would work on both protein and nucleotide data, could be used on large gene or protein domain families, and could be generated quickly in order for “first pass” annotation of large scale data. For these reasons, we chose properties of trees generated routinely in molecular phylogenetic studies; genetic distance, tree shape and balance, and internal node statistics (Heard 1992). Our current research looking at protein domain family data and phylogenetic trees from PFAM (Finn et al. 2008) suggests this approach towards detecting evolutionary outliers is feasible, but additional work will be necessary to determine the parameters that suggest either positive or negative selection is occurring in specific gene families. This is particularly true when other factors such as rapid duplication and deletion of genes containing these domains is taking place, and we suggest phylogenetic statistics may be useful in combination with existing methodologies for detailed studies of gene family data.  相似文献   

5.
Accurate 1H, 15N, and 13C chemical shift assignments were determined for staphylococcal nuclease H124L (in the absence of inhibitor or activator ion). Backbone 1H and 15N assignments, obtained by analysis of three-dimensional 1H-15N HMQC-NOESY data [Wang, J., Mooberry, E.S., Walkenhorst, W.F., & Markley, J. L. (1992) Biochemistry (preceding paper in this issue)], were refined and extended by a combination of homo- and heteronuclear two-dimensional NMR experiments. Staphylococcal nuclease H124L samples used in the homonuclear 1H NMR studies were at natural isotopic abundance or labeled randomly with 2H (to an isotope level of 50%); nuclease H124L samples used for heteronuclear NMR experiments were labeled uniformly with 15N (to an isotope level greater than 95%) or uniformly with 13C (to an isotope level of 26%). Additional nuclease H124L samples were labeled selectively by incorporating single 15N- or 13C-labeled amino acids. The chemical shifts of uncomplexed enzyme were then compared with those determined previously for the nuclease H124L.pdTp.Ca2+ ternary complex [Wang, J., LeMaster, D. M., & Markley, J.L. (1990) Biochemistry 29, 88-101; Wang, J., Hinck, A.P., Loh, S. N., & Markley, J.L. (1990) Biochemistry 29, 102-113; Wang, J., Hinck, A.P., Loh, S.N., & Markley, J.L. (1990) Biochemistry 29, 4242-4253]. The results reveal that the binding of pdTp and Ca2+ induces large shifts in the resonances of several amino acid segments. These chemical shift changes are interpreted in terms of changes in backbone torsion angles that accompany the binding of pdTp and Ca2+; changes at the binding site appear to be transmitted to other regions of the molecule through networks of hydrogen bonds.  相似文献   

6.
In this paper, we improve the homology search performance by the combination of the predicted protein secondary structures and protein sequences. Previous research suggested that the straightforward combination of predicted secondary structures did not improve the homology search performance, mostly because of the errors in the structure prediction. We solved this problem by taking into account the confidence scores output by the prediction programs.  相似文献   

7.
Poor chemical shift referencing, especially for 13C in protein Nuclear Magnetic Resonance (NMR) experiments, fundamentally limits and even prevents effective study of biomacromolecules via NMR, including protein structure determination and analysis of protein dynamics. To solve this problem, we constructed a Bayesian probabilistic framework that circumvents the limitations of previous reference correction methods that required protein resonance assignment and/or three-dimensional protein structure. Our algorithm named Bayesian Model Optimized Reference Correction (BaMORC) can detect and correct 13C chemical shift referencing errors before the protein resonance assignment step of analysis and without three-dimensional structure. By combining the BaMORC methodology with a new intra-peaklist grouping algorithm, we created a combined method called Unassigned BaMORC that utilizes only unassigned experimental peak lists and the amino acid sequence. Unassigned BaMORC kept all experimental three-dimensional HN(CO)CACB-type peak lists tested within ±?0.4 ppm of the correct 13C reference value. On a much larger unassigned chemical shift test set, the base method kept 13C chemical shift referencing errors to within ±?0.45 ppm at a 90% confidence interval. With chemical shift assignments, Assigned BaMORC can detect and correct 13C chemical shift referencing errors to within ±?0.22 at a 90% confidence interval. Therefore, Unassigned BaMORC can correct 13C chemical shift referencing errors when it will have the most impact, right before protein resonance assignment and other downstream analyses are started. After assignment, chemical shift reference correction can be further refined with Assigned BaMORC. These new methods will allow non-NMR experts to detect and correct 13C referencing error at critical early data analysis steps, lowering the bar of NMR expertise required for effective protein NMR analysis.  相似文献   

8.
The theory of natural selection has been vital in unifying the biological sciences and their research with a single testable metatheory. Despite a plethora of research supporting natural selection, teaching the theory of evolution remains controversial in high schools and higher education (Wilson et al. 2009; Scott 1997). In this article, we sample the attitudes toward evolution of 170 faculty and graduate and undergraduate students in family studies and human development programs from across the United States to determine whether resistances toward evolution remain and to describe the correlates of these resistances. Results reveal that an individual’s prosocial meliorist attitudes, religious ideation, and his or her reported interest in and knowledge of evolution all uniquely contribute to whether they report evolutionary theory as being applicable to their area of research interests. We discuss the relevance of including evolutionary theory within family studies and human development research programs and make suggestions for how to implement an evolutionary studies program (Wilson et al. 2009).  相似文献   

9.
10.
The staden sequence analysis package   总被引:31,自引:0,他引:31  
I describe the current version of the sequence analysis package developed at the MRC Laboratory of Molecular Biology, which has come to be known as the “Staden Package.” The package covers most of the standard sequence analysis tasks such as restriction site searching, translation, pattern searching, comparison, gene finding, and secondary structure prediction, and provides powerful tools for DNA sequence determination. Currently the programs are only available for computers running the UNIX operating system. Detailed information about the package is available from our WWW site: http://www.mrc-lmb.cam.ac.uk/pubseq/.  相似文献   

11.
It has been estimated that more than 20% of the proteins in the BMRB are improperly referenced and that about 1% of all chemical shift assignments are mis-assigned. These statistics also reflect the likelihood that any newly assigned protein will have shift assignment or shift referencing errors. The relatively high frequency of these errors continues to be a concern for the biomolecular NMR community. While several programs do exist to detect and/or correct chemical shift mis-referencing or chemical shift mis-assignments, most can only do one, or the other. The one program (SHIFTCOR) that is capable of handling both chemical shift mis-referencing and mis-assignments, requires the 3D structure coordinates of the target protein. Given that chemical shift mis-assignments and chemical shift re-referencing issues should ideally be addressed prior to 3D structure determination, there is a clear need to develop a structure-independent approach. Here, we present a new structure-independent protocol, which is based on using residue-specific and secondary structure-specific chemical shift distributions calculated over small (3–6 residue) fragments to identify mis-assigned resonances. The method is also able to identify and re-reference mis-referenced chemical shift assignments. Comparisons against existing re-referencing or mis-assignment detection programs show that the method is as good or superior to existing approaches. The protocol described here has been implemented into a freely available Java program called “Probabilistic Approach for protein Nmr Assignment Validation (PANAV)” and as a web server () which can be used to validate and/or correct as well as re-reference assigned protein chemical shifts.  相似文献   

12.
Rapid data collection, spectral referencing, processing by time domain deconvolution, peak picking and editing, and assignment of NMR spectra are necessary components of any efficient integrated system for protein NMR structure analysis. We have developed a set of software tools designated AutoProc, AutoPeak, and AutoAssign, which function together with the data processing and peak-picking programs NMRPipe and Sparky, to provide an integrated software system for rapid analysis of protein backbone resonance assignments. In this paper we demonstrate that these tools, together with high-sensitivity triple resonance NMR cryoprobes for data collection and a Linux-based computer cluster architecture, can be combined to provide nearly complete backbone resonance assignments and secondary structures (based on chemical shift data) for a 59-residue protein in less than 30 hours of data collection and processing time. In this optimum case of a small protein providing excellent spectra, extensive backbone resonance assignments could also be obtained using less than 6 hours of data collection and processing time. These results demonstrate the feasibility of high throughput triple resonance NMR for determining resonance assignments and secondary structures of small proteins, and the potential for applying NMR in large scale structural proteomics projects.Abbreviations: BPTI – bovine pancreatic trypsin inhibitor; LP – linear prediction; FT – Fourier transform; S/N – signal-to-noise ratio; FID – free induction decay  相似文献   

13.
Over the last few decades, much effort has been taken to develop approaches for identifying good predictions of RNA secondary structure. This is due to the fact that most computational prediction methods based on free energy minimization compute a number of suboptimal foldings and we have to identify the native folding among all these possible secondary structures. Using the abstract shapes approach as introduced by Giegerich et al. (Nucleic Acids Res 32(16):4843–4851, 2004), each class of similar secondary structures is represented by one shape and the native structures can be found among the top shape representatives. In this article, we derive some interesting results answering enumeration problems for abstract shapes and secondary structures of RNA. We compute precise asymptotics for the number of different shape representations of size n and for the number of different shapes showing up when abstracting from secondary structures of size n under a combinatorial point of view. A more realistic model taking primary structures into account remains an open challenge. We give some arguments why the present techniques cannot be applied in this case.  相似文献   

14.
This paper is an attempt at exploring the possibility of reconciling the two interpretations of biolinguistics which have been recently projected by Koster (Biolinguistics 3(1):61–92, 2009). The two interpretations—trivial and nontrivial—can be roughly construed as non-internalist and internalist conceptions of biolinguistics respectively. The internalist approach boils down to a conception of language where language as a mental grammar in the form of I-language grows and functions like a biological organ. On the other hand, under such a construal consistent with Koster’s (Biolinguistics 3(1):61-92, 2009), the non-internalist version does not necessarily have to be externalist in nature; rather it is a matter of mutual reinforcement of biology and culture under the rubric of a co-evolutionary dynamics. Here it will be argued that the apparent dichotomy between these two conceptions of biolinguistics can perhaps be resolved if we have a richer synthesis that accounts for both internalism and non-internalism.  相似文献   

15.
To address many challenges in RNA structure/function prediction, the characterization of RNA''s modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally described in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.  相似文献   

16.
Inconsistent 13C and 15N chemical shift referencing is a continuing problem associated with protein chemical shift assignments deposited in BioMagResBank (BMRB). Here we describe a simple and robust approach that can quantitatively determine the 13C and 15N referencing offsets solely from chemical shift assignment data and independently of 3D coordinate data. This novel structure-independent approach permitted the assessment and determination of 13C and 15N reference offsets for all protein entries deposited in the BMRB. Tests on 452 proteins with known 3D structures show that this structure-independent approach yields 13C and 15N referencing offsets that exhibit excellent agreement with those calculated on the basis of 3D structures. Furthermore, this protocol appears to improve the accuracy of chemical shift-derived secondary structural identification, and has been formally incorporated into a computer program called PSSI (http//www.pronmr.com).Supplementary material to this paper is available in electronic form at http://dx.doi.org/10.1007/s10858-004-7441-3  相似文献   

17.

Background  

Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction.  相似文献   

18.
The Protein Kinase C family of enzymes is a group of serine/threonine kinases that play central roles in cell-cycle regulation, development and cancer. A key step in the activation of PKC is translocation to membranes and binding of membrane-associated activators including diacylglycerol (DAG). Interaction of novel and conventional isotypes of PKC with DAG and phorbol esters occurs through the two C1 regulatory domains (C1A and C1B), which exhibit distinct ligand binding selectivity that likely controls enzyme activation by different co-activators. PKC has also been implicated in physiological responses to alcohol consumption and it has been proposed that PKCα (Slater et al. J Biol Chem 272(10):6167–6173, 1997; Slater et al. Biochemistry 43(23):7601–7609, 2004), PKCε (Das et al. Biochem J 421(3):405–413, 2009) and PKCδ (Das et al. J Biol Chem 279(36):37964–37972, 2004; Das et al. Protein Sci 15(9):2107–2119, 2006) contain specific alcohol-binding sites in their C1 domains. We are interested in understanding how ethanol affects signal transduction processes through its affects on the structure and function of the C1 domains of PKC. Here we present the 1H, 15N and 13C NMR chemical shift assignments for the Rattus norvegicus PKCδ C1A and C1B proteins.  相似文献   

19.
Numerous presentations and articles on manual inspection of pharmaceutical drug products have been released, since the pioneering articles on inspection by Knapp and associates Knapp and Kushner (J Parenter Drug Assoc 34:14, 1980); Knapp and Kushner (Bull Parenter Drug Assoc 34:369, 1980); Knapp and Kushner (J Parenter Sci Technol 35:176, 1981); Knapp and Kushner (J Parenter Sci Technol 37:170, 1983). This original work by Knapp and associates provided the industry with a statistical means of evaluating inspection performance. This methodology enabled measurement of individual inspector performance, performance of the entire inspector pool and provided basic suggestions for the conduct of manual inspection. Since that time, numerous subject matter experts (SMEs) have presented additional valuable information for the conduct of manual inspection Borchert et al. (J Parenter Sci Technol 40:212, 1986); Knapp and Abramson (J Parenter Sci Technol 44:74, 1990); Shabushnig et al. (1994); Knapp (1999); Knapp (2005); Cherris (2005); Budd (2005); Barber and Thomas (2005); Knapp (2005); Melchore (2007); Leversee and Ronald (2007); Melchore (2009); Budd (2007); Borchert et al. (1986); Berdovich (2005); Berdovich (2007); Knapp (2007); Leversee and Shabushing (2009); Budd (2009). Despite this abundance of knowledge, neither government regulations nor the multiple compendia provide more than minimal guidance or agreement for the conduct of manual inspection. One has to search the literature for useful information that has been published by SMEs in the field of Inspection. The purpose of this article is to restate the sound principles proclaimed by SMEs with the hope that they serve as a useful guideline to bring greater consistency to the conduct of manual inspection.  相似文献   

20.
Loops are the most variable regions of protein structure and are, in general, the least accurately predicted. Their prediction has been approached in two ways, ab initio and database search. In recent years, it has been thought that ab initio methods are more powerful. In light of the continued rapid expansion in the number of known protein structures, we have re‐evaluated FREAD, a database search method and demonstrate that the power of database search methods may have been underestimated. We found that sequence similarity as quantified by environment specific substitution scores can be used to significantly improve prediction. In fact, FREAD performs appreciably better for an identifiable subset of loops (two thirds of shorter loops and half of the longer loops tested) than the ab initio methods of MODELLER, PLOP, and RAPPER. Within this subset, FREAD's predictive ability is length independent, in general, producing results within 2Å RMSD, compared to an average of over 10Å for loop length 20 for any of the other tested methods. We also benchmarked the prediction protocols on a set of 212 loops from the model structures in CASP 7 and 8. An extended version of FREAD is able to make predictions for 127 of these, it gives the best prediction of the methods tested in 61 of these cases. In examining FREAD's ability to predict in the model environment, we found that whole structure quality did not affect the quality of loop predictions. Proteins 2010. © 2009 Wiley‐Liss, Inc.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号