首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Circular dichroism (CD) spectroscopy is a valuable technique for the determination of protein secondary structures. Many linear and nonlinear algorithms have been developed for the empirical analysis of CD data, using reference databases derived from proteins of known structures. To date, the reference databases used by the various algorithms have all been derived from the spectra of soluble proteins. When applied to the analysis of soluble protein spectra, these methods generally produce calculated secondary structures that correspond well with crystallographic structures. In this study, however, it was shown that when applied to membrane protein spectra, the resulting calculations produce considerably poorer results. One source of this discrepancy may be the altered spectral peak positions (wavelength shifts) of membrane proteins due to the different dielectric of the membrane environment relative to that of water. These results have important consequences for studies that seek to use the existing soluble protein reference databases for the analyses of membrane proteins.  相似文献   

2.
We have expanded the reference set of proteins used in SELCON3 by including 11 additional proteins (selected from the reference sets of Yang and co-workers and Keiderling and co-workers). Depending on the wavelength range and whether or not denatured proteins are included in the reference set, five reference sets were constructed with the number of reference proteins varying from 29 to 48. The performance of three popular methods for estimating protein secondary structure fractions from CD spectra (implemented in software packages CONTIN, SELCON3, and CDSSTR) and a variant of CONTIN, CONTIN/LL, that incorporates the variable selection method in the locally linearized model in CONTIN, were examined using the five reference sets described here, and a 22-protein reference set. Secondary structure assignments from DSSP were used in the analysis. The performances of all three methods were comparable, in spite of the differences in the algorithms used in the three software packages. While CDSSTR performed the best with a smaller reference set and larger wavelength range, and CONTIN/LL performed the best with a larger reference set and smaller wavelength range, the performances for individual secondary structures were mixed. Analyzing protein CD spectra using all three methods should improve the reliability of predicted secondary structural fractions. The three programs are provided in CDPro software package and have been modified for easier use with the different reference sets described in this paper. CDPro software is available at the website: http://lamar.colostate.edu/ approximately sreeram/CDPro.  相似文献   

3.
《Biophysical journal》2020,118(7):1665-1678
We have developed a computational method of atomistically refining the structural ensemble of intrinsically disordered peptides (IDPs) facilitated by experimental measurements using circular dichroism spectroscopy (CD). A major challenge surrounding this approach stems from the deconvolution of experimental CD spectra into secondary structure features of the IDP ensemble. Currently available algorithms for CD deconvolution were designed to analyze the spectra of proteins with stable secondary structures. Herein, our work aims to minimize any bias from the peptide deconvolution analysis by implementing a non-negative linear least-squares fitting algorithm in conjunction with a CD reference data set that contains soluble and denatured proteins (SDP48). The non-negative linear least-squares method yields the best results for deconvolution of proteins with higher disordered content than currently available methods, according to a validation analysis of a set of protein spectra with Protein Data Bank entries. We subsequently used this analysis to deconvolute our experimental CD data to refine our computational model of the peptide secondary structure ensemble produced by all-atom molecular dynamics simulations with implicit solvent. We applied this approach to determine the ensemble structures of a set of short IDPs, that mimic the calmodulin binding domain of calcium/calmodulin-dependent protein kinase II and its 1-amino-acid and 3-amino-acid mutants. Our study offers a, to our knowledge, novel way to solve the ensemble secondary structures of IDPs in solution, which is important to advance the understanding of their roles in regulating signaling pathways through the formation of complexes with multiple partners.  相似文献   

4.
MOTIVATION: Circular dichroism (CD) spectroscopy has become established as a key method for determining the secondary structure contents of proteins which has had a significant impact on molecular biology. Many excellent mathematical protocols have been developed for this purpose and their quality is above question. However, reference database sets of proteins, with CD spectra matched to secondary structure components derived from X-ray structures, provide the key resource for this task. These databases were created many years ago, before most CD spectrophotometers became standardized and before it was commonplace to validate X-ray structures prior to publication. The analyses presented here were undertaken to investigate the overall quality of these reference databases in light of their extensive usage in determining protein secondary structure content from CD spectra. RESULTS: The analyses show that there are a number of significant problems associated with the CD reference database sets in current use. There are disparities between CD spectra for the same protein collected by different groups. These include differences in magnitudes, peak positions or both. However, many current reference sets are now amalgamations of spectra from these groups, introducing inconsistencies that can lead to inaccuracies in the determination of secondary structure components from the CD spectra. A number of the X-ray structures used fall short on the validation criteria now employed as standard for structure determination. Many have substantial percentages of residues in the disallowed regions of the Ramachandran plot. Hence their calculated secondary structure components, used as a foundation for the reference databases, are likely to be in error. Additionally, the coverage of secondary structure space in the reference datasets is poorly correlated to the secondary structure components found in the Protein Data Bank. A conclusion is that a new reference CD database with cross-correlated, machine-independent CD spectra and validated X-ray structures that cover more secondary structure components, including diverse protein folds, is now needed. However, that reasonably accurate values for the secondary structure content of proteins can be determined from spectra is a testament to CD spectroscopy being a very powerful technique.  相似文献   

5.
Circular dichroism (CD) spectroscopy is a widely used technique for the evaluation of protein secondary structures that has a significant impact for the understanding of molecular biology. However, the quantitative analysis of protein secondary structures based on CD spectra is still a hard work due to the serious overlap of the spectra corresponding to different structural motifs. Here, Tchebichef image moment (TM) approach is introduced for the first time, which can effectively extract the chemical features in CD spectra for the quantitative analysis of protein secondary structures. The proposed approach was applied to analyze reference set and the obtained results were evaluated by the strict statistical parameters such as correlation coefficient, cross‐validation correlation coefficient and root mean squared error. Compared with several specialized prediction methods, TM approach provided satisfactory results, especially for turns and unordered structures. Our study indicates that TM approach can be regarded as a feasible tool for the analysis of the secondary structures of proteins based on CD spectra. An available TMs package is provided and can be used directly for secondary structures prediction.  相似文献   

6.
Circular dichroism (CD) is a spectroscopic technique commonly used to investigate the structure of proteins. Major secondary structure types, alpha‐helices and beta‐strands, produce distinctive CD spectra. Thus, by comparing the CD spectrum of a protein of interest to a reference set consisting of CD spectra of proteins of known structure, predictive methods can estimate the secondary structure of the protein. Currently available methods, including K2D2, use such experimental CD reference sets, which are very small in size when compared to the number of tertiary structures available in the Protein Data Bank (PDB). Conversely, given a PDB structure, it is possible to predict a theoretical CD spectrum from it. The methodological framework for this calculation was established long ago but only recently a convenient implementation called DichroCalc has been developed. In this study, we set to determine whether theoretically derived spectra could be used as reference set for accurate CD based predictions of secondary structure. We used DichroCalc to calculate the theoretical CD spectra of a nonredundant set of structures representing most proteins in the PDB, and applied a straightforward approach for predicting protein secondary structure content using these theoretical CD spectra as reference set. We show that this method improves the predictions, particularly for the wavelength interval between 200 and 240 nm and for beta‐strand content. We have implemented this method, called K2D3, in a publicly accessible web server at http://www. ogic.ca/projects/k2d3 . Proteins 2012. © 2011 Wiley Periodicals, Inc.  相似文献   

7.
We have expanded our reference set of proteins used in the estimation of protein secondary structure by CD spectroscopy from 29 to 37 proteins by including 3 additional globular proteins with known X-ray structure and 5 denatured proteins. We have also modified the self-consistent method for analyzing protein CD spectra, SELCON3, by including a new selection criterion developed by W. C. Johnson, Jr. (Proteins Struct. Funct. Genet. 35, 307-312, 1999). The secondary structure corresponding to the denatured proteins was approximated to be 90% unordered, owing to the spectral similarity of the denatured proteins and unordered structures. We examined the thermal denaturation of ribonuclease T1 by CD using both the original and expanded sets of reference proteins and obtained more consistent results with the expanded set. The expanded set of reference proteins will be helpful for the determination of protein secondary structure from protein CD spectra with higher reliability, especially of proteins with significant unordered structure content and/or in the course of denaturation.  相似文献   

8.
Circular dichroism (CD) spectroscopy is a valuable method for defining canonical secondary structure contents of proteins based on empirically‐defined spectroscopic signatures derived from proteins with known three‐dimensional structures. Many proteins identified as being “Intrinsically Disordered Proteins” have a significant amount of their structure that is neither sheet, helix, nor turn; this type of structure is often classified by CD as “other”, “random coil”, “unordered”, or “disordered”. However the “other” category can also include polyproline II (PPII)‐type structures, whose spectral properties have not been well‐distinguished from those of unordered structures. In this study, synchrotron radiation circular dichroism spectroscopy was used to investigate the spectral properties of collagen and polyproline, which both contain PPII‐type structures. Their native spectra were compared as representatives of PPII structures. In addition, their spectra before and after treatment with various conditions to produce unfolded or denatured structures were also compared, with the aim of defining the differences between CD spectra of PPII and disordered structures. We conclude that the spectral features of collagen are more appropriate than those of polyproline for use as the representative spectrum for PPII structures present in typical amino acid‐containing proteins, and that the single most characteristic spectroscopic feature distinguishing a PPII structure from a disordered structure is the presence of a positive peak around 220nm in the former but not in the latter. These spectra are now available for inclusion in new reference data sets used for CD analyses of the secondary structures of soluble proteins.  相似文献   

9.
A new algorithm, called convex constraint analysis, has been developed to deduce the chiral contribution of the common secondary structures directly from experimental CD curves of a large number of proteins. The analysis is based on CD data reported by Yang, J.T., Wu, C.-S.C. and Martinez, H.M. [Methods Enzymol., 130, 208-269 (1986)]. Application of the decomposition algorithm for simulated protein data sets resulted in component spectra [B (lambda, i)] identical to the originals and weights [C (i, k)] with excellent Pearson correlation coefficients (R) [Chang, C.T., Wu, C.-S.C. and Yang, J.T. (1978) Anal. Biochem., 91, 12-31]. Test runs were performed on sets of simulated protein spectra created by the Monte Carlo technique using poly-L-lysine-based pure component spectra. The significant correlational coefficients (R greater than 0.9) demonstrated the high power of the algorithm. The algorithm, applied to globular protein data, independent of X-ray data, revealed that the CD spectrum of a given protein is composed of at least four independent sources of chirality. Three of the computed component curves show remarkable resemblance to the CD spectra of known protein secondary structures. This approach yields a significant improvement in secondary structural evaluations when compared with previous methods, as compared with X-ray data, and yields a realistic set of pure component spectra. The new method is a useful tool not only in analyzing CD spectra of globular proteins but also has the potential for the analysis of integral membrane proteins.  相似文献   

10.
MOTIVATION: Circular Dichroism (CD) spectroscopy is a long-established technique for studying protein secondary structures in solution. Empirical analyses of CD data rely on the availability of reference datasets comprised of far-UV CD spectra of proteins whose crystal structures have been determined. This article reports on the creation of a new reference dataset which effectively covers both secondary structure and fold space, and uses the higher information content available in synchrotron radiation circular dichroism (SRCD) spectra to more accurately predict secondary structure than has been possible with existing reference datasets. It also examines the effects of wavelength range, structural redundancy and different means of categorizing secondary structures on the accuracy of the analyses. In addition, it describes a novel use of hierarchical cluster analyses to identify protein relatedness based on spectral properties alone. The databases are shown to be applicable in both conventional CD and SRCD spectroscopic analyses of proteins. Hence, by combining new bioinformatics and biophysical methods, a database has been produced that should have wide applicability as a tool for structural molecular biology.  相似文献   

11.
Major advances have been made in the prediction of soluble protein structures, led by the knowledge-based modeling methods that extract useful structural trends from known protein structures and incorporate them into scoring functions. The same cannot be reported for the class of transmembrane proteins, primarily due to the lack of high-resolution structural data for transmembrane proteins, which render many of the knowledge-based method unreliable or invalid. We have developed a method that harnesses the vast structural knowledge available in soluble protein data for use in the modeling of transmembrane proteins. At the core of the method, a set of transmembrane protein decoy sets that allow us to filter and train features recognized from soluble proteins for transmembrane protein modeling into a set of scoring functions. We have demonstrated that structures of soluble proteins can provide significant insight into transmembrane protein structures. A complementary novel two-stage modeling/selection process that mimics the two-stage helical membrane protein folding was developed. Combined with the scoring function, the method was successfully applied to model 5 transmembrane proteins. The root mean square deviations of the predicted models ranged from 5.0 to 8.8?Å to the native structures.  相似文献   

12.
13.
Many membrane proteins feature autonomously folded extramembranous domains which, when isolated from the intact protein, perform biochemical functions relevant to biological activity. Whereas intact membrane proteins usually require detergent solubilization for purification, most extramembranous fragments are soluble in aqueous solution. If appropriately constructed, such fragments are often crystallizable and the resulting atomic structures can lead to important biological insight. In most instances, these fragments are produced in recombinant expression systems. To be crystallizable, molecular fragments should be uniform in composition and conformation and be available in abundance. Considerations for the production of crystallizable fragments of membrane proteins include the definition of fragment boundaries, the control of nonuniformities introduced by glycosylation or phosphorylation, and optimization of expression systems. These aspects are addressed here in general terms and in the case studies of applications to CD4, CD8, the insulin receptor kinase, and N-cadherin.  相似文献   

14.
A Perczel  K Park  G D Fasman 《Proteins》1992,13(1):57-69
A recently developed algorithm, called Convex Constraint Analysis (CCA), was successfully applied to determine the circular dichroism (CD) spectra of the pure beta-pleated sheet in globular proteins. On the basis of X-ray diffraction determined secondary structures, the original data set used (Perczel, A., Hollosi, M., Tusnady, G. Fasman, G.D. Convex constraint analysis: A natural deconvolution of circular dichroism curves of proteins, Prot. Eng., 4:669-679, 1991), was improved by the addition of proteins with high beta-pleated sheet content. The analysis yielded CD curves of the pure components of the main secondary structural elements (alpha-helix, antiparallel beta-pleated sheet, beta-turns, and unordered conformation), as well as a curve attributed to the "aromatic contribution" in the wavelength range of 195-240 nm. Upon deconvolution the curves obtained were assigned to various secondary structures. The calculated weights (percentages determining the contributions of each pure component curve in the measured CD spectra of a given protein) were correlated with the X-ray diffraction determined percentages in an assignment procedure and were evaluated. The Pearson product correlation coefficients (R) are significant for all five components. The new pure component curves, which were obtained through deconvolution of the protein CD spectra alone, are promising candidates for determining the percentages of the secondary structural components in globular proteins without the necessity of adopting an X-ray database. The CD spectrum of the CheY protein was interesting because it has the characteristic shape associated with the alpha-helical structure, but upon analysis yielded a considerable amount of beta-sheet in agreement with the X-ray structure.  相似文献   

15.
16.
B A Clack  D M Gray 《Biopolymers》1989,28(11):1861-1873
The CD spectra of four filamentous bacteriophages--fd, IKe, Pf1, and Pf3--were analyzed to determine the alpha-helix contents of their major coat proteins. Measured spectra included the 192-nm band so that analyses could be carried out over the full wavelength range of the reference spectra for protein secondary structures available (a) from globular proteins [J.T. Yang, C.S.C. Wu, and H.M. Martinez (1986) Methods in Enzymology 130, 208-269] and (b) from poly(L-lysine) [N. Greenfield and G.D. Fasman (1960) Biochemistry 8, 4108-4116]. Extended analyses were also performed with the addition of the spectrum of a model beta-turn to the Greenfield and Fasman reference set, with the spectrum of a short alpha-helix in the Yang et al. reference set, and with an estimate of the spectrum of Trp added to both reference sets. The reference set based on the simple poly(L-lysine) polypeptide, plus a spectrum of a model beta-turn or of Trp, gave reasonably good fits to the measured spectra for all four phages and yielded the largest percentages of alpha-helix. The class I phages--fd and IKe--had large percentages of alpha-helix of 98 +/- 2 and 97 +/- 5%, respectively, while the two class II phages--Pf1 and Pf3--had similar but smaller alpha-helix contents of 83 +/- 6 and 84 +/- 2, respectively. While these alpha-helix contents were within the ranges previously reported from CD spectra of these phages in solution, they were more precise, and they indicated that the coat proteins of the intact phages have CD spectra that are probably modeled better by the reference spectra of polypeptides than by those of globular proteins.  相似文献   

17.
A new method for determination of the secondary protein structure from the CD spectra taking into account the contribution of aromatic amino acid residues is proposed. New proteins reference CD spectra for five secondary structures (alpha-helices, antiparallel and parallel beta-structures, beta-bends and irregular form) without contribution of aromatic residues are obtained. By means of this new method the secondary structure of sixteen different proteins was analysed. There is a good correlation of these results with the X-ray data.  相似文献   

18.
An infrared (ir) method to determine the secondary structure of proteins in solution using the amide I region of the spectrum has been devised. The method is based on the circular dichroism (CD) matrix method for secondary structure analysis given by Compton and Johnson (L. A. Compton and W. C. Johnson, 1986, Anal. Biochem. 155, 155-167). The infrared data matrix was constructed from the normalized Fourier transform infrared spectra from 1700 to 1600 cm-1 of 17 commercially available proteins. The secondary structure matrix was constructed from the X-ray data of the seventeen proteins with secondary structure elements of helix, beta-sheet, beta-turn, and other (random). The CD and ir methods were compared by analyzing the proteins of the CD and ir databases as unknowns. Both methods produce similar results compared to structures obtained by X-ray crystallographic means with the CD slightly better for helix conformation, and the ir slightly better for beta-sheet. The relatively good ir analysis for concanavalin A and alpha-chymotrypsin indicate that the ir method is less affected by the presence of aromatic groups. The concentration of the protein and the cell path length need not be known for the ir analysis since the spectra can be normalized to the total ir intensity in the amide I region. The ir spectra for helix, beta-sheet, beta-turn, and other, as extracted from the data-base, agree with the literature band assignments. The ir data matrix and the inverse matrix necessary to analyze unknown proteins are presented.  相似文献   

19.
It has been shown that the progress in the determination of membrane protein structure grows exponentially, with approximately the same growth rate as that of the water-soluble proteins. In order to investigate the effect of this, on the performance of prediction algorithms for both α-helical and β-barrel membrane proteins, we conducted a prospective study based on historical records. We trained separate hidden Markov models with different sized training sets and evaluated their performance on topology pred...  相似文献   

20.
We have developed a holistic protein structure estimation technique using amide I band Raman spectroscopy. This technique combines the superposition of reference spectra for pure secondary structure elements with simultaneous aromatic, fluorescence, and solvent background subtraction, and is applicable to solution, suspension, and solid protein samples. A key component of this technique was the calculation of the reference spectra for ordered helix, unordered helix, and sheet, turns, and unordered structures from a series of well-characterized reference proteins. We accurately account for the overlap between the amide I and non-amide I regions and allow for different scattering efficiencies for different secondary structures. For hydrated samples, we allowed for the possibility that bound water spectra differ from the bulk water spectra. Our computed reference spectra compare well with previous experimental and theoretical results in the literature. We have demonstrated the use of these reference spectra for the estimation of secondary structures of proteins in solution, suspension, and dry solid forms. The agreement between our structure estimates and the corresponding determinations from X-ray crystallography is good.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号