首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Park H  Seok C 《Proteins》2012,80(8):1974-1986
Contemporary template-based modeling techniques allow applications of modeling methods to vast biological problems. However, they tend to fail to provide accurate structures for less-conserved local regions in sequence even when the overall structure can be modeled reliably. We call these regions unreliable local regions (ULRs). Accurate modeling of ULRs is of enormous value because they are frequently involved in functional specificity. In this article, we introduce a new method for modeling ULRs in template-based models by employing a sophisticated loop modeling technique. Combined with our previous study on protein termini, the method is applicable to refinement of both loop and terminus ULRs. A large-scale test carried out in a blind fashion in CASP9 (the 9th Critical Assessment of techniques for protein structure prediction) shows that ULR structures are improved over initial template-based models by refinement in more than 70% of the successfully detected ULRs. It is also notable that successful modeling of several long ULRs over 12 residues is achieved. Overall, the current results show that a careful application of loop and terminus modeling can be a promising tool for model refinement in template-based modeling.  相似文献   

3.
A systematic survey was carried out in an unbiased sample of 815 protein chains with a maximum of 20% homology selected from the Protein Data Bank, whose structures were solved at a resolution higher than 1.6 A and with a R-factor lower than 25%. A set of 5556 subsequences with alpha-helix or 3(10)-helix motifs was extracted from the protein chains considered. Global and local propensities were then calculated for all possible amino acid pairs of the type (i, i + 1), (i, i + 2), (i, i + 3), and (i, i + 4), starting at the relevant helical positions N1, N2, N3, C3, C2, C1, and N-int (interior positions), and also at the first nonhelical positions in both termini of the helices, namely, N-cap and C-cap. The statistical analysis of the propensity values has shown that pairing is significantly dependent on the type of the amino acids and on the position of the pair. A few sequences of three and four amino acids were selected and their high prevalence in helices is outlined in this work. The Glu-Lys-Tyr-Pro sequence shows a peculiar distribution in proteins, which may suggest a relevant structural role in alpha-helices when Pro is located at the C-cap position. A bioinformatics tool was developed, which updates automatically and periodically the results and makes them available in a web site.  相似文献   

4.
Park H  Ko J  Joo K  Lee J  Seok C  Lee J 《Proteins》2011,79(9):2725-2734
The rapid increase in the number of experimentally determined protein structures in recent years enables us to obtain more reliable protein tertiary structure models than ever by template-based modeling. However, refinement of template-based models beyond the limit available from the best templates is still needed for understanding protein function in atomic detail. In this work, we develop a new method for protein terminus modeling that can be applied to refinement of models with unreliable terminus structures. The energy function for terminus modeling consists of both physics-based and knowledge-based potential terms with carefully optimized relative weights. Effective sampling of both the framework and terminus is performed using the conformational space annealing technique. This method has been tested on a set of termini derived from a nonredundant structure database and two sets of termini from the CASP8 targets. The performance of the terminus modeling method is significantly improved over our previous method that does not employ terminus refinement. It is also comparable or superior to the best server methods tested in CASP8. The success of the current approach suggests that similar strategy may be applied to other types of refinement problems such as loop modeling or secondary structure rearrangement.  相似文献   

5.
Methods for rapid and reliable design and structure prediction of linker loops would facilitate a variety of protein engineering applications. Circular permutation, in which the existing termini of a protein are linked by the polypeptide chain and new termini are created, is one such application that has been employed for decreasing proteolytic susceptibility and other functional purposes. The length and sequence of the linker can impact the expression level, solubility, structure and function of the permuted variants. Hence it is desirable to achieve atomic‐level accuracy in linker design. Here, we describe the use of RosettaRemodel for design and structure prediction of circular permutation linkers on a model protein. A crystal structure of one of the permuted variants confirmed the accuracy of the computational prediction, where the all‐atom rmsd of the linker region was 0.89 Å between the model and the crystal structure. This result suggests that RosettaRemodel may be generally useful for the design and structure prediction of protein loop regions for circular permutations or other structure‐function manipulations.  相似文献   

6.
Abstract: At the C-terminal end of Rubisco's large subunit major differences in sequence length and in charges of the amino acid residues occur in unicellular organisms and in plants. This C-terminal segment of the large subunit participates in large movements during the catalytic cycle. It participates in the closing mechanism of the binding niche for the substrate RuBP, changing from an ordered structure in the "open" enzyme conformation to a position, stretched over the protein surface, in a "closed" conformation. We analyzed the sequence variability in the C terminus in rbcL to investigate whether this structurally important entity evolved in an ordered process. Cyanobacteria and chlorophytes show similar C-terminal sequences (DXX), whereby D-473 is the last strictly conserved amino acid residue for all rbcLs . Contrary to the gymnosperms (D + 2 residues), the C termini of the angiosperms show variable lengths from D + 2 to D + 17 residues. The plant orders of Asterales, Batales, Cap-parales, Caryophyllales, Fabales, Gentianales, Lamiales, Ru-biales, Myrtales, Scrophulariales, and Solanales contain species with particularly elongated C termini. Recent studies regarding enzyme kinetics demonstrated that molecules with longer C termini are better adapted for a wider temperature range. We speculate that longer C termini confer properties to the enzyme that modulate the success of different species in different environments. This is supported by the fact that "modern" (e.g., phylogenetically young taxa in an actual radiation process) generally display a long C terminus, while conservative taxa have a relatively short C terminus.  相似文献   

7.
Protein termini play important roles in biological processes, but there have been few methods for comprehensive terminal proteomics. We have developed a new method that can identify both the amino and the carboxyl termini of proteins. The method independently uses two proteases, (lysyl endopeptidase) Lys-C and peptidyl-Lys metalloendopeptidase (Lys-N), to digest proteins, followed by LC-MS/MS analysis of the two digests. Terminal peptides can be identified by comparing the peptide masses in the two digests as follows: (i) the amino terminal peptide of a protein in Lys-C digest is one lysine residue mass heavier than that in Lys-N digest; (ii) the carboxyl terminal peptide in Lys-N digest is one lysine residue mass heavier than that in Lys-C digest; and (iii) all internal peptides give exactly the same molecular masses in both the Lys-C and the Lys-N digest, although amino acid sequences of Lys-C and Lys-N peptides are different (Lys-C peptides end with lysine, whereas Lys-N peptides begin with lysine). The identification of terminal peptides was further verified by examining their MS/MS spectra to avoid misidentifying pairs as termini. In this study, we investigated the usefulness of this method using several protein and peptide mixtures. Known protein termini were successfully identified. Acetylation on N-terminus and protein isoforms, which have different termini, was also determined. These results demonstrate that our new method can confidently identify terminal peptides in protein mixtures.  相似文献   

8.
Ichthyosis with confetti (IWC) is a genodermatosis associated with dominant‐negative variants in keratin 10 (KRT10) or keratin 1 (KRT1). These frameshift variants result in extended aberrant proteins, localized to the nucleus rather than the cytoplasm. This mislocalization is thought to occur as a result of the altered carboxy (C)‐terminus, from poly‐glycine to either a poly‐arginine or ‐alanine tail. Previous studies on the type of C‐terminus and subcellular localization of the respective mutant protein are divergent. In order to fully elucidate the pathomechanism of IWC, a greater understanding is critical. This study aimed to establish the consequences for localization and intermediate filament formation of altered keratin 10 (K10) C‐termini. To achieve this, plasmids expressing distinct KRT10 variants were generated. Sequences encoded all possible reading frames of the K10 C‐terminus as well as a nonsense variant. A keratinocyte line was transfected with these plasmids. Additionally, gene editing was utilized to introduce frameshift variants in exon 6 and exon 7 at the endogenous KRT10 locus. Cellular localization of aberrant K10 was observed via immunofluorescence using various antibodies. In each setting, immunofluorescence analysis demonstrated aberrant nuclear localization of K10 featuring an arginine‐rich C‐terminus. However, this was not observed with K10 featuring an alanine‐rich C‐terminus. Instead, the protein displayed cytoplasmic localization, consistent with wild‐type and truncated forms of K10. This study demonstrates that, of the various 3′ frameshift variants of KRT10, exclusively arginine‐rich C‐termini lead to nuclear localization of K10.  相似文献   

9.
Bahir I  Linial M 《Proteins》2006,63(4):996-1004
The two ends of each protein are known as the amino (N-) and carboxyl (C-) termini. Short signatures in a protein's termini often carry vital cellular function. No systematic research has been conducted to address the importance of short signatures (3 to 10 amino acids) in protein termini at the proteomic level. Specifically, it is unknown whether such signatures are evolutionarily conserved, and if so, whether this conservation confers shared biological functions. Current signature detection methods fail to detect such short signatures due to inadequate statistical scores. The findings presented in this study strongly support the notion that functional significance of protein sets may be captured by short signatures at their termini. A positional search method was applied to over one million proteins from the UniProt database. The result is a collection of about a thousand significant signature groups (SIGs) that include previously identified as well as many novel signatures in protein termini. These SIGs represent protein sets with minimal or no overall sequence similarity excepting the similarity at their termini. The most significant SIGs are assigned by their strong correspondence to functional annotations derived from external databases such as Gene Ontology. Each of the SIGs is associated with the statistical significance of its functional association. These SIGs provide a valuable source for testing previously overlooked signatures in protein termini and allow for the investigation of the role played by such signatures throughout evolution. The SIGs archive and advanced search options are available at http://www.proteus.cs.huji.ac.il.  相似文献   

10.
Proteins sample multiple conformational substates in their native environment, but the process of crystallization selects the conformers that allow for close packing. The population of conformers can be shifted by varying the environment through a range of crystallization conditions, often resulting in different space groups and changes in the packing arrangements. Three high resolution structures of myoglobin (Mb) in different crystal space groups are presented, including one in a new space group P6(1)22 and two structures in space groups P2(1)2(1)2(1) and P6. We compare coordinates and anisotropic displacement parameters (ADPs) from these three structures plus an existing structure in space group P2(1). While the overall changes are small, there is substantial variation in several external regions with varying patterns of crystal contacts across the space group packing arrangements. The structural ensemble containing four different crystal forms displays greater conformational variance (Calpha rmsd of 0.54-0.79 A) in comparison to a collection of four Mb structures with different ligands and mutations in the same crystal form (Calpha rmsd values of 0.28-0.37 A). The high resolution of the data enables comparison of both the magnitudes and directions of ADPs, which are found to be suppressed by crystal contacts. A composite dynamic profile of Mb structural variation from the four structures was compared with an independent structural ensemble developed from NMR refinement. Despite the limitations and biases of each method, the ADPs of the crystallographic ensemble closely match the positional variance from the solution NMR ensemble with linear correlation of 0.8. This suggests that crystal packing selects conformers representative of the solution ensemble, and several different crystal forms give a more complete view of the plasticity of a protein structure.  相似文献   

11.
Interleukin-1β (IL-1β) is the cytokine crucial to inflammatory and immune response. Two dominant routes are populated in the folding to native structure. These distinct routes are a result of the competition between early packing of the functional loops versus closure of the β-barrel to achieve efficient folding and have been observed both experimentally and computationally. Kinetic experiments on the WT protein established that the dominant route is characterized by early packing of geometrically frustrated functional loops. However, deletion of one of the functional loops, the β-bulge, switches the dominant route to an alternative, yet, as accessible, route, where the termini necessary for barrel closure form first. Here, we explore the effect of circular permutation of the WT sequence on the observed folding landscape with a combination of kinetic and thermodynamic experiments. Our experiments show that while the rate of formation of permutant protein is always slower than that observed for the WT sequence, the region of initial nucleation for all permutants is similar to that observed for the WT protein and occurs within a similar timescale. That is, even permutants with significant sequence rearrangement in which the functional-nucleus is placed at opposing ends of the polypeptide chain, fold by the dominant WT "functional loop-packing route", despite the entropic cost of having to fold the N- and C- termini early. Taken together, our results indicate that the early packing of the functional loops dominates the folding landscape in active proteins, and, despite the entropic penalty of coalescing the termini early, these proteins will populate an entropically unfavorable route in order to conserve function. More generally, circular permutation can elucidate the influence of local energetic stabilization of functional regions within a protein, where topological complexity creates a mismatch between energetics and topology in active proteins.  相似文献   

12.
Designing new protein folds requires a method for simultaneously optimizing the conformation of the backbone and the side-chains. One approach to this problem is the use of a parameterized backbone, which allows the systematic exploration of families of structures. We report the crystal structure of RH3, a right-handed, three-helix coiled coil that was designed using a parameterized backbone and detailed modeling of core packing. This crystal structure was determined using another rationally designed feature, a metal-binding site that permitted experimental phasing of the X-ray data. RH3 adopted the intended fold, which has not been observed previously in biological proteins. Unanticipated structural asymmetry in the trimer was a principal source of variation within the RH3 structure. The sequence of RH3 differs from that of a previously characterized right-handed tetramer, RH4, at only one position in each 11 amino acid sequence repeat. This close similarity indicates that the design method is sensitive to the core packing interactions that specify the protein structure. Comparison of the structures of RH3 and RH4 indicates that both steric overlap and cavity formation provide strong driving forces for oligomer specificity.  相似文献   

13.
BACKGROUND: Targeted gene repair is an attractive method to correct point-mutated genes at their natural chromosomal sites, but it is still rather inefficient. As revealed by earlier studies, successful gene correction requires a productive interaction of the repair molecule with the target locus. The work here set out to investigate whether DNA repair, e.g., mismatch repair, or a direct incorporation of the correction molecule follows as the step upon the initial interaction. METHODS: Single-stranded 21mer oligodeoxynucleotides (ODNs) of sense orientation were directed towards point-mutated enhanced green fluorescence protein transgene loci in HEK-293-derived cell clones. First gene repair assays compared ODNs carrying the canonical termini 5'-phosphate and 3'-OH with their respective variants harbouring non-canonical termini (5'-OH, 3'-H). Second, a protocol was established to allow efficient recovery of integrated short biotin-labelled ODNs from the genomes of gene-corrected cells using streptavidin-coated beads in order to test directly whether transfected ODNs become bona fide parts of the target locus DNA. RESULTS: Oligodeoxynucleotides with canonical termini were about 34-fold more efficient than their counterparts carrying non-canonical termini in a phosphorothioate-modified backbone. Furthermore, biotinylated fragments were successfully recovered from genomic DNAs of gene-corrected cells. CONCLUSIONS: The experiment showed that ODNs are incorporated into a mammalian genome. This unravels one early repair step and also sets an unexpected example of genome dynamics possibly relevant to other ODN-based cell techniques.  相似文献   

14.
Protein C‐termini study is still a challenging task and far behind its counterpart, N‐termini study. MS based C‐terminomics study is often hampered by the low ionization efficiency of C‐terminal peptides and the lack of efficient enrichment methods. We previously optimized the C‐terminal amine‐based isotope labeling of substrates (C‐TAILS) method and identified 369 genuine protein C‐termini in Escherichia coli. A key limitation of C‐TAILS is that the prior protection of amines and carboxylic groups at protein level makes Arg‐C as the only specific enzyme in practice. Herein, we report an approach combining multi‐enzyme digestion and C‐TAILS, which significantly increases the identification rate of C‐terminal peptides and consequently improves the applicability of C‐TAILS in biological studies. We carry out a systematic study and confirm that the omission of the prior amine protection at protein level has a negligible influence and allows the application of multi‐enzyme digestion. We successfully apply five different enzyme digestions to C‐TAILS, including trypsin, Arg‐C, Lys‐C, Lys‐N, and Lysarginase. As a result, we identify a total of 722 protein C‐termini in E. coli, which is at least 66% more than the results using any single enzyme. Moreover, the favored enzyme and enzyme combination are discovered. Data are available via ProteomeXchange with identifier PXD004275.  相似文献   

15.
Designed armadillo repeat proteins (dArmRP) are α‐helical solenoid repeat proteins with an extended peptide binding groove that were engineered to develop a generic modular technology for peptide recognition. In this context, the term “peptide” not only denotes a short unstructured chain of amino acids, but also an unstructured region of a protein, as they occur in termini, loops, or linkers between folded domains. Here we report two crystal structures of dArmRPs, in complex with peptides fused either to the N‐terminus of Green Fluorescent Protein or to the C‐terminus of a phage lambda protein D. These structures demonstrate that dArmRPs bind unfolded peptides in the intended conformation also when they constitute unstructured parts of folded proteins, which greatly expands possible applications of the dArmRP technology. Nonetheless, the structures do not fully reflect the binding behavior in solution, that is, some binding sites remain unoccupied in the crystal and even unexpected peptide residues appear to be bound. We show how these differences can be explained by restrictions of the crystal lattice or the composition of the crystallization solution. This illustrates that crystal structures have to be interpreted with caution when protein–peptide interactions are characterized, and should always be correlated with measurements in solution.  相似文献   

16.
Capping interactions associated with specific sequences at or near the ends of alpha-helices are important determinants of the stability of protein secondary and tertiary structure. We investigate here the role of the helix-capping motif Ser-X-X-Glu, a sequence that occurs frequently at the N termini of alpha helices in proteins, on the conformation and stability of the GCN4 leucine zipper. The 1.8 A resolution crystal structure of the capped molecule reveals distinct conformations, packing geometries and hydrogen-bonding networks at the amino terminus of the two helices in the leucine zipper dimer. The free energy of helix stabilization associated with the hydrogen-bonding and hydrophobic interactions in this capping structure is -1.2 kcal/mol, evaluated from thermal unfolding experiments. A single cap thus contributes appreciably to stabilizing the terminated helix and thereby the native state. These results suggest that helix capping plays a further role in protein folding, providing a sensitive connector linking alpha-helix formation to the developing tertiary structure of a protein.  相似文献   

17.
We present an analysis of the water molecules immobilized at the protein-protein interfaces of 115 homodimeric proteins and 46 protein-protein complexes, and compare them with 173 large crystal packing interfaces representing nonspecific interactions. With an average of 15 waters per 1000 A2 of interface area, the crystal packing interfaces are more hydrated than the specific interfaces of homodimers and complexes, which have 10-11 waters per 1000 A2, reflecting the more hydrophilic composition of crystal packing interfaces. Very different patterns of hydration are observed: Water molecules may form a ring around interfaces that remain "dry," or they may permeate "wet" interfaces. A majority of the specific interfaces are dry and most of the crystal packing interfaces are wet, but counterexamples exist in both categories. Water molecules at interfaces form hydrogen bonds with protein groups, with a preference for the main-chain carbonyl and the charged side-chains of Glu, Asp, and Arg. These interactions are essentially the same in specific and nonspecific interfaces, and very similar to those observed elsewhere on the protein surface. Water-mediated polar interactions are as abundant at the interfaces as direct protein-protein hydrogen bonds, and they may contribute to the stability of the assembly.  相似文献   

18.
Protein-protein crystal-packing contacts.   总被引:3,自引:1,他引:2       下载免费PDF全文
Protein-protein contacts in monomeric protein crystal structures have been analyzed and compared to the physiological protein-protein contacts in oligomerization. A number of features differentiate the crystal-packing contacts from the natural contacts occurring in multimeric proteins. The area of the protein surface patches involved in packing contacts is generally smaller and its amino acid composition is indistinguishable from that of the protein surface accessible to the solvent. The fraction of protein surface in crystal contacts is very variable and independent of the number of packing contacts. The thermal motion at the crystal packing interface and that of the protein core, even for large packing interfaces, though the tendency is to be closer to that of the core. These results suggest that protein crystallization depends on random protein-protein interactions, which have little in common with physiological protein-protein recognition processes, and that the possibility of engineering macromolecular crystallization to improve crystal quality could be widened.  相似文献   

19.
20.
Circularly permuted fluorescent proteins (FPs) have a growing number of uses in live cell fluorescence biosensing applications. Most notably, they enable the construction of single fluorescent protein‐based biosensors for Ca2+ and other analytes of interest. Circularly permuted FPs are also of great utility in the optimization of fluorescence resonance energy transfer (FRET)‐based biosensors by providing a means for varying the critical dipole–dipole orientation. We have previously reported on our efforts to create circularly permuted variants of a monomeric red FP (RFP) known as mCherry. In our previous work, we had identified six distinct locations within mCherry that tolerated the insertion of a short peptide sequence. Creation of circularly permuted variants with new termini at the locations corresponding to the sites of insertion led to the discovery of three permuted variants that retained no more than 18% of the brightness of mCherry. We now report the extensive directed evolution of the variant with new termini at position 193 of the protein sequence for improved fluorescent brightness. The resulting variant, known as cp193g7, has 61% of the intrinsic brightness of mCherry and was found to be highly tolerant of circular permutation at other locations within the sequence. We have exploited this property to engineer an expanded series of circularly permuted variants with new termini located along the length of the 10th β‐strand of mCherry. These new variants may ultimately prove useful for the creation of single FP‐based Ca2+ biosensors.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号