首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
In this work, we discovered a fundamental connection between selection for protein stability and emergence of preferred structures of proteins. Using a standard exact three-dimensional lattice model we evolve sequences starting from random ones and determine the exact native structure after each mutation. Acceptance of mutations is biased to select for stable proteins. We found that certain structures, "wonderfolds", are independently discovered numerous times as native states of stable proteins in many unrelated runs of selection. The strong dependence of lattice fold usage on the structural determinant of designability quantitatively reproduces uneven fold usage in natural proteins. Diversity of sequences that fold into wonderfold structures gives rise to superfamilies, i.e. sets of dissimilar sequences that fold into the same or very similar structures. The present work establishes a model of pre-biotic structure selection, which identifies dominant structural patterns emerging upon optimization of proteins for survival in a hot environment. Convergently discovered pre-biotic initial superfamilies with wonderfold structures could have served as a seed for subsequent biological evolution involving gene duplications and divergence.  相似文献   

2.
3.
A series of 2-deoxy-hexopyranosyl derivatives of uridine have been synthesized as analogues of UDP-sugar. These compounds were tested as inhibitors against bovine β-1,4-galactosyltransferase I in fluorescent assays and showed no significant inhibition.  相似文献   

4.
Tungtur S  Parente DJ  Swint-Kruse L 《Proteins》2011,79(5):1589-1608
Concomitant with the genomic era, many bioinformatics programs have been developed to identify functionally important positions from sequence alignments of protein families. To evaluate these analyses, many have used the LacI/GalR family and determined whether positions predicted to be "important" are validated by published experiments. However, we previously noted that predictions do not identify all of the experimentally important positions present in the linker regions of these homologs. In an attempt to reconcile these differences, we corrected and expanded the LacI/GalR sequence set commonly used in sequence/function analyses. Next, a variety of analyses were carried out (1) for the entire LacI/GalR sequence set and (2) for a subset of homologs with functionally-important "YxPxxxAxxL" motifs in their linkers. This strategy was devised to determine whether predictions could be improved by knowledge-based sequence sorting and-for some analyses-did increase the number of linker positions identified. However, two functionally important linker positions were not reliably identified by any analysis. Finally, we compared the new predictions to all known experimental data for E. coli LacI and three homologous linkers. From these, we estimate that >50% of positions are important to the functions of the LacI/GalR homologs. In corollary, neutral positions might occur less frequently and might be easier to detect in sequence analyses. Although analyses have successfully guided mutations that partially exchange protein functions, a better experimental understanding of the sequence/function relationships in protein families would be helpful for uncovering the remaining rules used by nature to evolve new protein functions.  相似文献   

5.
The structure of a previously calculated transition state (TS) was used to design the [tetrahydro-2-(methylthio)furan-2-yl]methyl phosphate dianion (1) as a new scaffold for transition-state analogs of reactions catalyzed by the inverting glycosyltransferases. This scaffold contains relevant features of the donor and acceptor and represents a new type of potential inhibitors for these enzymes. Available conformational space of 1 was explored using DFT quantum chemical methods by means of two-dimensional potential-energy maps calculated as a function of Phi, Psi, and omega dihedral angles at the B3LYP/6-31+G* level. The calculated potential energy surfaces revealed the existence of several low-energy domains. Structures from these regions were refined at the 6-311++G** level and led to 14 conformers. The stability of conformers is influenced by their environment, and in aqueous solution two conformers dominate the equilibrium. A superposition of calculated conformers with the predicted TS structure revealed that the preferred conformers in solution nicely mimic structural features of the TS. These results imply that 1 has structural properties required to mimic the TS and therefore can be used as a scaffold for further development of TS-analog inhibitors for retaining glycosyltransferases.  相似文献   

6.
Using a data set of aligned protein domain superfamilies of known three-dimensional structure, we compared the location of interdomain interfaces on the tertiary folds between members of distantly related protein domain superfamilies. The data set analyzed is comprised of interdomain interfaces, with domains occurring within a polypeptide chain and those between two polypeptide chains. We observe that, in general, the interfaces between protein domains are formed entirely in different locations on the tertiary folds in such pairs. This variation in the location of interface happens in protein domains involved in a wide range of functions, such as enzymes, adapters, and domains that bind protein ligands, or cofactors. While basic biochemical functionality is preserved at the domain superfamily level, the effect of biochemical function on protein assemblies is different in these protein domains related by superfamily. The divergence between proteins, in most cases, is coupled with domain recruitment, with different modes of interaction with the recruited domain. This is in complete contrast to the observation that in closely related homologous protein domains, almost always the interaction interfaces are topologically equivalent. In a small subset of interacting domains within proteins related by remote homology, we observe that the relative positioning of domains with respect to one another is preserved. Based on the analysis of multidomain proteins of known or unknown structure, we suggest that variation in protein-protein interactions in members within a superfamily could serve as diverging points in otherwise parallel metabolic or signaling pathways. We discuss a few representative cases of diverging pathways involving domains in a superfamily.  相似文献   

7.
O-antigens of Gram negative bacteria are polysaccharides covalently attached to lipopolysaccharides (LPS) that have roles as virulence factors. Due to the lack of defined substrates for in vitro assays only a few of the enzymes involved in the biosynthesis of O-antigens have been studied. Many O-antigens have GlcNAc at the reducing end of the oligosaccharide chain linked to pyrophosphate-lipid. We therefore designed and synthesized a series of GlcNAc-pyrophosphate-lipid analogs of the natural GlcNAc-pyrophosphate-undecaprenol acceptor substrate for studies of the acceptor specificities of O-antigen biosynthetic enzymes. We synthesized analogs with modifications of the pyrophosphate bond as well as the lipid chain. These compounds will be useful for the specificity studies of many bacterial glycosyltransferases. Knowledge of the substrate specificities is the basis for the development of specific glycosyltransferase inhibitors that could block O-antigen biosynthesis.  相似文献   

8.
Hydrophobic cores are fundamental structural properties of proteins typically associated with protein folding and stability; however, how the hydrophobic core shapes protein evolution and function is poorly understood. Here, we investigated the role of conserved hydrophobic cores in fold-A glycosyltransferases (GT-As), a large superfamily of enzymes that catalyze formation of glycosidic linkages between diverse donor and acceptor substrates through distinct catalytic mechanisms (inverting versus retaining). Using hidden Markov models and protein structural alignments, we identify similarities in the phosphate-binding cassette (PBC) of GT-As and unrelated nucleotide-binding proteins, such as UDP-sugar pyrophosphorylases. We demonstrate that GT-As have diverged from other nucleotide-binding proteins through structural elaboration of the PBC and its unique hydrophobic tethering to the F-helix, which harbors the catalytic base (xED-Asp). While the hydrophobic tethering is conserved across diverse GT-A fold enzymes, some families, such as B3GNT2, display variations in tethering interactions and core packing. We evaluated the structural and functional impact of these core variations through experimental mutational analysis and molecular dynamics simulations and find that some of the core mutations (T336I in B3GNT2) increase catalytic efficiency by modulating the conformational occupancy of the catalytic base between “D-in” and acceptor-accessible “D-out” conformation. Taken together, our studies support a model of evolution in which the GT-A core evolved progressively through elaboration upon an ancient PBC found in diverse nucleotide-binding proteins, and malleability of this core provided the structural framework for evolving new catalytic and substrate-binding functions in extant GT-A fold enzymes.  相似文献   

9.
It is often possible to identify sequence motifs that characterize a protein family in terms of its fold and/or function from aligned protein sequences. Such motifs can be used to search for new family members. Partitioning of sequence alignments into regions of similar amino acid variability is usually done by hand. Here, I present a completely automatic method for this purpose: one that is guaranteed to produce globally optimal solutions at all levels of partition granularity. The method is used to compare the tempo of sequence diversity across reliable three-dimensional (3D) structure-based alignments of 209 protein families (HOMSTRAD) and that for 69 superfamilies (CAMPASS). (The mean alignment length for HOMSTRAD and CAMPASS are very similar.) Surprisingly, the optimal segmentation distributions for the closely related proteins and distantly related ones are found to be very similar. Also, optimal segmentation identifies an unusual protein superfamily. Finally, protein 3D structure clues from the tempo of sequence diversity across alignments are examined. The method is general, and could be applied to any area of comparative biological sequence and 3D structure analysis where the constraint of the inherent linear organization of the data imposes an ordering on the set of objects to be clustered.  相似文献   

10.
The dsDNA-tailed bacteriophages are probably the largest evolving group in the Biosphere and they are arguably very ancient. Comparative examination of genomes indicates that the hallmark of phage evolution is horizontal exchange of sequences. This is accomplished, first, by rampant non-homologous recombination between different genomes and, second, by reassortment of the variant sequences so created through homologous recombination. The comparative analysis suggests mechanisms by which new genes can be added to phage genomes and by which genes with novel functions may be assembled from parts. Horizontal exchange of sequences occurs most frequently among closely related phages, but it also extends across the entire global population at lower frequency. Bacteriophages also have probable ancestral connections with viruses of eukaryotes and archaea.  相似文献   

11.
An enzyme-linked lectin binding assay (ELBA) has been developed for the detection of soluble lectin binding substances (receptors) and the determination of their relative affinity for the lectin. The assay is based on competitive binding to enzyme-labeled lectin of a known lectin receptor, bound to a solid phase, and unknown sample receptors. In this paper the assay is exemplified with the mannose/glucose-specific pea lectin, with the glycoprotein ovalbumin as its receptor, and with horseradish peroxidase (EC 1.11.1.7) as the enzyme used for labeling. Also a method was developed for the preparation of peroxidase-labeled lectin. Labeling was started by mixing equimolar amounts of lectin and periodate-oxidized enzyme at pH 4.5 at a final concentration of 10(-4)M, after which conjugation was started by raising the pH to 9.5. This resulted in complete conjugation, after which the product could be diluted 50-500 times for application in ELBA. For the ELBA ovalbumin was adsorbed onto polystyrene microtiter plates. Sample receptors, added together with the enzyme-labeled lectin, inhibited binding of the latter to ovalbumin. Bound enzyme activity was colorimetrically determined after addition of o-phenylenediamine. Relative lectin affinity (KL) was expressed as (formula; see text) in which [X]50% is the concentration of sample receptor necessary to inhibit 50% of the binding of a certain amount of lectin, and [M]50% is the concentration of D-mannose necessary to inhibit 50% binding of the same amount of lectin. With this technique lectin affinity of both monovalent and polyvalent lectin binding substances can be estimated: low KL values mean high lectin affinity.  相似文献   

12.
A comprehensive analysis of the quaternary features of distantly related homo‐oligomeric proteins is the focus of the current study. This study has been performed at the levels of quaternary state, symmetry, and quaternary structure. Quaternary state and quaternary structure refers to the number of subunits and spatial arrangements of subunits, respectively. Using a large dataset of available 3D structures of biologically relevant assemblies, we show that only 53% of the distantly related homo‐oligomeric proteins have the same quaternary state. Considering these homologous homo‐oligomers with the same quaternary state, conservation of quaternary structures is observed only in 38% of the pairs. In 36% of the pairs of distantly related homo‐oligomers with different quaternary states the larger assembly in a pair shows high structural similarity with the entire quaternary structure of the related protein with lower quaternary state and it is referred as “Russian doll effect.” The differences in quaternary state and structure have been suggested to contribute to the functional diversity. Detailed investigations show that even though the gross functions of many distantly related homo‐oligomers are the same, finer level differences in molecular functions are manifested by differences in quaternary states and structures. Comparison of structures of biological assemblies in distantly and closely related homo‐oligomeric proteins throughout the study differentiates the effects of sequence divergence on the quaternary structures and function. Knowledge inferred from this study can provide insights for improved protein structure classification and function prediction of homo‐oligomers. Proteins 2016; 84:1190–1202. © 2016 Wiley Periodicals, Inc.  相似文献   

13.
The intron/exon organization of the human gene for glycogen phosphorylase has been determined. The segments of the polypeptide chain that corresponds to the 19 exons of the gene are examined for relationships between the three-dimensional structure to the protein and gene structure. Only weak correlations are observed between domains of phosphorylase and exons. The nucleotide binding domains that are found in phosphorylase and other glycolytic enzymes are examined for relationships between exons of the genes and structures of the domains. When mapped to the three-dimensional structures, the intron/exon boundaries are shown to be widely distributed in this family of protein domains.  相似文献   

14.
Patterns of alternation of hydrophobic and polar residues are a profound aspect of amino acid sequences, but a feature not easily interpreted for soluble proteins. Here we report statistics of hydrophobicity patterns in proteins of known structure in a current protein database as compared with results from earlier, more limited structure sets. Previous studies indicated that long hydrophobic runs, common in membrane proteins, are underrepresented in soluble proteins. Long runs of hydrophobic residues remain significantly underrepresented in soluble proteins, with none longer than 16 residues observed. These long runs most commonly occur as buried alpha helices, with extended hydrophobic strands less common. Avoiding aggregation of partially folded intermediates during intracellular folding remains a viable explanation for the rarity of long hydrophobic runs in soluble proteins. Comparison between database editions reveals robustness of statistics on aqueous proteins despite an approximately twofold increase in nonredundant sequences. The expanded database does now allow us to explain several deviations of hydrophobicity statistics from models of random sequence in terms of requirements of specific secondary structure elements. Comparison to prior membrane-bound protein sequences, however, shows significant qualitative changes, with the average hydrophobicity and frequency of long runs of hydrophobic residues noticeably increasing between the database editions. These results suggest that the aqueous proteins of solved structure may represent an essentially complete sample of the universe of aqueous sequences, while the membrane proteins of known structure are not yet representative of the universe of membrane-associated proteins, even by relatively simple measures of hydrophobic patterns.  相似文献   

15.
Frenkel ZM  Trifonov EN 《Proteins》2007,67(2):271-284
A new method is proposed to reveal apparent evolutionary relationships between protein fragments with similar 3D structures by finding "intermediate" sequences in the proteomic database. Instead of looking for homologies and intermediates for a whole protein domain, we build a chain of intermediate short sequences, which allows one to link similar structural modules of proteins belonging to the same or different families. Several such chains of intermediates can be combined into an evolutionary tree of structural protein modules. All calculations were made for protein fragments of 20 aa residues. Three evolutionary trees for different module structures are described. The aim of the paper is to introduce the new method and to demonstrate its potential for protein structural predictions. The approach also opens new perspectives for protein evolution studies.  相似文献   

16.
We have isolated four members of the Arabidopsis cyclophilin (CyP) gene family, designated ROC1 to ROC4 (rotamase CyP). Deduced peptides of ROC1, 2 and 3 are 75% to 91% identical to Brassica napus cytosolic CyP, contain no leader peptides and include a conserved seven amino-acid insertion relative to mammalian cytosolic CyPs. Two other Arabidopsis CyPs, ROC5 (43H1; ATCYP1) and ROC6 (ATCYP2), share these features. ROC1, ROC2, ROC3 and ROC5 are expressed in all tested organs of light-grown plants. ROC2 and ROC5 show elevated expression in flowers. Expression of ROC1, ROC2, and ROC3 decreases in darkness and these genes also exhibit small elevations in expression upon wounding. The five Arabidopsis genes encoding putative cytosolic CyPs (ROC1, 2, 3, 5 and 6) contain no introns. In contrast, ROC4, which encodes a chloroplast stromal CyP, is interrupted by six introns. ROC4 is not expressed in roots, and is strongly induced by light. Phylogenetic trees of all known CyPs and CyP-related proteins provide evidence of possible horizontal transfer of CyP genes between prokaryotes and eukaryotes and of a possible polyphyletic origin of these proteins within eukaryotes. These trees also show significant grouping of eukaryotic CyPs on the basis of subcellular localization and structure. Mitochondrial CyPs are closely related to cytosolic CyPs of the source organism, but endoplasmic reticulum CyPs form separate clades. Known plant CyPs fall into three clades, one including the majority of higher-plant cytosolic CyPs, one including only ROC2 and a related rice CyP, and one including only chloroplast CyPs.  相似文献   

17.
In the "fold approach" proteins with a similar fold but different sequences are compared in order to investigate the relationship between native state structure and folding behaviour. Here we compare the properties of the transition states for folding of TI I27, the 27th immunoglobulin domain from human cardiac titin, and that of TNfn3, the third fibronectin type III domain from human tenascin. Experimental phi-values were used as restraints in molecular dynamics simulations to determine the structures that make up the transition state ensembles (TSEs) for folding of the two proteins. The restrained simulations that we present allow a detailed structural comparison of the two TSEs to be made. Further calculations show explicitly that for both proteins the formation of the interactions involving the residues in the folding nucleus is sufficient for the establishment of the topology of the Ig-like fold. We found that, although the folding nuclei of the two proteins are similar, the packing of the folding nucleus of TI I27 is much tighter than that of TNfn3, reflecting the higher experimental phi-values and beta(T) (Tanford Beta) of TI I27. These results suggest that the folding nucleus can be significantly deformed to accommodate extensive sequence variation while conserving the same folding mechanism.  相似文献   

18.
Lateral gene transfer plays an important role in the evolution of life. Events of ancient gene transfer can transmit genetic novelties to descendent lineages and subsequently shape their genetic systems. We here present the analyses of the gene encoding tyrosyl-tRNA synthetase (tyrRS), which reveal two eukaryotic tyrRS lineages, one including the opisthokonts and the other the remaining eukaryotes. The different origins of tyrRS lineages between the opisthokonts and the remaining eukaryotes indicate a likely case of ancient lateral gene transfer of tyrRS from an archaeon to the opisthokonts, which lends further support for the monophyly of the latter group. Ancient paralogy followed by differential gene loss is an alternative, albeit less parsimonious explanation for the distribution of the two eukaryotic tyrRS types. In either case, the presence of a haloarchaeal tyrRS type in the opisthokonts marks this group as monophyletic. This finding also points to the potential utility of ancient gene transfer events as molecular markers for major organismal lineages.  相似文献   

19.
The soluble proteins of the eggs of the coleopteran insect Anthonomus grandis Boheman, the cotton boll weevil, consist almost entirely of two vitellin types with Mrs of 160,000 and 47,000. We sequenced their N-terminal ends and one internal cyanogen bromide fragment of the large vitellin and compared these sequences with the deduced amino acid sequence from the vitellogenin gene. The results suggest that both the boll weevil vitellin proteins are products of the proteolytic cleavage of a single precursor protein. The smaller 47,000 M vitellin protein is derived from the N-terminal portion of the precursor adjacent to an 18 amino acid signal peptide. The cleavage site between the large and small vitellins at amino acid 362 is adjacent to a pentapeptide sequence containing two pairs of arginine residues. Comparison of the boll weevil sequences with limited known sequences from the single 180,000 Mr honey bee protein show that the honey bee vitellin N-terminal exhibits sequence homology to the N-terminal of the 47,000 Mr boll weevil vitellin. Treatment of the vitellins with an N-glycosidase results in a decrease in molecular weight of both proteins, from 47,000 to 39,000 and from 160,000 to 145,000, indicating that about 10–15% of the molecular weight of each vitellin consists of N-linked carbohydrate. The molecular weight of the deglycosylated large vitellin is smaller than that predicted from the gene sequence, indicating possible further proteolytic processing at the C-terminal of that protein. © 1993 Wiley-Liss, Inc.
  • 1 This article is a US Government work and, as such, is in the public domain in the United States of America.
  •   相似文献   

    20.
    Nam K  Gao J  York DM 《RNA (New York, N.Y.)》2008,14(8):1501-1507
    Molecular dynamics simulations using a combined quantum mechanical/molecular mechanical potential are used to determine the two-dimensional free energy profiles for the mechanism of RNA transphosphorylation in solution and catalyzed by the hairpin ribozyme. A mechanism is explored whereby the reaction proceeds without explicit chemical participation by conserved nucleobases in the active site. The ribozyme lowers the overall free energy barrier by up to 16 kcal/mol, accounting for the majority of the observed rate enhancement. The barrier reduction in this mechanism is achieved mainly by the electrostatic environment provided by the ribozyme without recruitment of active site nucleobases as acid or base catalysts. The results establish a baseline mechanism that invokes only the solvation and specific hydrogen-bonding interactions present in the ribozyme active site and provide a departure point for the exploration of alternate mechanisms where nucleobases play an active chemical role.  相似文献   

    设为首页 | 免责声明 | 关于勤云 | 加入收藏

    Copyright©北京勤云科技发展有限公司  京ICP备09084417号