首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Maps that relate all possible genotypes or phenotypes to fitness--fitness landscapes--are central to the evolution of life, but remain poorly known. An insertion or a deletion (indel) of one or several amino acids constitutes a substantial leap of a protein within the space of amino acid sequences, and it is unlikely that after such a leap the new sequence corresponds precisely to a fitness peak. Thus, one can expect an indel in the protein-coding sequence that gets fixed in a population to be followed by some number of adaptive amino acid substitutions, which move the new sequence towards a nearby fitness peak. Here, we study substitutions that occur after a frame-preserving indel in evolving proteins of Drosophila. An insertion triggers 1.03 ± 0.75 amino acid substitutions within the protein region centred at the site of insertion, and a deletion triggers 4.77 ± 1.03 substitutions within such a region. The difference between these values is probably owing to a higher fraction of effectively neutral insertions. Almost all of the triggered amino acid substitutions can be attributed to positive selection, and most of them occur relatively soon after the triggering indel and take place upstream of its site. A high fraction of substitutions that follow an indel occur at previously conserved sites, suggesting that an indel substantially changes selection that shapes the protein region around it. Thus, an indel is often followed by an adaptive walk of length that is in agreement with the theory of molecular adaptation.  相似文献   

2.
《Journal of molecular biology》2019,431(10):1981-1992
Interactions between mutations play a central role in shaping the fitness landscape, but a clear picture of intragenic epistasis has yet to emerge. To further reveal the prevalence and patterns of intragenic epistasis, we present a survey of epistatic interactions between sequential mutations in TEM-1 β-lactamase. We measured the fitness effect of ~ 12,000 pairs of consecutive amino acid substitutions and used our previous study of the fitness effects of single amino acid substitutions to calculate epistasis for over 8000 mutation pairs. Since sequential mutations are prone to physically interact, we postulated that our study would be surveying specific epistasis instead of nonspecific epistasis. We found widespread negative epistasis, especially in beta-strands, and a high frequency of negative sign epistasis among individually beneficial mutations. Negative epistasis (52%) occurred 7.6 times as frequently as positive epistasis (6.8%). Buried residues experienced more negative epistasis that surface-exposed residues. However, TEM-1 exhibited a couple of hotspots for positive epistasis, most notably L221/ R222 at which many combinations of mutations positively interacted. This study is the first to systematically examine pairwise epistasis throughout an entire protein performing its native function in its native host.  相似文献   

3.
Callahan BJ 《Fly》2012,6(1):16-20
Central to the study of molecular evolution, and an area of long-standing debate, is the appropriate model for the fitness landscape of proteins. Much of this debate has focused on the strength and frequency of positive and purifying selection, but the form and frequency of selective correlations is also a vital element. The constituent amino acids within a protein generically interact and share selective pressures in predictable ways, which conflicts with the selective independence assumed by common caricatures of the fitness landscape. Here, I discuss a recent study by myself and coauthors that used whole-genome comparisons of orthologous molecular sequences from closely related Drosophilids to explore the form of the selective correlations and selective interactions (epistasis) between the amino acids within a protein. I outline our results and highlight our finding of a selective length scale of ten amino acids within which individual amino acids are substantially and generically more likely to share selective pressures and interact epistatically. I then focus on the evidence presented in our study supporting a substantial role for epistasis in the process of molecular evolution, and discuss further the implications of this widespread epistasis on the overdispersion of the molecular clock and the efficacy of common tests for positive selection.  相似文献   

4.
Earlier studies of a group of monoclonal antibody-resistant (mar) mutants of herpes simplex virus type 1 glycoprotein C (gC) operationally defined two distinct antigenic sites on this molecule, each consisting of numerous overlapping epitopes. In this report, we further define epitopes of gC by sequence analysis of the mar mutant gC genes. In 18 mar mutants studied, the mar phenotype was associated with a single nucleotide substitution and a single predicted amino acid change. The mutations were localized to two regions within the coding sequence of the external domain of gC and correlated with the two previously defined antigenic sites. The predicted amino acid substitutions of site I mutants resided between residues Gln-307 and Pro-373, whereas those of site II mutants occurred between amino acids Arg-129 and Glu-247. Of the 12 site II mutations, 9 induced amino acid substitutions within an arginine-rich segment of 8 amino acids extending from residues 143 to 151. The clustering of the majority of substituted residues suggests that they contribute to the structure of the affected sites. Moreover, the patterns of substitutions which affected recognition by antibodies with similar epitope specificities provided evidence that epitope structures are physically linked and overlap within antigenic sites. Of the nine epitopes defined on the basis of mutations, three were located within site I and six were located within site II. Substituted residues affecting the site I epitopes did not overlap substituted residues of site II, supporting our earlier conclusion that sites I and II reside in spatially distinct antigenic domains. A computer analysis of the distribution of charged residues and the predicted secondary structural features of wild-type gC revealed that the two antigenic sites reside within the most hydrophilic regions of the molecule and that the antigenic residues are likely to be organized as beta sheets which loop out from the surface of the molecule. Together, these data and our previous studies support the conclusion that the mar mutations identified by sequence analysis very likely occur within or near the epitope structures themselves. Thus, two highly antigenic regions of gC have now been physically and genetically mapped to well-defined domains of the protein molecule.  相似文献   

5.
Understanding the patterns and causes of protein sequence evolution is a major challenge in evolutionary biology. One of the critical unresolved issues is the relative contribution of selection and genetic drift to the fixation of amino acid sequence differences between species. Molecular homoplasy, the independent evolution of the same amino acids at orthologous sites in different taxa, is one potential signature of selection; however, relatively little is known about its prevalence in eukaryotic proteomes. To quantify the extent and type of homoplasy among evolving proteins, we used phylogenetic methodology to analyze 8 genome-scale data matrices from clades of different evolutionary depths that span the eukaryotic tree of life. We found that the frequency of homoplastic amino acid substitutions in eukaryotic proteins was more than 2-fold higher than expected under neutral models of protein evolution. The overwhelming majority of homoplastic substitutions were parallelisms that involved the most frequently exchanged amino acids with similar physicochemical properties and that could be reached by a single-mutational step. We conclude that the role of homoplasy in shaping the protein record is much larger than generally assumed, and we suggest that its high frequency can be explained by both weak positive selection for certain substitutions and purifying selection that constrains substitutions to a small number of functionally equivalent amino acids.  相似文献   

6.
The pattern of amino acid substitutions and sequence conservation over many structure-based alignments of protein sequences was analyzed as a function of percentage sequence identity. The statistics of the amino acid substitutions were converted into the form of log-odds amino acid substitution matrices to which eigenvalue decomposition was applied. It was found that the most important component of the substitution matrices exhibited a sharp transition at the sequence identity of 30-35%, which coincides with the twilight zone. Above the transition point, the most dominant component is related to the mutability of amino acids and it acts to disfavor any substitutions, whereas below the transition point, the most dominant component is related to the hydrophobicity of amino acids and substitutions between residues of similar hydrophobic character are positively favored. Implications for protein evolution and sequence analysis are discussed.  相似文献   

7.
Jack da Silva 《Genetics》2009,182(1):265-275
The frequently reported amino acid covariation of the highly polymorphic human immunodeficiency virus type 1 (HIV-1) exterior envelope glycoprotein V3 region has been assumed to reflect fitness epistasis between residues. However, nonrandom association of amino acids, or linkage disequilibrium, has many possible causes, including population subdivision. If the amino acids at a set of sequence sites differ in frequencies between subpopulations, then analysis of the whole population may reveal linkage disequilibrium even if it does not exist in any subpopulation. HIV-1 has a complex population structure, and the effects of this structure on linkage disequilibrium were investigated by estimating within- and among-subpopulation components of variance in linkage disequilibrium. The amino acid covariation previously reported is explained by differences in amino acid frequencies among virus subpopulations in different patients and by nonsystematic disequilibrium among patients. Disequilibrium within patients appears to be entirely due to differences in amino acid frequencies among sampling time points and among chemokine coreceptor usage phenotypes of virus particles, but not source tissues. Positive selection explains differences in allele frequencies among time points and phenotypes, indicating that these differences are adaptive rather than due to genetic drift. However, the absence of a correlation between linkage disequilibrium and phenotype suggests that fitness epistasis is an unlikely cause of disequilibrium. Indeed, when population structure is removed by analyzing sequences from a single time point and phenotype, no disequilibrium is detectable within patients. These results caution against interpreting amino acid covariation and coevolution as evidence for fitness epistasis.  相似文献   

8.
Acetyltransferase enzymes target specific lysine residues in substrate proteins. While the list of histone and nonhistone substrates is growing, the mechanisms of substrate selection remain unclear. Here, we describe a mass spectrometric approach to examine the site selection of the acetyltransferase p300 in the HIV-1 protein Tat. Tat is acetylated by p300 at a single lysine (K50) within its basic RNA-binding domain. To determine the sequence requirements for K50 recognition within this domain, we synthesized mixtures of "degenerated" Tat peptides, in which one of the surrounding residues was substituted by all proteinogenic amino acids. Peptide mixtures were assembled based on nonoverlapping peptide masses and acetylated by p300 in a standard in vitro acetylation reaction. Analysis by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry identified amino acid substitutions that prevented acetylation by p300. This approach represents a fast and comprehensive screening method that was applied to the six surrounding residues of K50 in Tat. It can be applied to any known acetyltransferase substrate and might help to define consensus recognition sequences for individual acetyltransferase enzymes.  相似文献   

9.
Biochemical activity and core stability are essential properties of proteins, maintained usually by conserved amino acids. Structural dynamics emerged in recent years as another essential aspect of protein functionality. Structural dynamics enable the adaptation of the protein to binding substrates and to undergo allosteric transitions, while maintaining the native fold. Key residues that mediate structural dynamics would thus be expected to be conserved or exhibit coevolutionary patterns at least. Yet, the correlation between sequence evolution and structural dynamics is yet to be established. With recent advances in efficient characterization of structural dynamics, we are now in a position to perform a systematic analysis. In the present study, a set of 34 enzymes representing various folds and functional classes is analyzed using information theory and elastic network models. Our analysis shows that the structural regions distinguished by their coevolution propensity as well as high mobility are predisposed to serve as substrate recognition sites, whereas residues acting as global hinges during collective dynamics are often supported by conserved residues. We propose a mobility scale for different types of amino acids, which tends to vary inversely with amino acid conservation. Our findings suggest the balance between physical adaptability (enabled by structure-encoded motions) and chemical specificity (conferred by correlated amino acid substitutions) underlies the selection of a relatively small set of versatile folds by proteins.  相似文献   

10.
The nucleotide sequence of the nucleoprotein gene of influenza A/NT/60/68 was established after using improved cloning methods to obtain full length cDNA clones in pBr322. The gene is 1565 residues long and codes for a basic protein of 498 amino acids. There are only 30 amino acid differences between it and the homologous sequence in A/PR/8/35, all occurring as point mutations. Assuming a common lineage, the evolutionary rate of divergence of the two strains is 0.18% amino acid per year. This confirms there is a slow but significant rate of evolution.  相似文献   

11.
The amino acid sequence of a polypeptide defines both the folding pathway and the final three-dimensional structure of a protein. Eighteen amino acid substitutions have been identified in bacteriophage P22 coat protein that are defective in folding and cause their folding intermediates to be substrates for GroEL and GroES. These temperature-sensitive folding (tsf) substitutions identify amino acids that are critical for directing the folding of coat protein. Additional amino acid residues that are critical to the folding process of P22 coat protein were identified by isolating second site suppressors of the tsf coat proteins. Suppressor substitutions isolated from the phage carrying the tsf coat protein substitutions included global suppressors, which are substitutions capable of alleviating the folding defects of numerous tsf coat protein mutants. In addition, potential global and site-specific suppressors were isolated, as well as a group of same site amino acid substitutions that had a less severe phenotype than the tsf parent. The global suppressors were located at positions 163, 166, and 170 in the coat protein sequence and were 8-190 amino acid residues away from the tsf parent. Although the folding of coat proteins with tsf amino acid substitutions was improved by the global suppressor substitutions, GroEL remained necessary for folding. Therefore, we believe that the global suppressor sites identify a region that is critical to the folding of coat protein.  相似文献   

12.
Escherichia coli heat-stable enterotoxin STp is presumed from its DNA sequence to be synthesized in vivo as a 72-amino-acid residue precursor that is cleaved to generate mature STp consisting of the 18 carboxy-terminal amino acid residues. There are two methionine residues in the inferred STp sequence in addition to the methionine residue at position 1. In order to confirm production of the STp 72-amino-acid residue precursor, we substituted the additional methionine residues by oligonucleotide-directed site-specific mutagenesis. Since these substitutions did not cause a significant change in STp production, it can be concluded that STp is normally synthesized as the 72-amino-acid residue precursor. The length of the STp precursor indicated the existence of a pro sequence between the signal peptide and the mature protein. In order to identify the pro sequence and determine its role in protein secretion, deletion and fusion proteins were made. A deletion mutant in which the gene fragment encoding amino acid residues 22 to 53 of STp was removed was made. STp activity was found in the culture supernatant of cells. Amino acid sequence analysis of the purified STp deletion mutant revealed that the pro sequence encompasses amino acid residues 20 to 54. A hybrid protein consisting of STp amino acids 1 to 53 fused in frame from residue 53 to nuclease A was not secreted into the culture supernatant. These results indicate that the pro sequence does not function to guide periplasmic protein into the extracellular milieu.  相似文献   

13.
A mammalian cytoplasmic protein TCP-1, encoded by a gene within the mouse t-complex, has been found to exhibit highly significant (p much less than 0.00001) sequence homology to the 'chaperonin' family of bacterial and eukaryotic proteins (viz. groEL protein of E. coli, rubisco subunit binding protein of plant chloroplasts, yeast hsp58 and mammalian P1 proteins and 60-65 kDa mycobacterial antigen). With the introduction of few gaps, the amino acid sequence of TCP-1 shows between 60-63% similarity (17-20% identical residues and 42-45% conserved substitutions) throughout its length to various chaperonin proteins, indicating a common evolutionary origin. The sequence data also suggest that in contrast to the endosymbiotic origin of mitochondrial and chloroplast chaperonins, the cytoplasmic TCP-1 may have directly descended from the common universal ancestor via eukaryotic lineage. The observed similarity between TCP-1 and the 60-65 kDa bacterial 'common antigen' is also of importance from the viewpoint of immune/autoimmune response.  相似文献   

14.
MOTIVATION: The ability of human immunodeficiency virus-1 (HIV-1) protease to develop mutations that confer multi-drug resistance (MDR) has been a major obstacle in designing rational therapies against HIV. Resistance is usually imparted by a cooperative mechanism that can be elucidated by a covariance analysis of sequence data. Identification of such correlated substitutions of amino acids may be obscured by evolutionary noise. RESULTS: HIV-1 protease sequences from patients subjected to different specific treatments (set 1), and from untreated patients (set 2) were subjected to sequence covariance analysis by evaluating the mutual information (MI) between all residue pairs. Spectral clustering of the resulting covariance matrices disclosed two distinctive clusters of correlated residues: the first, observed in set 1 but absent in set 2, contained residues involved in MDR acquisition; and the second, included those residues differentiated in the various HIV-1 protease subtypes, shortly referred to as the phylogenetic cluster. The MDR cluster occupies sites close to the central symmetry axis of the enzyme, which overlap with the global hinge region identified from coarse-grained normal-mode analysis of the enzyme structure. The phylogenetic cluster, on the other hand, occupies solvent-exposed and highly mobile regions. This study demonstrates (i) the possibility of distinguishing between the correlated substitutions resulting from neutral mutations and those induced by MDR upon appropriate clustering analysis of sequence covariance data and (ii) a connection between global dynamics and functional substitution of amino acids.  相似文献   

15.
We have identified p10 as a fifth gag protein of avian sarcoma and leukemia viruses. Amino-terminal protein sequencing of this polypeptide purified from the Prague C strain of Rous sarcoma virus and from avian myeloblastosis virus implies that it is encoded within a stretch of 64 amino acid residues between p19 and p27 on the gag precursor polypeptide. For p10 from the Prague C strain of Rous sarcoma virus the first 30 residues were found to be identical with the predicted amino acid sequence from the Prague C strain of Rous sarcoma virus DNA sequence, whereas for p10 from avian myeloblastosis virus the protein sequence for the same region showed two amino acid substitutions. Amino acid composition data indicate that there are no gross composition changes beyond the region sequenced. The amino terminus of p10 is located two amino acid residues past the carboxy terminus of p19, whereas its carboxy terminus probably is located immediately adjacent to the first amino acid residue of p27.  相似文献   

16.
Substitutions of individual amino acids in proteins may be under very different evolutionary restraints depending on their structural and functional roles. The Environment Specific Substitution Table (ESST) describes the pattern of substitutions in terms of amino acid location within elements of secondary structure, solvent accessibility, and the existence of hydrogen bonds between side chains and neighbouring amino acid residues. Clearly amino acids that have very different local environments in their functional state compared to those in the protein analysed will give rise to inconsistencies in the calculation of amino acid substitution tables. Here, we describe how the calculation of ESSTs can be improved by discarding the functional residues from the calculation of substitution tables. Four categories of functions are examined in this study: protein–protein interactions, protein–nucleic acid interactions, protein–ligand interactions, and catalytic activity of enzymes. Their contributions to residue conservation are measured and investigated. We test our new ESSTs using the program CRESCENDO, designed to predict functional residues by exploiting knowledge of amino acid substitutions, and compare the benchmark results with proteins whose functions have been defined experimentally. The new methodology increases the Z-score by 98% at the active site residues and finds 16% more active sites compared with the old ESST. We also find that discarding amino acids responsible for protein–protein interactions helps in the prediction of those residues although they are not as conserved as the residues of active sites. Our methodology can make the substitution tables better reflect and describe the substitution patterns of amino acids that are under structural restraints only.  相似文献   

17.
The fire ant Solenopsis invicta exists in two social forms, one with colonies headed by a single reproductive queen (monogyne form) and the other with colonies containing multiple queens (polygyne form). This variation in social organization is associated with variation at the gene Gp-9, with monogyne colonies harboring only the B allelic variant and polygyne colonies containing b-like variants as well. We generated new Gp-9 sequences from 15 Solenopsis species and combined these with previously published sequences to conduct a comprehensive, phylogenetically based study of the molecular evolution of this important gene. The exon/intron structure and the respective lengths of the five exons of Gp-9 are identical across all species examined, and we detected no evidence for intragenic recombination. These data conform to a previous suggestion that Gp-9 lies in a genomic region with low recombination, and they indicate that evolution of the coding region in Solenopsis has involved point substitutions only. Our results confirm a link between the presence of b-like alleles and the expression of polygyny in all South American fire ant species known to possess colonies of both social forms. Moreover, phylogenetic analyses show that b-like alleles comprise a derived clade of Gp-9 sequences within the socially polymorphic species, lending further support to the hypothesis that monogyny preceded polygyny in this group of fire ants. Site-specific maximum likelihood tests identified several amino acids that have experienced positive selection, two of which are adjacent to the inferred binding-pocket residues in the GP-9 protein. Four other binding-pocket residues are variable among fire ant species, although selection is not implicated in this variation. Branch-specific tests revealed strong positive selection on the stem lineage of the b-like allele clade, as expected if selection drove the amino acid replacements crucial to the expression of polygyne social organization. Such selection may have operated via the ligand-binding properties of GP-9, as one of the two amino acids uniquely shared by all b-like alleles is predicted to be a binding-pocket residue.  相似文献   

18.
Molecular cloning and sequence analysis of human placental ferredoxin   总被引:2,自引:0,他引:2  
We have characterized several clones specific for the human iron-sulfur protein, ferredoxin, which is involved in electron transfer to mitochondrial cytochromes P-450. Clones were isolated from a human placental cDNA expression library in lambda gt11 by immunoscreening with antibody to bovine adrenal ferredoxin. One clone contained the entire amino acid coding sequence (552 bp) together with 27 bp at the 5'-terminus and approximately 0.9 kb at the 3'-terminus; this form appears to correspond to the major mRNA species of approximately 1.7 kb observed on Northern blots of placental mRNA. The deduced amino acid sequence suggests that human ferredoxin is synthesized as a precursor of 184 amino acids (Mr 19,371) which is cleaved to yield a polypeptide of 124 amino acids (Mr 13,546). The mature protein is highly acidic, and the sequence is very similar to those of bovine and porcine adrenodoxins with the exception of substitutions and variations in length at the C-terminus. The N-terminal precursor segment, on the other hand, is considerably diverged from that determined for bovine adrenodoxin, but is similar in overall basicity and the pattern of occurrence of arginine residues.  相似文献   

19.
Summary Tat, a 86-amino acid protein involved in the replication of Human Immunodeficiency Virus type 1 (HIV-1), is able to translocate efficiently through the plasma membrane and to reach the nucleus to transactivate the viral genome. The region 37–72 of the Tat protein, centered on a cluster of basic amino acids, has been assigned to this translocation activity. Recent data in our group have attributed this membrane translocating activity to a peptide extending from residues 48 to 60, which contains a cluster of eight basic amino acids within a linear sequence of nine residues. Internalization of this peptide into cells occurred within minutes at concentrations as low as 100 nM. In order to define more precisely the involvement of these basic amino acids in peptide translocation, several analogues carrying deletions or substitutions of one, or several, of the basic residues were synthesized and tested for their cellular uptake and nuclear translocation. A direct correlation between the overall charge of the peptide and its cell internalization was found. In addition, the covalent linkage of this short basic peptide allows the efficient translocation of a non-membrane permeant peptide.  相似文献   

20.
Invariant sites are a common feature of amino acid sequence evolution. The presence of invariant sites is frequently attributed to the need to preserve function through site-specific conservation of amino acid residues. Amino acid substitution models without a provision for invariant sites often fit the data significantly worse than those that allow for an excess of invariant sites beyond those predicted by models that only incorporate rate variation among sites (e.g., a Gamma distribution). An alternative is epistasis between sites to preserve residue interactions that can create invariant sites. Through computer-simulated sequence evolution, we evaluated the relative effects of site-specific preferences and site-site couplings in the generation of invariant sites and the modulation of the rate of molecular evolution. In an analysis of ten major families of protein domains with diverse sequence and functional properties, we find that the negative selection imposed by epistasis creates many more invariant sites than site-specific residue preferences alone. Further, epistasis plays an increasingly larger role in creating invariant sites over longer evolutionary periods. Epistasis also dictates rates of domain evolution over time by exerting significant additional purifying selection to preserve site couplings. These patterns illuminate the mechanistic role of epistasis in the processes underlying observed site invariance and evolutionary rates.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号