首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level.  相似文献   

2.
The strength and pattern of coevolution between amino acid residues vary depending on their structural and functional environment. This context dependence, along with differences in analytical technique, is responsible for the different results among coevolutionary analyses of different proteins. It is thus important to perform detailed study of individual proteins to gain better insight into how context dependence can affect coevolutionary patterns even within individual proteins, and to unravel the details of context dependence with respect to structure and function. Here we extend our previous study by presenting further analysis of residue coevolution in cytochrome c oxidase subunit I sequences from 231 vertebrates using a statistically robust phylogeny-based maximum likelihood ratio method. As in previous studies, a strong overall coevolutionary signal was detected, and coevolution within structural regions was significantly related to the Cα distances between residues. While the strong selection for adjacent residues among predicted coevolving pairs in the surface region indicates that the statistical method is highly selective for biologically relevant interactions, the coevolutionary signal was strongest in the transmembrane region, although the distances between coevolving residues were greater. This indicates that coevolution may act to maintain more global structural and functional constraints in the transmembrane region. In the transmembrane region, sites that coevolved according to polarity and hydrophobicity rather than volume had a greater tendency to colocalize with just one of the predicted proton channels (channel H). Thus, the details of coevolution in cytochrome c oxidase subunit I depend greatly on domain structure and residue physicochemical characteristics, but proximity to function appears to play a critical role. We hypothesize that coevolution is indicative of a more important functional role for this channel. Electronic Supplementary Material The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

3.
The antigenic peptide, major histocompatibility complex molecule (MHC; also called human leukocyte antigen, HLA), coreceptor CD8, or CD4 and T‐cell receptor (TCR) function as a complex to initiate effectors’ mechanisms of the immune system. The tight functional and physical interaction among these molecules may have involved strong coevolution links among domains within and between proteins. Despite the importance of unraveling such dependencies to understand the arms race of host–pathogen interaction, no previous studies have aimed at achieving such an objective. Here, we perform an exhaustive coevolution analysis and show that indeed such dependencies are strongly shaping the evolution and probably the function of these molecules. We identify intramolecular coevolution in HLA class I and II at domains important for their immune activity. Most of the amino acid sites identified to be coevolving in HLAI have been also detected to undergo positive Darwinian selection highlighting therefore their adaptive value. We also identify coevolution among antigen‐binding pockets (P1‐P9) and among these and TCR‐binding sites. Conversely to HLAI, coevolution is weaker in HLAII. Our results support that such coevolutionary patterns are due to selective pressures of host–pathogen coevolution and cooperative binding of TCRs, antigenic peptides, and CD8/CD4 to HLAI and HLAII.  相似文献   

4.
The impact of coevolutionary interaction between species on adaptive radiation processes is analysed with reference to pollination biology case studies. Occasional colonization of archipelagos can bring together coevolving partners and cause coradiation of the colonizing species, e.g. the drepanidids and the lobelioids on Hawaii. Permanent reciprocal selective pressure between pairs of coevolving species can lead to a coevolutionary race and rapid evolutionary change. This is exemplified by spurred flowers and long-tongued flower-visitors. The geographic patterning of diffuse coevolution systems can lead to dramatic changes in species interactions. In different populations, interaction between pollinating and seed-parasitizing Greya moths and their host plants varies from mutualism to commensalism and antagonism, depending on the presence of copollinators. Asymmetrical coevolution between angiosperms and oligolectic flower-visitors may facilitate rapid reproductive isolation of populations following a food-plant switch, if the oligoleges use their specific food plants as the rendezvous sites. Diffuse coevolution between angiosperm species and pollinating insects may cause frequent convergent evolution of floral traits such as nectar reward instead of pollen reward, floral guides, zygomorphic flowers, or mimicry of pollen signals, since the multiple plant species experience similar selective pressures via the coevolving partners. Patterns of angiosperm adaptive radiation are highlighted in the context of coevolution with pollinators.  相似文献   

5.
Amino acids do not occur randomly in proteins; rather, their occurrence at any given site is strongly influenced by the amino acid composition at other sites, the structural and functional aspects of the region of the protein in which they occur, and the evolutionary history of the protein. The goal of our research study is to identify networks of coevolving sites within the serpin proteins (serine protease inhibitors) and classify them as being caused by structural-functional constraints or by evolutionary history. To address this, a matrix of pairwise normalized mutual information (NMI) values was computed among amino acid sites for the serpin proteins. The NMI matrix was partitioned into orthogonal patterns of amino acid variability by factor analysis. Each common factor pattern was interpreted as having phylogenetic and/or structural-functional explanations. In addition, we used a bootstrap factor analysis technique to limit the effects of phylogenetic history on our factor patterns. Our results show an extensive network of correlations among amino acid sites in key functional regions (reactive center loop, shutter, and breach). Additionally, we have discovered long-range coevolution for packed amino acids within the serpin protein core. Lastly, we have discovered a group of serpin sites which coevolve in the hydrophobic core region (s5B and s4B) and appear to represent sites important for formation of the "native" instead of the "latent" serpin structure. This research provides a better understanding on how protein structure evolves; in particular, it elucidates the selective forces creating coevolution among protein sites.  相似文献   

6.
Comprehensive sampling of genomic biodiversity is fast becoming a reality for some genomic regions and complete organelle genomes. Genomic biodiversity is defined as large genomic sequences from many species, and here some recent work is reviewed that demonstrates the potential benefits of genomic biodiversity for molecular evolutionary analysis and phylogenetic reconstruction. This work shows that using likelihood-based approaches, taxon addition can dramatically improve phylogenetic reconstruction. Features or dynamics of the evolutionary process are much more easily inferred with large numbers of taxa, and large numbers are essential for discriminating differences in evolutionary patterns between sites. Accurate prediction of site-specific patterns can improve phylogenetic reconstruction by an amount equivalent to quadrupling sequence length. Genomic biodiversity is particularly central to research relating patterns of evolution, adaptation and coevolution to structural and functional features of proteins. Research on detecting coevolution between amino acid residues in proteins demonstrates a clear need for much greater numbers of closely related taxa to better discriminate site-specific patterns of interaction, and to allow more detailed analysis of coevolutionary interactions between subunits in protein complexes. It is argued that parsing out coevolutionary and other context-dependent substitution probabilities is essential for discriminating between coevolution and adaptation, and for more realistically modelling the evolution of proteins. Also reviewed is research that argues for increasing the efficiency of acquiring genomic biodiversity, and suggests that this might be done by simultaneously shotgun cloning and sequencing genomic mixtures from many species. Increased efficiency is a prerequisite if genomic biodiversity levels are to rapidly increase by orders of magnitude, and thus lead to dramatically improved understanding of interactions between protein structure, function and sequence evolution.  相似文献   

7.
Reproductive proteins are among the fastest evolving in the proteome, often due to the consequences of positive selection, and their rapid evolution is frequently attributed to a coevolutionary process between interacting female and male proteins. Such a process could leave characteristic signatures at coevolving genes. One signature of coevolution, predicted by sexual selection theory, is an association of alleles between the two genes. Another predicted signature is a correlation of evolutionary rates during divergence due to compensatory evolution. We studied female–male coevolution in the abalone by resequencing sperm lysin and its interacting egg coat protein, VERL, in populations of two species. As predicted, we found intergenic linkage disequilibrium between lysin and VERL, despite our demonstration that they are not physically linked. This finding supports a central prediction of sexual selection using actual genotypes, that of an association between a male trait and its female preference locus. We also created a novel likelihood method to show that lysin and VERL have experienced correlated rates of evolution. These two signatures of coevolution can provide statistical rigor to hypotheses of coevolution and could be exploited for identifying coevolving proteins a priori. We also present polymorphism-based evidence for positive selection and implicate recent selective events at the specific structural regions of lysin and VERL responsible for their species-specific interaction. Finally, we observed deep subdivision between VERL alleles in one species, which matches a theoretical prediction of sexual conflict. Thus, abalone fertilization proteins illustrate how coevolution can lead to reproductive barriers and potentially drive speciation.  相似文献   

8.
Compensatory substitutions happen when one mutation is advantageously selected because it restores the loss of fitness induced by a previous deleterious mutation. How frequent such mutations occur in evolution and what is the structural and functional context permitting their emergence remain open questions. We built an atlas of intra-protein compensatory substitutions using a phylogenetic approach and a dataset of 1,630 bacterial protein families for which high-quality sequence alignments and experimentally derived protein structures were available. We identified more than 51,000 positions coevolving by the mean of predicted compensatory mutations. Using the evolutionary and structural properties of the analyzed positions, we demonstrate that compensatory mutations are scarce (typically only a few in the protein history) but widespread (the majority of proteins experienced at least one). Typical coevolving residues are evolving slowly, are located in the protein core outside secondary structure motifs, and are more often in contact than expected by chance, even after accounting for their evolutionary rate and solvent exposure. An exception to this general scheme is residues coevolving for charge compensation, which are evolving faster than noncoevolving sites, in contradiction with predictions from simple coevolutionary models, but similar to stem pairs in RNA. While sites with a significant pattern of coevolution by compensatory mutations are rare, the comparative analysis of hundreds of structures ultimately permits a better understanding of the link between the three-dimensional structure of a protein and its fitness landscape.  相似文献   

9.

Background  

The strength of selective constraints operating on amino acid sites of proteins has a multifactorial nature. In fact, amino acid sites within proteins coevolve due to their functional and/or structural relationships. Different methods have been developed that attempt to account for the evolutionary dependencies between amino acid sites. Researchers have invested a significant effort to increase the sensitivity of such methods. However, the difficulty in disentangling functional co-dependencies from historical covariation has fuelled the scepticism over their power to detect biologically meaningful results. In addition, the biological parameters connecting linear sequence evolution to structure evolution remain elusive. For these reasons, most of the evolutionary studies aimed at identifying functional dependencies among protein domains have focused on the structural properties of proteins rather than on the information extracted from linear multiple sequence alignments (MSA). Non-parametric methods to detect coevolution have been reported to be especially susceptible to produce false positive results based on the properties of MSAs. However, no formal statistical analysis has been performed to definitively test the differential effects of these properties on the sensitivity of such methods.  相似文献   

10.
N Cheng  Y Mao  Y Shi  S Tao 《PloS one》2012,7(9):e44376
Understanding intra-molecular coevolution helps to elucidate various structural and functional constraints acting on molecules and might have practical applications in predicting molecular structure and interactions. In this study, we used 5S rRNA as a template to investigate how selective constraints have shaped the RNA evolution. We have observed the nonrandom occurrence of paired differences along the phylogenetic trees, the high rate of compensatory evolution, and the high TIR scores (the ratio of the numbers of terminal to intermediate states), all of which indicate that significant positive selection has driven the evolution of 5S rRNA. We found three mechanisms of compensatory evolution: Watson-Crick interaction (the primary one), complex interactions between multiple sites within a stem, and interplay of stems and loops. Coevolutionary interactions between sites were observed to be highly dependent on the structural and functional environment in which they occurred. Coevolution occurred mostly in those sites closest to loops or bulges within structurally or functionally important helices, which may be under weaker selective constraints than other stem positions. Breaking these pairs would directly increase the size of the adjoining loop or bulge, causing a partial or total structural rearrangement. In conclusion, our results indicate that sequence coevolution is a direct result of maintaining optimal structural and functional integrity.  相似文献   

11.
Gloor GB  Martin LC  Wahl LM  Dunn SD 《Biochemistry》2005,44(19):7156-7165
Information theory was used to identify nonconserved coevolving positions in multiple sequence alignments from a variety of protein families. Coevolving positions in these alignments fall into two general categories. One set is composed of positions that coevolve with only one or two other positions. These positions often display direct amino acid side-chain interactions with their coevolving partner. The other set comprises positions that coevolve with many others and are frequently located in regions critical for protein function, such as active sites and surfaces involved in intermolecular interactions and recognition. We find that coevolving positions are more likely to change protein function when mutated than are positions showing little coevolution. These results imply that information theory may be applied generally to find coevolving, nonconserved positions that are part of functional sites in uncharacterized protein families. We propose that these coevolving positions compose an important subset of the positions in an alignment, and may be as important to the structure and function of the protein family as are highly conserved positions.  相似文献   

12.
Coevolution commonly occurs in spatially heterogeneous environments, resulting in variable selection pressures acting on coevolving species. Dispersal across such environments is predicted to have a major impact on local coevolutionary dynamics. Here, we address how co‐dispersal of coevolving populations of host and parasite across an environmental productivity gradient affected coevolution in experimental populations of bacteria and their parasitic viruses (phages). The rate of coevolution between bacteria and phages was greater in high‐productivity environments. High‐productivity immigrants (~2% of the recipient population) caused coevolutionary dynamics (rates of coevolution and degree of generalist evolution) in low‐productivity environments to be largely indistinguishable from high‐productivity environments, whereas immigration from low‐productivity environments (~0.5% of the population) had no discernable impact. These results could not be explained by demography alone, but rather high‐productivity immigrants had a selective advantage in low‐productivity environments, but not vice versa. Coevolutionary interactions in high‐productivity environments are therefore likely to have a disproportionate impact on coevolution across the landscape as a whole.  相似文献   

13.
Biochemical activity and core stability are essential properties of proteins, maintained usually by conserved amino acids. Structural dynamics emerged in recent years as another essential aspect of protein functionality. Structural dynamics enable the adaptation of the protein to binding substrates and to undergo allosteric transitions, while maintaining the native fold. Key residues that mediate structural dynamics would thus be expected to be conserved or exhibit coevolutionary patterns at least. Yet, the correlation between sequence evolution and structural dynamics is yet to be established. With recent advances in efficient characterization of structural dynamics, we are now in a position to perform a systematic analysis. In the present study, a set of 34 enzymes representing various folds and functional classes is analyzed using information theory and elastic network models. Our analysis shows that the structural regions distinguished by their coevolution propensity as well as high mobility are predisposed to serve as substrate recognition sites, whereas residues acting as global hinges during collective dynamics are often supported by conserved residues. We propose a mobility scale for different types of amino acids, which tends to vary inversely with amino acid conservation. Our findings suggest the balance between physical adaptability (enabled by structure-encoded motions) and chemical specificity (conferred by correlated amino acid substitutions) underlies the selection of a relatively small set of versatile folds by proteins.  相似文献   

14.
Our knowledge on the mode of evolution of the multifunctional viral proteins remains incomplete. To tackle this problem, here, we have investigated the evolutionary dynamics of the potyvirus multifunctional protein HC-Pro, with particular focus on its functional domains. The protein was partitioned into the three previously described functional domains, and each domain was analyzed separately and assembled. We searched for signatures of adaptive evolution and evolutionary dependencies of amino acid sites within and between the three domains using the entire set of available potyvirus sequences in GenBank. Interestingly, we identified strongly significant patterns of co-occurrence of adaptive events along the phylogenetic tree in the three domains. These patterns suggest that Domain I, whose main function is to mediate aphid transmission, has likely been coevolving with the other two domains, which are involved in different functions but all requiring the capacity to bind RNA. By contrast, episodes of positive selection on Domains II and III did not correlate, reflecting a trade-off between their evolvability and their evolutionary dependency likely resulting from their functional overlap. Covariation analyses have identified several groups of amino acids with evidence of concerted variation within each domain, but interdomain significant covariations were only found for Domains II and III, further reflecting their functional overlapping.  相似文献   

15.
Evaluating the importance of coevolution for a wide range of evolutionary questions, such as the role parasites play in the evolution of sexual reproduction, requires that we understand the genetic basis of coevolutionary interactions. Despite its importance, little progress has been made identifying the genetic basis of coevolution, largely because we lack tools designed specifically for this purpose. Instead, coevolutionary studies are often forced to re‐purpose single species techniques. Here, we propose a novel approach for identifying the genes mediating locally adapted coevolutionary interactions that relies on spatial correlations between genetic marker frequencies in the interacting species. Using individual‐based multi‐locus simulations, we quantify the performance of our approach across a range of coevolutionary genetic models. Our results show that when one species is strongly locally adapted to the other and a sufficient number of populations can be sampled, our approach accurately identifies functionally coupled host and parasite genes. Although not a panacea, the approach we outline here could help to focus the search for coevolving genes in a wide variety of well‐studied systems for which substantial local adaptation has been demonstrated.  相似文献   

16.
It frequently has been postulated that intersexual coevolution between the male ejaculate and the female reproductive tract is a driving force in the rapid evolution of reproductive proteins. The dearth of research on female tracts, however, presents a major obstacle to empirical tests of this hypothesis. Here, we employ a comparative EST approach to identify 241 candidate female reproductive proteins in Drosophila arizonae, a repleta group species in which physiological ejaculate-female coevolution has been documented. Thirty-one of these proteins exhibit elevated amino acid substitution rates, making them candidates for molecular coevolution with the male ejaculate. Strikingly, we also discovered 12 unique digestive proteases whose expression is specific to the D. arizonae lower female reproductive tract. These enzymes belong to classes most commonly found in the gastrointestinal tracts of a diverse array of organisms. We show that these proteases are associated with recent, lineage-specific gene duplications in the Drosophila repleta species group, and exhibit strong signatures of positive selection. Observation of adaptive evolution in several female reproductive tract proteins indicates they are active players in the evolution of reproductive tract interactions. Additionally, pervasive gene duplication, adaptive evolution, and rapid acquisition of a novel digestive function by the female reproductive tract points to a novel coevolutionary mechanism of ejaculate-female interaction.  相似文献   

17.
Correlated mutation analysis (CMA) has been used to investigate protein functional sites. However, CMA has suffered from low signal-to-noise ratio caused by meaningless phylogenetic signals or structural constraints. We present a new method, Structure-based Correlated Mutation Analysis (SCMA), which encodes coevolution scores into a protein structure network. A path-based network model is adapted to describe information transfer between residues, and the statistical significance is estimated by network shuffling. This model intrinsically assumes that residues in physical contact have a more reliable coevolution score than distant residues, and that coevolution in distant residues likely arises from a series of contacting and coevolving residues. In addition, coevolutionary coupling is statistically controlled to remove the structural effects. When applied to the rhodopsin structure, the SCMA method identified a much higher percentage of functional residues than the typical coevolution score (61% vs. 22%). In addition, statistically significant residues are used to construct the coevolved residue-residue subnetwork. The network has one highly connected node (retinal bound Lys296), indicating that Lys296 can induce and regulate most other coevolved residues in a variety of locations. The coevolved network consists of a few modular clusters which have distinct functional roles. This article is part of a Special Issue entitled: Computational Methods for Protein Interaction and Structural Prediction.  相似文献   

18.
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions.  相似文献   

19.
Communication between distant sites often defines the biological role of a protein: amino acid long-range interactions are as important in binding specificity, allosteric regulation and conformational change as residues directly contacting the substrate. The maintaining of functional and structural coupling of long-range interacting residues requires coevolution of these residues. Networks of interaction between coevolved residues can be reconstructed, and from the networks, one can possibly derive insights into functional mechanisms for the protein family. We propose a combinatorial method for mapping conserved networks of amino acid interactions in a protein which is based on the analysis of a set of aligned sequences, the associated distance tree and the combinatorics of its subtrees. The degree of coevolution of all pairs of coevolved residues is identified numerically, and networks are reconstructed with a dedicated clustering algorithm. The method drops the constraints on high sequence divergence limiting the range of applicability of the statistical approaches previously proposed. We apply the method to four protein families where we show an accurate detection of functional networks and the possibility to treat sets of protein sequences of variable divergence.  相似文献   

20.
Antagonistic coevolution between hosts and parasites is probably ubiquitous. However, very little is known of the genetic changes associated with parasite infectivity evolution during adaptation to a coevolving host. We followed the phenotypic and genetic changes in a lytic virus population (bacteriophage; phage Φ2) that coevolved with its bacterial host, Pseudomonas fluorescens SBW25. First, we show the rapid evolution of numerous unique phage infectivity phenotypes, and that both phage host range and bacterial resistance to individual phage increased over coevolutionary time. Second, each of the distinct phage phenotypes in our study had a unique genotype, and molecular evolution did not act uniformly across the phage genome during coevolution. In particular, we detected numerous substitutions on the tail fibre gene, which is involved in the first step of the host-parasite interaction: host adsorption. None of the observed mutations could be directly linked with infection against a particular host, suggesting that the phenotypic effects of infectivity mutations are probably epistatic. However, phage genotypes with the broadest host ranges had the largest number of nonsynonymous amino acid changes on genes implicated in infectivity evolution. An understanding of the molecular genetics of phage infectivity has helped to explain the complex phenotypic coevolutionary dynamics in this system.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号