首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes.  相似文献   

2.
Males and females share most of the same genes, so selection in one sex will typically produce a correlated response in the other sex. Yet, the sexes have evolved to differ in a multitude of behavioral, morphological, and physiological traits. How did this sexual dimorphism evolve despite the presence of a common underlying genome? We investigated the potential role of gene duplication in the evolution of sexual dimorphism. Because duplication events provide extra genetic material, the sexes each might use this redundancy to facilitate sex‐specific gene expression, permitting the evolution of dimorphism. We investigated this hypothesis at the genome‐wide level in Drosophila melanogaster, using the presence of sex‐biased expression as a proxy for the sex‐specific specialization of gene function. We expected that if sexually antagonistic selection is a potent force acting upon individual genes, duplication will result in paralog families whose members differ in sex‐biased expression. Gene members of the same duplicate family can have different expression patterns in males versus females. In particular, duplicate pairs containing a male‐biased gene are found more frequently than expected, in agreement with previous studies. Furthermore, when the singleton ortholog is unbiased, duplication appears to allow one of the paralog copies to acquire male‐biased expression. Conversely, female‐biased expression is not common among duplicates; fewer duplicate genes are expressed in the female‐soma and ovaries than in the male‐soma and testes. Expression divergence exists more in older than in younger duplicates pairs, but expression divergence does not correlate with protein sequence divergence. Finally, genomic proximity may have an effect on whether paralogs differ in sex‐biased expression. We conclude that the data are consistent with a role of gene duplication in fostering male‐biased, but not female‐biased, gene expression, thereby aiding the evolution of sexual dimorphism.  相似文献   

3.
Gene duplication provides raw material for functional innovation, but gene duplicability varies considerably. Previous studies have found widespread asymmetrical sequence evolution between paralogs. However, it remains unknown whether the rate of evolution among paralogs affects their propensity of being retained after another round of whole-genome duplication (WGD). In this study, we investigated gene groups that have experienced two successive WGDs to determine which of two older duplicates with different evolutionary rates was more likely to retain both younger duplicates. To uncouple the measurement of evolutionary rates from any assignment of duplicate or singleton status, we measured the evolutionary rates of singleton genes in out-lineages but classified these singleton genes according to whether they are retained or not in a crown group of species. We found that genes that retained younger duplicates in the crown group of genomes were more constrained prior to the younger duplication event than those that failed to leave duplicates. In addition, we also found that the retained clades have more genes in out-lineages. Subsequent analyses showed that genes in the retained clades were expressed more broadly and highly than genes in the singleton clades. We concluded that the set of repeatedly retained genes after two WGDs is biased toward slowly evolving genes in angiosperms, suggesting that the potential of genes for both functional conservation and divergence likely affects their propensity of being retained after WGD in angiosperms.  相似文献   

4.
Expression divergence of duplicate genes is widely believedto be important for their retention and evolution of new function,although the mechanism that determines their expression divergenceremains unclear. We use a genetical genomics approach to exploredivergence in genetical control of yeast duplicate genes createdby a whole-genome duplication that occurred about 100 MYA andthose with a younger duplication age. The analysis reveals thatduplicate genes have a significantly higher probability of sharingcommon genetic control than pairs of singleton genes. The expressionquantitative trait loci (eQTLs) have diverged completely fora high proportion of duplicate pairs, whereas a substantiallylarger proportion of duplicates share common regulatory motifsafter 100 Myr of divergent evolution. The similarity in bothgenetical control and cis motif structure for a duplicate pairis a reflection of its evolutionary age. This study revealsthat up to 20% of variation in expression between ancient duplicategene pairs in the yeast genome can be explained by both cismotif divergence (8%) and by trans eQTL divergence (10%). Initially,divergence in all 3 aspects of cis motif structure, trans-geneticalcontrol, and expression evolves coordinately with the codingsequence divergence of both young and old duplicate pairs. Thesefindings highlight the importance of divergence in both cismotif structure and trans-genetical control in the diverse setof mechanisms underlying the expression divergence of yeastduplicate genes.  相似文献   

5.
6.
7.
Guan Y  Dunham MJ  Troyanskaya OG 《Genetics》2007,175(2):933-943
Gene duplication can occur on two scales: whole-genome duplications (WGD) and smaller-scale duplications (SSD) involving individual genes or genomic segments. Duplication may result in functionally redundant genes or diverge in function through neofunctionalization or subfunctionalization. The effect of duplication scale on functional evolution has not yet been explored, probably due to the lack of global knowledge of protein function and different times of duplication events. To address this question, we used integrated Bayesian analysis of diverse functional genomic data to accurately evaluate the extent of functional similarity and divergence between paralogs on a global scale. We found that paralogs resulting from the whole-genome duplication are more likely to share interaction partners and biological functions than smaller-scale duplicates, independent of sequence similarity. In addition, WGD paralogs show lower frequency of essential genes and higher synthetic lethality rate, but instead diverge more in expression pattern and upstream regulatory region. Thus, our analysis demonstrates that WGD paralogs generally have similar compensatory functions but diverging expression patterns, suggesting a potential of distinct evolutionary scenarios for paralogs that arose through different duplication mechanisms. Furthermore, by identifying these functional disparities between the two types of duplicates, we reconcile previous disputes on the relationship between sequence divergence and expression divergence or essentiality.  相似文献   

8.
Duplicate loci offer a very powerful system for understanding the complicated genome structure and adaptive evolution of a gene family. In this study, the genetic variation at paralogs AtHVA22d and AtHVA22e, members of an ABA- and stress-inducible gene family, is examined in the selfing Arabidopsis thaliana. Population genetic analysis indicates contrasting levels of nucleotide diversity at overall exon sequence and nonsynonymous sites between AtHVA22d (pi = 0.00337, pi(rep) = 0.00158) and AtHVA22e (pi = 0.00054, pi(rep) = 0.00023). The fact of Ka/Ks ratios significantly less than 1 in all sequences indicates that both genes are functional and subjected to purifying selection. In addition, rooted at barley HVA22, accelerated evolution is detected at replacement changes in the AtHVA22d locus, indicating relaxation of purifying selection after gene duplication. However, relative rate tests reveal no deviation from the neutrality at synonymous sites between the two paralogs. Based on clock-like evolution, the rate of synonymous substitution is estimated at 1.83 x 10(-9) substitutions per site per year; and the divergence of the two paralogs is traced to 90 MYA, coinciding with a period of the diversification of angiosperms. Given no codon usage bias in both genes, natural selection alone cannot account for the 6.4-fold differences in the nucleotide variation at synonymous sites between the two paralogs. Random processes resulting in different coalescence times, 3.65 MYA at AtHVA22d vs. 1.20 MYA at AtHVA22e, may have predominantly contributed to the evident differences of the genetic diversity. Partially nonoverlapping modes of expression between the two functional paralogs suggest a subfunctionalization hypothesis for explaining the fates of duplicate loci.  相似文献   

9.

Background  

The mechanism by which duplicate genes originate – whether by duplication of a whole genome or of a genomic segment – influences their genetic fates. To study events that trigger duplicate gene persistence after whole genome duplication in vertebrates, we have analyzed molecular evolution and expression of hundreds of persistent duplicate gene pairs in allopolyploid clawed frogs (Xenopus and Silurana). We collected comparative data that allowed us to tease apart the molecular events that occurred soon after duplication from those that occurred later on. We also quantified expression profile divergence of hundreds of paralogs during development and in different tissues.  相似文献   

10.
We characterized rates and patterns of synonymous and nonsynonymous substitution in 242 duplicated gene pairs on chromosomes 2 and 4 of Arabidopsis thaliana. Based on their collinear order along the two chromosomes, the gene pairs were likely duplicated contemporaneously, and therefore comparison of genetic distances among gene pairs provides insights into the distribution of nucleotide substitution rates among plant nuclear genes. Rates of synonymous substitution varied 13.8-fold among the duplicated gene pairs, but 90% of gene pairs differed by less than 2.6-fold. Average nonsynonymous rates were approximately fivefold lower than average synonymous rates; this rate difference is lower than that of previously studied nonplant lineages. The coefficient of variation of rates among genes was 0.65 for nonsynonymous rates and 0.44 for synonymous rates, indicating that synonymous and nonsynonymous rates vary among genes to roughly the same extent. The causes underlying rate variation were explored. Our analyses tentatively suggest an effect of physical location on synonymous substitution rates but no similar effect on nonsynonymous rates. Nonsynonymous substitution rates were negatively correlated with GC content at synonymous third codon positions, and synonymous substitution rates were negatively correlated with codon bias, as observed in other systems. Finally, the 242 gene pairs permitted investigation of the processes underlying divergence between paralogs. We found no evidence of positive selection, little evidence that paralogs evolve at different rates, and no evidence of differential codon usage or third position GC content.  相似文献   

11.
For more than 30 years, expression divergence has been considered as a major reason for retaining duplicated genes in a genome, but how often and how fast duplicate genes diverge in expression has not been studied at the genomic level. Using yeast microarray data, we show that expression divergence between duplicate genes is significantly correlated with their synonymous divergence (KS) and also with their nonsynonymous divergence (KA) if KA ≤ 0.3. Thus, expression divergence increases with evolutionary time, and KA is initially coupled with expression divergence. More interestingly, a large proportion of duplicate genes have diverged quickly in expression and the vast majority of gene pairs eventually become divergent in expression. Indeed, more than 40% of gene pairs show expression divergence even when KS is ≤ 0.10, and this proportion becomes >80% for KS > 1.5. Only a small fraction of ancient gene pairs do not show expression divergence.  相似文献   

12.
Gene duplication plays an important role in evolution because it is the primary source of new genes. Many recent studies showed that gene duplicability varies considerably among genes. Several considerations led us to hypothesize that less important genes have higher rates of successful duplications, where gene importance is measured by the fitness reduction caused by the deletion of the gene. Here, we test this hypothesis by comparing the importance of two groups of singleton genes in the yeast Saccharomyces cerevisiae (Sce). Group S genes did not duplicate in four other yeast species examined, whereas group D experienced duplication in these species. Consistent with our hypothesis, we found group D genes to be less important than group S genes. Specifically, 17% of group D genes are essential in Sce, compared to 28% for group S. Furthermore, deleting a group D gene in Sce reduces the fitness by 24% on average, compared to 38% for group S. Our subsequent analysis showed that less important genes have more cis-regulatory motifs, which could lead to a higher chance of subfunctionalization of duplicate genes and result in an enhanced rate of gene retention. Less important genes may also have weaker dosage imbalance effects and cause fewer genetic perturbations when duplicated. Regardless of the cause, our observation indicates that the previous finding of a less severe fitness consequence of deleting a duplicate gene than deleting a singleton gene is at least in part due to the fact that duplicate genes are intrinsically less important than singleton genes and suggests that the contribution of duplicate genes to genetic robustness has been overestimated.  相似文献   

13.
14.
15.
Gene duplication provides a window of opportunity for biological variants to persist under the protection of a co-expressed copy with similar or redundant function. Duplication catalyzes innovation (neofunctionalization), subfunction degeneration (subfunctionalization), and genetic buffering (redundancy), and the genetic survival of each paralog is triggered by mechanisms that add, compromise, or do not alter protein function. We tested the applicability of three types of mechanisms for promoting the retained expression of duplicated genes in 290 expressed paralogs of the tetraploid clawed frog, Xenopus laevis. Tests were based on explicit expectations concerning the ka/ks ratio, and the number and location of nonsynonymous substitutions after duplication. Functional constraints on the majority of paralogs are not significantly different from a singleton ortholog. However, we recover strong support that some of them have an asymmetric rate of nonsynonymous substitution: 6% match predictions of the neofunctionalization hypothesis in that (1) each paralog accumulated nonsynonymous substitutions at a significantly different rate and (2) the one that evolves faster has a higher ka/ks ratio than the other paralog and than a singleton ortholog. Fewer paralogs (3%) exhibit a complementary pattern of substitution at the protein level that is predicted by enhancement or degradation of different functional domains, and the remaining 13% have a higher average ka/ks ratio in both paralogs that is consistent with altered functional constraints, diversifying selection, or activity-reducing mutations after duplication. We estimate that these paralogs have been retained since they originated by genome duplication between 21 and 41 million years ago. Multiple mechanisms operate to promote the retained expression of duplicates in the same genome, in genes in the same functional class, over the same period of time following duplication, and sometimes in the same pair of paralogs. None of these paralogs are superfluous; degradation or enhancement of different protein subfunctions and neofunctionalization are plausible hypotheses for the retained expression of some of them. Evolution of most X. laevis paralogs, however, is consistent with retained expression via mechanisms that do not radically alter functional constraints, such as selection to preserve post-duplication stoichiometry or temporal, quantitative, or spatial subfunctionalization.  相似文献   

16.
Current hypotheses of gene duplicate divergence propose that surviving members of a gene duplicate pair may evolve, under conditions of purifying or nearly neutral selection, in one of two ways: with new function arising in one duplicate while the other retains original function (neofunctionalization [NF]) or partitioning of the original function between the 2 paralogs (subfunctionalization [SF]). More recent studies propose that SF followed by NF (subneofunctionalization [SNF]) explains the divergence of many duplicate genes. In this analysis, we evaluate these hypotheses in the context of the large monosaccharide transporter (MST) gene families in Arabidopsis and rice. MSTs have an ancient origin, predating plants, and have evolved in the seed plant lineage to comprise 7 subfamilies. In Arabidopsis, 53 putative MST genes have been identified, with one subfamily greatly expanded by tandem gene duplications. We searched the rice genome for members of the MST gene family and compared them with the MST gene family in Arabidopsis to determine subfamily expansion patterns and estimate gene duplicate divergence times. We tested hypotheses of gene duplicate divergence in 24 paralog pairs by comparing protein sequence divergence rates, estimating positive selection on codon sites, and analyzing tissue expression patterns. Results reveal the MST gene family to be significantly larger (65) in rice with 2 subfamilies greatly expanded by tandem duplications. Gene duplicate divergence time estimates indicate that early diversification of most subfamilies occurred in the Proterozoic (2500-540 Myr) and that expansion of large subfamilies continued through the Cenozoic (65-0 Myr). Two-thirds of paralog pairs show statistically symmetric rates of sequence evolution, most consistent with the SF model, with half of those showing evidence for positive selection in one or both genes. Among 8 paralog pairs showing asymmetric divergence rates, most consistent with the NF model, nearly half show evidence of positive selection. Positive selection does not appear in any duplicate pairs younger than approximately 34 Myr. Our data suggest that the NF, SF, and SNF models describe different outcomes along a continuum of divergence resulting from initial conditions of relaxed constraint after duplication.  相似文献   

17.
Gene dosage and gene duplicability   总被引:2,自引:0,他引:2       下载免费PDF全文
Qian W  Zhang J 《Genetics》2008,179(4):2319-2324
The evolutionary process leading to the fixation of newly duplicated genes is not well understood. It was recently proposed that the fixation of duplicate genes is frequently driven by positive selection for increased gene dosage (i.e., the gene dosage hypothesis), because haploinsufficient genes were reported to have more paralogs than haplosufficient genes in the human genome. However, the previous analysis incorrectly assumed that the presence of dominant abnormal alleles of a human gene means that the gene is haploinsufficient, ignoring the fact that many dominant abnormal alleles arise from gain-of-function mutations. Here we show in both humans and yeast that haploinsufficient genes generally do not duplicate more frequently than haplosufficient genes. Yeast haploinsufficient genes do exhibit enhanced retention after whole-genome duplication compared to haplosufficient genes if they encode members of stable protein complexes, but the same phenomenon is absent if the genes do not encode protein complex members, suggesting that the dosage balance effect rather than the dosage effect is the underlying cause of the phenomenon. On the basis of these and other results, we conclude that selection for higher gene dosage does not play a major role in driving the fixation of duplication genes.  相似文献   

18.
Insertions and deletions (indels) in protein-coding genes are important sources of genetic variation. Their role in creating new proteins may be especially important after gene duplication. However, little is known about how indels affect the divergence of duplicate genes. We here study thousands of duplicate genes in five fish (teleost) species with completely sequenced genomes. The ancestor of these species has been subject to a fish-specific genome duplication (FSGD) event that occurred approximately 350 Ma. We find that duplicate genes contain at least 25% more indels than single-copy genes. These indels accumulated preferentially in the first 40 my after the FSGD. A lack of widespread asymmetric indel accumulation indicates that both members of a duplicate gene pair typically experience relaxed selection. Strikingly, we observe a 30-80% excess of deletions over insertions that is consistent for indels of various lengths and across the five genomes. We also find that indels preferentially accumulate inside loop regions of protein secondary structure and in regions where amino acids are exposed to solvent. We show that duplicate genes with high indel density also show high DNA sequence divergence. Indel density, but not amino acid divergence, can explain a large proportion of the tertiary structure divergence between proteins encoded by duplicate genes. Our observations are consistent across all five fish species. Taken together, they suggest a general pattern of duplicate gene evolution in which indels are important driving forces of evolutionary change.  相似文献   

19.
20.
Gene duplication is a widespread phenomenon in genome evolution, and it has been proposed to serve as an engine of evolutionary innovation. In the present study, we performed the first comprehensive analysis of duplicate genes in the bovine genome. A total of 3131 putative duplicated gene pairs were identified, including 712 cattle-specific duplicate gene pairs unevenly distributed across the genome, which are significantly enriched for specific biological functions including immunity, growth, digestion, reproduction, embryonic development, inflammatory response, and defense response to bacterium. Around 97.1% (87.8%) of (cattle-specific) duplicate gene pairs were found to have distinct exon-intron structures. Analysis of gene expression by RNA-Seq and sequence divergence (synonymous or non-synonymous) revealed that expression divergence is correlated with sequence divergence, as has been previously observed in other species. This analysis also led to the identification of a subset of cattle-specific duplicate gene pairs exhibiting very high expression divergence. Interestingly, further investigation revealed a significant relationship between structural and expression divergence while controlling for the effect of synonymous sequence divergence. Together these results provide further insight into duplicate gene sequence and expression divergence in cattle, and their potential contributions to phenotypic divergence.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号