首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Preservation of duplicate genes by complementary, degenerative mutations   总被引:106,自引:0,他引:106  
Force A  Lynch M  Pickett FB  Amores A  Yan YL  Postlethwait J 《Genetics》1999,151(4):1531-1545
The origin of organismal complexity is generally thought to be tightly coupled to the evolution of new gene functions arising subsequent to gene duplication. Under the classical model for the evolution of duplicate genes, one member of the duplicated pair usually degenerates within a few million years by accumulating deleterious mutations, while the other duplicate retains the original function. This model further predicts that on rare occasions, one duplicate may acquire a new adaptive function, resulting in the preservation of both members of the pair, one with the new function and the other retaining the old. However, empirical data suggest that a much greater proportion of gene duplicates is preserved than predicted by the classical model. Here we present a new conceptual framework for understanding the evolution of duplicate genes that may help explain this conundrum. Focusing on the regulatory complexity of eukaryotic genes, we show how complementary degenerative mutations in different regulatory elements of duplicated genes can facilitate the preservation of both duplicates, thereby increasing long-term opportunities for the evolution of new gene functions. The duplication-degeneration-complementation (DDC) model predicts that (1) degenerative mutations in regulatory elements can increase rather than reduce the probability of duplicate gene preservation and (2) the usual mechanism of duplicate gene preservation is the partitioning of ancestral functions rather than the evolution of new functions. We present several examples (including analysis of a new engrailed gene in zebrafish) that appear to be consistent with the DDC model, and we suggest several analytical and experimental approaches for determining whether the complementary loss of gene subfunctions or the acquisition of novel functions are likely to be the primary mechanisms for the preservation of gene duplicates. For a newly duplicated paralog, survival depends on the outcome of the race between entropic decay and chance acquisition of an advantageous regulatory mutation.Sidow 1996(p. 717) On one hand, it may fix an advantageous allele giving it a slightly different, and selectable, function from its original copy. This initial fixation provides substantial protection against future fixation of null mutations, allowing additional mutations to accumulate that refine functional differentiation. Alternatively, a duplicate locus can instead first fix a null allele, becoming a pseudogene.Walsh 1995 (p. 426) Duplicated genes persist only if mutations create new and essential protein functions, an event that is predicted to occur rarely.Nadeau and Sankoff 1997 (p. 1259) Thus overall, with complex metazoans, the major mechanism for retention of ancient gene duplicates would appear to have been the acquisition of novel expression sites for developmental genes, with its accompanying opportunity for new gene roles underlying the progressive extension of development itself.Cooke et al. 1997 (p. 362)  相似文献   

2.
After gene duplication, mutations cause the gene copies to diverge. The classical model predicts that these mutations will generally lead to the loss of function of one gene copy; rarely, new functions will be created and both duplicate genes are conserved. In contrast, under the subfunctionalization model both duplicates are preserved due to the partition of different functions between the duplicates. A recent study provides support for the subfunctionalization model, identifying several expressed gene duplicates common to humans and mice that contain regions conserved in one duplicate but variable in the other (and vice versa). We discuss both the methodology used in this study and also how gene phylogeny may lead to additional evidence for the importance of subfunctionalization in the evolution of new genes.  相似文献   

3.
Yang J  Xie Z  Glover BJ 《The New phytologist》2005,165(2):623-632
NF-Y is a ubiquitous CCAAT-binding factor composed of NF-YA, NF-YB and NF-YC. Multiple genes encoding NF-Y subunits have been identified in plant genomes. It remains unclear whether the duplicate genes underwent different evolutionary patterns. Likelihood-ratio tests were used to examine whether the amino acid substitution rates are the same between duplicate genes. The influences of selection on evolution were evaluated by comparing the conservative and radical amino acid substitution rates, as well as maximum-likelihood analysis. Some NF-YB and NF-YC duplicates showed significant evidence of asymmetric evolution but not the NF-YA duplicates. Most amino acid replacements in the NF-YB and NF-YC duplicates result in changes in hydropathy, polar requirement and polarity. The physicochemical changes in the sequences of NF-YB seem to be coupled to asymmetric divergence in gene function. Plant NF-Y genes have evolved in different patterns. Relaxed selective constraints following gene duplication are most likely responsible for the unequal evolutionary rates and distinct divergence patterns of duplicate NF-Y genes. Positive selection may have promoted amino acid hydropathy changes in the NF-YC duplicates.  相似文献   

4.
5.
Current hypotheses of gene duplicate divergence propose that surviving members of a gene duplicate pair may evolve, under conditions of purifying or nearly neutral selection, in one of two ways: with new function arising in one duplicate while the other retains original function (neofunctionalization [NF]) or partitioning of the original function between the 2 paralogs (subfunctionalization [SF]). More recent studies propose that SF followed by NF (subneofunctionalization [SNF]) explains the divergence of many duplicate genes. In this analysis, we evaluate these hypotheses in the context of the large monosaccharide transporter (MST) gene families in Arabidopsis and rice. MSTs have an ancient origin, predating plants, and have evolved in the seed plant lineage to comprise 7 subfamilies. In Arabidopsis, 53 putative MST genes have been identified, with one subfamily greatly expanded by tandem gene duplications. We searched the rice genome for members of the MST gene family and compared them with the MST gene family in Arabidopsis to determine subfamily expansion patterns and estimate gene duplicate divergence times. We tested hypotheses of gene duplicate divergence in 24 paralog pairs by comparing protein sequence divergence rates, estimating positive selection on codon sites, and analyzing tissue expression patterns. Results reveal the MST gene family to be significantly larger (65) in rice with 2 subfamilies greatly expanded by tandem duplications. Gene duplicate divergence time estimates indicate that early diversification of most subfamilies occurred in the Proterozoic (2500-540 Myr) and that expansion of large subfamilies continued through the Cenozoic (65-0 Myr). Two-thirds of paralog pairs show statistically symmetric rates of sequence evolution, most consistent with the SF model, with half of those showing evidence for positive selection in one or both genes. Among 8 paralog pairs showing asymmetric divergence rates, most consistent with the NF model, nearly half show evidence of positive selection. Positive selection does not appear in any duplicate pairs younger than approximately 34 Myr. Our data suggest that the NF, SF, and SNF models describe different outcomes along a continuum of divergence resulting from initial conditions of relaxed constraint after duplication.  相似文献   

6.
7.
TATA box, the core promoter element, exists in a broad range of eukaryotes, and the expression of TATA-containing genes usually responds to various environmental stresses. Hence, the evolution of TATA-box in duplicate genes may provide some clues for the interrelationship among environmental stress, expression differentiation, and duplicate gene preservation. In the present study, we observed that the TATA box is significantly overrepresented in duplicate genes compared with singletons in human, worm, Arabidopsis, and yeast genomes. We then conducted an extensive functional genomic analysis to investigate the evolution of TATA box along over 700 yeast gene family phylogenies. After reconstructing the ancestral TATA-box states (presence or absence), we found that significantly higher numbers of TATA box gain events than loss events had occurred after yeast gene duplications-the overall gain-loss ratio is about 3-4 to 1. Interestingly, these TATA-gain duplicate genes on average have experienced greater expression divergence from the ancestral expression states than their most closely related TATA-less duplicate partners, but only under environmental stress conditions (asymmetric evolution); indeed, under normal physiological conditions, they have similar expression divergence (symmetric evolution). Moreover, we showed that TATA-gain duplicates are enriched in stress-associated functional categories but that is not the case for TATA-ancestral duplicates (those inherited from their ancestors prior to duplication). Together, we conclude that after the gene duplication, gain of the TATA box in duplicate promoters may have played an important role in yeast duplicate preservation by accelerating expression divergence that may facilitate the adaptive evolution of the organism in response to environmental changes.  相似文献   

8.
Chen Y  Ding Y  Zhang Z  Wang W  Chen JY  Ueno N  Mao B 《遗传学报》2011,38(12):577-584
The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes.  相似文献   

9.
Summary In the 50 million years since the polyploidization event that gave rise to the catostomid family of fishes the duplicate genes encoding isozymes have undergone different fates. Ample opportunity has been available for regulatory evolution of these duplicate genes. Approximately half the duplicate genes have lost their expressions during this time. Of the duplicate genes remaining, the majority have diverged to different extents in their expression within and among adult tissues. The pattern of divergence of duplicate gene expression is consistent with the accumulation of mutations at regulatory genes. The absence of a correlation of extent of divergence of gene expression with the level of genetic variability for isozymes at these loci is consistent with the view that the rates of regulatory gene and structural gene evolution are uncoupled. The magnitude of divergence of duplicate gene expressions varies among tissues, enzymes, and species. Little correlation was found with the extent of divergence of duplicate gene expression within a species and its degree of morphological conservatism, although species pairs which are increasingly taxonomically distant are less likely to share specific patterns of differential gene expression. Probable phylogenetic times of origin of several patterns of differential gene expression have been proposed. Some patterns of differential gene expression have evolved in recent evolutionary times and are specific to one or a few species, whereas at least one pattern of differential gene expression is present in nearly all species and probably arose soon after the polyploidization event. Multilocus isozymes, formed by polyploidization, provide a useful model system for studying the forces responsible for the maintenance of duplicate genes and the evolution of these once identical genes to new spatially and temporally specific patterns of regulation.  相似文献   

10.
Historically, duplicate genes have been regarded as a major source of novel genetic material. However, recent work suggests that chimeric genes formed through the fusion of pieces of different genes may also contribute to the evolution of novel functions. To compare the contribution of chimeric and duplicate genes to genome evolution, we measured their prevalence and persistence within Drosophila melanogaster. We find that ~80.4 duplicates form per million years, but most are rapidly eliminated from the genome, leaving only 4.1% to be preserved by natural selection. Chimeras form at a comparatively modest rate of ~11.4 per million years but follow a similar pattern of decay, with ultimately only 1.4% of chimeras preserved. We propose two mechanisms of chimeric gene formation, which rely entirely on local, DNA-based mutations to explain the structure and placement of the youngest chimeric genes observed. One involves imprecise excision of an unpaired duplication during large-loop mismatch repair, while the other invokes a process akin to replication slippage to form a chimeric gene in a single event. Our results paint a dynamic picture of both chimeras and duplicate genes within the genome and suggest that chimeric genes contribute substantially to genomic novelty.  相似文献   

11.
Plant genomes appear to exploit the process of gene duplication as a primary means of acquiring biochemical and developmental flexibility. Thus, for example, most of the enzymatic components of plant secondary metabolism are encoded by small families of genes that originated through duplication over evolutionary time. The dynamics of gene family evolution are well illustrated by the genes that encode chalcone synthase (CHS), the first committed step in flavonoid biosynthesis. We review pertinent facts about CHS evolution in flowering plants with special reference to the morning glory genus, Ipomoea. Our review shows that new CHS genes are recruited recurrently in flowering plant evolution. Rates of nucleotide substitution are frequently accelerated in new duplicate genes, and there is clear evidence for repeated shifts in enzymatic function among duplicate copies of CHS genes. In addition, we present new data on expression patterns of CHS genes as a function of tissue and developmental stage in the common morning glory (I. purpurea). These data show extensive differentiation in gene expression among duplicate copies of CHS genes. We also show that a single mutation which blocks anthocyanin biosynthesis in the floral limb is correlated with a loss of expression of one of the six duplicate CHS genes present in the morning glory genome. This suggests that different duplicate copies of CHS have acquired specialized functional roles over the course of evolution. We conclude that recurrent gene duplication and subsequent differentiation is a major adaptive strategy in plant genome evolution.  相似文献   

12.
Learning about the roles that duplicate genes play in the origins of novel phenotypes requires an understanding of how their functions evolve. A previous method for achieving this goal, CDROM, employs gene expression distances as proxies for functional divergence and then classifies the evolutionary mechanisms retaining duplicate genes from comparisons of these distances in a decision tree framework. However, CDROM does not account for stochastic shifts in gene expression or leverage advances in contemporary statistical learning for performing classification, nor is it capable of predicting the parameters driving duplicate gene evolution. Thus, here we develop CLOUD, a multi-layer neural network built on a model of gene expression evolution that can both classify duplicate gene retention mechanisms and predict their underlying evolutionary parameters. We show that not only is the CLOUD classifier substantially more powerful and accurate than CDROM, but that it also yields accurate parameter predictions, enabling a better understanding of the specific forces driving the evolution and long-term retention of duplicate genes. Further, application of the CLOUD classifier and predictor to empirical data from Drosophila recapitulates many previous findings about gene duplication in this lineage, showing that new functions often emerge rapidly and asymmetrically in younger duplicate gene copies, and that functional divergence is driven by strong natural selection. Hence, CLOUD represents a major advancement in classifying retention mechanisms and predicting evolutionary parameters of duplicate genes, thereby highlighting the utility of incorporating sophisticated statistical learning techniques to address long-standing questions about evolution after gene duplication.  相似文献   

13.
Actin is a highly conserved cytoskeletal protein that is a key component of cells. Genes encoding actin occur in single copies in most green algae, in 2–3 copies in bryophytes, and in increasingly more complex gene families in ferns and seed plants. We use the well-resolved phylogenetic frameworks of the Streptophyta as a guide to reconstruct the patterns of actin gene duplication in early diverging land plants. Our working hypothesis is that the origin of novel tissues in the bryophytes (e.g. multicellular sporophyte) may be reflected in the functional diversification of duplicate actin genes in these taxa. Actin is used as a model cytoskeletal protein with the assumption that its evolutionary history represents those of other cytoskeletal elements and the coevolved binding proteins. Here we provide a phylogenetic perspective on the origin of green algal and land plant actin genes and use this information to speculate on the role of plant actin in early plant evolution.  相似文献   

14.
With about 24,000 extant species, teleosts are the largest group of vertebrates. They constitute more than 99% of the ray-finned fishes (Actinopterygii) that diverged from the lobe-finned fish lineage (Sarcopterygii) about 450 MYA. Although the role of genome duplication in the evolution of vertebrates is now established, its role in structuring the teleost genomes has been controversial. At least two hypotheses have been proposed: a whole-genome duplication in an ancient ray-finned fish and independent gene duplications in different lineages. These hypotheses are, however, based on small data sets and lack adequate statistical and phylogenetic support. In this study, we have made a systematic comparison of the draft genome sequences of Fugu and humans to identify paralogous chromosomal regions ("paralogons") in the Fugu that arose in the ray-finned fish lineage ("fish-specific"). We identified duplicate genes in the Fugu by phylogenetic analyses of the Fugu, human, and invertebrate sequences. Our analyses provide evidence for 425 fish-specific duplicate genes in the Fugu and show that at least 6.6% of the genome is represented by fish-specific paralogons. We estimated the ages of Fugu duplicate genes and paralogons using the molecular clock. Remarkably, the ages of duplicate genes and paralogons are clustered, with a peak around 350 MYA. These data strongly suggest a whole-genome duplication event early during the evolution of ray-finned fishes, probably before the origin of teleosts.  相似文献   

15.
Hu S  Wang H  Knisely AA  Reddy S  Kovacevic D  Liu Z  Hoffman SM 《Genetica》2008,133(2):215-226
The evolution of gene families can be best understood by studying the modern organization and functions of family members, and by comparing parallel families in different species. In this study, the CYP2ABFGST gene cluster has been characterized in rat and compared to the syntenic clusters in mouse and human, providing an interesting example of gene family evolution. In the rat, 18 loci from six subfamilies have been identified by specifically amplifying and sequencing gene fragments from cloned DNA, and have been exactly placed on chromosome 1. The overall organization of the gene cluster in rat is relatively simple, with genes from each subfamily in tandem, and is more similar to the mouse than to the human cluster. We have reconstructed the probable structure of the CYP2ABFGST cluster in the common ancestor of primates and rodents, and inferred a model of the evolution of this gene cluster in the three species. Numerous nontandem and block duplications, inversions, and translocations have occurred entirely inside the cluster, indicating that pairing between duplicate genes is keeping the rearrangements within the cluster region. The initial tandem duplication of a CYP2 gene in an early mammalian ancestor has made this region particularly subject to such localized rearrangements. Even if duplicated genes do not have a large-scale effect on chromosomal rearrangements, on a local level clustered gene families may have contributed significantly to the genomic complexity of modern mammals.  相似文献   

16.
Zhang Z  Kishino H 《Genetics》2004,166(4):1995-1999
Gene duplication with subsequent divergence plays a central role in the acquisition of genes with novel function and complexity during the course of evolution. With reduced functional constraints or through positive selection, these duplicated genes may experience accelerated evolution. Under the model of subfunctionalization, loss of subfunctions leads to complementary acceleration at sites with two copies, and the difference in average rate between the sequences may not be obvious. On the other hand, the classical model of neofunctionalization predicts that the evolutionary rate in one of the two duplicates is accelerated. However, the classical model does not tell which of the duplicates experiences the acceleration in evolutionary rate. Here, we present evidence from the Saccharomyces cerevisiae genome that a duplicate located in a genomic region with a low-recombination rate is likely to evolve faster than a duplicate in an area of high recombination. This observation is consistent with population genetics theory that predicts that purifying selection is less effective in genomic regions of low recombination (Hill-Robertson effect). Together with previous studies, our results suggest the genomic background (e.g., local recombination rate) as a potential force to drive the divergence between nontandemly duplicated genes. This implies the importance of structure and complexity of genomes in the diversification of organisms via gene duplications.  相似文献   

17.
18.
Over 3,000 human diseases are known to be linked to heritable genetic variation, mapping to over 1,700 unique genes. Dating of the evolutionary age of these disease-associated genes has suggested that they have a tendency to be ancient, specifically coming into existence with early metazoa. The approach taken by past studies, however, assumes that the age of a disease is the same as the age of its common ancestor, ignoring the fundamental contribution of duplication events in the evolution of new genes and function. Here, we date both the common ancestor and the duplication history of known human disease-associated genes. We find that the majority of disease genes (80%) are genes that have been duplicated in their evolutionary history. Periods for which there are more disease-associated genes, for example, at the origins of bony vertebrates, are explained by the emergence of more genes at that time, and the majority of these are duplicates inferred to have arisen by whole-genome duplication. These relationships are similar for different disease types and the disease-associated gene's cellular function. This indicates that the emergence of duplication-associated diseases has been ongoing and approximately constant (relative to the retention of duplicate genes) throughout the evolution of life. This continued until approximately 390 Ma from which time relatively fewer novel genes came into existence on the human lineage, let alone disease genes. For single-copy genes associated with disease, we find that the numbers of disease genes decreases with recency. For the majority of duplicates, the disease-associated mutation is associated with just one of the duplicate copies. A universal explanation for heritable disease is, thus, that it is merely a by-product of the evolutionary process; the evolution of new genes (de novo or by duplication) results in the potential for new diseases to emerge.  相似文献   

19.
Duplicate genes emerge as copy-number variations (CNVs) at the population level, and remain copy-number polymorphic until they are fixed or lost. The successful establishment of such structural polymorphisms in the genome plays an important role in evolution by promoting genetic diversity, complexity and innovation. To characterize the early evolutionary stages of duplicate genes and their potential adaptive benefits, we combine comparative genomics with population genomics analyses to evaluate the distribution and impact of CNVs across natural populations of an eco-genomic model, the three-spined stickleback. With whole genome sequences of 66 individuals from populations inhabiting three distinct habitats, we find that CNVs generally occur at low frequencies and are often only found in one of the 11 populations surveyed. A subset of CNVs, however, displays copy-number differentiation between populations, showing elevated within-population frequencies consistent with local adaptation. By comparing teleost genomes to identify lineage-specific genes and duplications in sticklebacks, we highlight rampant gene content differences among individuals in which over 30% of young duplicate genes are CNVs. These CNV genes are evolving rapidly at the molecular level and are enriched with functional categories associated with environmental interactions, depicting the dynamic early copy-number polymorphic stage of genes during population differentiation.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号