首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Plant genomes appear to exploit the process of gene duplication as a primary means of acquiring biochemical and developmental flexibility. Thus, for example, most of the enzymatic components of plant secondary metabolism are encoded by small families of genes that originated through duplication over evolutionary time. The dynamics of gene family evolution are well illustrated by the genes that encode chalcone synthase (CHS), the first committed step in flavonoid biosynthesis. We review pertinent facts about CHS evolution in flowering plants with special reference to the morning glory genus, Ipomoea. Our review shows that new CHS genes are recruited recurrently in flowering plant evolution. Rates of nucleotide substitution are frequently accelerated in new duplicate genes, and there is clear evidence for repeated shifts in enzymatic function among duplicate copies of CHS genes. In addition, we present new data on expression patterns of CHS genes as a function of tissue and developmental stage in the common morning glory (I. purpurea). These data show extensive differentiation in gene expression among duplicate copies of CHS genes. We also show that a single mutation which blocks anthocyanin biosynthesis in the floral limb is correlated with a loss of expression of one of the six duplicate CHS genes present in the morning glory genome. This suggests that different duplicate copies of CHS have acquired specialized functional roles over the course of evolution. We conclude that recurrent gene duplication and subsequent differentiation is a major adaptive strategy in plant genome evolution.  相似文献   

3.
Learning about the roles that duplicate genes play in the origins of novel phenotypes requires an understanding of how their functions evolve. A previous method for achieving this goal, CDROM, employs gene expression distances as proxies for functional divergence and then classifies the evolutionary mechanisms retaining duplicate genes from comparisons of these distances in a decision tree framework. However, CDROM does not account for stochastic shifts in gene expression or leverage advances in contemporary statistical learning for performing classification, nor is it capable of predicting the parameters driving duplicate gene evolution. Thus, here we develop CLOUD, a multi-layer neural network built on a model of gene expression evolution that can both classify duplicate gene retention mechanisms and predict their underlying evolutionary parameters. We show that not only is the CLOUD classifier substantially more powerful and accurate than CDROM, but that it also yields accurate parameter predictions, enabling a better understanding of the specific forces driving the evolution and long-term retention of duplicate genes. Further, application of the CLOUD classifier and predictor to empirical data from Drosophila recapitulates many previous findings about gene duplication in this lineage, showing that new functions often emerge rapidly and asymmetrically in younger duplicate gene copies, and that functional divergence is driven by strong natural selection. Hence, CLOUD represents a major advancement in classifying retention mechanisms and predicting evolutionary parameters of duplicate genes, thereby highlighting the utility of incorporating sophisticated statistical learning techniques to address long-standing questions about evolution after gene duplication.  相似文献   

4.
Raquel Assis 《Fly》2014,8(2):91-94
Gene duplication is thought to play a key role in phenotypic innovation. While several processes have been hypothesized to drive the retention and functional evolution of duplicate genes, their genomic contributions have never been determined. We recently developed the first genome-wide method to classify these processes by comparing distances between expression profiles of duplicate genes and their ancestral single-copy orthologs. Application of our approach to spatial gene expression profiles in two Drosophila species revealed that a majority of young duplicate genes possess new functions, and that new functions are acquired rapidly—often within a few million years. Surprisingly, new functions tend to arise in younger copies of duplicate gene pairs. Moreover, we found that young duplicates are often specifically expressed in testes, whereas old duplicates are broadly expressed across several tissues, providing strong support for the hypothetical “out-of-testes” origin of new genes. In this Extra View, I discuss our findings in the context of theoretical predictions about gene duplication, with a particular emphasis on the importance of natural selection in the evolution of novel phenotypes.  相似文献   

5.
Chen Y  Ding Y  Zhang Z  Wang W  Chen JY  Ueno N  Mao B 《遗传学报》2011,38(12):577-584
The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes.  相似文献   

6.
Thornton KR 《Genetics》2007,177(2):987-1000
I describe a method for simulating samples from gene families of size two under a neutral coalescent process, for the case where the duplicate gene either has fixed recently in the population or is still segregating. When a duplicate locus has recently fixed by genetic drift, diversity in the new gene is expected to be reduced, and an excess of rare alleles is expected, relative to the predictions of the standard coalescent model. The expected patterns of polymorphism in segregating duplicates ("copy-number variants") depend both on the frequency of the duplicate in the sample and on the rate of crossing over between the two loci. When the crossover rate between the ancestral gene and the copy-number variant is low, the expected pattern of variability in the ancestral gene will be similar to the predictions of models of either balancing or positive selection, if the frequency of the duplicate in the sample is intermediate or high, respectively. Simulations are used to investigate the effect of crossing over between loci, and gene conversion between the duplicate loci, on levels of variability and the site-frequency spectrum.  相似文献   

7.
8.
A key question in molecular genetics is why severe gene mutations often do not result in a detectable abnormal phenotype. Alternative networks are known to be a gene compensation mechanism. Gene redundancy, i.e. the presence of a duplicate gene (or paralog) elsewhere in the genome, also underpins many cases of gene dispensability. Here, we investigated the role of partial duplicate genes on dispensability, where a partial duplicate is defined as a gene that has no paralog but which codes for a protein made of domains, each of which belongs to at least another protein. The rationale behind this investigation is that, as a partial duplicate codes for a domain redundant protein, we hypothesised that its deletion might have a less severe phenotypic effect than the deletion of other genes. This prompted us to (re)address the topic of gene dispensability by focusing on domain redundancy rather than on gene redundancy. Using fitness data of single-gene deletion mutants of Saccharomyces cerevisiae, we will show that domain redundancy is a compensation mechanism, the strength of which is lower than that of gene redundancy. Finally, we shall discuss the molecular basis of this new compensation mechanism.  相似文献   

9.
10.
Unusual phenotypic distributions at the muscle-specific, duplicate aspartate aminotransferase (AAT) loci were found in wild populations of brook trout (Salvelinus fontinalis), a species of the tetraploid-derivative Salmonidae. Analysis of these phenotypic distributions ruled out disparate gene frequencies, nonrandom association between the two loci, and inbreeding as possible explanations; however, models incorporating a null allele fit the data. Inheritance data from hatchery populations of brook trout also indicated a null allele polymorphism. This proposed AAT null allele, along with other null allele polymorphisms in salmonids, is evidence that loss of duplicate gene expression is still occurring. In contrast, there is no such evidence of ongoing loss of duplicate gene expression in the Catostomidae, another tetraploid-derivative lineage. We interpret this and other differences between salmonids and catostomids as reflecting an autotetraploid origin for salmonids and an allotetraploid origin for catostomids. The significance of these findings is also considered with respect to current models of the rate of loss of duplicate gene expression in tetraploid-derivative organisms.  相似文献   

11.
In 50 million years the tetraploid catostomid fishes have lost the expression of approximately half of their duplicate genes, with species rich taxa having lost more than species poor taxa. We have constructed a phylogenetic tree of the catostomids based primarily on morphological data, and have estimated the divergence times from the fossil record and genetic distances. The losses of duplicate gene expression were then analyzed conditionally given this tree. Three probabilistic models were generated to describe the process of loss of gene expression: gene dysfunction depends on (1) time alone, (2) the number of speciation events alone, or (3) a combination of speciation and time. A maximum likelihood analysis revealed that the two component model fits the data better than the other models. The loss of duplicate gene expression is mediated by null mutations at structural and/or regulatory genes, and the rate of fixation of these nulls might have been enhanced by any reductions in population size accompanying speciation events. This reduction may explain the lower number of duplicate genes expressed in the more speciose taxa.  相似文献   

12.
Previous work was shown that strains of Aspergillus nidulans with a chromosome segment in duplicate (one in normal position, one translocated to another chromosone) are unstable. Deletions occur from either duplicate segment. The present work has shown that when a chromosome I duplication and a chromosome III duplication are together in a haploid, deletions from the intact III duplication generally precede deletions from particular sections of the I duplication. Furthermore, the III duplication can enhance to some (but not major) extent the frequency of deletions from the I duplication. After the III duplication becomes reduced in size as a result of the loss of chromosomal material from the translocated duplicate III segment, such a reduced III duplication can greatly enhance the frequency of deletions from the I duplication. In other words a III duplication of reduced size can promote far more deletions from the I duplication than the intact III duplication. The major increase in the deletional instability of the duplication as promoted by the reduced III duplication is confined to the translocated duplicate I segment. The reduced III duplication can induce deletions from a section of the translocated duplicate I segment in accord with a temporal programme, and it appears that a particular region of the I duplication is far more under the mutagenic influence of the reduced III duplication than another region. Moreover, there is indication that there is a differential effet of two generally different genetic backgrounds on the susceptibility of duplication-regions to deletion.  相似文献   

13.
We conducted a detailed analysis of duplicate genes in three complete genomes: yeast, Drosophila, and Caenorhabditis elegans. For two proteins belonging to the same family we used the criteria: (1) their similarity is > or =I (I = 30% if L > or = 150 a.a. and I = 0.01n + 4.8L(-0.32(1 + exp(-L/1000))) if L < 150 a.a., where n = 6 and L is the length of the alignable region), and (2) the length of the alignable region between the two sequences is > or = 80% of the longer protein. We found it very important to delete isoforms (caused by alternative splicing), same genes with different names, and proteins derived from repetitive elements. We estimated that there were 530, 674, and 1,219 protein families in yeast, Drosophila, and C. elegans, respectively, so, as expected, yeast has the smallest number of duplicate genes. However, for the duplicate pairs with the number of substitutions per synonymous site (K(S)) < 0.01, Drosophila has only seven pairs, whereas yeast has 58 pairs and nematode has 153 pairs. After considering the possible effects of codon usage bias and gene conversion, these numbers became 6, 55, and 147, respectively. Thus, Drosophila appears to have much fewer young duplicate genes than do yeast and nematode. The larger numbers of duplicate pairs with K(S) < 0.01 in yeast and C. elegans were probably largely caused by block duplications. At any rate, it is clear that the genome of Drosophila melanogaster has undergone few gene duplications in the recent past and has much fewer gene families than C. elegans.  相似文献   

14.
Duplicate genes are believed to be a major source of new gene functions over evolutionary time. In order to evaluate the evolutionary dynamics of rice duplicate genes, formed principally by paleoployploidization prior to the speciation of the Poaceae family, we have employed a public microarray dataset including 155 gene expression omnibus sample plates and bioinformatics tools. At least 57.4% of old ~70 million years ago (MYA) duplicate gene pairs exhibit divergences in expression over the given experimental set, whereas at least 50.9% of young ~7.7-MYA duplicate gene pairs were shown to be divergent. When grouping the rice duplicate genes according to functional categories, we noted a striking and significant enrichment of divergent duplicate metabolism-associated genes, as compared to that observed in non-divergent duplicate genes. While both non-synonymous substitution (Ka) and synonymous substitution (Ks) values between non- and divergent duplicate gene pairs evidenced significant differences, the Ka/Ks values between them exhibited no significant differences. Interestingly, the average numbers of conserved motifs of the duplicate gene pairs revealed a pattern of decline along with an increase in expression diversity, partially supporting the subfunctionalization model with degenerative complementation in regulatory motifs. Duplicate gene pairs with high local similarity (HLS) segments, which might be formed via conversion between rice paleologs, evidenced higher expression correlations than were observed in the gene pairs without the HLS segments; this probably resulted in an increased likelihood of gene conversion in promoters of the gene pairs harboring HLS segments. More than 60% of the rice gene families exhibited similar high expression diversity between members as compared to that of randomly selected gene pairs. These findings are likely reflective of the evolutionary dynamics of rice duplicate genes for gene retention. An erratum to this article can be found at  相似文献   

15.
Allopolyploidization (hybridization and whole-genome duplication) is a common phenomenon in plant evolution with immediate saltational effects on genome structure and gene expression. New technologies have allowed rapid progress over the past decade in our understanding of the consequences of allopolyploidy. A major question, raised by early pioneer of this field Leslie Gottlieb, concerned the extent to which gene expression differences among duplicate genes present in an allopolyploid are a legacy of expression differences that were already present in the progenitor diploid species. Addressing this question necessitates phylogenetically well-understood natural study systems, appropriate technology, availability of genomic resources and a suitable analytical framework, including a sufficiently detailed and generally accepted terminology. Here, we review these requirements and illustrate their application to a natural study system that Gottlieb worked on and recommended for this purpose: recent allopolyploids of Tragopogon (Asteraceae). We reanalyse recent data from this system within the conceptual framework of parental legacies on duplicate gene expression in allopolyploids. On a broader level, we highlight the intellectual connection between Gottlieb''s phrasing of this issue and the more contemporary framework of cis- versus trans-regulation of duplicate gene expression in allopolyploid plants.  相似文献   

16.
Redundant duplication among putative Nordic spring barley material held at 12 gene banks worldwide was studied using 35 microsatellite primer pairs covering the entire barley genome. These microsatellite markers revealed an average of 7.1 alleles per locus, and a range of 1 to 17 different alleles per locus. Similarity of accession name was initially used to partition the 174 repatriated accessions into 36 potential duplicate groups, and one group containing 36 apparently unique or unrelated accessions. This partitioning was efficient to produce a distribution of mainly small average genetic distances within potential duplicate groups compared to distances from the group of unique accessions. However, comparisons within potential duplicate groups still contained large genetic distances of the same size as distances between unique accessions indicating classification errors. A bootstrap approach based on re-sampling of both microsatellite markers and alleles within marker loci was used to test for homogeneity within potential duplicate groups. The test was used in each group for sequential elimination of accessions with a significantly large average genetic distance to identify a homogeneous group. Such genetically homogeneous groups of two or more accessions were identified in 22 among the 36 potential duplicate groups studied. Results from the genetic analysis of some potential duplicate groups supported previous conclusions based on passport data through inclusion of the historically most-original accession in the genetically homogeneous group. In other potential duplicate groups the apparently most-original accession according to passport data was not included in the homogeneous set of accessions, indicating that this most-original accession does not have duplicate accessions in the group. During the present study the largest average genetic distance accepted in any homogeneous group was smaller than the smallest distance declared significant in any group, with a threshold average genetic distance of approximately 0.14. The results are discussed with respect to the identification of duplicate accessions within potential duplicate groups, as well as the elimination of genetic off types in such groups. Furthermore, large barley gene bank collections may be screened for potential duplicates with genetic distances below the suggested threshold of 0.14.  相似文献   

17.
Despite much recent interest, it remains unclear what determines the rate of evolution of gene expression. To study this issue we develop a new measure, called "Expression Conservation Index" (ECI), to quantify the degree of tissue-expression conservation between two homologous genes. Applying this measure to a large set of gene expression data from human and mouse, we show that tissue expression tends to evolve rapidly for genes that are expressed in only a limited number of tissues, whereas tissue expression can be conserved for a long time for genes expressed in a large number of tissues. Therefore, expression breadth is an important determinant for evolutionary conservation of tissue expression. In addition, we find a rapid decrease in ECI with the synonymous divergence between duplicate genes, suggesting fast divergence in tissue expression between duplicate genes.  相似文献   

18.

Background  

During gene expression analysis by Serial Analysis of Gene Expression (SAGE), duplicate ditags are routinely removed from the data analysis, because they are suspected to stem from artifacts during SAGE library construction. As a consequence, naturally occurring duplicate ditags are also removed from the analysis leading to an error of measurement.  相似文献   

19.
We examine robustness to mutations in the nematode worm Caenorhabditis elegans and the role of single-copy and duplicate genes in it. We do so by integrating complete genome sequence and microarray gene expression data with results from a genome-scale study using RNA interference (RNAi) to temporarily eliminate the functions of more than 16000 worm genes. We found that 89% of single-copy and 96% of duplicate genes show no detectable phenotypic effect in an RNAi knock-down experiment. We find that mutational robustness is greatest for closely related gene duplicates, large gene families and similarly expressed genes. We discuss the different causes of mutational robustness in single-copy and duplicate genes, as well as its evolutionary origin.  相似文献   

20.
Current hypotheses of gene duplicate divergence propose that surviving members of a gene duplicate pair may evolve, under conditions of purifying or nearly neutral selection, in one of two ways: with new function arising in one duplicate while the other retains original function (neofunctionalization [NF]) or partitioning of the original function between the 2 paralogs (subfunctionalization [SF]). More recent studies propose that SF followed by NF (subneofunctionalization [SNF]) explains the divergence of many duplicate genes. In this analysis, we evaluate these hypotheses in the context of the large monosaccharide transporter (MST) gene families in Arabidopsis and rice. MSTs have an ancient origin, predating plants, and have evolved in the seed plant lineage to comprise 7 subfamilies. In Arabidopsis, 53 putative MST genes have been identified, with one subfamily greatly expanded by tandem gene duplications. We searched the rice genome for members of the MST gene family and compared them with the MST gene family in Arabidopsis to determine subfamily expansion patterns and estimate gene duplicate divergence times. We tested hypotheses of gene duplicate divergence in 24 paralog pairs by comparing protein sequence divergence rates, estimating positive selection on codon sites, and analyzing tissue expression patterns. Results reveal the MST gene family to be significantly larger (65) in rice with 2 subfamilies greatly expanded by tandem duplications. Gene duplicate divergence time estimates indicate that early diversification of most subfamilies occurred in the Proterozoic (2500-540 Myr) and that expansion of large subfamilies continued through the Cenozoic (65-0 Myr). Two-thirds of paralog pairs show statistically symmetric rates of sequence evolution, most consistent with the SF model, with half of those showing evidence for positive selection in one or both genes. Among 8 paralog pairs showing asymmetric divergence rates, most consistent with the NF model, nearly half show evidence of positive selection. Positive selection does not appear in any duplicate pairs younger than approximately 34 Myr. Our data suggest that the NF, SF, and SNF models describe different outcomes along a continuum of divergence resulting from initial conditions of relaxed constraint after duplication.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号