首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 29 毫秒
1.
Recent years have witnessed a breathtaking increase in the availability of genome sequence data, providing evidence of the highly duplicate nature of eukaryotic genomes. Plants are exceptional among eukaryotic organisms in that duplicate loci compose a large fraction of their genomes, partly because of the frequent occurrence of polyploidy (or whole-genome duplication) events. Tandem gene duplication and transposition have also contributed to the large number of duplicated genes in plant genomes. Evolutionary analyses allowed the dynamics of duplicate gene evolution to be studied and several models were proposed. It seems that, over time, many duplicated genes were lost and some of those that were retained gained new functions and/or expression patterns (neofunctionalization) or subdivided their functions and/or expression patterns between them (subfunctionalization). Recent studies have provided examples of genes that originated by duplication with successive diversification within plants. In this review, we focused on the TEL (TERMINAL EAR1-like) genes to illustrate such mechanisms. Emerged from the mei2 gene family, these TEL genes are likely to be land plant-specific. Phylogenetic analyses revealed one or two TEL copies per diploid genome. TEL gene degeneration and loss in several Angiosperm species such as in poplar and maize seem to have occurred. In Arabidopsis thaliana, whose genome experienced at least three polyploidy events followed by massive gene loss and genomic reorganization, two TEL genes were retained and two new shorter TEL-like (MCT) genes emerged. Molecular and expression analyses suggest for these genes sub- and neofunctionalization events, but confirmation will come from their functional characterization.  相似文献   

2.
Conant GC  Wolfe KH 《Genetics》2008,179(3):1681-1692
Identification of orthologous genes across species becomes challenging in the presence of a whole-genome duplication (WGD). We present a probabilistic method for identifying orthologs that considers all possible orthology/paralogy assignments for a set of genomes with a shared WGD (here five yeast species). This approach allows us to estimate how confident we can be in the orthology assignments in each genomic region. Two inferences produced by this model are indicative of purifying selection acting to prevent duplicate gene loss. First, our model suggests that there are significant differences (up to a factor of seven) in duplicate gene half-life. Second, we observe differences between the genes that the model infers to have been lost soon after WGD and those lost more recently. Gene losses soon after WGD appear uncorrelated with gene expression level and knockout fitness defect. However, later losses are biased toward genes whose paralogs have high expression and large knockout fitness defects, as well as showing biases toward certain functional groups such as ribosomal proteins. We suggest that while duplicate copies of some genes may be lost neutrally after WGD, another set of genes may be initially preserved in duplicate by natural selection for reasons including dosage.  相似文献   

3.
Evolutionary change of the numbers of homeobox genes in bilateral animals   总被引:6,自引:0,他引:6  
It has been known that the conservation or diversity of homeobox genes is responsible for the similarity and variability of some of the morphological or physiological characters among different organisms. To gain some insights into the evolutionary pattern of homeobox genes in bilateral animals, we studied the change of the numbers of these genes during the evolution of bilateral animals. We analyzed 2,031 homeodomain sequences compiled from 11 species of bilateral animals ranging from Caenorhabditis elegans to humans. Our phylogenetic analysis using a modified reconciled-tree method suggested that there were at least about 88 homeobox genes in the common ancestor of bilateral animals. About 50-60 genes of them have left at least one descendant gene in each of the 11 species studied, suggesting that about 30-40 genes were lost in a lineage-specific manner. Although similar numbers of ancestral genes have survived in each species, vertebrate lineages gained many more genes by duplication than invertebrate lineages, resulting in more than 200 homeobox genes in vertebrates and about 100 in invertebrates. After these gene duplications, a substantial number of old duplicate genes have also been lost in each lineage. Because many old duplicate genes were lost, it is likely that lost genes had already been differentiated from other groups of genes at the time of gene loss. We conclude that both gain and loss of homeobox genes were important for the evolutionary change of phenotypic characters in bilateral animals.  相似文献   

4.
The relationship between speciation times and the corresponding times of gene divergence is of interest in phylogenetic inference as a means of understanding the past evolutionary dynamics of populations and of estimating the timing of speciation events. It has long been recognized that gene divergence times might substantially pre-date speciation events. Although the distribution of the difference between these has previously been studied for the case of two populations, this distribution has not been explicitly computed for larger species phylogenies. Here we derive a simple method for computing this distribution for trees of arbitrary size. A two-stage procedure is proposed which (i) considers the probability distribution of the time from the speciation event at the root of the species tree to the gene coalescent time conditionally on the number of gene lineages available at the root; and (ii) calculates the probability mass function for the number of gene lineages at the root. This two-stage approach dramatically simplifies numerical analysis, because in the first step the conditional distribution does not depend on an underlying species tree, while in the second step the pattern of gene coalescence prior to the species tree root is irrelevant. In addition, the algorithm provides intuition concerning the properties of the distribution with respect to the various features of the underlying species tree. The methodology is complemented by developing probabilistic formulae and software, written in R. The method and software are tested on five-taxon species trees with varying levels of symmetry. The examples demonstrate that more symmetric species trees tend to have larger mean coalescent times and are more likely to have a unimodal gamma-like distribution with a long right tail, while asymmetric trees tend to have smaller mean coalescent times with an exponential-like distribution. In addition, species trees with longer branches generally have shorter mean coalescent times, with branches closest to the root of the tree being most influential.  相似文献   

5.
Gene duplication and gene loss as well as other biological events can result in multiple copies of genes in a given species. Because of these gene duplication and loss dynamics, in addition to variation in sequence evolution and other sources of uncertainty, different gene trees ultimately present different evolutionary histories. All of this together results in gene trees that give different topologies from each other, making consensus species trees ambiguous in places. Other sources of data to generate species trees are also unable to provide completely resolved binary species trees. However, in addition to gene duplication events, speciation events have provided some underlying phylogenetic signal, enabling development of algorithms to characterize these processes. Therefore, a soft parsimony algorithm has been developed that enables the mapping of gene trees onto species trees and modification of uncertain or weakly supported branches based on minimizing the number of gene duplication and loss events implied by the tree. The algorithm also allows for rooting of unrooted trees and for removal of in-paralogues (lineage-specific duplicates and redundant sequences masquerading as such). The algorithm has also been made available for download as a software package, Softparsmap.  相似文献   

6.
A nuclear low-copy gene phylogeny provides strong evidence for the hybrid origin of seven polyploid species in Geinae (Rosaceae). In a gene tree, alleles at homologous loci in an allopolyploid species are expected to be sisters to orthologues in the ancestral taxa rather than to each other. Alleles at a duplicated locus in an autopolyploid, however, are expected to be more closely related to each other than they are to any orthologous copies in closely related species. We cloned and sequenced about 1.9 kilobases from the 5' end of the GBSSI-1 gene from two diploid, one tetraploid, and six hexaploid species. Each of the three loci in the hexaploid species forms a separate group, two of which are more closely related to copies in other species than they are to each other. This finding indicates that the hexaploid lineage evolved through two consecutive allopolyploidization events. Based on the GBSSI-1 gene tree, we hypothesized that there was an initial hybridization between a diploid species from the ancestral lineage of Coluria and Waldsteinia and an unknown diploid species to form the tetraploid Geum heterocarpum lineage. Backcrossing of G. heterocarpum with a representative of the unknown diploid lineage then resulted in a hexaploid lineage that has radiated considerably since its origin, comprising at least 40 extant species with various morphologies. A penalized likelihood analysis indicated that Geinae may be about 17 million years old, implying that the hypothesized allopolyploid speciation events are relatively ancient. Six of the 22 cloned Geinae GBSSI-1 copies in this study, which all are duplicate copies in polyploid taxa, may have become pseudogenes. We compared the GBSSI-1 phylogeny with one from chloroplast data and explored implications for the evolution of some fruit characters.  相似文献   

7.
We studied the phylogenetic relationships among Japanese Leptocarabus ground beetles, which show extensive trans-species polymorphisms in mitochondrial gene genealogies. Simultaneous analysis of combined nuclear data with partial sequences from the long-wavelength rhodopsin, wingless, phosphoenolpyruvate carboxykinase, and 28S rRNA genes resolved the relationships among the five species, although separate analyses of these genes provided topologies with low resolution. For both the nuclear gene tree resulting from the combined data from four genes and a mitochondrial cytochrome oxidase subunit I (COI) gene tree, we applied a Bayesian divergence time estimation using a common calibration method to identify mitochondrial introgression events that occurred after speciation. Three mitochondrial lineages shared by two or three species were likely subject to introgression due to interspecific hybridization because the coalescent times for these lineages were much shorter than the corresponding speciation times estimated from nuclear gene sequences. We demonstrated that when species phylogeny is fully resolved with nuclear gene sequence data, comparative analysis of nuclear and mitochondrial gene trees can be used to infer introgressive hybridization events that might cause trans-species polymorphisms in mitochondrial gene trees.  相似文献   

8.
RPB1 and RPB2, which encode the largest and second largest subunits of RNA polymerase II, respectively, are essential single copy genes in fungi, animals and most plants. Two paralogs of the RPB2 gene have been found in some groups of angioperms [Oxelman, B., Yoshikawa, N., McConaughy, B.L., Luo, J., Denton, A.L., Hall, B.D., 2004. RPB2 gene phylogeny in flowering plants, with particular emphasis on asterids. Mol. Phylogenet. Evol. 32, 462-479]. Here, we report the results of experiments designed to identify the evolutionary origin of the RPB2 duplicate copies. Through careful sampling and phylogenetic analysis, we were able to construct the RPB2 gene tree in angiosperms and infer the phylogenetic positions of the gene duplication and gene loss events that occurred. Our study shows that an RPB2 gene duplication occurred early in core eudicot evolution, at or near the time of the Buxaceae/Trochodendraceae divergence. Subsequently, multiple gene duplication and paralog sorting events happened independently in different core eudicot taxa. Differential expression of the two RPB2 gene paralogs may explain the preservation of both paralogs in the asterids. One gene (RPB2-i) accounts for most of the RPB2 mRNA made in the flower organs while the other gene (RPB2-d) is predominantly used in the vegetative tissues. We also found two paralogs of the RPB1 gene in some core eudicot species. The RPB1 gene duplication occurred before core eudicot divergence, around the time of RPB2 gene duplication. Several independent RPB1 paralog sorting events happened in different core eudicot taxa; their occurrence was independent of the RPB2 paralog sorting events. Our results suggest that a polyploidization event happened at or near the time of the Buxaceae/Trochodendraceae divergence. We propose that this polyploidization and the partial diploidization processes thereafter may have been the driving force of core eudicot radiation.  相似文献   

9.
Background

The number of species with completed genomes, including those with evidence for recent whole genome duplication events has exploded. The recently sequenced Atlantic salmon genome has been through two rounds of whole genome duplication since the divergence of teleost fish from the lineage that led to amniotes. This quadrupoling of the number of potential genes has led to complex patterns of retention and loss among gene families.

Results

Methods have been developed to characterize the interplay of duplicate gene retention processes across both whole genome duplication events and additional smaller scale duplication events. Further, gene expression divergence data has become available as well for Atlantic salmon and the closely related, pre-whole genome duplication pike and methods to describe expression divergence are also presented. These methods for the characterization of duplicate gene retention and gene expression divergence that have been applied to salmon are described.

Conclusions

With the growth in available genomic and functional data, the opportunities to extract functional inference from large scale duplicates using comparative methods have expanded dramatically. Recently developed methods that further this inference for duplicated genes have been described.

  相似文献   

10.
Estimates of the timing of divergence are central to testing the underlying causes of speciation. Relaxed molecular clocks and fossil calibration have improved these estimates; however, these advances are implemented in the context of gene trees, which can overestimate divergence times. Here we couple recent innovations for dating speciation events with the analytical power of species trees, where multilocus data are considered in a coalescent context. Divergence times are estimated in the bird genus Aphelocoma to test whether speciation in these jays coincided with mountain uplift or glacial cycles. Gene trees and species trees show general agreement that diversification began in the Miocene amid mountain uplift. However, dates from the multilocus species tree are more recent, occurring predominately in the Pleistocene, consistent with theory that divergence times can be significantly overestimated with gene‐tree based approaches that do not correct for genetic divergence that predates speciation. In addition to coalescent stochasticity, Haldane's rule could account for some differences in timing estimates between mitochondrial DNA and nuclear genes. By incorporating a fossil calibration applied to the species tree, in addition to the process of gene lineage coalescence, the present approach provides a more biologically realistic framework for dating speciation events, and hence for testing the links between diversification and specific biogeographic and geologic events.  相似文献   

11.
Summary In the 50 million years since the polyploidization event that gave rise to the catostomid family of fishes the duplicate genes encoding isozymes have undergone different fates. Ample opportunity has been available for regulatory evolution of these duplicate genes. Approximately half the duplicate genes have lost their expressions during this time. Of the duplicate genes remaining, the majority have diverged to different extents in their expression within and among adult tissues. The pattern of divergence of duplicate gene expression is consistent with the accumulation of mutations at regulatory genes. The absence of a correlation of extent of divergence of gene expression with the level of genetic variability for isozymes at these loci is consistent with the view that the rates of regulatory gene and structural gene evolution are uncoupled. The magnitude of divergence of duplicate gene expressions varies among tissues, enzymes, and species. Little correlation was found with the extent of divergence of duplicate gene expression within a species and its degree of morphological conservatism, although species pairs which are increasingly taxonomically distant are less likely to share specific patterns of differential gene expression. Probable phylogenetic times of origin of several patterns of differential gene expression have been proposed. Some patterns of differential gene expression have evolved in recent evolutionary times and are specific to one or a few species, whereas at least one pattern of differential gene expression is present in nearly all species and probably arose soon after the polyploidization event. Multilocus isozymes, formed by polyploidization, provide a useful model system for studying the forces responsible for the maintenance of duplicate genes and the evolution of these once identical genes to new spatially and temporally specific patterns of regulation.  相似文献   

12.
Recently diverged taxa may continue to exchange genes. A number of models of speciation with gene flow propose that the frequency of gene exchange will be lower in genomic regions of low recombination and that these regions will therefore be more differentiated. However, several population-genetic models that focus on selection at linked sites also predict greater differentiation in regions of low recombination simply as a result of faster sorting of ancestral alleles even in the absence of gene flow. Moreover, identifying the actual amount of gene flow from patterns of genetic variation is tricky, because both ancestral polymorphism and migration lead to shared variation between recently diverged taxa. New analytic methods have been developed to help distinguish ancestral polymorphism from migration. Along with a growing number of datasets of multi-locus DNA sequence variation, these methods have spawned a renewed interest in speciation models with gene flow. Here, we review both speciation and population-genetic models that make explicit predictions about how the rate of recombination influences patterns of genetic variation within and between species. We then compare those predictions with empirical data of DNA sequence variation in rabbits and mice. We find strong support for the prediction that genomic regions experiencing low levels of recombination are more differentiated. In most cases, reduced gene flow appears to contribute to the pattern, although disentangling the relative contribution of reduced gene flow and selection at linked sites remains a challenge. We suggest fruitful areas of research that might help distinguish between different models.  相似文献   

13.
Polyploidy and genome evolution in plants   总被引:2,自引:0,他引:2  
Genome doubling (polyploidy) has been and continues to be a pervasive force in plant evolution. Modern plant genomes harbor evidence of multiple rounds of past polyploidization events, often followed by massive silencing and elimination of duplicated genes. Recent studies have refined our inferences of the number and timing of polyploidy events and the impact of these events on genome structure. Many polyploids experience extensive and rapid genomic alterations, some arising with the onset of polyploidy. Survivorship of duplicated genes are differential across gene classes, with some duplicate genes more prone to retention than others. Recent theory is now supported by evidence showing that genes that are retained in duplicate typically diversify in function or undergo subfunctionalization. Polyploidy has extensive effects on gene expression, with gene silencing accompanying polyploid formation and continuing over evolutionary time.  相似文献   

14.
15.
16.
Under a coalescent model for within-species evolution, gene trees may differ from species trees to such an extent that the gene tree topology most likely to evolve along the branches of a species tree can disagree with the species tree topology. Gene tree topologies that are more likely to be produced than the topology that matches that of the species tree are termed anomalous, and the region of branch-length space that gives rise to anomalous gene trees (AGTs) is the anomaly zone. We examine the occurrence of anomalous gene trees for the case of five taxa, the smallest number of taxa for which every species tree topology has a nonempty anomaly zone. Considering all sets of branch lengths that give rise to anomalous gene trees, the largest value possible for the smallest branch length in the species tree is greater in the five-taxon case (0.1934 coalescent time units) than in the previously studied case of four taxa (0.1568). The five-taxon case demonstrates the existence of three phenomena that do not occur in the four-taxon case. First, anomalous gene trees can have the same unlabeled topology as the species tree. Second, the anomaly zone does not necessarily enclose a ball centered at the origin in branch-length space, in which all branches are short. Third, as a branch length increases, it is possible for the number of AGTs to increase rather than decrease or remain constant. These results, which help to describe how the properties of anomalous gene trees increase in complexity as the number of taxa increases, will be useful in formulating strategies for evading the problem of anomalous gene trees during species tree inference from multilocus data.  相似文献   

17.
The ancient duplication of the Saccharomyces cerevisiae genome and subsequent massive loss of duplicated genes is apparent when it is compared to the genomes of related species that diverged before the duplication event. To learn more about the evolutionary effects of the duplication event, we compared the S. cerevisiae genome to other Saccharomyces genomes. We demonstrate that the whole genome duplication occurred before S. castellii diverged from S. cerevisiae. In addition to more accurately dating the duplication event, this finding allowed us to study the effects of the duplication on two separate lineages. Analyses of the duplication regions of the genomes indicate that most of the duplicated genes (approximately 85%) were lost before the speciation. Only a small amount of paralogous gene loss (4-6%) occurred after speciation. On the other hand, S. castellii appears to have lost several hundred genes that were not retained as duplicated paralogs. These losses could be related to genomic rearrangements that reduced the number of chromosomes from 16 to 9. In addition to S. castellii, other Saccharomyces sensu lato species likely diverged from S. cerevisiae after the duplication. A thorough analysis of these species will likely reveal other important outcomes of the whole genome duplication.  相似文献   

18.
Most plant phylogenetic inference has used DNA sequence data from the plastid genome. This genome represents a single genealogical sample with no recombination among genes, potentially limiting the resolution of evolutionary relationships in some contexts. In contrast, nuclear DNA is inherently more difficult to employ for phylogeny reconstruction because major mutational events in the genome, including polyploidization, gene duplication, and gene extinction can result in homologous gene copies that are difficult to identify as orthologs or paralogs. Gene tree parsimony (GTP) can be used to infer the rooted species tree by fitting gene genealogies to species trees while simultaneously minimizing the estimated number of duplications needed to reconcile conflicts among them. Here, we use GTP for five nuclear gene families and a previously published plastid data set to reconstruct the phylogenetic backbone of the aquatic plant family Pontederiaceae. Plastid-based phylogenetic studies strongly supported extensive paraphyly of Eichhornia (one of the four major genera) but also depicted considerable ambiguity concerning the true root placement for the family. Our results indicate that species trees inferred from the nuclear genes (alone and in combination with the plastid data) are highly congruent with gene trees inferred from plastid data alone. Consideration of optimal and suboptimal gene tree reconciliations place the root of the family at (or near) a branch leading to the rare and locally restricted E. meyeri. We also explore methods to incorporate uncertainty in individual gene trees during reconciliation by considering their individual bootstrap profiles and relate inferred excesses of gene duplication events on individual branches to whole-genome duplication events inferred for the same branches. Our study improves understanding of the phylogenetic history of Pontederiaceae and also demonstrates the utility of GTP for phylogenetic analysis.  相似文献   

19.
The volcanic archipelago of the Canary Islands, 100 km off the northwestern coast of Africa, harbors 43 endemic species of the mostly circum-Mediterranean spider genus Dysdera (Araneae, Dysderidae). This amounts to approximately one-fourth of all known Dysdera species in an area that represents 0.1% of the range of the genus. In order to address the origin of this extraordinary number of endemic species, the phylogenetic relationships among all the endemic taxa and a sample of 27 continental species were reconstructed. A simultaneous cladistic analysis was performed on 66 morphological characters, 471 bp of the cytochrome oxidase I and 424 bp of the 16S rRNA mitochondrial genes. The preferred most parsimonious tree supports a single origin for most of the endemic species (84%), although this tree is ambiguous regarding the total number of overseas colonizations (allowing a minimum of two and a maximum of four colonization events). Our data suggest that the Canary Islands have been the source of the colonizers of some of the remaining Macaronesian archipelagoes (certainly for the Selvagem Islands and the Cape Verdes and possibly for Madeira); the Azores have been independently colonized by dysderids from the continent. The present study provides a phylogenetic framework for an exceptional case of insular species radiation, an essential tool for unraveling the factors that have promoted this amazing diversification. Species radiations in oceanic archipelagoes are excellent models for the study of speciation processes.  相似文献   

20.
H. A. Orr 《Genetics》1995,139(4):1805-1813
Speciation often results from the accumulation of ``complementary genes,' i.e., from genes that, while having no deleterious effect within species, cause inviability or sterility when brought together with genes from another species. Here I model speciation as the accumulation of genic incompatibilities between diverging populations. Several results are obtained. First, and most important, the number of genic incompatibilities between taxa increases much faster than linearly with time. In particular, the probability of speciation increases at least as fast as the square of the time since separation between two taxa. Second, as Muller realized, all hybrid incompatibilities must initially be asymmetric. Third, at loci that have diverged between taxa, evolutionarily derived alleles cause hybrid problems far more often than ancestral alleles. Last, it is ``easier' to evolve complex hybrid incompatibilities requiring the simultaneous action of three or more loci than to evolve simple incompatibilities between pairs of genes. These results have several important implications for genetic analyses of speciation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号