首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 234 毫秒
1.
Molecular dating of phylogenetic trees is a growing discipline using sequence data to co‐estimate the timing of evolutionary events and rates of molecular evolution. All molecular‐dating methods require converting genetic divergence between sequences into absolute time. Historically, this could only be achieved by associating externally derived dates obtained from fossil or biogeographical evidence to internal nodes of the tree. In some cases, notably for fast‐evolving genomes such as viruses and some bacteria, the time span over which samples were collected may cover a significant proportion of the time since they last shared a common ancestor. This situation allows phylogenetic trees to be calibrated by associating sampling dates directly to the sequences representing the tips (terminal nodes) of the tree. The increasing availability of genomic data from ancient DNA extends the applicability of such tip‐based calibration to a variety of taxa including humans, extinct megafauna and various microorganisms which typically have a scarce fossil record. The development of statistical models accounting for heterogeneity in different aspects of the evolutionary process while accommodating very large data sets (e.g. whole genomes) has allowed using tip‐dating methods to reach inferences on divergence times, substitution rates, past demography or the age of specific mutations on a variety of spatiotemporal scales. In this review, we summarize the current state of the art of tip dating, discuss some recent applications, highlight common pitfalls and provide a ‘how to’ guide to thoroughly perform such analyses.  相似文献   

2.
We present a model for genome evolution, comprising biologically plausible events such as transpositions inside the genome and insertions of exogenous sequences. This model attempts to formulate a minimal proposition accounting for key statistical properties of genomes, avoiding, as far as possible, unsupportable hypotheses for the remote evolutionary past. The statistical properties that are observed in genomic sequences and are reproduced by the proposed model are: (i) deviations from randomness at different length scales, measured by suitable algorithms, (ii) a special form of size distribution (power law distribution) characterising different levels of genome organisation in the non-coding, and (iii) extensive resemblance in the alternation of coding and non-coding regions at several length scales (self-similarity) in long genomic sequences of higher eukaryotes.  相似文献   

3.
Microhomology-mediated break-induced replication (MMBIR) is a DNA repair pathway initiated by polymerase template switching at microhomology, which can produce templated insertions that initiate chromosomal rearrangements leading to neurological and metabolic diseases, and promote complex genomic rearrangements (CGRs) found in cancer. Yet, how often templated insertions accumulate from processes like MMBIR in genomes is poorly understood due to difficulty in directly identifying these events by whole genome sequencing (WGS). Here, by using our newly developed MMBSearch software, we directly detect such templated insertions (MMB-TIs) in human genomes and report substantial differences in frequency and complexity of MMB-TI events between normal and cancer cells. Through analysis of 71 cancer genomes from The Cancer Genome Atlas (TCGA), we observed that MMB-TIs readily accumulate de novo across several cancer types, with particularly high accumulation in some breast and lung cancers. By contrast, MMB-TIs appear only as germline variants in normal human fibroblast cells, and do not accumulate as de novo somatic mutations. Finally, we performed WGS on a lung adenocarcinoma patient case and confirmed MMB-TI-initiated chromosome fusions that disrupted potential tumor suppressors and induced chromothripsis-like CGRs. Based on our findings we propose that MMB-TIs represent a trigger for widespread genomic instability and tumor evolution.  相似文献   

4.
Evolution operates on whole genomes through direct rearrangements of genes, such as inversions, transpositions, and inverted transpositions, as well as through operations, such as duplications, losses, and transfers, that also affect the gene content of the genomes. Because these events are rare relative to nucleotide substitutions, gene order data offer the possibility of resolving ancient branches in the tree of life; the combination of gene order data with sequence data also has the potential to provide more robust phylogenetic reconstructions, since each can elucidate evolution at different time scales. Distance corrections greatly improve the accuracy of phylogeny reconstructions from DNA sequences, enabling distance-based methods to approach the accuracy of the more elaborate methods based on parsimony or likelihood at a fraction of the computational cost. This paper focuses on developing distance correction methods for phylogeny reconstruction from whole genomes. The main question we investigate is how to estimate evolutionary histories from whole genomes with equal gene content, and we present a technique, the empirically derived estimator (EDE), that we have developed for this purpose. We study the use of EDE on whole genomes with identical gene content, and we explore the accuracy of phylogenies inferred using EDE with the neighbor joining and minimum evolution methods under a wide range of model conditions. Our study shows that tree reconstruction under these two methods is much more accurate when based on EDE distances than when based on other distances previously suggested for whole genomes. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Martin Kreitman]  相似文献   

5.
6.
MicroRNAs (miRNAs) are a class of small non-coding RNAs that can play important regulatory roles in many important biological processes. Although clustering patterns of miRNA clusters have been uncovered in animals, the origin and evolution of miRNA clusters in vertebrates are still poorly understood. Here, we performed comparative genomic analyses to construct 51 sets of orthologous miRNA clusters (SOMCs) across seven test vertebrate species, a collection of miRNA clusters from two or more species that are likely to have evolved from a common ancestral miRNA cluster, and used these to systematically examine the evolutionary characteristics and patterns of miRNA clusters in vertebrates. We found that miRNA clusters are continuously generated, and most of them tend to be conserved and maintained in vertebrate genomes, although some adaptive gains and losses of miRNA cluster have occurred during evolution. Furthermore, miRNA clusters appeared relatively early in the evolutionary history might suffer from more complicated adaptive gain-and-loss than those young miRNA clusters. Detailed analysis showed that genomic duplication events of ancestral miRNAs or miRNA clusters are likely to be major driving force and apparently contribute to origin and evolution of miRNA clusters. Comparison of conserved with lineage-specific miRNA clusters revealed that the contribution of duplication events for the formation of miRNA cluster appears to be more important for conserved miRNA clusters than lineage-specific. Our study provides novel insights for further exploring the origins and evolution of miRNA clusters in vertebrates at a genome scale.  相似文献   

7.
Accurate estimates of genome-wide rates and fitness effects of new mutations are essential for an improved understanding of molecular evolutionary processes. Although eukaryotic genomes generally contain a large noncoding fraction, functional noncoding regions and fitness effects of mutations in such regions are still incompletely characterized. A promising approach to characterize functional noncoding regions relies on identifying accessible chromatin regions (ACRs) tightly associated with regulatory DNA. Here, we applied this approach to identify and estimate selection on ACRs in Capsella grandiflora, a crucifer species ideal for population genomic quantification of selection due to its favorable population demography. We describe a population-wide ACR distribution based on ATAC-seq data for leaf samples of 16 individuals from a natural population. We use population genomic methods to estimate fitness effects and proportions of positively selected fixations (α) in ACRs and find that intergenic ACRs harbor a considerable fraction of weakly deleterious new mutations, as well as a significantly higher proportion of strongly deleterious mutations than comparable inaccessible intergenic regions. ACRs are enriched for expression quantitative trait loci (eQTL) and depleted of transposable element insertions, as expected if intergenic ACRs are under selection because they harbor regulatory regions. By integrating empirical identification of intergenic ACRs with analyses of eQTL and population genomic analyses of selection, we demonstrate that intergenic regulatory regions are an important source of nearly neutral mutations. These results improve our understanding of selection on noncoding regions and the role of nearly neutral mutations for evolutionary processes in outcrossing Brassicaceae species.  相似文献   

8.
P elements, a family of DNA transposons, are known as aggressive intruders into the hitherto uninfected gene pool of Drosophila melanogaster. Invading through horizontal transmission from an external source they managed to spread rapidly through natural populations within a few decades. Owing to their propensity for rapid propagation within genomes as well as within populations, they are considered as the classic example of selfish DNA, causing havoc in a genomic environment permissive for transpositional activity. Tracing the fate of P transposons on an evolutionary scale we describe different stages in their evolutionary life history. Starting from horizontal transfer events, which now appear to be rather a common phenomenon, the initial transpositional burst in the new host is slowed down by the accumulation of defective copies as well as host-directed epigenetic silencing. This leads to the loss of mobility and, finally, to molecular erosion by random mutations. Possible escape routes from genomic extinction are the reactivation within the original host genome by recombination or suspension of the repressing regime, horizontal emigration to a virgin gene pool, or genomic integration and acquisition of a novel function as a domesticated host gene.  相似文献   

9.
Polymerase chain reaction (PCR)-based genome walking techniques are commonly used to clone unknown genomic regions flanking known sequences. However, these methods are typically problematic when applied to highly complex DNA templates isolated from plants with large genomes. Here we describe a reliable and efficient genome walking method that is particularly effective for plants with large genomes. Our ligation-mediated PCR method, Straight Walk, has improved sensitivity and specificity due to optimization of sequences of adaptors and adaptor primers. Successful genome walking in lily, which has one of the largest genomes in plants, indicates that Straight Walk is applicable for most plant species.  相似文献   

10.
Forward Genomics – a comparative genomics approach to link phenotype to genotype Despite availability of several sequenced genomes, we know very little about the specific changes in the DNA that underlie phenotypic differences between species. The main reason is that species differ by both numerous genomic and phenotypic changes. A new comparative genomics method addresses this question by for phenotypes with independent evolutionary losses by searching for genomic regions that exhibit an elevated number of mutations in exactly these phenotype‐loss species. The near future sequencing of thousands of novel genomes will make it possible to use comparative genomics to systematically search for such DNA changes that are associated with phenotypic differences.  相似文献   

11.
12.
13.
Eukaryotic transposable elements and genome evolution   总被引:54,自引:0,他引:54  
The changes in DNA sequence that have taken place during the evolution of eukaryotic genomes cannot be accounted for simply by base substitutions; some more complex mutations must have taken place as well. Transposable elements can affect gene structure and expression in several ways that suggest that they may have contributed to these evolutionary events.  相似文献   

14.
Comparative analysis is one of the most powerful methods available for understanding the diverse and complex systems found in biology, but it is often limited by a lack of comprehensive taxonomic sampling. Despite the recent development of powerful genome technologies capable of producing sequence data in large quantities (witness the recently completed first draft of the human genome), there has been relatively little change in how evolutionary studies are conducted. The application of genomic methods to evolutionary biology is a challenge, in part because gene segments from different organisms are manipulated separately, requiring individual purification, cloning, and sequencing. We suggest that a feasible approach to collecting genome-scale data sets for evolutionary biology (i.e., evolutionary genomics) may consist of combination of DNA samples prior to cloning and sequencing, followed by computational reconstruction of the original sequences. This approach will allow the full benefit of automated protocols developed by genome projects to be realized; taxon sampling levels can easily increase to thousands for targeted genomes and genomic regions. Sequence diversity at this level will dramatically improve the quality and accuracy of phylogenetic inference, as well as the accuracy and resolution of comparative evolutionary studies. In particular, it will be possible to make accurate estimates of normal evolution in the context of constant structural and functional constraints (i.e., site-specific substitution probabilities), along with accurate estimates of changes in evolutionary patterns, including pairwise coevolution between sites, adaptive bursts, and changes in selective constraints. These estimates can then be used to understand and predict the effects of protein structure and function on sequence evolution and to predict unknown details of protein structure, function, and functional divergence. In order to demonstrate the practicality of these ideas and the potential benefit for functional genomic analysis, we describe a pilot project we are conducting to simultaneously sequence large numbers of vertebrate mitochondrial genomes.  相似文献   

15.
16.
Horizontal gene transfer in prokaryotes is rampant on short and intermediate evolutionary time scales. It poses a fundamental problem to our ability to reconstruct the evolutionary tree of life. Is it also frequent over long evolutionary distances? To address this question, we analyzed the evolution of 2,091 insertion sequences from all 20 major families in 438 completely sequenced prokaryotic genomes. Specifically, we mapped insertion sequence occurrence on a 16S rDNA tree of the genomes we analyzed, and we also constructed phylogenetic trees of the insertion sequence transposase coding sequences. We found only 30 cases of likely horizontal transfer among distantly related prokaryotic clades. Most of these horizontal transfer events are ancient. Only seven events are recent. Almost all of these transfer events occur between pairs of human pathogens or commensals. If true also for other, non-mobile DNA, the rarity of distant horizontal transfer increases the odds of reliable phylogenetic inference from sequence data.  相似文献   

17.
Comparative genome sequence analysis is powerful, but sequencing genomes is expensive. It is desirable to be able to predict how many genomes are needed for comparative genomics, and at what evolutionary distances. Here I describe a simple mathematical model for the common problem of identifying conserved sequences. The model leads to some useful rules of thumb. For a given evolutionary distance, the number of comparative genomes needed for a constant level of statistical stringency in identifying conserved regions scales inversely with the size of the conserved feature to be detected. At short evolutionary distances, the number of comparative genomes required also scales inversely with distance. These scaling behaviors provide some intuition for future comparative genome sequencing needs, such as the proposed use of “phylogenetic shadowing” methods using closely related comparative genomes, and the feasibility of high-resolution detection of small conserved features.  相似文献   

18.
Rates of evolutionary change in viruses: patterns and determinants   总被引:1,自引:0,他引:1  
Understanding the factors that determine the rate at which genomes generate and fix mutations provides important insights into key evolutionary mechanisms. We review our current knowledge of the rates of mutation and substitution, as well as their determinants, in RNA viruses, DNA viruses and retroviruses. We show that the high rate of nucleotide substitution in RNA viruses is matched by some DNA viruses, suggesting that evolutionary rates in viruses are explained by diverse aspects of viral biology, such as genomic architecture and replication speed, and not simply by polymerase fidelity.  相似文献   

19.
Microcystis aeruginosa is one of the most common bloom-forming cyanobacteria in freshwater ecosystems worldwide. This species produces numerous secondary metabolites, including microcystins, which are harmful to human health. We sequenced the genomes of ten strains of M. aeruginosa in order to explore the genomic basis of their ability to occupy varied environments and proliferate. Our findings show that M. aeruginosa genomes are characterized by having a large open pangenome, and that each genome contains similar proportions of core and flexible genes. By comparing the GC content of each gene to the mean value of the whole genome, we estimated that in each genome, around 11% of the genes seem to result from recent horizontal gene transfer events. Moreover, several large gene clusters resulting from HGT (up to 19 kb) have been found, illustrating the ability of this species to integrate such large DNA molecules. It appeared also that all M. aeruginosa displays a large genomic plasticity, which is characterized by a high proportion of repeat sequences and by low synteny values between the strains. Finally, we identified 13 secondary metabolite gene clusters, including three new putative clusters. When comparing the genomes of Microcystis and Prochlorococcus, one of the dominant picocyanobacteria living in marine ecosystems, our findings show that they are characterized by having almost opposite evolutionary strategies, both of which have led to ecological success in their respective environments.  相似文献   

20.
Comparing chromosomal gene order in two or more related species is an important approach to studying the forces that guide genome organization and evolution. Linked clusters of similar genes found in related genomes are often used to support arguments of evolutionary relatedness or functional selection. However, as the gene order and the gene complement of sister genomes diverge progressively due to large scale rearrangements, horizontal gene transfer, gene duplication and gene loss, it becomes increasingly difficult to determine whether observed similarities in local genomic structure are indeed remnants of common ancestral gene order, or are merely coincidences. A rigorous comparative genomics requires principled methods for distinguishing chance commonalities, within or between genomes, from genuine historical or functional relationships. In this paper, we construct tests for significant groupings against null hypotheses of random gene order, taking incomplete clusters, multiple genomes, and gene families into account. We consider both the significance of individual clusters of prespecified genes and the overall degree of clustering in whole genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号