首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A whole‐genome duplication (WGD) doubles the entire genomic content of a species and is thought to have catalysed adaptive radiation in some polyploid‐origin lineages. However, little is known about general consequences of a WGD because gene duplicates (i.e., paralogs) are commonly filtered in genomic studies; such filtering may remove substantial portions of the genome in data sets from polyploid‐origin species. We demonstrate a new method that enables genome‐wide scans for signatures of selection at both nonduplicated and duplicated loci by taking locus‐specific copy number into account. We apply this method to RAD sequence data from different ecotypes of a polyploid‐origin salmonid (Oncorhynchus nerka) and reveal signatures of divergent selection that would have been missed if duplicated loci were filtered. We also find conserved signatures of elevated divergence at pairs of homeologous chromosomes with residual tetrasomic inheritance, suggesting that joint evolution of some nondiverged gene duplicates may affect the adaptive potential of these genes. These findings illustrate that including duplicated loci in genomic analyses enables novel insights into the evolutionary consequences of WGDs and local segmental gene duplications.  相似文献   

2.
Ubiquitin (Ub) and Ub‐like proteins, collectively forming the ubiquiton family, regulate nearly all aspects of cellular processes via post‐translational modifications. Studies devoted to specific members suggested a large expansion of this family in plants; however, a lack of systematic analysis hinders the comparison of individual members at both evolutionary history and functional divergence levels, which may provide new insight into biological functions. In this work, we first retrieved a total of 5856 members of 17 known ubiquiton subfamilies in 50 plant genomes by searching both prior annotations and missing loci in each genome. We then applied this list to analyze the duplication history of major ubiquiton subfamilies in plants. We show that autophagy‐related protein 8 (ATG8), membrane‐anchored Ub‐fold (MUB), small Ub‐like modifier (SUMO) and Ub loci encode 88% of the plant ubiquiton family. Although whole genome duplications (WGDs) significantly expanded the family, we discovered contrasting duplication patterns both in species and in subfamilies. Within the family, the ATG8 and MUB members were primarily duplicated through WGDs, whereas a significant number of Ub and SUMO loci were generated through retroposition and tandem duplications, respectively. Although Ub coding regions are highly conserved in plants, promoter activity analysis demonstrated lineage‐specific expression patterns of polyUb genes in Oryza sativa (rice) and Arabidopsis, confirming their retroposition origin. Based on the theory of dosage balance constraints, our study suggests that ubiquiton members duplicated through WGDs play crucial roles in plants, and that the regulatory pathways involving ATG8 and MUB are more conserved than those controlled by Ub and SUMO.  相似文献   

3.
Whole‐genome duplications are major evolutionary events with a lasting impact on genome structure. Duplication events complicate genetic analyses as paralogous sequences are difficult to distinguish; consequently, paralogs are often excluded from studies. The effects of an ancient whole‐genome duplication (approximately 88 MYA) are still evident in salmonids through the persistence of numerous paralogous gene sequences and partial tetrasomic inheritance. We use restriction site‐associated DNA sequencing on 10 collections of chum salmon from the Salish Sea in the USA and Canada to investigate genetic diversity and population structure in both tetrasomic and rediploidized regions of the genome. We use a pedigree and high‐density linkage map to identify paralogous loci and to investigate genetic variation across the genome. By applying multivariate statistical methods, we show that it is possible to characterize paralogous loci and that they display similar patterns of population structure as the diploidized portion of the genome. We find genetic associations with the adaptively important trait of run‐timing in both sets of loci. By including paralogous loci in genome scans, we can observe evolutionary signals in genomic regions that have routinely been excluded from population genetic studies in other polyploid‐derived species.  相似文献   

4.
Many eukaryotic genomes contain a large fraction of gene duplicates (or paralogs) as a result of ancient or recent whole‐genome duplications (Ohno 1970 ; Jaillon et al. 2004 ; Kellis et al. 2004 ). Identifying paralogs with NGS data is a pervasive problem in both ancient polyploids and neopolyploids. Likewise, paralogs are often treated as a nuisance that has to be detected and removed (Everett et al. 2012 ). In this issue of Molecular Ecology Resources, Waples et al. ( 2015 ) show that exclusion might not be necessary and how we may miss out on important genomic information in doing so. They present a novel statistical approach to detect paralogs based on the segregation of RAD loci in haploid offspring and test their method by constructing linkage maps with and without these duplicated loci in chum salmon, Oncorhynchus keta (Fig.  1 ). Their linkage map including the resolved paralogs shows that these are mostly located in the distal regions of several linkage groups. Particularly intriguing is their finding that these homoeologous regions appear impoverished in transposable elements (TE). Given the role that TE play in genome remodelling, it is noteworthy that these elements are of low abundance in regions showing residual tetrasomic inheritance. This raises the question whether re‐diploidization is constrained in these regions and whether they might have a role to play in salmonid speciation. This study provides an original approach to identifying duplicated loci in species with a pedigree, as well as providing a dense linkage map for chum salmon, and interesting insights into the retention of gene duplicates in an ancient polyploid.  相似文献   

5.
Whole‐genome duplications have occurred in the recent ancestors of many plants, fish, and amphibians, resulting in a pervasiveness of paralogous loci and the potential for both disomic and tetrasomic inheritance in the same genome. Paralogs can be difficult to reliably genotype and are often excluded from genotyping‐by‐sequencing (GBS) analyses; however, removal requires paralogs to be identified which is difficult without a reference genome. We present a method for identifying paralogs in natural populations by combining two properties of duplicated loci: (i) the expected frequency of heterozygotes exceeds that for singleton loci, and (ii) within heterozygotes, observed read ratios for each allele in GBS data will deviate from the 1:1 expected for singleton (diploid) loci. These deviations are often not apparent within individuals, particularly when sequence coverage is low; but, we postulated that summing allele reads for each locus over all heterozygous individuals in a population would provide sufficient power to detect deviations at those loci. We identified paralogous loci in three species: Chinook salmon (Oncorhynchus tshawytscha) which retains regions with ongoing residual tetrasomy on eight chromosome arms following a recent whole‐genome duplication, mountain barberry (Berberis alpina) which has a large proportion of paralogs that arose through an unknown mechanism, and dusky parrotfish (Scarus niger) which has largely rediploidized following an ancient whole‐genome duplication. Importantly, this approach only requires the genotype and allele‐specific read counts for each individual, information which is readily obtained from most GBS analysis pipelines.  相似文献   

6.
Pervasive hybridization and whole-genome duplications (WGDs) influenced genome evolution in several eukaryotic lineages. Although frequent and recurrent hybridizations may result in reticulate phylogenies, the evolutionary events underlying these reticulations, including detailed structure of the ancestral diploid and polyploid genomes, were only rarely reconstructed. Here, we elucidate the complex genomic history of a monophyletic clade from the mustard family (Brassicaceae), showing contentious relationships to the early-diverging clades of this model plant family. Genome evolution in the crucifer tribe Biscutelleae (∼60 species, 5 genera) was dominated by pervasive hybridizations and subsequent genome duplications. Diversification of an ancestral diploid genome into several divergent but crossable genomes was followed by hybridizations between these genomes. Whereas a single genus (Megadenia) remained diploid, the four remaining genera originated by allopolyploidy (Biscutella, Lunaria, Ricotia) or autopolyploidy (Heldreichia). The contentious relationships among the Biscutelleae genera, and between the tribe and other early diverged crucifer lineages, are best explained by close genomic relatedness among the recurrently hybridizing ancestral genomes. By using complementary cytogenomics and phylogenomics approaches, we demonstrate that the origin of a monophyletic plant clade can be more complex than a parsimonious assumption of a single WGD spurring postpolyploid cladogenesis. Instead, recurrent hybridization among the same and/or closely related parental genomes may phylogenetically interlink diploid and polyploid genomes despite the incidence of multiple independent WGDs. Our results provide new insights into evolution of early-diverging Brassicaceae lineages and elucidate challenges in resolving the contentious relationships within and between land plant lineages with pervasive hybridization and WGDs.  相似文献   

7.
Gene duplication is a major mechanism to create new genes. After gene duplication, some duplicated genes undergo functionalization, whereas others largely maintain redundant functions. Duplicated genes comprise various degrees of functional diversification in plants. However, the evolutionary fate of high and low diversified duplicates is unclear at genomic scale. To infer high and low diversified duplicates in Arabidopsis thaliana genome, we generated a prediction method for predicting whether a pair of duplicate genes was subjected to high or low diversification based on the phenotypes of knock-out mutants. Among 4,017 pairs of recently duplicated A. thaliana genes, 1,052 and 600 are high and low diversified duplicate pairs, respectively. The predictions were validated based on the phenotypes of generated knock-down transgenic plants. We determined that the high diversified duplicates resulting from tandem duplications tend to have lineage-specific functions, whereas the low diversified duplicates produced by whole-genome duplications are related to essential signaling pathways. To assess the evolutionary impact of high and low diversified duplicates in closely related species, we compared the retention rates and selection pressures on the orthologs of A. thaliana duplicates in two closely related species. Interestingly, high diversified duplicates resulting from tandem duplications tend to be retained in multiple lineages under positive selection. Low diversified duplicates by whole-genome duplications tend to be retained in multiple lineages under purifying selection. Taken together, the functional diversities determined by different duplication mechanisms had distinct effects on plant evolution.  相似文献   

8.
Genome sequencing has demonstrated that besides frequent small-scale duplications, large-scale duplication events such as whole genome duplications (WGDs) are found on many branches of the evolutionary tree of life. Especially in the plant lineage, there is evidence for recurrent WGDs, and the ancestor of all angiosperms was in fact most likely a polyploid species. The number of WGDs found in sequenced plant genomes allows us to investigate questions about the roles of WGDs that were hitherto impossible to address. An intriguing observation is that many plant WGDs seem associated with periods of increased environmental stress and/or fluctuations, a trend that is evident for both present-day polyploids and palaeopolyploids formed around the Cretaceous–Palaeogene (K–Pg) extinction at 66 Ma. Here, we revisit the WGDs in plants that mark the K–Pg boundary, and discuss some specific examples of biological innovations and/or diversifications that may be linked to these WGDs. We review evidence for the processes that could have contributed to increased polyploid establishment at the K–Pg boundary, and discuss the implications on subsequent plant evolution in the Cenozoic.  相似文献   

9.
Whole‐genome duplications have occurred in the recent ancestors of many plants, fish and amphibians. Signals of these whole‐genome duplications still exist in the form of paralogous loci. Recent advances have allowed reliable identification of paralogs in genotyping‐by‐sequencing (GBS) data such as that generated from restriction‐site‐associated DNA sequencing (RADSeq); however, excluding paralogs from analyses is still routine due to difficulties in genotyping. This exclusion of paralogs may filter a large fraction of loci, including loci that may be adaptively important or informative for population genetic analyses. We present a maximum‐likelihood method for inferring allele dosage in paralogs and assess its accuracy using simulated GBS, empirical RADSeq and amplicon sequencing data from Chinook salmon. We accurately infer allele dosage for some paralogs from a RADSeq data set and show how accuracy is dependent upon both read depth and allele frequency. The amplicon sequencing data set, using RADSeq‐derived markers, achieved sufficient depth to infer allele dosage for all paralogs. This study demonstrates that RADSeq locus discovery combined with amplicon sequencing of targeted loci is an effective method for incorporating paralogs into population genetic analyses.  相似文献   

10.
Ferns, the second largest group of vascular plants, originated ~400 mil ion years ago(Mya). They became dominant in the ancient Earth landscape before the angiosperms and are stil important in current ecosystems.Many ferns have exceptional y high chromosome numbers,possibly resulting from whole-genome duplications(WGDs).However, WGDs have not been investigated molecularly across fern diversity. Here we detected and dated fern WGDs using a phylogenomic approach and by calculating synonymous substitution rates(Ks). We also investigated a possible correlation between proposed WGDs and shifts in species diversification rates. We identified 19 WGDs: three ancient events along the fern phylogenetic backbone that are shared by 66%–97% of extant ferns, with additional lineage-specific WGDs for eight orders, providing strongevidence for recurring genome duplications across fern evolutionary history. We also observed similar Ks peak values for more than half of these WGDs, with multiple WGDs occurring close to the Cretaceous(~145–66 Mya). Despite the repeated WGD events, the biodiversity of ferns declined during the Cretaceous, implying that other factors probably contributed to the floristic turnover from ferns to angiosperms. This study provides molecular evidence for recurring WGDs in ferns and offers important clues to the genomic evolutionary history of ferns.  相似文献   

11.
Using a data set of protein translations associated with map positions in the human genome, we identified 1520 mapped highly conserved gene families. By comparing sharing of families between genomic windows, we identified 92 potentially duplicated blocks in the human genome containing 422 duplicated members of these families. Using branching order in the phylogenetic trees, we timed gene duplication events in these families relative to the primate-rodent divergence, the amniote-amphibian divergence, and the deuterostome-protostome divergence. The results showed similar patterns of gene duplication times within duplicated blocks and outside duplicated blocks. Both within and outside duplicated blocks, numerous duplications were timed prior to the deuterostome-protostome divergence, whereas others occurred after the amniote-amphibian divergence. Thus, neither gene duplication in general nor duplication of genomic blocks could be attributed entirely to polyploidization early in vertebrate history. The strongest signal in the data was a tendency for intrachromosomal duplications to be more recent than interchromosomal duplications, consistent with a model whereby tandem duplication-whether of single genes or of genomic blocks-may be followed by eventual separation of duplicates due to chromosomal rearrangements. The rate of separation of tandemly duplicated gene pairs onto separated chromosomes in the human lineage was estimated at 1.7 x 10(-9) per gene-pair per year.  相似文献   

12.
Estimating the evolutionary potential of quantitative traits and reliably predicting responses to selection in wild populations are important challenges in evolutionary biology. The genomic revolution has opened up opportunities for measuring relatedness among individuals with precision, enabling pedigree‐free estimation of trait heritabilities in wild populations. However, until now, most quantitative genetic studies based on a genomic relatedness matrix (GRM) have focused on long‐term monitored populations for which traditional pedigrees were also available, and have often had access to knowledge of genome sequence and variability. Here, we investigated the potential of RAD‐sequencing for estimating heritability in a free‐ranging roe deer (Capreolous capreolus) population for which no prior genomic resources were available. We propose a step‐by‐step analytical framework to optimize the quality and quantity of the genomic data and explore the impact of the single nucleotide polymorphism (SNP) calling and filtering processes on the GRM structure and GRM‐based heritability estimates. As expected, our results show that sequence coverage strongly affects the number of recovered loci, the genotyping error rate and the amount of missing data. Ultimately, this had little effect on heritability estimates and their standard errors, provided that the GRM was built from a minimum number of loci (above 7,000). Genomic relatedness matrix‐based heritability estimates thus appear robust to a moderate level of genotyping errors in the SNP data set. We also showed that quality filters, such as the removal of low‐frequency variants, affect the relatedness structure of the GRM, generating lower h2 estimates. Our work illustrates the huge potential of RAD‐sequencing for estimating GRM‐based heritability in virtually any natural population.  相似文献   

13.
Gene and genome duplications are the primary source of new genes and novel functions and have played a pivotal role in the evolution of genomic and organismal complexity. The spontaneous rate of gene duplication is a critical parameter for understanding the evolutionary dynamics of gene duplicates; yet few direct empirical estimates exist and differ widely. The presence of a large population of recently derived gene duplicates in sequenced genomes suggests a high rate of spontaneous origin, also evidenced by population genomic studies reporting rampant copy-number polymorphism at the intraspecific level. An analysis of long-term mutation accumulation lines of Caenorhabditis elegans for gene copy-number changes with array comparative genomic hybridization yields the first direct estimate of the genome-wide rate of gene duplication in a multicellular eukaryote. The gene duplication rate in C. elegans is quite high, on the order of 10(-7) duplications/gene/generation. This rate is two orders of magnitude greater than the spontaneous rate of point mutation per nucleotide site in this species and also greatly exceeds an earlier estimate derived from the frequency distribution of extant gene duplicates in the sequenced C. elegans genome.  相似文献   

14.
15.
High‐throughput sequencing has revolutionized population and conservation genetics. RAD sequencing methods, such as 2b‐RAD, can be used on species lacking a reference genome. However, transferring protocols across taxa can potentially lead to poor results. We tested two different IIB enzymes (AlfI and CspCI) on two species with different genome sizes (the loggerhead turtle Caretta caretta and the sharpsnout seabream Diplodus puntazzo) to build a set of guidelines to improve 2b‐RAD protocols on non‐model organisms while optimising costs. Good results were obtained even with degraded samples, showing the value of 2b‐RAD in studies with poor DNA quality. However, library quality was found to be a critical parameter on the number of reads and loci obtained for genotyping. Resampling analyses with different number of reads per individual showed a trade‐off between number of loci and number of reads per sample. The resulting accumulation curves can be used as a tool to calculate the number of sequences per individual needed to reach a mean depth ≥20 reads to acquire good genotyping results. Finally, we demonstrated that selective‐base ligation does not affect genomic differentiation between individuals, indicating that this technique can be used in species with large genome sizes to adjust the number of loci to the study scope, to reduce sequencing costs and to maintain suitable sequencing depth for a reliable genotyping without compromising the results. Here, we provide a set of guidelines to improve 2b‐RAD protocols on non‐model organisms with different genome sizes, helping decision‐making for a reliable and cost‐effective genotyping.  相似文献   

16.
Population genetic studies in nonmodel organisms are often hampered by a lack of reference genomes that are essential for whole‐genome resequencing. In the light of this, genotyping methods have been developed to effectively eliminate the need for a reference genome, such as genotyping by sequencing or restriction site‐associated DNA sequencing (RAD‐seq). However, what remains relatively poorly studied is how accurately these methods capture both average and variation in genetic diversity across an organism's genome. In this issue of Molecular Ecology Resources, Dutoit et al. (2016) use whole‐genome resequencing data from the collard flycatcher to assess what factors drive heterogeneity in nucleotide diversity across the genome. Using these data, they then simulate how well different sequencing designs, including RAD sequencing, could capture most of the variation in genetic diversity. They conclude that for evolutionary and conservation‐related studies focused on the estimating genomic diversity, researchers should emphasize the number of loci analysed over the number of individuals sequenced.  相似文献   

17.
Zheng D 《Genome biology》2008,9(7):R105-13

Background

Sequencing and annotation of several mammalian genomes have revealed that segmental duplications are a common architectural feature of primate genomes; in fact, about 5% of the human genome is composed of large blocks of interspersed segmental duplications. These segmental duplications have been implicated in genomic copy-number variation, gene novelty, and various genomic disorders. However, the molecular processes involved in the evolution and regulation of duplicated sequences remain largely unexplored.

Results

In this study, the profile of about 20 histone modifications within human segmental duplications was characterized using high-resolution, genome-wide data derived from a ChIP-Seq study. The analysis demonstrates that derivative loci of segmental duplications often differ significantly from the original with respect to many histone methylations. Further investigation showed that genes are present three times more frequently in the original than in the derivative, whereas pseudogenes exhibit the opposite trend. These asymmetries tend to increase with the age of segmental duplications. The uneven distribution of genes and pseudogenes does not, however, fully account for the asymmetry in the profile of histone modifications.

Conclusion

The first systematic analysis of histone modifications between segmental duplications demonstrates that two seemingly 'identical' genomic copies are distinct in their epigenomic properties. Results here suggest that local chromatin environments may be implicated in the discrimination of derived copies of segmental duplications from their originals, leading to a biased pseudogenization of the new duplicates. The data also indicate that further exploration of the interactions between histone modification and sequence degeneration is necessary in order to understand the divergence of duplicated sequences.  相似文献   

18.
The study of the evolutionary origin of vertebrates has been linked to the study of genome duplications since Susumo Ohno suggested that the successful diversification of vertebrate innovations was facilitated by two rounds of whole-genome duplication (2R-WGD) in the stem vertebrate. Since then, studies on the functional evolution of many genes duplicated in the vertebrate lineage have provided the grounds to support experimentally this link. This article reviews cases of gene duplications derived either from the 2R-WGD or from local gene duplication events in vertebrates, analyzing their impact on the evolution of developmental innovations. We analyze how gene regulatory networks can be rewired by the activity of transposable elements after genome duplications, discuss how different mechanisms of duplication might affect the fate of duplicated genes, and how the loss of gene duplicates might influence the fate of surviving paralogs. We also discuss the evolutionary relationships between gene duplication and alternative splicing, in particular in the vertebrate lineage. Finally, we discuss the role that the 2R-WGD might have played in the evolution of vertebrate developmental gene networks, paying special attention to those related to vertebrate key features such as neural crest cells, placodes, and the complex tripartite brain. In this context, we argue that current evidences points that the 2R-WGD may not be linked to the origin of vertebrate innovations, but to their subsequent diversification in a broad variety of complex structures and functions that facilitated the successful transition from peaceful filter-feeding non-vertebrate ancestors to voracious vertebrate predators.  相似文献   

19.
20.
Zhang Z  Kishino H 《Genetics》2004,166(4):1995-1999
Gene duplication with subsequent divergence plays a central role in the acquisition of genes with novel function and complexity during the course of evolution. With reduced functional constraints or through positive selection, these duplicated genes may experience accelerated evolution. Under the model of subfunctionalization, loss of subfunctions leads to complementary acceleration at sites with two copies, and the difference in average rate between the sequences may not be obvious. On the other hand, the classical model of neofunctionalization predicts that the evolutionary rate in one of the two duplicates is accelerated. However, the classical model does not tell which of the duplicates experiences the acceleration in evolutionary rate. Here, we present evidence from the Saccharomyces cerevisiae genome that a duplicate located in a genomic region with a low-recombination rate is likely to evolve faster than a duplicate in an area of high recombination. This observation is consistent with population genetics theory that predicts that purifying selection is less effective in genomic regions of low recombination (Hill-Robertson effect). Together with previous studies, our results suggest the genomic background (e.g., local recombination rate) as a potential force to drive the divergence between nontandemly duplicated genes. This implies the importance of structure and complexity of genomes in the diversification of organisms via gene duplications.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号