首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Homoeologous exchanges (HEs) have been shown to generate novel gene combinations and phenotypes in a range of polyploid species. Gene presence/absence variation (PAV) is also a major contributor to genetic diversity. In this study, we show that there is an association between these two events, particularly in recent Brassica napus synthetic accessions, and that these represent a novel source of genetic diversity, which can be captured for the improvement of this important crop species. By assembling the pangenome of B. napus, we show that 38% of the genes display PAV behaviour, with some of these variable genes predicted to be involved in important agronomic traits including flowering time, disease resistance, acyl lipid metabolism and glucosinolate metabolism. This study is a first and provides a detailed characterization of the association between HEs and PAVs in B. napus at the pangenome level.  相似文献   

2.
The Tasmanian devil (Sarcophilus harrisii) is threatened with extinction due to the spread of devil facial tumour disease. Polymorphisms in immune genes can provide adaptive potential to resist diseases. Previous studies in diversity at immune loci in wild species have almost exclusively focused on genes of the major histocompatibility complex (MHC); however, these genes only account for a fraction of immune gene diversity. Devils lack diversity at functionally important immunity loci, including MHC and Toll‐like receptor genes. Whether there are polymorphisms at devil immune genes outside these two families is unknown. Here, we identify polymorphisms in a wide range of key immune genes, and develop assays to type single nucleotide polymorphisms (SNPs) within a subset of these genes. A total of 167 immune genes were examined, including cytokines, chemokines and natural killer cell receptors. Using genome‐level data from ten devils, SNPs within coding regions, introns and 10 kb flanking genes of interest were identified. We found low polymorphism across 167 immune genes examined bioinformatically using whole‐genome data. From this data, we developed long amplicon assays to target nine genes. These amplicons were sequenced in 29–220 devils and found to contain 78 SNPs, including eight SNPS within exons. Despite the extreme paucity of genetic diversity within these genes, signatures of balancing selection were exhibited by one chemokine gene, suggesting that remaining diversity may hold adaptive potential. The low functional diversity may leave devils highly vulnerable to infectious disease, and therefore, monitoring and preserving remaining diversity will be critical for the long‐term management of this species. Examining genetic variation in diverse immune genes should be a priority for threatened wildlife species. This study can act as a model for broad‐scale immunogenetic diversity analysis in threatened species.  相似文献   

3.
参考基因组是现代功能基因组学的核心框架,以此为基础的现代基因组学技术在过去20年对植物遗传变异发掘、功能基因克隆等研究起了巨大的推动作用.然而,越来越多的研究发现,单一或少数参考基因组不能完整代表和呈现物种或特定群体内的所有基因组变异,因此其在功能基因组学研究中应用存在很大的局限性,甚至会导致错误的结果.泛基因组是指物...  相似文献   

4.
Brassica oleracea is an important agricultural species encompassing many vegetable crops including cabbage, cauliflower, broccoli and kale; however, it can be susceptible to a variety of fungal diseases such as clubroot, blackleg, leaf spot and downy mildew. Resistance to these diseases is meditated by specific disease resistance genes analogs (RGAs) which are differently distributed across B. oleracea lines. The sequenced reference cultivar does not contain all B. oleracea genes due to gene presence/absence variation between individuals, which makes it necessary to search for RGA candidates in the B. oleracea pangenome. Here we present a comparative analysis of RGA candidates in the pangenome of B. oleracea. We show that the presence of RGA candidates differs between lines and suggests that in B. oleracea, SNPs and presence/absence variation drive RGA diversity using separate mechanisms. We identified 59 RGA candidates linked to Sclerotinia, clubroot, and Fusarium wilt resistance QTL, and these findings have implications for crop breeding in B. oleracea, which may also be applicable in other crops species.  相似文献   

5.
Methods based on single nucleotide polymorphism (SNP), copy number variation (CNV) and presence/absence variation (PAV) discovery provide a valuable resource to study gene structure and evolution. However, as a result of these structural variations, a single reference genome is unable to cover the entire gene content of a species. Therefore, pangenomics analysis is needed to ensure that the genomic diversity within a species is fully represented. Brassica napus is one of the most important oilseed crops in the world and exhibits variability in its resistance genes across different cultivars. Here, we characterized resistance gene distribution across 50 B. napus lines. We identified a total of 1749 resistance gene analogs (RGAs), of which 996 are core and 753 are variable, 368 of which are not present in the reference genome (cv. Darmor‐bzh). In addition, a total of 15 318 SNPs were predicted within 1030 of the RGAs. The results showed that core R‐genes harbour more SNPs than variable genes. More nucleotide binding site‐leucine‐rich repeat (NBS‐LRR) genes were located in clusters than as singletons, with variable genes more likely to be found in clusters. We identified 106 RGA candidates linked to blackleg resistance quantitative trait locus (QTL). This study provides a better understanding of resistance genes to target for genomics‐based improvement and improved disease resistance.  相似文献   

6.
The tubercle complex consists of closely related mycobacterium species which appear to be variants of a single species. Comparative genome analysis of different strains could provide useful clues and insights into the genetic diversity of the species. We integrated genome assemblies of 96 strains from Mycobacterium tuberculosis complex (MTBC), which included 8 Indian clinical isolates sequenced and assembled in this study, to understand its pangenome architecture. We predicted genes for all the 96 strains and clustered their respective CDSs into homologous gene clusters (HGCs) to reveal a hard-core, soft-core and accessory genome component of MTBC. The hard-core (HGCs shared amongst 100% of the strains) was comprised of 2,066 gene clusters whereas the soft-core (HGCs shared amongst at least 95% of the strains) comprised of 3,374 gene clusters. The change in the core and accessory genome components when observed as a function of their size revealed that MTBC has an open pangenome. We identified 74 HGCs that were absent from reference strains H37Rv and H37Ra but were present in most of clinical isolates. We report PCR validation on 9 candidate genes depicting 7 genes completely absent from H37Rv and H37Ra whereas 2 genes shared partial homology with them accounting to probable insertion and deletion events. The pangenome approach is a promising tool for studying strain specific genetic differences occurring within species. We also suggest that since selecting appropriate target genes for typing purposes requires the expected target gene be present in all isolates being typed, therefore estimating the core-component of the species becomes a subject of prime importance.  相似文献   

7.
As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating between true gene absence and variation in assembly or annotation is essential for the accurate identification of conserved and variable genes in a species. Here, we present the de novo assembly of the B. napus cultivar Tapidor and comparison with an improved assembly of the Brassica napus cultivar Darmor‐bzh. Both cultivars were annotated using the same method to allow comparison of gene content. We identified genes unique to each cultivar and differentiate these from artefacts due to variation in the assembly and annotation. We demonstrate that using a common annotation pipeline can result in different gene predictions, even for closely related cultivars, and repeat regions which collapse during assembly impact whole genome comparison. After accounting for differences in assembly and annotation, we demonstrate that the genome of Darmor‐bzh contains a greater number of genes than the genome of Tapidor. Our results are the first step towards comparison of the true differences between B. napus genomes and highlight the potential sources of error in future production of a B. napus pangenome.  相似文献   

8.
Brassica rapa is grown worldwide as economically important vegetable and oilseed crop. However, its production is challenged by yield-limiting pathogens. The sustainable control of these pathogens mainly relies on the deployment of genetic resistance primarily driven by resistance gene analogues (RGAs). While several studies have identified RGAs in B. rapa, these were mainly based on a single genome reference and do not represent the full range of RGA diversity in B. rapa. In this study, we utilized the B. rapa pangenome, constructed from 71 lines encompassing 12 morphotypes, to describe a comprehensive repertoire of RGAs in B. rapa. We show that 309 RGAs were affected by presence-absence variation (PAV) and 223 RGAs were missing from the reference genome. The transmembrane leucine-rich repeat (TM-LRR) RGA class had more core gene types than variable genes, while the opposite was observed for nucleotide-binding site leucine-rich repeats (NLRs). Comparative analysis with the B. napus pangenome revealed significant RGA conservation (93%) between the two species. We identified 138 candidate RGAs located within known B. rapa disease resistance QTL, of which the majority were under negative selection. Using blackleg gene homologues, we demonstrated how these genes in B. napus were derived from B. rapa. This further clarifies the genetic relationship of these loci, which may be useful in narrowing-down candidate blackleg resistance genes. This study provides a novel genomic resource towards the identification of candidate genes for breeding disease resistance in B. rapa and its relatives.  相似文献   

9.
Recent pangenome studies have revealed a large fraction of the gene content within a species exhibits presence–absence variation (PAV). However, coding regions alone provide an incomplete assessment of functional genomic sequence variation at the species level. Little to no attention has been paid to noncoding regulatory regions in pangenome studies, though these sequences directly modulate gene expression and phenotype. To uncover regulatory genetic variation, we generated chromosome-scale genome assemblies for thirty Arabidopsis thaliana accessions from multiple distinct habitats and characterized species level variation in Conserved Noncoding Sequences (CNS). Our analyses uncovered not only PAV and positional variation (PosV) but that diversity in CNS is nonrandom, with variants shared across different accessions. Using evolutionary analyses and chromatin accessibility data, we provide further evidence supporting roles for conserved and variable CNS in gene regulation. Additionally, our data suggests that transposable elements contribute to CNS variation. Characterizing species-level diversity in all functional genomic sequences may later uncover previously unknown mechanistic links between genotype and phenotype.  相似文献   

10.
Maize is a diverse paleotetraploid species with considerable presence/absence variation and copy number variation. One mechanism through which presence/absence variation can arise is differential fractionation. Fractionation refers to the loss of duplicate gene pairs from one of the maize subgenomes during diploidization. Differential fractionation refers to non‐shared gene loss events between individuals following a whole‐genome duplication event. We investigated the prevalence of presence/absence variation resulting from differential fractionation in the syntenic portion of the genome using two whole‐genome de novo assemblies of the inbred lines B73 and PH207. Between these two genomes, syntenic genes were highly conserved with less than 1% of syntenic genes being subject to differential fractionation. The few variably fractionated syntenic genes that were identified are unlikely to contribute to functional phenotypic variation, as there is a significant depletion of these genes in annotated gene sets. In further comparisons of 60 diverse inbred lines, non‐syntenic genes were six times more likely to be variable than syntenic genes, suggesting that comparisons among additional genome assemblies are not likely to result in the discovery of large‐scale presence/absence variation among syntenic genes.  相似文献   

11.
12.
13.
At the genome level, microorganisms are highly adaptable both in terms of allele and gene composition. Such heritable traits emerge in response to different environmental niches and can have a profound influence on microbial community dynamics. As a consequence, any individual genome or population will contain merely a fraction of the total genetic diversity of any operationally defined “species”, whose ecological potential can thus be only fully understood by studying all of their genomes and the genes therein. This concept, known as the pangenome, is valuable for studying microbial ecology and evolution, as it partitions genomes into core (present in all the genomes from a species, and responsible for housekeeping and species-level niche adaptation among others) and accessory regions (present only in some, and responsible for intra-species differentiation). Here we present SuperPang, an algorithm producing pangenome assemblies from a set of input genomes of varying quality, including metagenome-assembled genomes (MAGs). SuperPang runs in linear time and its results are complete, non-redundant, preserve gene ordering and contain both coding and non-coding regions. Our approach provides a modular view of the pangenome, identifying operons and genomic islands, and allowing to track their prevalence in different populations. We illustrate this by analysing intra-species diversity in Polynucleobacter, a bacterial genus ubiquitous in freshwater ecosystems, characterized by their streamlined genomes and their ecological versatility. We show how SuperPang facilitates the simultaneous analysis of allelic and gene content variation under different environmental pressures, allowing us to study the drivers of microbial diversification at unprecedented resolution.  相似文献   

14.
15.
Populus euphratica is well adapted to extreme desert environments and is an important model species for elucidating the mechanisms of abiotic stress resistance in trees. The current assembly of P. euphratica genome is highly fragmented with many gaps and errors, thereby impeding downstream applications. Here, we report an improved chromosome‐level reference genome of P. euphratica (v2.0) using single‐molecule sequencing and chromosome conformation capture (Hi‐C) technologies. Relative to the previous reference genome, our assembly represents a nearly 60‐fold improvement in contiguity, with a scaffold N50 size of 28.59 Mb. Using this genome, we have found that extensive expansion of Gypsy elements in P. euphratica led to its rapid increase in genome size compared to any other Salicaceae species studied to date, and potentially contributed to adaptive divergence driven by insertions near genes involved in stress tolerance. We also detected a wide range of unique structural rearrangements in P. euphratica, including 2,549 translocations, 454 inversions, 121 tandem and 14 segmental duplications. Several key genes likely to be involved in tolerance to abiotic stress were identified within these regions. This high‐quality genome represents a valuable resource for poplar breeding and genetic improvement in the future, as well as comparative genomic analysis with other Salicaceae species.  相似文献   

16.
Increasing our understanding of how evolutionary processes drive the genomic landscape of variation is fundamental to a better understanding of the genomic consequences of speciation. However, genome‐wide patterns of within‐ and between‐ species variation have not been fully investigated in most forest tree species despite their global ecological and economic importance. Here, we use whole‐genome resequencing data from four Populus species spanning the speciation continuum to reconstruct their demographic histories and investigate patterns of diversity and divergence within and between species. Using Populus trichocarpa as an outgroup species, we further infer the genealogical relationships and estimate the extent of ancient introgression among the three aspen species (Populus tremula, Populus davidiana and Populus tremuloides) throughout the genome. Our results show substantial variation in these patterns along the genomes with this variation being strongly predicted by local recombination rates and the density of functional elements. This implies that the interaction between recurrent selection and intrinsic genomic features has dramatically sculpted the genomic landscape over long periods of time. In addition, our findings provide evidence that, apart from background selection, recent positive selection and long‐term balancing selection have also been crucial components in shaping patterns of genome‐wide variation during the speciation process.  相似文献   

17.
18.
Selection processes are believed to be an important evolutionary driver behind the successful establishment of nonindigenous species, for instance through adaptation for invasiveness (e.g. dispersal mechanisms and reproductive allocation). However, evidence supporting this assumption is still scarce. Genome scans have often identified loci with atypical patterns of genetic differentiation (i.e. outliers) indicative of selection processes. Using microsatellite‐ and AFLP‐based genome scans, we looked for evidence of selection following the introduction of the mollusc Crepidula fornicata. Native to the northwestern Atlantic, this gastropod has become an emblematic invader since its introduction during the 19th and 20th centuries in the northeastern Atlantic and northeastern Pacific. We examined 683 individuals from seven native and 15 introduced populations spanning the latitudinal introduction and native ranges of the species. Our results confirmed the previously documented high genetic diversity in native and introduced populations with little genetic structure between the two ranges, a pattern typical of marine invaders. Analysing 344 loci, no outliers were detected between the introduced and native populations or in the introduced range. The genomic sampling may have been insufficient to reveal selection especially if it acts on traits determined by a few genes. Eight outliers were, however, identified within the native range, underlining a genetic singularity congruent with a well‐known biogeographical break along the Florida. Our results call into question the relevance of AFLP genome scans in detecting adaptation on the timescale of biological invasions: genome scans often reveal long‐term adaptation involving numerous genes throughout the genome but seem less effective in detecting recent adaptation from pre‐existing variation on polygenic traits. This study advocates other methods to detect selection effects during biological invasions—for example on phenotypic traits, although genome scans may remain useful for elucidating introduction histories.  相似文献   

19.
Advanced resources for genome‐assisted research in barley (Hordeum vulgare) including a whole‐genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole‐genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA‐coding exome reduces barley genomic complexity more than 50‐fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in‐solution hybridization‐based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full‐length cDNAs and de novo assembled RNA‐Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA‐coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping‐by‐sequencing and genetic diversity analyzes.  相似文献   

20.
Cryptosporidium is an apicomplexan protozoan that lives in most vertebrates, including humans. Its gp60 gene is functionally involved in its attachment to host cells, and its high level of genetic variation has made it the reference marker for sample typing in epidemiological studies. To understand the origin of such high diversity and to determine the extent to which this classification applies to the rest of the genome, we analysed the patterns of variation at gp60 and nine other nuclear loci in isolates of three Cryptosporidium species. Most loci showed low genetic polymorphism (πS <1%) and similar levels of between‐species divergence. Contrastingly, gp60 exhibited very different characteristics: (i) it was nearly ten times more variable than the other loci; (ii) it displayed a significant excess of polymorphisms relative to between‐species differences in a maximum‐likelihood Hudson–Kreitman–Aguadé test; (iii) gp60 subtypes turned out to be much older than the species they were found in; and (iv) showed a significant excess of polymorphic variants shared across species from random expectations. These observations suggest that this locus evolves under balancing selection and specifically under negative frequency‐dependent selection (FDS). Interestingly, genetic variation at the other loci clusters very well within the groups of isolates defined by gp60 subtypes, which may provide new tools to understand the genome‐wide patterns of genetic variation of the parasite in the wild. These results suggest that gp60 plays an active and essential role in the life cycle of the parasite and that genetic variation at this locus might be essential for the parasite's long‐term success.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号