共查询到20条相似文献,搜索用时 15 毫秒
1.
Senanu M. Spring-Pearson Joshua K. Stone Adina Doyle Christopher J. Allender Richard T. Okinaka Mark Mayo Stacey M. Broomall Jessica M. Hill Mark A. Karavis Kyle S. Hubbard Joseph M. Insalaco Lauren A. McNew C. Nicole Rosenzweig Henry S. Gibbons Bart J. Currie David M. Wagner Paul Keim Apichai Tuanyok 《PloS one》2015,10(10)
The pangenomic diversity in Burkholderia pseudomallei is high, with approximately 5.8% of the genome consisting of genomic islands. Genomic islands are known hotspots for recombination driven primarily by site-specific recombination associated with tRNAs. However, recombination rates in other portions of the genome are also high, a feature we expected to disrupt gene order. We analyzed the pangenome of 37 isolates of B. pseudomallei and demonstrate that the pangenome is ‘open’, with approximately 136 new genes identified with each new genome sequenced, and that the global core genome consists of 4568±16 homologs. Genes associated with metabolism were statistically overrepresented in the core genome, and genes associated with mobile elements, disease, and motility were primarily associated with accessory portions of the pangenome. The frequency distribution of genes present in between 1 and 37 of the genomes analyzed matches well with a model of genome evolution in which 96% of the genome has very low recombination rates but 4% of the genome recombines readily. Using homologous genes among pairs of genomes, we found that gene order was highly conserved among strains, despite the high recombination rates previously observed. High rates of gene transfer and recombination are incompatible with retaining gene order unless these processes are either highly localized to specific sites within the genome, or are characterized by symmetrical gene gain and loss. Our results demonstrate that both processes occur: localized recombination introduces many new genes at relatively few sites, and recombination throughout the genome generates the novel multi-locus sequence types previously observed while preserving gene order. 相似文献
2.
Fernando Puente-Sánchez Matthias Hoetzinger Moritz Buck Stefan Bertilsson 《Molecular ecology resources》2023,23(7):1724-1736
At the genome level, microorganisms are highly adaptable both in terms of allele and gene composition. Such heritable traits emerge in response to different environmental niches and can have a profound influence on microbial community dynamics. As a consequence, any individual genome or population will contain merely a fraction of the total genetic diversity of any operationally defined “species”, whose ecological potential can thus be only fully understood by studying all of their genomes and the genes therein. This concept, known as the pangenome, is valuable for studying microbial ecology and evolution, as it partitions genomes into core (present in all the genomes from a species, and responsible for housekeeping and species-level niche adaptation among others) and accessory regions (present only in some, and responsible for intra-species differentiation). Here we present SuperPang, an algorithm producing pangenome assemblies from a set of input genomes of varying quality, including metagenome-assembled genomes (MAGs). SuperPang runs in linear time and its results are complete, non-redundant, preserve gene ordering and contain both coding and non-coding regions. Our approach provides a modular view of the pangenome, identifying operons and genomic islands, and allowing to track their prevalence in different populations. We illustrate this by analysing intra-species diversity in Polynucleobacter, a bacterial genus ubiquitous in freshwater ecosystems, characterized by their streamlined genomes and their ecological versatility. We show how SuperPang facilitates the simultaneous analysis of allelic and gene content variation under different environmental pressures, allowing us to study the drivers of microbial diversification at unprecedented resolution. 相似文献
3.
《Fungal biology》2019,123(5):351-363
The overall goal of this study was to determine whether the genome of an important plant pathogen in Africa, Ceratocystis albifundus, is structured into subgenomic compartments, and if so, to establish how these compartments are distributed across the genome. For this purpose, the publicly available genome of C. albifundus was complemented with the genome sequences for four additional isolates using the Illumina HiSeq platform. In addition, a reference genome for one of the individuals was assembled using both PacBio and Illumina HiSeq technologies. Our results showed a high degree of synteny between the five genomes, although several regions lacked detectable long-range synteny. These regions were associated with the presence of accessory genes, lower genetic similarity, variation in read-map depth, as well as transposable elements and genes associated with host-pathogen interactions (e.g. effectors and CAZymes). Such patterns are regarded as hallmarks of accelerated evolution, particularly of accessory subgenomic compartments in fungal pathogens. Our findings thus showed that the genome of C. albifundus is made-up of core and accessory subgenomic compartments, which is an important step towards characterizing its pangenome. This study also highlights the value of comparative genomics for understanding mechanisms that may underly and influence the biology and evolution of pathogens. 相似文献
4.
5.
《Microbiological research》2014,169(1):2-17
The need for sustainable agricultural practices is revitalizing the interest in biological nitrogen fixation and rhizobia-legumes symbioses, particularly those involving economically important legume crops in terms of food and forage. The genus Mesorhizobium includes species with high geographical dispersion and able to nodulate a wide variety of legumes, including important crop species, like chickpea or biserrula. Some cases of legume-mesorhizobia inoculant introduction represent exceptional opportunities to study the rhizobia genomes evolution and the evolutionary relationships among species. Complete genome sequences revealed that mesorhizobia typically harbour chromosomal symbiosis islands. The phylogenies of symbiosis genes, such as nodC, are not congruent with the phylogenies based on core genes, reflecting rhizobial host range, rather than species affiliation. This agrees with studies showing that Mesorhizobium species are able to exchange symbiosis genes through lateral transfer of chromosomal symbiosis islands, thus acquiring the ability to nodulate new hosts. Phylogenetic analyses of the Mesorhizobium genus based on core and accessory genes reveal complex evolutionary relationships and a high genomic plasticity, rendering the Mesorhizobium genus as a good model to investigate rhizobia genome evolution and adaptation to different host plants. Further investigation of symbiosis genes as well as stress response genes will certainly contribute to understand mesorhizobia-legume symbiosis and to develop more effective mesorhizobia inoculants. 相似文献
6.
Bessen DE Kumar N Hall GS Riley DR Luo F Lizano S Ford CN McShan WM Nguyen SV Dunning Hotopp JC Tettelin H 《Journal of bacteriology》2011,193(23):6651-6663
Group A Streptococcus (GAS) has a rich evolutionary history of horizontal transfer among its core genes. Yet, despite extensive genetic mixing, GAS strains have discrete ecological phenotypes. To further our understanding of the molecular basis for ecological phenotypes, comparative genomic hybridization of a set of 97 diverse strains to a GAS pangenome microarray was undertaken, and the association of accessory genes with emm genotypes that define tissue tropisms for infection was determined. Of the 22 nonprophage accessory gene regions (AGRs) identified, only 3 account for all statistically significant linkage disequilibrium among strains having the genotypic biomarkers for throat versus skin infection specialists. Networked evolution and population structure analyses of loci representing each of the AGRs reveal that most strains with the skin specialist and generalist biomarkers form discrete clusters, whereas strains with the throat specialist biomarker are highly diverse. To identify coinherited and coselected accessory genes, the strength of genetic associations was determined for all possible pairwise combinations of accessory genes among the 97 GAS strains. Accessory genes showing very strong associations provide the basis for an evolutionary model, which reveals that a major transition between many throat and skin specialist haplotypes correlates with the gain or loss of genes encoding fibronectin-binding proteins. This study employs a novel synthesis of tools to help delineate the major genetic changes associated with key adaptive shifts in an extensively recombined bacterial species. 相似文献
7.
Vinita Periwal Ashok Patowary Shamsudheen Karuthedath Vellarikkal Anju Gupta Meghna Singh Ashish Mittal Shamini Jeyapaul Rajendra Kumar Chauhan Ajay Vir Singh Pravin Kumar Singh Parul Garg Viswa Mohan Katoch Kiran Katoch Devendra Singh Chauhan Sridhar Sivasubbu Vinod Scaria 《PloS one》2015,10(4)
The tubercle complex consists of closely related mycobacterium species which appear to be variants of a single species. Comparative genome analysis of different strains could provide useful clues and insights into the genetic diversity of the species. We integrated genome assemblies of 96 strains from Mycobacterium tuberculosis complex (MTBC), which included 8 Indian clinical isolates sequenced and assembled in this study, to understand its pangenome architecture. We predicted genes for all the 96 strains and clustered their respective CDSs into homologous gene clusters (HGCs) to reveal a hard-core, soft-core and accessory genome component of MTBC. The hard-core (HGCs shared amongst 100% of the strains) was comprised of 2,066 gene clusters whereas the soft-core (HGCs shared amongst at least 95% of the strains) comprised of 3,374 gene clusters. The change in the core and accessory genome components when observed as a function of their size revealed that MTBC has an open pangenome. We identified 74 HGCs that were absent from reference strains H37Rv and H37Ra but were present in most of clinical isolates. We report PCR validation on 9 candidate genes depicting 7 genes completely absent from H37Rv and H37Ra whereas 2 genes shared partial homology with them accounting to probable insertion and deletion events. The pangenome approach is a promising tool for studying strain specific genetic differences occurring within species. We also suggest that since selecting appropriate target genes for typing purposes requires the expected target gene be present in all isolates being typed, therefore estimating the core-component of the species becomes a subject of prime importance. 相似文献
8.
9.
Alan E. Yocca Zefu Lu Robert J. Schmitz Michael Freeling Patrick P. Edger 《Molecular biology and evolution》2021,38(7):2692
Recent pangenome studies have revealed a large fraction of the gene content within a species exhibits presence–absence variation (PAV). However, coding regions alone provide an incomplete assessment of functional genomic sequence variation at the species level. Little to no attention has been paid to noncoding regulatory regions in pangenome studies, though these sequences directly modulate gene expression and phenotype. To uncover regulatory genetic variation, we generated chromosome-scale genome assemblies for thirty Arabidopsis thaliana accessions from multiple distinct habitats and characterized species level variation in Conserved Noncoding Sequences (CNS). Our analyses uncovered not only PAV and positional variation (PosV) but that diversity in CNS is nonrandom, with variants shared across different accessions. Using evolutionary analyses and chromatin accessibility data, we provide further evidence supporting roles for conserved and variable CNS in gene regulation. Additionally, our data suggests that transposable elements contribute to CNS variation. Characterizing species-level diversity in all functional genomic sequences may later uncover previously unknown mechanistic links between genotype and phenotype. 相似文献
10.
Shosei Kubota Takaya Iwasaki Kousuke Hanada Atsushi J. Nagano Asao Fujiyama Atsushi Toyoda Sumio Sugano Yutaka Suzuki Kouki Hikosaka Motomi Ito Shin-Ichi Morinaga 《PLoS genetics》2015,11(7)
Adaptive divergence at the microgeographic scale has been generally disregarded because high gene flow is expected to disrupt local adaptation. Yet, growing number of studies reporting adaptive divergence at a small spatial scale highlight the importance of this process in evolutionary biology. To investigate the genetic basis of microgeographic local adaptation, we conducted a genome-wide scan among sets of continuously distributed populations of Arabidopsis halleri subsp. gemmifera that show altitudinal phenotypic divergence despite gene flow. Genomic comparisons were independently conducted in two distinct mountains where similar highland ecotypes are observed, presumably as a result of convergent evolution. Here, we established a de novo reference genome and employed an individual-based resequencing for a total of 56 individuals. Among 527,225 reliable SNP loci, we focused on those showing a unidirectional allele frequency shift across altitudes. Statistical tests on the screened genes showed that our microgeographic population genomic approach successfully retrieve genes with functional annotations that are in line with the known phenotypic and environmental differences between altitudes. Furthermore, comparison between the two distinct mountains enabled us to screen out those genes that are neutral or adaptive only in either mountain, and identify the genes involved in the convergent evolution. Our study demonstrates that the genomic comparison among a set of genetically connected populations, instead of the commonly-performed comparison between two isolated populations, can also offer an effective screening for the genetic basis of local adaptation. 相似文献
11.
A pangenome is the complete set of genes (core and accessory) present in a phylogenetic clade. We hypothesize that a pangenome’s accessory gene content is structured and maintained by selection. To test this hypothesis, we interrogated the genomes of 40 Pseudomonas species for statistically significant coincident (i.e., co-occurring/avoiding) gene patterns. We found that 86.7% of common accessory genes are involved in 1 coincident relationship. Further, genes that co-occur and/or avoid each other—but are not vertically inherited—are more likely to share functional categories, are more likely to be simultaneously transcribed, and are more likely to produce interacting proteins, than would be expected by chance. These results are not due to coincident genes being adjacent to one another on the chromosome. Together, these findings suggest that the accessory genome is structured into sets of genes that function together within a given strain. Given the similarity of the Pseudomonas pangenome with open pangenomes of other prokaryotic species, we speculate that these results are generalizable. 相似文献
12.
Alan McNally Yaara Oren Darren Kelly Ben Pascoe Steven Dunn Tristan Sreecharan Minna Vehkala Niko V?lim?ki Michael B. Prentice Amgad Ashour Oren Avram Tal Pupko Ulrich Dobrindt Ivan Literak Sebastian Guenther Katharina Schaufler Lothar H. Wieler Zong Zhiyong Samuel K. Sheppard James O. McInerney Jukka Corander 《PLoS genetics》2016,12(9)
The use of whole-genome phylogenetic analysis has revolutionized our understanding of the evolution and spread of many important bacterial pathogens due to the high resolution view it provides. However, the majority of such analyses do not consider the potential role of accessory genes when inferring evolutionary trajectories. Moreover, the recently discovered importance of the switching of gene regulatory elements suggests that an exhaustive analysis, combining information from core and accessory genes with regulatory elements could provide unparalleled detail of the evolution of a bacterial population. Here we demonstrate this principle by applying it to a worldwide multi-host sample of the important pathogenic E. coli lineage ST131. Our approach reveals the existence of multiple circulating subtypes of the major drug–resistant clade of ST131 and provides the first ever population level evidence of core genome substitutions in gene regulatory regions associated with the acquisition and maintenance of different accessory genome elements. 相似文献
13.
14.
Alexander S T Papadopulos Andrew J Helmstetter Owen G Osborne Aaron A Comeault Daniel P Wood Edward A Straw Laurence Mason Michael F Fay Joe Parker Luke T Dunning Andrew D Foote Rhian J Smith Jackie Lighten 《Molecular biology and evolution》2021,38(9):3724
The impact of human-mediated environmental change on the evolutionary trajectories of wild organisms is poorly understood. In particular, capacity of species to adapt rapidly (in hundreds of generations or less), reproducibly and predictably to extreme environmental change is unclear. Silene uniflora is predominantly a coastal species, but it has also colonized isolated, disused mines with phytotoxic, zinc-contaminated soils. To test whether rapid, parallel adaptation to anthropogenic pollution has taken place, we used reduced representation sequencing (ddRAD) to reconstruct the evolutionary history of geographically proximate mine and coastal population pairs and found largely independent colonization of mines from different coastal sites. Furthermore, our results show that parallel evolution of zinc tolerance has occurred without gene flow spreading adaptive alleles between mine populations. In genomic regions where signatures of selection were detected across multiple mine-coast pairs, we identified genes with functions linked to physiological differences between the putative ecotypes, although genetic differentiation at specific loci is only partially shared between mine populations. Our results are consistent with a complex, polygenic genetic architecture underpinning rapid adaptation. This shows that even under a scenario of strong selection and rapid adaptation, evolutionary responses to human activities (and other environmental challenges) may be idiosyncratic at the genetic level and, therefore, difficult to predict from genomic data. 相似文献
15.
《Genomics》2022,114(1):398-408
Here the pangenome analysis of Burkholderia sensu lato (s.l.) was performed for the first time, together with an updated analysis of the pangenome of Burkholderia sensu stricto, and Burkholderia cepacia complex (Bcc) focusing on the Bcc B. catarinensis specific features of its re-sequenced genome. The pangenome of Burkholderia s.l., Burkholderia s.s., and of the Bcc was open, composed of more than 96% of accessory genes, and more than 62% of unknown genes. Functional annotations showed that secondary metabolism genes belonged to the variable portion of genomes, which might explain their production of several compounds with varied bioactivities. Taken together, this work showed the great variability and uniqueness of these genomes and revealed an underexplored unknown potential in poorly characterized genes. Regarding B. catarinensis 89T, its genome harbors genes related to hydrolases production and plant growth promotion. This draft genome will be valuable for further investigation of its biotechnological potentials. 相似文献
16.
Background
Pseudomonas aeruginosa is an important opportunistic pathogen responsible for many infections in hospitalized and immunocompromised patients. Previous reports estimated that approximately 10% of its 6.6 Mbp genome varies from strain to strain and is therefore referred to as “accessory genome”. Elements within the accessory genome of P. aeruginosa have been associated with differences in virulence and antibiotic resistance. As whole genome sequencing of bacterial strains becomes more widespread and cost-effective, methods to quickly and reliably identify accessory genomic elements in newly sequenced P. aeruginosa genomes will be needed.Results
We developed a bioinformatic method for identifying the accessory genome of P. aeruginosa. First, the core genome was determined based on sequence conserved among the completed genomes of twelve reference strains using Spine, a software program developed for this purpose. The core genome was 5.84 Mbp in size and contained 5,316 coding sequences. We then developed an in silico genome subtraction program named AGEnt to filter out core genomic sequences from P. aeruginosa whole genomes to identify accessory genomic sequences of these reference strains. This analysis determined that the accessory genome of P. aeruginosa ranged from 6.9-18.0% of the total genome, was enriched for genes associated with mobile elements, and was comprised of a majority of genes with unknown or unclear function. Using these genomes, we showed that AGEnt performed well compared to other publically available programs designed to detect accessory genomic elements. We then demonstrated the utility of the AGEnt program by applying it to the draft genomes of two previously unsequenced P. aeruginosa strains, PA99 and PA103.Conclusions
The P. aeruginosa genome is rich in accessory genetic material. The AGEnt program accurately identified the accessory genomes of newly sequenced P. aeruginosa strains, even when draft genomes were used. As P. aeruginosa genomes become available at an increasingly rapid pace, this program will be useful in cataloging the expanding accessory genome of this bacterium and in discerning correlations between phenotype and accessory genome makeup. The combination of Spine and AGEnt should be useful in defining the accessory genomes of other bacterial species as well.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-737) contains supplementary material, which is available to authorized users. 相似文献17.
18.
19.
Genome‐wide evidence for divergent selection between populations of a major agricultural pathogen
下载免费PDF全文
![点击此处可从《Molecular ecology》网站下载免费的PDF全文](/ch/ext_images/free.gif)
The genetic and environmental homogeneity in agricultural ecosystems is thought to impose strong and uniform selection pressures. However, the impact of this selection on plant pathogen genomes remains largely unknown. We aimed to identify the proportion of the genome and the specific gene functions under positive selection in populations of the fungal wheat pathogen Zymoseptoria tritici. First, we performed genome scans in four field populations that were sampled from different continents and on distinct wheat cultivars to test which genomic regions are under recent selection. Based on extended haplotype homozygosity and composite likelihood ratio tests, we identified 384 and 81 selective sweeps affecting 4% and 0.5% of the 35 Mb core genome, respectively. We found differences both in the number and the position of selective sweeps across the genome between populations. Using a XtX‐based outlier detection approach, we identified 51 extremely divergent genomic regions between the allopatric populations, suggesting that divergent selection led to locally adapted pathogen populations. We performed an outlier detection analysis between two sympatric populations infecting two different wheat cultivars to identify evidence for host‐driven selection. Selective sweep regions harboured genes that are likely to play a role in successfully establishing host infections. We also identified secondary metabolite gene clusters and an enrichment in genes encoding transporter and protein localization functions. The latter gene functions mediate responses to environmental stress, including interactions with the host. The distinct gene functions under selection indicate that both local host genotypes and abiotic factors contributed to local adaptation. 相似文献
20.