首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background

Pseudomonas aeruginosa is an important opportunistic pathogen responsible for many infections in hospitalized and immunocompromised patients. Previous reports estimated that approximately 10% of its 6.6 Mbp genome varies from strain to strain and is therefore referred to as “accessory genome”. Elements within the accessory genome of P. aeruginosa have been associated with differences in virulence and antibiotic resistance. As whole genome sequencing of bacterial strains becomes more widespread and cost-effective, methods to quickly and reliably identify accessory genomic elements in newly sequenced P. aeruginosa genomes will be needed.

Results

We developed a bioinformatic method for identifying the accessory genome of P. aeruginosa. First, the core genome was determined based on sequence conserved among the completed genomes of twelve reference strains using Spine, a software program developed for this purpose. The core genome was 5.84 Mbp in size and contained 5,316 coding sequences. We then developed an in silico genome subtraction program named AGEnt to filter out core genomic sequences from P. aeruginosa whole genomes to identify accessory genomic sequences of these reference strains. This analysis determined that the accessory genome of P. aeruginosa ranged from 6.9-18.0% of the total genome, was enriched for genes associated with mobile elements, and was comprised of a majority of genes with unknown or unclear function. Using these genomes, we showed that AGEnt performed well compared to other publically available programs designed to detect accessory genomic elements. We then demonstrated the utility of the AGEnt program by applying it to the draft genomes of two previously unsequenced P. aeruginosa strains, PA99 and PA103.

Conclusions

The P. aeruginosa genome is rich in accessory genetic material. The AGEnt program accurately identified the accessory genomes of newly sequenced P. aeruginosa strains, even when draft genomes were used. As P. aeruginosa genomes become available at an increasingly rapid pace, this program will be useful in cataloging the expanding accessory genome of this bacterium and in discerning correlations between phenotype and accessory genome makeup. The combination of Spine and AGEnt should be useful in defining the accessory genomes of other bacterial species as well.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-737) contains supplementary material, which is available to authorized users.  相似文献   

2.
The pangenomic diversity in Burkholderia pseudomallei is high, with approximately 5.8% of the genome consisting of genomic islands. Genomic islands are known hotspots for recombination driven primarily by site-specific recombination associated with tRNAs. However, recombination rates in other portions of the genome are also high, a feature we expected to disrupt gene order. We analyzed the pangenome of 37 isolates of B. pseudomallei and demonstrate that the pangenome is ‘open’, with approximately 136 new genes identified with each new genome sequenced, and that the global core genome consists of 4568±16 homologs. Genes associated with metabolism were statistically overrepresented in the core genome, and genes associated with mobile elements, disease, and motility were primarily associated with accessory portions of the pangenome. The frequency distribution of genes present in between 1 and 37 of the genomes analyzed matches well with a model of genome evolution in which 96% of the genome has very low recombination rates but 4% of the genome recombines readily. Using homologous genes among pairs of genomes, we found that gene order was highly conserved among strains, despite the high recombination rates previously observed. High rates of gene transfer and recombination are incompatible with retaining gene order unless these processes are either highly localized to specific sites within the genome, or are characterized by symmetrical gene gain and loss. Our results demonstrate that both processes occur: localized recombination introduces many new genes at relatively few sites, and recombination throughout the genome generates the novel multi-locus sequence types previously observed while preserving gene order.  相似文献   

3.
The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5%) of its precursor “silent” germline micronuclear genome by a process of “unscrambling” and fragmentation. The tiny macronuclear “nanochromosomes” typically encode single, protein-coding genes (a small portion, 10%, encode 2–8 genes), have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size) that vary from 469 bp to 66 kb long (mean ∼3.2 kb) and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%), suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb) suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing studies of rearrangements arising during evolution and disease.  相似文献   

4.
Pseudomonas aeruginosa is an opportunistic bacterial pathogen able to thrive in highly diverse ecological niches and to infect compromised patients. Its genome exhibits a mosaic structure composed of a core genome into which accessory genes are inserted en bloc at specific sites. The size and the content of the core genome are open for debate as their estimation depends on the set of genomes considered and the pipeline of gene detection and clustering. Here, we redefined the size and the content of the core genome of P. aeruginosa from fully re-analyzed genomes of 17 reference strains. After the optimization of gene detection and clustering parameters, the core genome was defined at 5,233 orthologs, which represented ~ 88% of the average genome. Extrapolation indicated that our panel was suitable to estimate the core genome that will remain constant even if new genomes are added. The core genome contained resistance determinants to the major antibiotic families as well as most metabolic, respiratory, and virulence genes. Although some virulence genes were accessory, they often related to conserved biological functions. Long-standing prophage elements were subjected to a genetic drift to eventually display a G+C content as higher as that of the core genome. This contrasts with the low G+C content of highly conserved ribosomal genes. The conservation of metabolic and respiratory genes could guarantee the ability of the species to thrive on a variety of carbon sources for energy in aerobiosis and anaerobiosis. Virtually all the strains, of environmental or clinical origin, have the complete toolkit to become resistant to the major antipseudomonal compounds and possess basic pathogenic mechanisms to infect humans. The knowledge of the genes shared by the majority of the P. aeruginosa isolates is a prerequisite for designing effective therapeutics to combat the wide variety of human infections.  相似文献   

5.
Previous evolutionary reconstructions have concluded that early eukaryotic ancestors including both the last common ancestor of eukaryotes and of all fungi had intron-rich genomes. By contrast, some extant eukaryotes have few introns, underscoring the complex histories of intron–exon structures, and raising the question as to why these few introns are retained. Here, we have used recently available fungal genomes to address a variety of questions related to intron evolution. Evolutionary reconstruction of intron presence and absence using 263 diverse fungal species supports the idea that massive intron reduction through intron loss has occurred in multiple clades. The intron densities estimated in various fungal ancestors differ from zero to 7.6 introns per 1 kb of protein-coding sequence. Massive intron loss has occurred not only in microsporidian parasites and saccharomycetous yeasts, but also in diverse smuts and allies. To investigate the roles of the remaining introns in highly-reduced species, we have searched for their special characteristics in eight intron-poor fungi. Notably, the introns of ribosome-associated genes RPL7 and NOG2 have conserved positions; both intron-containing genes encoding snoRNAs. Furthermore, both the proteins and snoRNAs are involved in ribosome biogenesis, suggesting that the expression of the protein-coding genes and noncoding snoRNAs may be functionally coordinated. Indeed, these introns are also conserved in three-quarters of fungi species. Our study shows that fungal introns have a complex evolutionary history and underappreciated roles in gene expression.  相似文献   

6.
Mitochondria are eukaryotic organelles supporting individual life-style via generation of proton motive force and cellular energy, and indispensable metabolic pathways. As part of genome sequencing of the white rot Basidiomycota species Phlebia radiata, we first assembled its mitochondrial genome (mtDNA). So far, the 156 348 bp mtDNA is the second largest described for fungi, and of considerable size among eukaryotes. The P. radiata mtDNA assembled as single circular dsDNA molecule containing genes for the large and small ribosomal RNAs, 28 transfer RNAs, and over 100 open reading frames encoding the 14 fungal conserved protein subunits of the mitochondrial complexes I, III, IV, and V. Two genes (atp6 and tRNA-IleGAU) were duplicated within 6.1 kbp inverted region, which is a unique feature of the genome. The large mtDNA size, however, is explained by the dominance of intronic and intergenic regions (sum 80% of mtDNA sequence). The intergenic DNA stretches harness short (≤200 nt) repetitive, dispersed and overlapping sequence elements in abundance. Long self-splicing introns of types I and II interrupt eleven of the conserved genes (cox1,2,3; cob; nad1,2,4,4L,5; rnl; rns). The introns embrace a total of 57 homing endonucleases with LAGLIDADGD and GYI-YIG core motifs, which makes P. radiata mtDNA to one of the largest known reservoirs of intron-homing endonucleases. The inverted duplication, intergenic stretches, and intronic features are indications of dynamics and genetic flexibility of the mtDNA, not fully recognized to this extent in fungal mitochondrial genomes previously, thus giving new insights for the evolution of organelle genomes in eukaryotes.  相似文献   

7.
8.
Ganoderma lucidum is one of the well-known medicinal basidiomycetes worldwide. The mitochondrion, referred to as the second genome, is an organelle found in most eukaryotic cells and participates in critical cellular functions. Elucidating the structure and function of this genome is important to understand completely the genetic contents of G. lucidum. In this study, we assembled the mitochondrial genome of G. lucidum and analyzed the differential expressions of its encoded genes across three developmental stages. The mitochondrial genome is a typical circular DNA molecule of 60,630 bp with a GC content of 26.67%. Genome annotation identified genes that encode 15 conserved proteins, 27 tRNAs, small and large rRNAs, four homing endonucleases, and two hypothetical proteins. Except for genes encoding trnW and two hypothetical proteins, all genes were located on the positive strand. For the repeat structure analysis, eight forward, two inverted, and three tandem repeats were detected. A pair of fragments with a total length around 5.5 kb was found in both the nuclear and mitochondrial genomes, which suggests the possible transfer of DNA sequences between two genomes. RNA-Seq data for samples derived from three stages, namely, mycelia, primordia, and fruiting bodies, were mapped to the mitochondrial genome and qualified. The protein-coding genes were expressed higher in mycelia or primordial stages compared with those in the fruiting bodies. The rRNA abundances were significantly higher in all three stages. Two regions were transcribed but did not contain any identified protein or tRNA genes. Furthermore, three RNA-editing sites were detected. Genome synteny analysis showed that significant genome rearrangements occurred in the mitochondrial genomes. This study provides valuable information on the gene contents of the mitochondrial genome and their differential expressions at various developmental stages of G. lucidum. The results contribute to the understanding of the functions and evolution of fungal mitochondrial DNA.  相似文献   

9.
《Fungal biology》2019,123(5):351-363
The overall goal of this study was to determine whether the genome of an important plant pathogen in Africa, Ceratocystis albifundus, is structured into subgenomic compartments, and if so, to establish how these compartments are distributed across the genome. For this purpose, the publicly available genome of C. albifundus was complemented with the genome sequences for four additional isolates using the Illumina HiSeq platform. In addition, a reference genome for one of the individuals was assembled using both PacBio and Illumina HiSeq technologies. Our results showed a high degree of synteny between the five genomes, although several regions lacked detectable long-range synteny. These regions were associated with the presence of accessory genes, lower genetic similarity, variation in read-map depth, as well as transposable elements and genes associated with host-pathogen interactions (e.g. effectors and CAZymes). Such patterns are regarded as hallmarks of accelerated evolution, particularly of accessory subgenomic compartments in fungal pathogens. Our findings thus showed that the genome of C. albifundus is made-up of core and accessory subgenomic compartments, which is an important step towards characterizing its pangenome. This study also highlights the value of comparative genomics for understanding mechanisms that may underly and influence the biology and evolution of pathogens.  相似文献   

10.
Transposable element (TE) amplification has been recognized as a driving force mediating genome size expansion and evolution, but the consequences for shaping 3D genomic architecture remains largely unknown in plants. Here, we report reference-grade genome assemblies for three species of cotton ranging 3-fold in genome size, namely Gossypium rotundifolium (K2), G. arboreum (A2), and G. raimondii (D5), using Oxford Nanopore Technologies. Comparative genome analyses document the details of lineage-specific TE amplification contributing to the large genome size differences (K2, 2.44 Gb; A2, 1.62 Gb; D5, 750.19 Mb) and indicate relatively conserved gene content and synteny relationships among genomes. We found that approximately 17% of syntenic genes exhibit chromatin status change between active (“A”) and inactive (“B”) compartments, and TE amplification was associated with the increase of the proportion of A compartment in gene regions (∼7,000 genes) in K2 and A2 relative to D5. Only 42% of topologically associating domain (TAD) boundaries were conserved among the three genomes. Our data implicate recent amplification of TEs following the formation of lineage-specific TAD boundaries. This study sheds light on the role of transposon-mediated genome expansion in the evolution of higher-order chromatin structure in plants.  相似文献   

11.
12.
The 2 465 177 bp genome of Sulfolobus islandicus LAL14/1, host of the model rudivirus SIRV2, was sequenced. Exhaustive comparative genomic analysis of S. islandicus LAL14/1 and the nine other completely sequenced S. islandicus strains isolated from Iceland, Russia and USA revealed a highly syntenic common core genome of approximately 2 Mb and a long hyperplastic region containing most of the strain-specific genes. In LAL14/1, the latter region is enriched in insertion sequences, CRISPR (clustered regularly interspaced short palindromic repeats), glycosyl transferase genes, toxin–antitoxin genes and MITE (miniature inverted-repeat transposable elements). The tRNA genes of LAL14/1 are preferential targets for the integration of mobile elements but clusters of atypical genes (CAG) are also integrated elsewhere in the genome. LAL14/1 carries five CRISPR loci with 10 per cent of spacers matching perfectly or imperfectly the genomes of archaeal viruses and plasmids found in the Icelandic hot springs. Strikingly, the CRISPR_2 region of LAL14/1 carries an unusually long 1.9 kb spacer interspersed between two repeat regions and displays a high similarity to pING1-like conjugative plasmids. Finally, we have developed a genetic system for S. islandicus LAL14/1 and created ΔpyrEF and ΔCRISPR_1 mutants using double cross-over and pop-in/pop-out approaches, respectively. Thus, LAL14/1 is a promising model to study virus–host interactions and the CRISPR/Cas defence mechanism in Archaea.  相似文献   

13.
Burkholderia glumae is the major causal agent of bacterial panicle blight of rice, a growing disease problem in global rice production. To better understand its genome-scale characteristics, the genome of the highly virulent B. glumae strain 336gr-1 isolated from Louisiana, USA was sequenced using the Illumina Genome Analyser II system. De novo assembled 336gr-1 contigs were aligned and compared with the previously sequenced genome of B. glumae strain BGR1, which was isolated from an infected rice plant in South Korea. Comparative analysis of the whole genomes of B. glumae 336gr-1 and B. glumae BGR1 revealed numerous unique genomic regions present only in one of the two strains. These unique regions contained accessory genes including mobile elements and phage-related genes, and some of the unique regions in B. glumae BGR1 corresponded to predicted genomic islands. In contrast, little variation was observed in known and potential virulence genes between the two genomes. The considerable amount of plasticity largely based on accessory genes and genome islands observed from the comparison of the genomes of these two strains of B. glumae may explain the versatility of this bacterial species in various environmental conditions and geographic locations.  相似文献   

14.
Mobile genetic elements are major contributing factors to the generation of genetic diversity in prokaryotic organisms. For example, insertion sequence (IS) elements have been shown to specifically contribute to niche adaptation by promoting a variety of genetic rearrangements. The complete genome sequence of the cheese culture Lactobacillus helveticus DPC 4571 was determined and revealed significant conservation compared to three nondairy gut lactobacilli. Despite originating from significantly different environments, 65 to 75% of the genes were conserved between the commensal and dairy lactobacilli, which allowed key niche-specific gene sets to be described. However, the primary distinguishing feature was 213 IS elements in the DPC 4571 genome, 10 times more than for the other lactobacilli. Moreover, genome alignments revealed an unprecedented level of genome stability between these four Lactobacillus species, considering the number of IS elements in the L. helveticus genome. Comparative analysis also indicated that the IS elements were not the primary agents of niche adaptation for the L. helveticus genome. A clear bias toward the loss of genes reported to be important for gut colonization was observed for the cheese culture, but there was no clear evidence of IS-associated gene deletion and decay for the majority of genes lost. Furthermore, an extraordinary level of sequence diversity exists between copies of certain IS elements in the DPC 4571 genome, indicating they may represent an ancient component of the L. helveticus genome. These data suggest a special unobtrusive relationship between the DPC 4571 genome and its mobile DNA complement.  相似文献   

15.
Neonatal Meningitis Escherichia coli (NMEC) is one of the most common causes of neonatal bacterial meningitis in the US and elsewhere resulting in mortality or neurologic deficits in survivors. Large plasmids have been shown experimentally to increase the virulence of NMEC in the rat model of neonatal meningitis. Here, 9 ExPEC-like plasmids were isolated from NMEC and sequenced to identify the core and accessory plasmid genes of ExPEC-like virulence plasmids in NMEC and create an expanded plasmid phylogeny. Results showed sequenced virulence plasmids carry a strongly conserved core of genes with predicted functions in five distinct categories including: virulence, metabolism, plasmid stability, mobile elements, and unknown genes. The major functions of virulence-associated and plasmid core genes serve to increase in vivo fitness by adding multiple iron uptake systems to the genetic repertoire to facilitate NMEC’s survival in the host’s low iron environment, and systems to enhance bacterial resistance to host innate immunity. Phylogenetic analysis based on these core plasmid genes showed that at least two lineages of ExPEC-like plasmids could be discerned. Further, virulence plasmids from Avian Pathogenic E. coli and NMEC plasmids could not be differentiated based solely on the genes of the core plasmid genome.  相似文献   

16.
In the ciliated protozoan Tetrahymena thermophila, extensive DNA elimination is associated with differentiation of the somatic macronucleus from the germline micronucleus. This study describes the isolation and complete characterization of Tlr elements, a family of approximately 30 micronuclear DNA sequences that are efficiently eliminated from the developing macronucleus. The data indicate that Tlr elements are comprised of an ~22 kb internal region flanked by complex and variable termini. The Tlr internal region is highly conserved among family members and contains 15 open reading frames, some of which resemble genes encoded by transposons and viruses. The Tlr termini appear to be long inverted repeats consisting of (i) a variable region containing multiple direct repeats which differ in number and sequence from element to element and (ii) a conserved terminal 47 bp sequence. Taken together, these results suggest that Tlr elements comprise a novel family of mobile genetic elements that are confined to the Tetrahymena germline genome. Possible mechanisms of developmentally programmed Tlr elimination are discussed.  相似文献   

17.

Background

Species of Bryopsidales form ecologically important components of seaweed communities worldwide. These siphonous macroalgae are composed of a single giant tubular cell containing millions of nuclei and chloroplasts, and harbor diverse bacterial communities. Little is known about the diversity of chloroplast genomes (cpDNAs) in this group, and about the possible consequences of intracellular bacteria on genome composition of the host. We present the complete cpDNAs of Bryopsis plumosa and Tydemania expeditiones, as well as a re-annotated cpDNA of B. hypnoides, which was shown to contain a higher number of genes than originally published. Chloroplast genomic data were also used to evaluate phylogenetic hypotheses in the Chlorophyta, such as monophyly of the Ulvophyceae (the class in which the order Bryopsidales is currently classified).

Results

Both DNAs are circular and lack a large inverted repeat. The cpDNA of B. plumosa is 106,859 bp long and contains 115 unique genes. A 13 kb region was identified with several freestanding open reading frames (ORFs) of putative bacterial origin, including a large ORF (>8 kb) closely related to bacterial rhs-family genes. The cpDNA of T. expeditiones is 105,200 bp long and contains 125 unique genes. As in B. plumosa, several regions were identified with ORFs of possible bacterial origin, including genes involved in mobile functions (transposases, integrases, phage/plasmid DNA primases), and ORFs showing close similarity with bacterial DNA methyltransferases. The cpDNA of B. hypnoides differs from that of B. plumosa mainly in the presence of long intergenic spacers, and a large tRNA region. Chloroplast phylogenomic analyses were largely inconclusive with respect to monophyly of the Ulvophyceae, and the relationship of the Bryopsidales within the Chlorophyta.

Conclusions

The cpDNAs of B. plumosa and T. expeditiones are amongst the smallest and most gene dense chloroplast genomes in the core Chlorophyta. The presence of bacterial genes, including genes typically found in mobile elements, suggest that these have been acquired through horizontal gene transfer, which may have been facilitated by the occurrence of obligate intracellular bacteria in these siphonous algae.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1418-3) contains supplementary material, which is available to authorized users.  相似文献   

18.
19.
20.
Conserved plant microRNAs (miRNAs) modulate important biological processes but little is known about conserved cis-regulatory elements (CREs) surrounding MIRNA genes. We developed a solution-based targeted genomic enrichment methodology to capture, enrich, and sequence flanking genomic regions surrounding conserved MIRNA genes with a locked-nucleic acid (LNA)-modified, biotinylated probe complementary to the mature miRNA sequence. Genomic DNA bound by the probe is captured by streptavidin-coated magnetic beads, amplified, sequenced and assembled de novo to obtain genomic DNA sequences flanking MIRNA locus of interest. We demonstrate the sensitivity and specificity of this enrichment methodology in Arabidopsis thaliana to enrich targeted regions spanning 10–20 kb surrounding known MIR166 and MIR165 loci. Assembly of the sequencing reads successfully recovered all targeted loci. While further optimization for larger, more complex genomes is needed, this method may enable determination of flanking genomic DNA sequence surrounding a known core (like a conserved mature miRNA) from multiple species that currently don''t have a full genome assembly available.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号