期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

共查询到20条相似文献，搜索用时 31 毫秒

Characterization of the core and accessory genomes of Pseudomonas aeruginosa using bioinformatic tools Spine and AGEnt

Egon A Ozer Jonathan P Allen Alan R Hauser 《BMC genomics》2014,15(1)

Background

Pseudomonas aeruginosa is an important opportunistic pathogen responsible for many infections in hospitalized and immunocompromised patients. Previous reports estimated that approximately 10% of its 6.6 Mbp genome varies from strain to strain and is therefore referred to as “accessory genome”. Elements within the accessory genome of P. aeruginosa have been associated with differences in virulence and antibiotic resistance. As whole genome sequencing of bacterial strains becomes more widespread and cost-effective, methods to quickly and reliably identify accessory genomic elements in newly sequenced P. aeruginosa genomes will be needed.

Results

We developed a bioinformatic method for identifying the accessory genome of P. aeruginosa. First, the core genome was determined based on sequence conserved among the completed genomes of twelve reference strains using Spine, a software program developed for this purpose. The core genome was 5.84 Mbp in size and contained 5,316 coding sequences. We then developed an in silico genome subtraction program named AGEnt to filter out core genomic sequences from P. aeruginosa whole genomes to identify accessory genomic sequences of these reference strains. This analysis determined that the accessory genome of P. aeruginosa ranged from 6.9-18.0% of the total genome, was enriched for genes associated with mobile elements, and was comprised of a majority of genes with unknown or unclear function. Using these genomes, we showed that AGEnt performed well compared to other publically available programs designed to detect accessory genomic elements. We then demonstrated the utility of the AGEnt program by applying it to the draft genomes of two previously unsequenced P. aeruginosa strains, PA99 and PA103.

Conclusions

The P. aeruginosa genome is rich in accessory genetic material. The AGEnt program accurately identified the accessory genomes of newly sequenced P. aeruginosa strains, even when draft genomes were used. As P. aeruginosa genomes become available at an increasingly rapid pace, this program will be useful in cataloging the expanding accessory genome of this bacterium and in discerning correlations between phenotype and accessory genome makeup. The combination of Spine and AGEnt should be useful in defining the accessory genomes of other bacterial species as well.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-737) contains supplementary material, which is available to authorized users. 相似文献

Maqsud Hossain Sharon A Egan Tracey Coffey Philip N Ward Ray Wilson James A Leigh Richard D Emes 《BMC genomics》2015,16(1)

Background

Streptococcus uberis, a Gram-positive, catalase-negative member of the family Streptococcaceae is an important environmental pathogen responsible for a significant proportion of subclinical and clinical bovine intramammary infections. Currently, the genome of only a single reference strain (0140J) has been described. Here we present a comparative analysis of complete draft genome sequences of an additional twelve S. uberis strains.

Results

Pan and core genome analysis revealed the core genome common to all strains to be 1,550 genes in 1,509 orthologous clusters, complemented by 115-246 accessory genes present in one or more S. uberis strains but absent in the reference strain 0140J. Most of the previously predicted virulent genes were present in the core genome of all 13 strains but gene gain/loss was observed between the isolates in CDS associated with clustered regularly interspaced short palindromic repeats (CRISPRs), prophage and bacteriocin production. Experimental challenge experiments confirmed strain EF20 as non-virulent; only able to infect in a transient manner that did not result in clinical mastitis. Comparison of the genome sequence of EF20 with the validated virulent strain 0140J identified genes associated with virulence, however these did not relate clearly with clinical/non-clinical status of infection.

Conclusion

The gain/loss of mobile genetic elements such as CRISPRs and prophage are a potential driving force for evolutionary change. This first “whole-genome” comparison of strains isolated from clinical vs non-clinical intramammary infections including the type virulent vs non-virulent strains did not identify simple gene gain/loss rules that readily explain, or be confidently associated with, differences in virulence. This suggests that a more complex dynamic determines infection potential and clinical outcome not simply gene content.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1512-6) contains supplementary material, which is available to authorized users. 相似文献

Comparative genomic analysis of Mycobacterium tuberculosis clinical isolates

Fei Liu Yongfei Hu Qi Wang Hong Min Li George F Gao Cui Hua Liu Baoli Zhu 《BMC genomics》2014,15(1)

Background

Due to excessive antibiotic use, drug-resistant Mycobacterium tuberculosis has become a serious public health threat and a major obstacle to disease control in many countries. To better understand the evolution of drug-resistant M. tuberculosis strains, we performed whole genome sequencing for 7 M. tuberculosis clinical isolates with different antibiotic resistance profiles and conducted comparative genomic analysis of gene variations among them.

Results

We observed that all 7 M. tuberculosis clinical isolates with different levels of drug resistance harbored similar numbers of SNPs, ranging from 1409–1464. The numbers of insertion/deletions (Indels) identified in the 7 isolates were also similar, ranging from 56 to 101. A total of 39 types of mutations were identified in drug resistance-associated loci, including 14 previously reported ones and 25 newly identified ones. Sixteen of the identified large Indels spanned PE-PPE-PGRS genes, which represents a major source of antigenic variability. Aside from SNPs and Indels, a CRISPR locus with varied spacers was observed in all 7 clinical isolates, suggesting that they might play an important role in plasticity of the M. tuberculosis genome. The nucleotide diversity (Л value) and selection intensity (dN/dS value) of the whole genome sequences of the 7 isolates were similar. The dN/dS values were less than 1 for all 7 isolates (range from 0.608885 to 0.637365), supporting the notion that M. tuberculosis genomes undergo purifying selection. The Л values and dN/dS values were comparable between drug-susceptible and drug-resistant strains.

Conclusions

In this study, we show that clinical M. tuberculosis isolates exhibit distinct variations in terms of the distribution of SNP, Indels, CRISPR-cas locus, as well as the nucleotide diversity and selection intensity, but there are no generalizable differences between drug-susceptible and drug-resistant isolates on the genomic scale. Our study provides evidence strengthening the notion that the evolution of drug resistance among clinical M. tuberculosis isolates is clearly a complex and diversified process.

Electronic supplementary material

The online version of this article (doi: 10.1186/1471-2164-15-469) contains supplementary material, which is available to authorized users. 相似文献

Extensive intra-phylotype diversity in lactobacilli and bifidobacteria from the honeybee gut

Kirsten M Ellegaard Daniel Tamarit Emelie Javelind Tobias C Olofsson Siv GE Andersson Alejandra Vásquez 《BMC genomics》2015,16(1)

Background

In the honeybee Apis mellifera, the bacterial gut community is consistently colonized by eight distinct phylotypes of bacteria. Managed bee colonies are of considerable economic interest and it is therefore important to elucidate the diversity and role of this microbiota in the honeybee. In this study, we have sequenced the genomes of eleven strains of lactobacilli and bifidobacteria isolated from the honey crop of the honeybee A. mellifera.

Results

Single gene phylogenies confirmed that the isolated strains represent the diversity of lactobacilli and bifidobacteria in the gut, as previously identified by 16S rRNA gene sequencing. Core genome phylogenies of the lactobacilli and bifidobacteria further indicated extensive divergence between strains classified as the same phylotype. Phylotype-specific protein families included unique surface proteins. Within phylotypes, we found a remarkably high level of gene content diversity. Carbohydrate metabolism and transport functions contributed up to 45% of the accessory genes, with some genomes having a higher content of genes encoding phosphotransferase systems for the uptake of carbohydrates than any previously sequenced genome. These genes were often located in highly variable genomic segments that also contained genes for enzymes involved in the degradation and modification of sugar residues. Strain-specific gene clusters for the biosynthesis of exopolysaccharides were identified in two phylotypes. The dynamics of these segments contrasted with low recombination frequencies and conserved gene order structures for the core genes. Hits for CRISPR spacers were almost exclusively found within phylotypes, suggesting that the phylotypes are associated with distinct phage populations.

Conclusions

The honeybee gut microbiota has been described as consisting of a modest number of phylotypes; however, the genomes sequenced in the current study demonstrated a very high level of gene content diversity within all three described phylotypes of lactobacilli and bifidobacteria, particularly in terms of metabolic functions and surface structures, where many features were strain-specific. Together, these results indicate niche differentiation within phylotypes, suggesting that the honeybee gut microbiota is more complex than previously thought.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1476-6) contains supplementary material, which is available to authorized users. 相似文献

Comparative genomic analysis of clinical and environmental strains provides insight into the pathogenicity and evolution of Vibrio parahaemolyticus

Lei Li Hin-chung Wong Wenyan Nong Man Kit Cheung Patrick Tik Wan Law Kai Man Kam Hoi Shan Kwan 《BMC genomics》2014,15(1)

Background

Vibrio parahaemolyticus is a Gram-negative halophilic bacterium. Infections with the bacterium could become systemic and can be life-threatening to immunocompromised individuals. Genome sequences of a few clinical isolates of V. parahaemolyticus are currently available, but the genome dynamics across the species and virulence potential of environmental strains on a genome-scale have not been described before.

Results

Here we present genome sequences of four V. parahaemolyticus clinical strains from stool samples of patients and five environmental strains in Hong Kong. Phylogenomics analysis based on single nucleotide polymorphisms revealed a clear distinction between the clinical and environmental isolates. A new gene cluster belonging to the biofilm associated proteins of V. parahaemolyticus was found in clincial strains. In addition, a novel small genomic island frequently found among clinical isolates was reported. A few environmental strains were found harboring virulence genes and prophage elements, indicating their virulence potential. A unique biphenyl degradation pathway was also reported. A database for V. parahaemolyticus (http://kwanlab.bio.cuhk.edu.hk/vp) was constructed here as a platform to access and analyze genome sequences and annotations of the bacterium.

Conclusions

We have performed a comparative genomics analysis of clinical and environmental strains of V. parahaemolyticus. Our analyses could facilitate understanding of the phylogenetic diversity and niche adaptation of this bacterium.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1135) contains supplementary material, which is available to authorized users. 相似文献

Comprehensive molecular,genomic and phenotypic analysis of a major clone of Enterococcus faecalis MLST ST40

《BMC genomics》2015,16(1)

Background

Enterococcus faecalis is a multifaceted microorganism known to act as a beneficial intestinal commensal bacterium. It is also a dreaded nosocomial pathogen causing life-threatening infections in hospitalised patients. Isolates of a distinct MLST type ST40 represent the most frequent strain type of this species, distributed worldwide and originating from various sources (animal, human, environmental) and different conditions (colonisation/infection). Since enterococci are known to be highly recombinogenic we determined to analyse the microevolution and niche adaptation of this highly distributed clonal type.

Results

We compared a set of 42 ST40 isolates by assessing key molecular determinants, performing whole genome sequencing (WGS) and a number of phenotypic assays including resistance profiling, formation of biofilm and utilisation of carbon sources. We generated the first circular closed reference genome of an E. faecalis isolate D32 of animal origin and compared it with the genomes of other reference strains. D32 was used as a template for detailed WGS comparisons of high-quality draft genomes of 14 ST40 isolates. Genomic and phylogenetic analyses suggest a high level of similarity regarding the core genome, also demonstrated by similar carbon utilisation patterns. Distribution of known and putative virulence-associated genes did not differentiate between ST40 strains from a commensal and clinical background or an animal or human source. Further analyses of mobile genetic elements (MGE) revealed genomic diversity owed to: (1) a modularly structured pathogenicity island; (2) a site-specifically integrated and previously unknown genomic island of 138 kb in two strains putatively involved in exopolysaccharide synthesis; and (3) isolate-specific plasmid and phage patterns. Moreover, we used different cell-biological and animal experiments to compare the isolate D32 with a closely related ST40 endocarditis isolate whose draft genome sequence was also generated. D32 generally showed a greater capacity of adherence to human cell lines and an increased pathogenic potential in various animal models in combination with an even faster growth in vivo (not in vitro).

Conclusion

Molecular, genomic and phenotypic analysis of representative isolates of a major clone of E. faecalis MLST ST40 revealed new insights into the microbiology of a commensal bacterium which can turn into a conditional pathogen.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1367-x) contains supplementary material, which is available to authorized users. 相似文献

Genome-wide survey and analysis of microsatellites in giant panda (Ailuropoda melanoleuca), with a focus on the applications of a novel microsatellite marker system

Jie Huang Yu-Zhi Li Lian-Ming Du Bo Yang Fu-Jun Shen He-Min Zhang Zhi-He Zhang Xiu-Yue Zhang Bi-Song Yue 《BMC genomics》2015,16(1)

Background

The giant panda (Ailuropoda melanoleuca) is a critically endangered species endemic to China. Microsatellites have been preferred as the most popular molecular markers and proven effective in estimating population size, paternity test, genetic diversity for the critically endangered species. The availability of the giant panda complete genome sequences provided the opportunity to carry out genome-wide scans for all types of microsatellites markers, which now opens the way for the analysis and development of microsatellites in giant panda.

Results

By screening the whole genome sequence of giant panda in silico mining, we identified microsatellites in the genome of giant panda and analyzed their frequency and distribution in different genomic regions. Based on our search criteria, a repertoire of 855,058 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. A total of 160 primer pairs were designed to screen for polymorphic microsatellites using the selected tetranucleotide microsatellite sequences. The 51 novel polymorphic tetranucleotide microsatellite loci were discovered based on genotyping blood DNA from 22 captive giant pandas in this study. Finally, a total of 15 markers, which showed good polymorphism, stability, and repetition in faecal samples, were used to establish the novel microsatellite marker system for giant panda. Meanwhile, a genotyping database for Chengdu captive giant pandas (n = 57) were set up using this standardized system. What’s more, a universal individual identification method was established and the genetic diversity were analysed in this study as the applications of this marker system.

Conclusion

The microsatellite abundance and diversity were characterized in giant panda genomes. A total of 154,677 tetranucleotide microsatellites were identified and 15 of them were discovered as the polymorphic and stable loci. The individual identification method and the genetic diversity analysis method in this study provided adequate material for the future study of giant panda.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1268-z) contains supplementary material, which is available to authorized users. 相似文献

An assembly and alignment-free method of phylogeny reconstruction from next-generation sequencing data

Huan Fan Anthony R. Ives Yann Surget-Groba Charles H. Cannon 《BMC genomics》2015,16(1)

Background

Next-generation sequencing technologies are rapidly generating whole-genome datasets for an increasing number of organisms. However, phylogenetic reconstruction of genomic data remains difficult because de novo assembly for non-model genomes and multi-genome alignment are challenging.

Results

To greatly simplify the analysis, we present an Assembly and Alignment-Free (AAF) method (https://sourceforge.net/projects/aaf-phylogeny) that constructs phylogenies directly from unassembled genome sequence data, bypassing both genome assembly and alignment. Using mathematical calculations, models of sequence evolution, and simulated sequencing of published genomes, we address both evolutionary and sampling issues caused by direct reconstruction, including homoplasy, sequencing errors, and incomplete sequencing coverage. From these results, we calculate the statistical properties of the pairwise distances between genomes, allowing us to optimize parameter selection and perform bootstrapping. As a test case with real data, we successfully reconstructed the phylogeny of 12 mammals using raw sequencing reads. We also applied AAF to 21 tropical tree genome datasets with low coverage to demonstrate its effectiveness on non-model organisms.

Conclusion

Our AAF method opens up phylogenomics for species without an appropriate reference genome or high sequence coverage, and rapidly creates a phylogenetic framework for further analysis of genome structure and diversity among non-model organisms.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1647-5) contains supplementary material, which is available to authorized users. 相似文献

Comparative analyses of Legionella species identifies genetic features of strains causing Legionnaires’ disease

Laura Gomez-Valero Christophe Rusniok Monica Rolando Mario Neou Delphine Dervins-Ravault Jasmin Demirtas Zoe Rouy Robert J Moore Honglei Chen Nicola K Petty Sophie Jarraud Jerome Etienne Michael Steinert Klaus Heuner Simonetta Gribaldo Claudine Médigue Gernot Gl?ckner Elizabeth L Hartland Carmen Buchrieser 《Genome biology》2014,15(11)

Background

The genus Legionella comprises over 60 species. However, L. pneumophila and L. longbeachae alone cause over 95% of Legionnaires’ disease. To identify the genetic bases underlying the different capacities to cause disease we sequenced and compared the genomes of L. micdadei, L. hackeliae and L. fallonii (LLAP10), which are all rarely isolated from humans.

Results

We show that these Legionella species possess different virulence capacities in amoeba and macrophages, correlating with their occurrence in humans. Our comparative analysis of 11 Legionella genomes belonging to five species reveals highly heterogeneous genome content with over 60% representing species-specific genes; these comprise a complete prophage in L. micdadei, the first ever identified in a Legionella genome. Mobile elements are abundant in Legionella genomes; many encode type IV secretion systems for conjugative transfer, pointing to their importance for adaptation of the genus. The Dot/Icm secretion system is conserved, although the core set of substrates is small, as only 24 out of over 300 described Dot/Icm effector genes are present in all Legionella species. We also identified new eukaryotic motifs including thaumatin, synaptobrevin or clathrin/coatomer adaptine like domains.

Conclusions

Legionella genomes are highly dynamic due to a large mobilome mainly comprising type IV secretion systems, while a minority of core substrates is shared among the diverse species. Eukaryotic like proteins and motifs remain a hallmark of the genus Legionella. Key factors such as proteins involved in oxygen binding, iron storage, host membrane transport and certain Dot/Icm substrates are specific features of disease-related strains.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0505-0) contains supplementary material, which is available to authorized users. 相似文献

10.

Comparative genomic analysis of Acinetobacter baumannii clinical isolates reveals extensive genomic variation and diverse antibiotic resistance determinants

Fei Liu Yuying Zhu Yong Yi Na Lu Baoli Zhu Yongfei Hu 《BMC genomics》2014,15(1)

Background

Acinetobacter baumannii is an important nosocomial pathogen that poses a serious health threat to immune-compromised patients. Due to its rapid ability to develop multidrug resistance (MDR), A. baumannii has increasingly become a focus of attention worldwide. To better understand the genetic variation and antibiotic resistance mechanisms of this bacterium at the genomic level, we reported high-quality draft genome sequences of 8 clinical isolates with various sequence types and drug susceptibility profiles.

Results

We sequenced 7 MDR and 1 drug-sensitive clinical A. baumannii isolates and performed comparative genomic analysis of these draft genomes with 16 A. baumannii complete genomes from GenBank. We found a high degree of variation in A. baumannii, including single nucleotide polymorphisms (SNPs) and large DNA fragment variations in the AbaR-like resistance island (RI) regions, the prophage and the type VI secretion system (T6SS). In addition, we found several new AbaR-like RI regions with highly variable structures in our MDR strains. Interestingly, we found a novel genomic island (designated as GI_BJ4) in the drug-sensitive strain BJ4 carrying metal resistance genes instead of antibiotic resistance genes inserted into the position where AbaR-like RIs commonly reside in other A. baumannii strains. Furthermore, we showed that diverse antibiotic resistance determinants are present outside the RIs in A. baumannii, including antibiotic resistance-gene bearing integrons, the bla_OXA-23-containing transposon Tn2009, and chromosomal intrinsic antibiotic resistance genes.

Conclusions

Our comparative genomic analysis revealed that extensive genomic variation exists in the A. baumannii genome. Transposons, genomic islands and point mutations are the main contributors to the plasticity of the A. baumannii genome and play critical roles in facilitating the development of antibiotic resistance in the clinical isolates.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1163) contains supplementary material, which is available to authorized users. 相似文献

11.

Population structure of mitochondrial genomes in Saccharomyces cerevisiae

John F. Wolters Kenneth Chiu Heather L. Fiumera 《BMC genomics》2015,16(1)

Background

Rigorous study of mitochondrial functions and cell biology in the budding yeast, Saccharomyces cerevisiae has advanced our understanding of mitochondrial genetics. This yeast is now a powerful model for population genetics, owing to large genetic diversity and highly structured populations among wild isolates. Comparative mitochondrial genomic analyses between yeast species have revealed broad evolutionary changes in genome organization and architecture. A fine-scale view of recent evolutionary changes within S. cerevisiae has not been possible due to low numbers of complete mitochondrial sequences.

Results

To address challenges of sequencing AT-rich and repetitive mitochondrial DNAs (mtDNAs), we sequenced two divergent S. cerevisiae mtDNAs using a single-molecule sequencing platform (PacBio RS). Using de novo assemblies, we generated highly accurate complete mtDNA sequences. These mtDNA sequences were compared with 98 additional mtDNA sequences gathered from various published collections. Phylogenies based on mitochondrial coding sequences and intron profiles revealed that intraspecific diversity in mitochondrial genomes generally recapitulated the population structure of nuclear genomes. Analysis of intergenic sequence indicated a recent expansion of mobile elements in certain populations. Additionally, our analyses revealed that certain populations lacked introns previously believed conserved throughout the species, as well as the presence of introns never before reported in S. cerevisiae.

Conclusions

Our results revealed that the extensive variation in S. cerevisiae mtDNAs is often population specific, thus offering a window into the recent evolutionary processes shaping these genomes. In addition, we offer an effective strategy for sequencing these challenging AT-rich mitochondrial genomes for small scale projects.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1664-4) contains supplementary material, which is available to authorized users. 相似文献

12.

Genomic factors related to tissue tropism in Chlamydia pneumoniae infection

Thomas Weinmaier Jonathan Hoser Sebastian Eck Inga Kaufhold Kensuke Shima Tim M Strom Thomas Rattei Jan Rupp 《BMC genomics》2015,16(1)

Background

Chlamydia pneumoniae (Cpn) are obligate intracellular bacteria that cause acute infections of the upper and lower respiratory tract and have been implicated in chronic inflammatory diseases. Although of significant clinical relevance, complete genome sequences of only four clinical Cpn strains have been obtained. All of them were isolated from the respiratory tract and shared more than 99% sequence identity. Here we investigate genetic differences on the whole-genome level that are related to Cpn tissue tropism and pathogenicity.

Results

We have sequenced the genomes of 18 clinical isolates from different anatomical sites (e.g. lung, blood, coronary arteries) of diseased patients, and one animal isolate. In total 1,363 SNP loci and 184 InDels have been identified in the genomes of all clinical Cpn isolates. These are distributed throughout the whole chlamydial genome and enriched in highly variable regions. The genomes show clear evidence of recombination in at least one potential region but no phage insertions. The tyrP gene was always encoded as single copy in all vascular isolates. Phylogenetic reconstruction revealed distinct evolutionary lineages containing primarily non-respiratory Cpn isolates. In one of these, clinical isolates from coronary arteries and blood monocytes were closely grouped together. They could be distinguished from all other isolates by characteristic nsSNPs in genes involved in RB to EB transition, inclusion membrane formation, bacterial stress response and metabolism.

Conclusions

This study substantially expands the genomic data of Cpn and elucidates its evolutionary history. The translation of the observed Cpn genetic differences into biological functions and the prediction of novel pathogen-oriented diagnostic strategies have to be further explored.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1377-8) contains supplementary material, which is available to authorized users. 相似文献

13.

Identification of genome-wide single nucleotide polymorphisms in allopolyploid crop Brassica napus

Shunmou Huang Linbin Deng Mei Guan Jiana Li Kun Lu Hanzhong Wang Donghui Fu Annaliese S Mason Shengyi Liu Wei Hua 《BMC genomics》2013,14(1)

Background

Single nucleotide polymorphisms (SNPs) are the most common type of genetic variation. Identification of large numbers of SNPs is helpful for genetic diversity analysis, map-based cloning, genome-wide association analyses and marker-assisted breeding. Recently, identifying genome-wide SNPs in allopolyploid Brassica napus (rapeseed, canola) by resequencing many accessions has become feasible, due to the availability of reference genomes of Brassica rapa (2n = AA) and Brassica oleracea (2n = CC), which are the progenitor species of B. napus (2n = AACC). Although many SNPs in B. napus have been released, the objective in the present study was to produce a larger, more informative set of SNPs for large-scale and efficient genotypic screening. Hence, short-read genome sequencing was conducted on ten elite B. napus accessions for SNP discovery. A subset of these SNPs was randomly selected for sequence validation and for genotyping efficiency testing using the Illumina GoldenGate assay.

Results

A total of 892,536 bi-allelic SNPs were discovered throughout the B. napus genome. A total of 36,458 putative amino acid variants were located in 13,552 protein-coding genes, which were predicted to have enriched binding and catalytic activity as a result. Using the GoldenGate genotyping platform, 94 of 96 SNPs sampled could effectively distinguish genotypes of 130 lines from two mapping populations, with an average call rate of 92%.

Conclusions

Despite the polyploid nature of B. napus, nearly 900,000 simple SNPs were identified by whole genome resequencing. These SNPs were predicted to be effective in high-throughput genotyping assays (51% polymorphic SNPs, 92% average call rate using the GoldenGate assay, leading to an estimated >450 000 useful SNPs). Hence, the development of a much larger genotyping array of informative SNPs is feasible. SNPs identified in this study to cause non-synonymous amino acid substitutions can also be utilized to directly identify causal genes in association studies. 相似文献

14.

Intraspecies comparison of Streptomyces pratensis genomes reveals high levels of recombination and gene conservation between strains of disparate geographic origin

James R Doroghazi Daniel H Buckley 《BMC genomics》2014,15(1)

Background

Streptomyces are widespread bacteria that contribute to the terrestrial carbon cycle and produce the majority of clinically useful antibiotics. While interspecific genomic diversity has been investigated among Streptomyces, information is lacking on intraspecific genomic diversity. Streptomyces pratensis has high rates of homologous recombination but the impact of such gene exchange on genome evolution and the evolution of natural product gene clusters remains uncharacterized.

Results

We report draft genome sequences of four S. pratensis strains and compare to the complete genome of Streptomyces flavogriseus IAF-45-CD (=ATCC 33331), a strain recently reclassified to S. pratensis. Despite disparate geographic origins, the genomes are highly similar with 85.9% of genes present in the core genome and conservation of all natural product gene clusters. Natural products include a novel combination of carbapenem and beta-lactamase inhibitor gene clusters. While high intraspecies recombination rates abolish the phylogenetic signal across the genome, intraspecies recombination is suppressed in two genomic regions. The first region is centered on an insertion/deletion polymorphism and the second on a hybrid NRPS-PKS gene. Finally, two gene families accounted for over 25% of the divergent genes in the core genome. The first includes homologs of bldB (required for spore development and antibiotic production) while the second includes homologs of an uncharacterized protein with a helix-turn-helix motif (hpb). Genes from these families co-occur with fifteen pairs spread across the genome. These genes have evidence for co-evolution of co-localized pairs, supporting previous assertions that these genes may function akin to a toxin-antitoxin system.

Conclusions

S. pratensis genomes are highly similar with exceptional levels of recombination which erase phylogenetic signal among strains of the species. This species has a large core genome and variable terminal regions that are smaller than those found in interspecies comparisons. There is no geographic differentiation between these strains, but there is evidence for local linkage disequilibrium affecting two genomic regions. We have also shown further observational evidence that the DUF397-HTH (bldB and hpb) are a novel toxin-antitoxin pair. 相似文献

15.

Genome diversification within a clonal population of pandemic Vibrio parahaemolyticus seems to depend on the life circumstances of each individual bacteria

David E Loyola Cristell Navarro Paulina Uribe Katherine García Claudia Mella Diego Díaz Natalia Valdes Jaime Martínez-Urtaza Romilio T Espejo 《BMC genomics》2015,16(1)

Background

New strains of Vibrio parahaemolyticus that cause diarrhea in humans by seafood ingestion periodically emerge through continuous evolution in the ocean. Influx and expansion in the Southern Chilean ocean of a highly clonal V. parahaemolyticus (serotype O3:K6) population from South East Asia caused one of the largest seafood-related diarrhea outbreaks in the world. Here, genomics analyses of isolates from this rapidly expanding clonal population offered an opportunity to observe the molecular evolutionary changes often obscured in more diverse populations.

Results

Whole genome sequence comparison of eight independent isolates of this population from mussels or clinical cases (from different years) was performed. Differences of 1366 to 217,729 bp genome length and 13 to 164 bp single nucleotide variants (SNVs) were found. Most genomic differences corresponded to the presence of regions unique to only one or two isolates, and were probably acquired by horizontal gene transfer (HGT). Some DNA gain was chromosomal but most was in plasmids. One isolate had a large region (8,644 bp) missing, which was probably caused by excision of a prophage. Genome innovation by the presence of unique DNA, attributable to HGT from related bacteria, varied greatly among the isolates, with values of 1,366 (ten times the number of highest number of SNVs) to 217,729 (a thousand times more than the number of highest number of SNVs).

Conclusions

The evolutionary forces (SNVs, HGT) acting on each isolate of the same population were found to differ to an extent that probably depended on the ecological scenario and life circumstances of each bacterium.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1385-8) contains supplementary material, which is available to authorized users. 相似文献

16.

Biogeography and Virulence of Staphylococcus aureus

Juan Fan Min Shu Ge Zhang Wei Zhou Yongmei Jiang Yu Zhu Guihua Chen Sharon J. Peacock Chaomin Wan Wubin Pan Edward J. Feil 《PloS one》2009,4(7)

Background

Staphylococcus aureus is commonly carried asymptomatically in the human anterior nares and occasionally enters the bloodstream to cause invasive disease. Much of the global diversity of S. aureus remains uncharacterised, and is not clear how disease propensity varies between strains, and between host populations.

Methodology

We compared 147 isolates recovered from five kindergartens in Chengdu, China, with 51 isolates contemporaneously recovered from cases of pediatric infection from the main hospital serving this community. The samples were characterised by MLST, the presence/absence of PVL, and antibiotic resistance profiling.

Principal Findings

Genotype frequencies within individual kindergartens differ, but the sample recovered from cases of disease shows a general enrichment of certain MLST genotypes and PVL positive isolates. Genotypes under-represented in the disease sample tend to correspond to a single sequence cluster, and this cluster is more common in China than in other parts of the world.

Conclusions/Significance

Virulence propensity likely reflects a synergy between variation in the core genome (MLST) and accessory genome (PVL). By combining evidence form biogeography and virulence we demonstrate the existence of a “native” clade in West China which has lowered virulence, possibility due to acquired host immunity. 相似文献

17.

Divergences in gene repertoire among the reference Prevotella genomes derived from distinct body sites of human

Vinod Kumar Gupta Narendrakumar M Chaudhari Suchismitha Iskepalli Chitra Dutta 《BMC genomics》2015,16(1)

Background

The community composition of the human microbiome is known to vary at distinct anatomical niches. But little is known about the nature of variations, if any, at the genome/sub-genome levels of a specific microbial community across different niches. The present report aims to explore, as a case study, the variations in gene repertoire of 28 Prevotella reference genomes derived from different body-sites of human, as reported earlier by the Human Microbiome Consortium.

Results

The pan-genome for Prevotella remains “open”. On an average, 17% of predicted protein-coding genes of any particular Prevotella genome represent the conserved core genes, while the remaining 83% contribute to the flexible and singletons. The study reveals exclusive presence of 11798, 3673, 3348 and 934 gene families and exclusive absence of 17, 221, 115 and 645 gene families in Prevotella genomes derived from human oral cavity, gastro-intestinal tracts (GIT), urogenital tract (UGT) and skin, respectively. Distribution of various functional COG categories differs significantly among the habitat-specific genes. No niche-specific variations could be observed in distribution of KEGG pathways.

Conclusions

Prevotella genomes derived from different body sites differ appreciably in gene repertoire, suggesting that these microbiome components might have developed distinct genetic strategies for niche adaptation within the host. Each individual microbe might also have a component of its own genetic machinery for host adaptation, as appeared from the huge number of singletons.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1350-6) contains supplementary material, which is available to authorized users. 相似文献

18.

RiTE database: a resource database for genus-wide rice genomics and evolutionary biology

Dario Copetti Jianwei Zhang Moaine El Baidouri Dongying Gao Jun Wang Elena Barghini Rosa M. Cossu Angelina Angelova Carlos E. Maldonado L. Stefan Roffler Hajime Ohyanagi Thomas Wicker Chuanzhu Fan Andrea Zuccolo Mingsheng Chen Antonio Costa de Oliveira Bin Han Robert Henry Yue-ie Hsing Nori Kurata Wen Wang Scott A. Jackson Olivier Panaud Rod A. Wing 《BMC genomics》2015,16(1)

Background

Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size.

Results

Here we present the Rice TE database (RiTE-db) - a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies.

Conclusions

This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1762-3) contains supplementary material, which is available to authorized users. 相似文献

19.

Identification of three extra-chromosomal replicons in Leptospira pathogenic strain and development of new shuttle vectors

Weinan Zhu Jin Wang Yongzhang Zhu Biao Tang Yunyi Zhang Ping He Yan Zhang Boyu Liu Xiaokui Guo Guoping Zhao Jinhong Qin 《BMC genomics》2015,16(1)

Background

The genome of pathogenic Leptospira interrogans contains two chromosomes. Plasmids and prophages are known to play specific roles in gene transfer in bacteria and can potentially serve as efficient genetic tools in these organisms. Although plasmids and prophage remnants have recently been reported in Leptospira species, their characteristics and potential applications in leptospiral genetic transformation systems have not been fully evaluated.

Results

Three extrachromosomal replicons designated lcp1 (65,732 bp), lcp2 (56,757 bp), and lcp3 (54,986 bp) in the L. interrogans serovar Linhai strain 56609 were identified through whole genome sequencing. All three replicons were stable outside of the bacterial chromosomes. Phage particles were observed in the culture supernatant of 56609 after mitomycin C induction, and lcp3, which contained phage-related genes, was considered to be an inducible prophage. L. interrogans–Escherichia coli shuttle vectors, constructed with the predicted replication elements of single rep or rep combined with parAB loci from the three plasmids were shown to successfully transform into both saprophytic and pathogenic Leptospira species, suggesting an essential function for rep genes in supporting auto-replication of the plasmids. Additionally, a wide distribution of homologs of the three rep genes was identified in L. interrogans isolates, and correlation tests showed that the transformability of the shuttle vectors in L. interrogans isolates depended, to certain extent, on genetic compatibility between the rep sequences of both plasmid and host.

Conclusions

Three extrachromosomal replicons co-exist in L. interrogans, one of which we consider to be an inducible prophage. The vectors constructed with the rep genes of the three replicons successfully transformed into saprophytic and pathogenic Leptospira species alike, but this was partly dependent on genetic compatibility between the rep sequences of both plasmid and host.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1321-y) contains supplementary material, which is available to authorized users. 相似文献

20.

Identification of prophages in bacterial genomes by dinucleotide relative abundance difference

Srividhya KV Alaguraj V Poornima G Kumar D Singh GP Raghavenderan L Katta AV Mehta P Krishnaswamy S 《PloS one》2007,2(11):e1193

Background

Prophages are integrated viral forms in bacterial genomes that have been found to contribute to interstrain genetic variability. Many virulence-associated genes are reported to be prophage encoded. Present computational methods to detect prophages are either by identifying possible essential proteins such as integrases or by an extension of this technique, which involves identifying a region containing proteins similar to those occurring in prophages. These methods suffer due to the problem of low sequence similarity at the protein level, which suggests that a nucleotide based approach could be useful.

Methodology

Earlier dinucleotide relative abundance (DRA) have been used to identify regions, which deviate from the neighborhood areas, in genomes. We have used the difference in the dinucleotide relative abundance (DRAD) between the bacterial and prophage DNA to aid location of DNA stretches that could be of prophage origin in bacterial genomes. Prophage sequences which deviate from bacterial regions in their dinucleotide frequencies are detected by scanning bacterial genome sequences. The method was validated using a subset of genomes with prophage data from literature reports. A web interface for prophage scan based on this method is available at http://bicmku.in:8082/prophagedb/dra.html. Two hundred bacterial genomes which do not have annotated prophages have been scanned for prophage regions using this method.

Conclusions

The relative dinucleotide distribution difference helps detect prophage regions in genome sequences. The usefulness of this method is seen in the identification of 461 highly probable loci pertaining to prophages which have not been annotated so earlier. This work emphasizes the need to extend the efforts to detect and annotate prophage elements in genome sequences. 相似文献