期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Sub classification and targeted characterization of prophage-encoded two-component cell lysis cassette

K. V. Srividhya S. Krishnaswamy 《Journal of biosciences》2007,32(1):979-990

Bacteriophage induced lysis of host bacterial cell is mediated by a two component cell lysis cassette comprised of holin and lysozyme. Prophages are integrated forms of bacteriophages in bacterial genomes providing a repertoire for bacterial evolution. Analysis using the prophage database ( http://bicmku.in:8082 ) constructed by us showed 47 prophages were associated with putative two component cell lysis genes. These proteins cluster into four different subgroups. In this process, a putative holin (essd) and endolysin (ybcS), encoded by the defective lambdoid prophage DLP12 was found to be similar to two component cell lysis genes in functional bacteriophages like p21 and P1. The holin essd was found to have a characteristic dual start motif with two transmembrane regions and C-terminal charged residues as in class II holins. Expression of a fusion construct of essd in Escherichia coli showed slow growth. However, under appropriate conditions, this protein could be over expressed and purified for structure function studies. The second component of the cell lysis cassette, ybcS, was found to have an N-terminal SAR (Signal Arrest Release) transmembrane domain. The construct of ybcS has been over expressed in E. coli and the purified protein was functional, exhibiting lytic activity against E. coli and Salmonella typhi cell wall substrate. Such targeted sequence-structure-function characterization of proteins encoded by cryptic prophages will help understand the contribution of prophage proteins to bacterial evolution. 相似文献

2.

Prophage-like elements present in Mycobacterium genomes

Xiangyu Fan Longxiang Xie Wu Li Jianping Xie 《BMC genomics》2014,15(1)

Background

Prophages, integral components of many bacterial genomes, play significant roles in cognate host bacteria, such as virulence, toxin biosynthesis and secretion, fitness cost, genomic variations, and evolution. Many prophages and prophage-like elements present in sequenced bacterial genomes, such as Bifidobacteria, Lactococcus and Streptococcus, have been described. However, information for the prophage of Mycobacterium remains poorly defined.

Results

In this study, based on the search of the complete genome database from GenBank, the Whole Genome Shotgun (WGS) databases, and some published literatures, thirty-three prophages were described in detail. Eleven of them were full-length prophages, and others were prophage-like elements. Eleven prophages were firstly revealed. They were phiMAV_1, phiMAV_2, phiMmcs_1, phiMmcs_2, phiMkms_1, phiMkms_2, phiBN42_1, phiBN44_1, phiMCAN_1, phiMycsm_1, and phiW7S_1. Their genomes and gene contents were firstly analyzed. Furthermore, comparative genomics analyses among mycobacterioprophages showed that full-length prophage phi172_2 belonged to mycobacteriophage Cluster A and the phiMmcs_1, phiMkms_1, phiBN44_1, and phiMCAN_1 shared high homology and could be classified into one group.

Conclusions

To our knowledge, this is the first systematic characterization of mycobacterioprophages, their genomic organization and phylogeny. This information will afford more understanding of the biology of Mycobacterium.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-243) contains supplementary material, which is available to authorized users. 相似文献

3.

High resolution assembly and characterization of genomes of Canadian isolates of Salmonella Enteritidis

Dele Ogunremi John Devenish Kingsley Amoako Hilary Kelly Andrée Ann Dupras Sebastien Belanger Lin Ru Wang 《BMC genomics》2014,15(1)

Background

There is a need to characterize genomes of the foodborne pathogen, Salmonella enterica serovar Enteritidis (SE) and identify genetic information that could be ultimately deployed for differentiating strains of the organism, a need that is yet to be addressed mainly because of the high degree of clonality of the organism. In an effort to achieve the first characterization of the genomes of SE of Canadian origin, we carried out massively parallel sequencing of the nucleotide sequence of 11 SE isolates obtained from poultry production environments (n = 9), a clam and a chicken, assembled finished genomes and investigated diversity of the SE genome.

Results

The median genome size was 4,678,683 bp. A total of 4,833 chromosomal genes defined the pan genome of our field SE isolates consisting of 4,600 genes present in all the genomes, i.e., core genome, and 233 genes absent in at least one genome (accessory genome). Genome diversity was demonstrable by the presence of 1,360 loci showing single nucleotide polymorphism (SNP) in the core genome which was used to portray the genetic distances by means of a phylogenetic tree for the SE isolates. The accessory genome consisted mostly of previously identified SE prophage sequences as well as two, apparently full- sized, novel prophages namely a 28 kb sequence provisionally designated as SE-OLF-10058 (3) prophage and a 43 kb sequence provisionally designated as SE-OLF-10012 prophage.

Conclusions

The number of SNPs identified in the relatively large core genome of SE is a reflection of substantial diversity that could be exploited for strain differentiation as shown by the development of an informative phylogenetic tree. Prophage sequences can also be exploited for SE strain differentiation and lineage tracking. This work has laid the ground work for further studies to develop a readily adoptable laboratory test for the subtyping of SE.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-713) contains supplementary material, which is available to authorized users. 相似文献

4.

Using the taxon-specific genes for the taxonomic classification of bacterial genomes

Ankit Gupta Vineet K Sharma 《BMC genomics》2015,16(1)

Background

The correct taxonomic assignment of bacterial genomes is a primary and challenging task. With the availability of whole genome sequences, the gene content based approaches appear promising in inferring the bacterial taxonomy. The complete genome sequencing of a bacterial genome often reveals a substantial number of unique genes present only in that genome which can be used for its taxonomic classification.

Results

In this study, we have proposed a comprehensive method which uses the taxon-specific genes for the correct taxonomic assignment of existing and new bacterial genomes. The taxon-specific genes identified at each taxonomic rank have been successfully used for the taxonomic classification of 2,342 genomes present in the NCBI genomes, 36 newly sequenced genomes, and 17 genomes for which the complete taxonomy is not yet known. This approach has been implemented for the development of a tool ‘Microtaxi’ which can be used for the taxonomic assignment of complete bacterial genomes.

Conclusion

The taxon-specific gene based approach provides an alternate valuable methodology to carry out the taxonomic classification of newly sequenced or existing bacterial genomes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1542-0) contains supplementary material, which is available to authorized users. 相似文献

5.

Murasaki: a fast, parallelizable algorithm to find anchors from multiple genomes

Popendorf K Tsuyoshi H Osana Y Sakakibara Y 《PloS one》2010,5(9):e12651

Background

With the number of available genome sequences increasing rapidly, the magnitude of sequence data required for multiple-genome analyses is a challenging problem. When large-scale rearrangements break the collinearity of gene orders among genomes, genome comparison algorithms must first identify sets of short well-conserved sequences present in each genome, termed anchors. Previously, anchor identification among multiple genomes has been achieved using pairwise alignment tools like BLASTZ through progressive alignment tools like TBA, but the computational requirements for sequence comparisons of multiple genomes quickly becomes a limiting factor as the number and scale of genomes grows.

Methodology/Principal Findings

Our algorithm, named Murasaki, makes it possible to identify anchors within multiple large sequences on the scale of several hundred megabases in few minutes using a single CPU. Two advanced features of Murasaki are (1) adaptive hash function generation, which enables efficient use of arbitrary mismatch patterns (spaced seeds) and therefore the comparison of multiple mammalian genomes in a practical amount of computation time, and (2) parallelizable execution that decreases the required wall-clock and CPU times. Murasaki can perform a sensitive anchoring of eight mammalian genomes (human, chimp, rhesus, orangutan, mouse, rat, dog, and cow) in 21 hours CPU time (42 minutes wall time). This is the first single-pass in-core anchoring of multiple mammalian genomes. We evaluated Murasaki by comparing it with the genome alignment programs BLASTZ and TBA. We show that Murasaki can anchor multiple genomes in near linear time, compared to the quadratic time requirements of BLASTZ and TBA, while improving overall accuracy.

Conclusions/Significance

Murasaki provides an open source platform to take advantage of long patterns, cluster computing, and novel hash algorithms to produce accurate anchors across multiple genomes with computational efficiency significantly greater than existing methods. Murasaki is available under GPL at http://murasaki.sourceforge.net. 相似文献

6.

Subclassification and targeted characterization of prophage-encoded two-component cell lysis cassette

Srividhya KV Krishnaswamy S 《Journal of biosciences》2007,32(5):979-990

Bacteriophage induced lysis of host bacterial cell is mediated by a two component cell lysis cassette comprised of holin and lysozyme. Prophages are integrated forms of bacteriophages in bacterial genomes providing a repertoire for bacterial evolution. Analysis using the prophage database (http://bicmku.in:8082) constructed by us showed 47 prophages were associated with putative two component cell lysis genes. These proteins cluster into four different subgroups. In this process, a putative holin (essd) and endolysin (ybcS), encoded by the defective lambdoid prophage DLP12 was found to be similar to two component cell lysis genes in functional bacteriophages like p21 and P1. The holin essd was found to have a characteristic dual start motif with two transmembrane regions and C-terminal charged residues as in class II holins. Expression of a fusion construct of essd in Escherichia coli showed slow growth. However, under appropriate conditions, this protein could be over expressed and purified for structure function studies.The second component of the cell lysis cassette, ybcS, was found to have an N-terminal SAR (Signal Arrest Release) transmembrane domain. The construct of ybcS has been over expressed in E.coli and the purified protein was functional, exhibiting lytic activity against E.coli and Salmonella typhi cell wall substrate. Such targeted sequence- structure-function characterization of proteins encoded by cryptic prophages will help understand the contribution of prophage proteins to bacterial evolution. 相似文献

7.

Intergenomic single nucleotide polymorphisms as a tool for bacterial artificial chromosome contig building of homoeologous Brassica napus regions

Hieu Xuan Cao Renate Schmidt 《BMC genomics》2014,15(1)

Background

Homoeologous sequences pose a particular challenge if bacterial artificial chromosome (BAC) contigs shall be established for specific regions of an allopolyploid genome. Single nucleotide polymorphisms (SNPs) differentiating between homoeologous genomes (intergenomic SNPs) may represent a suitable screening tool for such purposes, since they do not only identify homoeologous sequences but also differentiate between them.

Results

Sequence alignments between Brassica rapa (AA) and Brassica oleracea (CC) sequences mapping to corresponding regions on chromosomes A1 and C1, respectively were used to identify single nucleotide polymorphisms between the A and C genomes. A large fraction of these polymorphisms was also present in Brassica napus (AACC), an allopolyploid species that originated from hybridisation of A and C genome species. Intergenomic SNPs mapping throughout homoeologous chromosome segments spanning approximately one Mbp each were included in Illumina’s GoldenGate® Genotyping Assay and used to screen multidimensional pools of a Brassica napus bacterial artificial chromosome library with tenfold genome coverage. Based on the results of 50 SNP assays, a BAC contig for the Brassica napus A subgenome was established that spanned the entire region of interest. The C subgenome region was represented in three BAC contigs.

Conclusions

This proof-of-concept study shows that sequence resources of diploid progenitor genomes can be used to deduce intergenomic SNPs suitable for multiplex polymerase chain reaction (PCR)-based screening of multidimensional BAC pools of a polyploid organism. Owing to their high abundance and ease of identification, intergenomic SNPs represent a versatile tool to establish BAC contigs for homoeologous regions of a polyploid genome.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-560) contains supplementary material, which is available to authorized users. 相似文献

8.

UniDrug-target: a computational tool to identify unique drug targets in pathogenic bacteria

Chanumolu SK Rout C Chauhan RS 《PloS one》2012,7(3):e32833

Background

Targeting conserved proteins of bacteria through antibacterial medications has resulted in both the development of resistant strains and changes to human health by destroying beneficial microbes which eventually become breeding grounds for the evolution of resistances. Despite the availability of more than 800 genomes sequences, 430 pathways, 4743 enzymes, 9257 metabolic reactions and protein (three-dimensional) 3D structures in bacteria, no pathogen-specific computational drug target identification tool has been developed.

Methods

A web server, UniDrug-Target, which combines bacterial biological information and computational methods to stringently identify pathogen-specific proteins as drug targets, has been designed. Besides predicting pathogen-specific proteins essentiality, chokepoint property, etc., three new algorithms were developed and implemented by using protein sequences, domains, structures, and metabolic reactions for construction of partial metabolic networks (PMNs), determination of conservation in critical residues, and variation analysis of residues forming similar cavities in proteins sequences. First, PMNs are constructed to determine the extent of disturbances in metabolite production by targeting a protein as drug target. Conservation of pathogen-specific protein''s critical residues involved in cavity formation and biological function determined at domain-level with low-matching sequences. Last, variation analysis of residues forming similar cavities in proteins sequences from pathogenic versus non-pathogenic bacteria and humans is performed.

Results

The server is capable of predicting drug targets for any sequenced pathogenic bacteria having fasta sequences and annotated information. The utility of UniDrug-Target server was demonstrated for Mycobacterium tuberculosis (H37Rv). The UniDrug-Target identified 265 mycobacteria pathogen-specific proteins, including 17 essential proteins which can be potential drug targets.

Conclusions/Significance

UniDrug-Target is expected to accelerate pathogen-specific drug targets identification which will increase their success and durability as drugs developed against them have less chance to develop resistances and adverse impact on environment. The server is freely available at http://117.211.115.67/UDT/main.html. The standalone application (source codes) is available at http://www.bioinformatics.org/ftp/pub/bioinfojuit/UDT.rar. 相似文献

9.

High sensitivity TSS prediction: estimates of locations where TSS cannot occur

Schaefer U Kodzius R Kai C Kawai J Carninci P Hayashizaki Y Bajic VB 《PloS one》2010,5(11):e13934

相似文献

10.

YOC,A new strategy for pairwise alignment of collinear genomes

Raluca Uricaru Célia Michotey Hélène Chiapello Eric Rivals 《BMC bioinformatics》2015,16(1)

Background

Comparing and aligning genomes is a key step in analyzing closely related genomes. Despite the development of many genome aligners in the last 15 years, the problem is not yet fully resolved, even when aligning closely related bacterial genomes of the same species. In addition, no procedures are available to assess the quality of genome alignments or to compare genome aligners.

Results

We designed an original method for pairwise genome alignment, named YOC, which employs a highly sensitive similarity detection method together with a recent collinear chaining strategy that allows overlaps. YOC improves the reliability of collinear genome alignments, while preserving or even improving sensitivity. We also propose an original qualitative evaluation criterion for measuring the relevance of genome alignments. We used this criterion to compare and benchmark YOC with five recent genome aligners on large bacterial genome datasets, and showed it is suitable for identifying the specificities and the potential flaws of their underlying strategies.

Conclusions

The YOC prototype is available at https://github.com/ruricaru/YOC. It has several advantages over existing genome aligners: (1) it is based on a simplified two phase alignment strategy, (2) it is easy to parameterize, (3) it produces reliable genome alignments, which are easier to analyze and to use.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0530-3) contains supplementary material, which is available to authorized users. 相似文献

11.

A Genome-Wide Analysis of Genetic Diversity in Trypanosoma cruzi Intergenic Regions

Leonardo G. Panunzi Fernán Agüero 《PLoS neglected tropical diseases》2014,8(5)

Background

Trypanosoma cruzi is the causal agent of Chagas Disease. Recently, the genomes of representative strains from two major evolutionary lineages were sequenced, allowing the construction of a detailed genetic diversity map for this important parasite. However this map is focused on coding regions of the genome, leaving a vast space of regulatory regions uncharacterized in terms of their evolutionary conservation and/or divergence.

Methodology

Using data from the hybrid CL Brener and Sylvio X10 genomes (from the TcVI and TcI Discrete Typing Units, respectively), we identified intergenic regions that share a common evolutionary ancestry, and are present in both CL Brener haplotypes (TcII-like and TcIII-like) and in the TcI genome; as well as intergenic regions that were conserved in only two of the three genomes/haplotypes analyzed. The genetic diversity in these regions was characterized in terms of the accumulation of indels and nucleotide changes.

Principal Findings

Based on this analysis we have identified i) a core of highly conserved intergenic regions, which remained essentially unchanged in independently evolving lineages; ii) intergenic regions that show high diversity in spite of still retaining their corresponding upstream and downstream coding sequences; iii) a number of defined sequence motifs that are shared by a number of unrelated intergenic regions. A fraction of indels explains the diversification of some intergenic regions by the expansion/contraction of microsatellite-like repeats. 相似文献

12.

The repetitive component of the A genome of peanut (Arachis hypogaea) and its role in remodelling intergenic sequence space since its evolutionary divergence from the B genome

David J. Bertioli Bruna Vidigal Stephan Nielen Milind B. Ratnaparkhe Tae-Ho Lee Soraya C. M. Leal-Bertioli Changsoo Kim Patricia M. Guimar?es Guillermo Seijo Trude Schwarzacher Andrew H. Paterson Pat Heslop-Harrison Ana C. G. Araujo 《Annals of botany》2013,112(3):545-559

Background and Aims

Peanut (Arachis hypogaea) is an allotetraploid (AABB-type genome) of recent origin, with a genome of about 2·8 Gb and a high repetitive content. This study reports an analysis of the repetitive component of the peanut A genome using bacterial artificial chromosome (BAC) clones from A. duranensis, the most probable A genome donor, and the probable consequences of the activity of these elements since the divergence of the peanut A and B genomes.

Methods

The repetitive content of the A genome was analysed by using A. duranensis BAC clones as probes for fluorescence in situ hybridization (BAC-FISH), and by sequencing and characterization of 12 genomic regions. For the analysis of the evolutionary dynamics, two A genome regions are compared with their B genome homeologues.

Key Results

BAC-FISH using 27 A. duranensis BAC clones as probes gave dispersed and repetitive DNA characteristic signals, predominantly in interstitial regions of the peanut A chromosomes. The sequences of 14 BAC clones showed complete and truncated copies of ten abundant long terminal repeat (LTR) retrotransposons, characterized here. Almost all dateable transposition events occurred <3·5 million years ago, the estimated date of the divergence of A and B genomes. The most abundant retrotransposon is Feral, apparently parasitic on the retrotransposon FIDEL, followed by Pipa, also non-autonomous and probably parasitic on a retrotransposon we named Pipoka. The comparison of the A and B genome homeologous regions showed conserved segments of high sequence identity, punctuated by predominantly indel regions without significant similarity.

Conclusions

A substantial proportion of the highly repetitive component of the peanut A genome appears to be accounted for by relatively few LTR retrotransposons and their truncated copies or solo LTRs. The most abundant of the retrotransposons are non-autonomous. The activity of these retrotransposons has been a very significant driver of genome evolution since the evolutionary divergence of the A and B genomes. 相似文献

13.

Identification of three extra-chromosomal replicons in Leptospira pathogenic strain and development of new shuttle vectors

Weinan Zhu Jin Wang Yongzhang Zhu Biao Tang Yunyi Zhang Ping He Yan Zhang Boyu Liu Xiaokui Guo Guoping Zhao Jinhong Qin 《BMC genomics》2015,16(1)

Background

The genome of pathogenic Leptospira interrogans contains two chromosomes. Plasmids and prophages are known to play specific roles in gene transfer in bacteria and can potentially serve as efficient genetic tools in these organisms. Although plasmids and prophage remnants have recently been reported in Leptospira species, their characteristics and potential applications in leptospiral genetic transformation systems have not been fully evaluated.

Results

Three extrachromosomal replicons designated lcp1 (65,732 bp), lcp2 (56,757 bp), and lcp3 (54,986 bp) in the L. interrogans serovar Linhai strain 56609 were identified through whole genome sequencing. All three replicons were stable outside of the bacterial chromosomes. Phage particles were observed in the culture supernatant of 56609 after mitomycin C induction, and lcp3, which contained phage-related genes, was considered to be an inducible prophage. L. interrogans–Escherichia coli shuttle vectors, constructed with the predicted replication elements of single rep or rep combined with parAB loci from the three plasmids were shown to successfully transform into both saprophytic and pathogenic Leptospira species, suggesting an essential function for rep genes in supporting auto-replication of the plasmids. Additionally, a wide distribution of homologs of the three rep genes was identified in L. interrogans isolates, and correlation tests showed that the transformability of the shuttle vectors in L. interrogans isolates depended, to certain extent, on genetic compatibility between the rep sequences of both plasmid and host.

Conclusions

Three extrachromosomal replicons co-exist in L. interrogans, one of which we consider to be an inducible prophage. The vectors constructed with the rep genes of the three replicons successfully transformed into saprophytic and pathogenic Leptospira species alike, but this was partly dependent on genetic compatibility between the rep sequences of both plasmid and host.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1321-y) contains supplementary material, which is available to authorized users. 相似文献

14.

Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing

Yu Liu Mehmet Koyutürk Sean Maxwell Min Xiang Martina Veigl Richard S Cooper Bamidele O Tayo Li Li Thomas LaFramboise Zhenghe Wang Xiaofeng Zhu Mark R Chance 《BMC genomics》2014,15(1)

相似文献

15.

PCR primers for metazoan mitochondrial 12S ribosomal DNA sequences

Machida RJ Kweskin M Knowlton N 《PloS one》2012,7(4):e35887

Background

Assessment of the biodiversity of communities of small organisms is most readily done using PCR-based analysis of environmental samples consisting of mixtures of individuals. Known as metagenetics, this approach has transformed understanding of microbial communities and is beginning to be applied to metazoans as well. Unlike microbial studies, where analysis of the 16S ribosomal DNA sequence is standard, the best gene for metazoan metagenetics is less clear. In this study we designed a set of PCR primers for the mitochondrial 12S ribosomal DNA sequence based on 64 complete mitochondrial genomes and then tested their efficacy.

Methodology/Principal Findings

A total of the 64 complete mitochondrial genome sequences representing all metazoan classes available in GenBank were downloaded using the NCBI Taxonomy Browser. Alignment of sequences was performed for the excised mitochondrial 12S ribosomal DNA sequences, and conserved regions were identified for all 64 mitochondrial genomes. These regions were used to design a primer pair that flanks a more variable region in the gene. Then all of the complete metazoan mitochondrial genomes available in NCBI''s Organelle Genome Resources database were used to determine the percentage of taxa that would likely be amplified using these primers. Results suggest that these primers will amplify target sequences for many metazoans.

Conclusions/Significance

Newly designed 12S ribosomal DNA primers have considerable potential for metazoan metagenetic analysis because of their ability to amplify sequences from many metazoans. 相似文献

16.

Core and accessory genome architecture in a group of Pseudomonas aeruginosa Mu-like phages

Adrián Cazares Guillermo Mendoza-Hernández Gabriel Guarneros 《BMC genomics》2014,15(1)

Background

Bacteriophages that infect the opportunistic pathogen Pseudomonas aeruginosa have been classified into several groups. One of them, which includes temperate phage particles with icosahedral heads and long flexible tails, bears genomes whose architecture and replication mechanism, but not their nucleotide sequences, are like those of coliphage Mu. By comparing the genomic sequences of this group of P. aeruginosa phages one could draw conclusions about their ontogeny and evolution.

Results

Two newly isolated Mu-like phages of P. aeruginosa are described and their genomes sequenced and compared with those available in the public data banks. The genome sequences of the two phages are similar to each other and to those of a group of P. aeruginosa transposable phages. Comparing twelve of these genomes revealed a common genomic architecture in the group. Each phage genome had numerous genes with homologues in all the other genomes and a set of variable genes specific for each genome. The first group, which comprised most of the genes with assigned functions, was named “core genome”, and the second group, containing mostly short ORFs without assigned functions was called “accessory genome”. Like in other phage groups, variable genes are confined to specific regions in the genome.

Conclusion

Based on the known and inferred functions for some of the variable genes of the phages analyzed here, they appear to confer selective advantages for the phage survival under particular host conditions. We speculate that phages have developed a mechanism for horizontally acquiring genes to incorporate them at specific loci in the genome that help phage adaptation to the selective pressures imposed by the host.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1146) contains supplementary material, which is available to authorized users. 相似文献

17.

Whole genome comparative analysis of transposable elements provides new insight into mechanisms of their inactivation in fungal genomes

Jo?lle Amselem Marc-Henri Lebrun Hadi Quesneville 《BMC genomics》2015,16(1)

Background

Transposable Elements (TEs) are key components that shape the organization and evolution of genomes. Fungi have developed defense mechanisms against TE invasion such as RIP (Repeat-Induced Point mutation), MIP (Methylation Induced Premeiotically) and Quelling (RNA interference). RIP inactivates repeated sequences by promoting Cytosine to Thymine mutations, whereas MIP only methylates TEs at C residues. Both mechanisms require specific cytosine DNA Methyltransferases (RID1/Masc1) of the Dnmt1 superfamily.

Results

We annotated TE sequences from 10 fungal genomes with different TE content (1-70%). We then used these TE sequences to carry out a genome-wide analysis of C to T mutations biases. Genomes from either Ascomycota or Basidiomycota that were massively invaded by TEs (Blumeria, Melampsora, Puccinia) were characterized by a low frequency of C to T mutation bias (10-20%), whereas other genomes displayed intermediate to high frequencies (25-75%). We identified several dinucleotide signatures at these C to T mutation sites (CpA, CpT, and CpG). Phylogenomic analysis of fungal Dnmt1 MTases revealed a previously unreported association between these dinucleotide signatures and the presence/absence of sub-classes of Dnmt1.

Conclusions

We identified fungal genomes containing large numbers of TEs with many C to T mutations associated with species-specific dinucleotide signatures. This bias suggests that a basic defense mechanism against TE invasion similar to RIP is widespread in fungi, although the efficiency and specificity of this mechanism differs between species. Our analysis revealed that dinucleotide signatures are associated with the presence/absence of specific Dnmt1 subfamilies. In particular, an RID1-dependent RIP mechanism was found only in Ascomycota.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1347-1) contains supplementary material, which is available to authorized users. 相似文献

18.

Prophage genomics.

Carlos Canchaya Caroline Proux Ghislain Fournous Anne Bruttin Harald Brüssow 《Microbiology and molecular biology reviews》2003,67(2):238-76, table of contents

The majority of the bacterial genome sequences deposited in the National Center for Biotechnology Information database contain prophage sequences. Analysis of the prophages suggested that after being integrated into bacterial genomes, they undergo a complex decay process consisting of inactivating point mutations, genome rearrangements, modular exchanges, invasion by further mobile DNA elements, and massive DNA deletion. We review the technical difficulties in defining such altered prophage sequences in bacterial genomes and discuss theoretical frameworks for the phage-bacterium interaction at the genomic level. The published genome sequences from three groups of eubacteria (low- and high-G+C gram-positive bacteria and gamma-proteobacteria) were screened for prophage sequences. The prophages from Streptococcus pyogenes served as test case for theoretical predictions of the role of prophages in the evolution of pathogenic bacteria. The genomes from further human, animal, and plant pathogens, as well as commensal and free-living bacteria, were included in the analysis to see whether the same principles of prophage genomics apply for bacteria living in different ecological niches and coming from distinct phylogenetical affinities. The effect of selection pressure on the host bacterium is apparently an important force shaping the prophage genomes in low-G+C gram-positive bacteria and gamma-proteobacteria. 相似文献

19.

Using comparative genomics to reorder the human genome sequence into a virtual sheep genome 总被引：4，自引：1，他引：3

Dalrymple BP Kirkness EF Nefedov M McWilliam S Ratnakumar A Barris W Zhao S Shetty J Maddox JF O'Grady M Nicholas F Crawford AM Smith T de Jong PJ McEwan J Oddy VH Cockett NE;International Sheep Genomics Consortium 《Genome biology》2007,8(7):R152-20

Background

Is it possible to construct an accurate and detailed subgene-level map of a genome using bacterial artificial chromosome (BAC) end sequences, a sparse marker map, and the sequences of other genomes?

Results

A sheep BAC library, CHORI-243, was constructed and the BAC end sequences were determined and mapped with high sensitivity and low specificity onto the frameworks of the human, dog, and cow genomes. To maximize genome coverage, the coordinates of all BAC end sequence hits to the cow and dog genomes were also converted to the equivalent human genome coordinates. The 84,624 sheep BACs (about 5.4-fold genome coverage) with paired ends in the correct orientation (tail-to-tail) and spacing, combined with information from sheep BAC comparative genome contigs (CGCs) built separately on the dog and cow genomes, were used to construct 1,172 sheep BAC-CGCs, covering 91.2% of the human genome. Clustered non-tail-to-tail and outsize BACs located close to the ends of many BAC-CGCs linked BAC-CGCs covering about 70% of the genome to at least one other BAC-CGC on the same chromosome. Using the BAC-CGCs, the intrachromosomal and interchromosomal BAC-CGC linkage information, human/cow and vertebrate synteny, and the sheep marker map, a virtual sheep genome was constructed. To identify BACs potentially located in gaps between BAC-CGCs, an additional set of 55,668 sheep BACs were positioned on the sheep genome with lower confidence. A coordinate conversion process allowed us to transfer human genes and other genome features to the virtual sheep genome to display on a sheep genome browser.

Conclusion

We demonstrate that limited sequencing of BACs combined with positioning on a well assembled genome and integrating locations from other less well assembled genomes can yield extensive, detailed subgene-level maps of mammalian genomes, for which genomic resources are currently limited. 相似文献

20.

Young,intact and nested retrotransposons are abundant in the onion and asparagus genomes

C. Vitte M. C. Estep J. Leebens-Mack J. L. Bennetzen 《Annals of botany》2013,112(5):881-889

Background and Aims

Although monocotyledonous plants comprise one of the two major groups of angiosperms and include >65 000 species, comprehensive genome analysis has been focused mainly on the Poaceae (grass) family. Due to this bias, most of the conclusions that have been drawn for monocot genome evolution are based on grasses. It is not known whether these conclusions apply to many other monocots.

Methods

To extend our understanding of genome evolution in the monocots, Asparagales genomic sequence data were acquired and the structural properties of asparagus and onion genomes were analysed. Specifically, several available onion and asparagus bacterial artificial chromosomes (BACs) with contig sizes >35 kb were annotated and analysed, with a particular focus on the characterization of long terminal repeat (LTR) retrotransposons.

Key Results

The results reveal that LTR retrotransposons are the major components of the onion and garden asparagus genomes. These elements are mostly intact (i.e. with two LTRs), have mainly inserted within the past 6 million years and are piled up into nested structures. Analysis of shotgun genomic sequence data and the observation of two copies for some transposable elements (TEs) in annotated BACs indicates that some families have become particularly abundant, as high as 4–5 % (asparagus) or 3–4 % (onion) of the genome for the most abundant families, as also seen in large grass genomes such as wheat and maize.

Conclusions

Although previous annotations of contiguous genomic sequences have suggested that LTR retrotransposons were highly fragmented in these two Asparagales genomes, the results presented here show that this was largely due to the methodology used. In contrast, this current work indicates an ensemble of genomic features similar to those observed in the Poaceae. 相似文献