首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification.  相似文献   

2.

Background

Diversity-generating retroelements (DGRs) provide organisms with a unique means for adaptation to a dynamic environment through massive protein sequence variation. The potential scope of this variation exceeds that of the vertebrate adaptive immune system. DGRs were known to exist only in viruses and bacteria until their recent discovery in archaea belonging to the ‘microbial dark matter’, specifically in organisms closely related to Nanoarchaeota. However, Nanoarchaeota DGR variable proteins were unassignable to known protein folds and apparently unrelated to characterized DGR variable proteins.

Results

To address the issue of how Nanoarchaeota DGR variable proteins accommodate massive sequence variation, we determined the 2.52 Å resolution limit crystal structure of one such protein, AvpA, which revealed a C-type lectin (CLec)-fold that organizes a putative ligand-binding site that is capable of accommodating 1013 sequences. This fold is surprisingly reminiscent of the CLec-folds of viral and bacterial DGR variable protein, but differs sufficiently to define a new CLec-fold subclass, which is consistent with early divergence between bacterial and archaeal DGRs. The structure also enabled identification of a group of AvpA-like proteins in multiple putative DGRs from uncultivated archaea. These variable proteins may aid Nanoarchaeota and these uncultivated archaea in symbiotic relationships.

Conclusions

Our results have uncovered the widespread conservation of the CLec-fold in viruses, bacteria, and archaea for accommodating massive sequence variation. In addition, to our knowledge, this is the first report of an archaeal CLec-fold protein.
  相似文献   

3.
4.

Background

The mitochondrial genomes of snakes are characterized by an overall evolutionary rate that appears to be one of the most accelerated among vertebrates. They also possess other unusual features, including short tRNAs and other genes, and a duplicated control region that has been stably maintained since it originated more than 70 million years ago. Here, we provide a detailed analysis of evolutionary dynamics in snake mitochondrial genomes to better understand the basis of these extreme characteristics, and to explore the relationship between mitochondrial genome molecular evolution, genome architecture, and molecular function. We sequenced complete mitochondrial genomes from Slowinski's corn snake (Pantherophis slowinskii) and two cottonmouths (Agkistrodon piscivorus) to complement previously existing mitochondrial genomes, and to provide an improved comparative view of how genome architecture affects molecular evolution at contrasting levels of divergence.

Results

We present a Bayesian genetic approach that suggests that the duplicated control region can function as an additional origin of heavy strand replication. The two control regions also appear to have different intra-specific versus inter-specific evolutionary dynamics that may be associated with complex modes of concerted evolution. We find that different genomic regions have experienced substantial accelerated evolution along early branches in snakes, with different genes having experienced dramatic accelerations along specific branches. Some of these accelerations appear to coincide with, or subsequent to, the shortening of various mitochondrial genes and the duplication of the control region and flanking tRNAs.

Conclusion

Fluctuations in the strength and pattern of selection during snake evolution have had widely varying gene-specific effects on substitution rates, and these rate accelerations may have been functionally related to unusual changes in genomic architecture. The among-lineage and among-gene variation in rate dynamics observed in snakes is the most extreme thus far observed in animal genomes, and provides an important study system for further evaluating the biochemical and physiological basis of evolutionary pressures in vertebrate mitochondria.  相似文献   

5.
The concept of the genome tree depends on the potential evolutionary significance in the clustering of species according to similarities in the gene content of their genomes. In this respect, genome trees have often been identified with species trees. With the rapid expansion of genome sequence data it becomes of increasing importance to develop accurate methods for grasping global trends for the phylogenetic signals that mutually link the various genomes. We therefore derive here the methodological concept of genome trees based on protein conservation profiles in multiple species. The basic idea in this derivation is that the multi-component "presence-absence" protein conservation profiles permit tracking of common evolutionary histories of genes across multiple genomes. We show that a significant reduction in informational redundancy is achieved by considering only the subset of distinct conservation profiles. Beyond these basic ideas, we point out various pitfalls and limitations associated with the data handling, paving the way for further improvements. As an illustration for the methods, we analyze a genome tree based on the above principles, along with a series of other trees derived from the same data and based on pair-wise comparisons (ancestral duplication-conservation and shared orthologs). In all trees we observe a sharp discrimination between the three primary domains of life: Bacteria, Archaea, and Eukarya. The new genome tree, based on conservation profiles, displays a significant correspondence with classically recognized taxonomical groupings, along with a series of departures from such conventional clusterings.  相似文献   

6.
We have used mapping of large T1 oligonucleotides to examine the genome of Rous-associated virus-O (RAV-O), an endogenous virus of chickens, and to compare it with that of Prague strain Rous sarcoma virus, subgroup B, (Pr-RSV-B), an exogenous sarcoma virus. To extend the sensitivity of such comparisons, we have developed a system of nucleic acid hybridization and hybridization-competition combined with fingerprinting. This method allows us to estimate the relative degree of relatedness of various portions of the viral genomes. From the results of this study, we have concluded that the genomes of Pr-RSV-B and RAV-O are related in the following way. The 5'-terminal half of the genomes (corresponding to the gag and pol regions) is virtually identical, with only scattered single nucleotide differences. This region is followed by a region comprising 25 to 30% of the genome (the env region) which contains substantial nucleotide sequence differences, most or all of which are due to single base changes. The env-coding region can be further subdivided into three regions: a more variable region probably containing sequences coding for subgroup specificity, flanked by relatively common sequences on each side. To the 3' side of the env region, the RAV-O genome contains a very short sequence not found in Pr-RSV-B, whereas the Pr-RSV-B genome contains a much longer unrelated sequence. The central portion of this sequence comprises the src gene as defined by transformation-defective mutants. Particularly striking is the absence, in the RAV-O genome, of any nucleotide sequence related to the "c region" found very near the 3' end of all exogenous tumor viruses. Both the Pr-RSV-B and RAV-O genomes contain the identical terminally redundant sequence of 21 nucleotides near each end of the genome.  相似文献   

7.
8.
DNA gel-blot and in situ hybridization with genome-specific repeated sequences have proven to be valuable tools in analyzing genome structure and relationships in species with complex allopolyploid genomes such as hexaploid oat (Avena sativa L., 2n = 6x = 42; AACCDD genome). In this report, we describe a systematic approach for isolating genome-, chromosome-, and region-specific repeated and low-copy DNA sequences from oat that can presumably be applied to any complex genome species. Genome-specific DNA sequences were first identified in a random set of A. sativa genomic DNA cosmid clones by gel-blot hybridization using labeled genomic DNA from different Avena species. Because no repetitive sequences were identified that could distinguish between the A and D gneomes, sequences specific to these two genomes are refereed to as A/D genome specific. A/D or C genome specific DNA subfragments were used as screening probes to identify additional genome-specific cosmid clones in the A. sativa genomic library. We identified clustered and dispersed repetitive DNA elements for the A/D and C genomes that could be used as cytogenetic markers for discrimination of the various oat chromosomes. Some analyzed cosmids appeared to be composed entirely of genome-specific elements, whereas others represented regions with genome- and non-specific repeated sequences with interspersed low-copy DNA sequences. Thus, genome-specific hybridization analysis of restriction digests of random and selected A. sativa cosmids also provides insight into the sequence organization of the oat genome.  相似文献   

9.
10.
In the Neisseria spp., natural competence for transformation and homologous recombination generate antigenic variants through creation of mosaic genes (such as opas) and through recombination with silent cassettes (such as pilE/pilS) and gene-complement diversity through the horizontal exchange of whole genes or groups of genes, in minimal mobile elements (MMEs). An MME is a region encompassing 2 conserved genes between which different whole-gene cassettes are found in different strains, which are chromosomally incorporated solely through the action of homologous recombination. Comparative analyses of the neisserial genome sequences identified 39 potential MME sites, the contents of which were investigated in 11 neisserial strains. One hundred and eight different MME regions were identified, 20 of which contain novel sequences and these contain 12 newly identified neisserial coding sequences. Neisserial uptake signal sequences are associated with 38 of the 40 MMEs studied. In some sites, divergent dinucleotide signatures of the sequences between the flanking genes suggest relatively recent horizontal acquisition of some cassettes. The neisserial MMEs were used to interrogate all of the other available bacterial genome sequences, revealing frequent conservation of the flanking genes combined with the presence of different gene cassettes between them. In some cases, these sites can definitively be classified as MMEs in these other genera. These findings provide additional evidence for the MME model, indicate that MME-directed investigations are a good basis for the identification of novel strain-specific genes and differences within bacterial populations and demonstrate that these elements are probably ubiquitously involved in genetic exchange, particularly in naturally competent bacteria.  相似文献   

11.

Background

One of the most important global pathogens infecting all age groups is Streptococcus pneumoniae (the ‘pneumococcus’). Pneumococci reside in the paediatric nasopharynx, where they compete for space and resources, and one competition strategy is to produce a bacteriocin (antimicrobial peptide or protein) to attack other bacteria and an immunity protein to protect against self-destruction. We analysed a collection of 336 diverse pneumococcal genomes dating from 1916 onwards, identified bacteriocin cassettes, detailed their genetic composition and sequence diversity, and evaluated the data in the context of the pneumococcal population structure.

Results

We found that all genomes maintained a blp bacteriocin cassette and we identified several novel blp cassettes and genes. The composition of the ‘bacteriocin/immunity region’ of the blp cassette was highly variable: one cassette possessed six bacteriocin genes and eight putative immunity genes, whereas another cassette had only one of each. Both widely-distributed and highly clonal blp cassettes were identified. Most surprisingly, one-third of pneumococcal genomes also possessed a cassette encoding a novel circular bacteriocin that we called pneumocyclicin, which shared a similar genetic organisation to well-characterised circular bacteriocin cassettes in other bacterial species. Pneumocyclicin cassettes were mainly of one genetic cluster and largely found among seven major pneumococcal clonal complexes.

Conclusions

These detailed genomic analyses revealed a novel pneumocyclicin cassette and a wide variety of blp bacteriocin cassettes, suggesting that competition in the nasopharynx is a complex biological phenomenon.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1729-4) contains supplementary material, which is available to authorized users.  相似文献   

12.
Diversity-generating retroelements   总被引:1,自引:0,他引:1  
  相似文献   

13.
For the last 15 years molecular cytogenetic techniques have been extensively used to study primate evolution. Molecular probes were helpful to distinguish mammalian chromosomes and chromosome segments on the basis of their DNA content rather than solely on morphological features such as banding patterns. Various landmark rearrangements have been identified for most of the nodes in primate phylogeny while chromosome banding still provides helpful reference maps. Fluorescence in situ hybridization (FISH) techniques were used with probes of different complexity including chromosome painting probes, probes derived from chromosome sub-regions and in the size of a single gene. Since more recently, in silico techniques have been applied to trace down evolutionarily derived chromosome rearrangements by searching the human and mouse genome sequence databases. More detailed breakpoint analyses of chromosome rearrangements that occurred during higher primate evolution also gave some insights into the molecular changes in chromosome rearrangements that occurred in evolution. Hardly any "fusion genes" as known from chromosome rearrangements in cancer cells or dramatic "position effects" of genes transferred to new sites in primate genomes have been reported yet. Most breakpoint regions have been identified within gene poor areas rich in repetitive elements and/or low copy repeats (segmental duplications). The progress in various molecular and molecular-cytogenetic approaches including the recently launched chimpanzee genome project suggests that these new tools will have a significant impact on the further understanding of human genome evolution.  相似文献   

14.
In comparison with retrotransposons, which comprise the majority of the Triticeae genomes, very few class 2 transposons have been described in these genomes. Based on the recent discovery of a local accumulation of CACTA elements at the Glu-A3 loci in the two wheat species Triticum monococcum and Triticum durum, we performed a database search for additional such elements in Triticeae spp. A combination of BLAST search and dot-plot analysis of publicly available Triticeae sequences led to the identification of 41 CACTA elements. Only seven of them encode a protein similar to known transposases, whereas the other 34 are considered to be deletion derivatives. A detailed characterization of the identified elements allowed a further classification into seven subgroups. The major subgroup, designated the "Caspar " family, was shown by hybridization to be present in at least 3,000 copies in the T. monococcum genome. The close association of numerous CACTA elements with genes and the identification of several similar elements in sorghum (Sorghum bicolor) and rice (Oryza sativa) led to the conclusion that CACTA elements contribute significantly to genome size and to organization and evolution of grass genomes.  相似文献   

15.
16.
In an effort to extend our understanding of the evolutionary relationship between the canine and human genomes, we have developed and positioned 52 new gene-associated polymorphic markers on the canine meiotic linkage map. Canine-specific PCR primers were developed from the consensus of published sequences of several mammalian genomes and were designed to span intronic regions, thus optimizing the probability that a polymorphic site was included. The resulting markers were analyzed on a panel of three-generation canine reference families and the data were incorporated into the current meiotic linkage map. The data were compared with those generated by three chromosome paint studies in an effort to understand the distribution and frequency of microrearrangements within the canine genome. Forty-eight of 52 genes map to a chromosomal region predicted to contain genes from the corresponding region of the human genome according to all published reciprocal chromosome paint studies. Meiotic linkage mapping data for three genes can be used to resolve discrepancies between the published reciprocal chromosome paint studies, and for an additional two genes, meiotic mapping data allow evolutionary breakpoints to be more precisely defined. We conclude that microrearrangements of evolutionarily conserved segments between the canine and human genomes are rare, occurring for less than 0.5% of gene data reported to date. In addition, we have found that the placement of genes on the meiotic linkage map is a useful mechanism for resolving discrepancies between existing data sets. Received: 7 February 2001 / Accepted: 9 May 2001  相似文献   

17.
Plant resistance(R) proteins are immune receptors that recognize pathogen effectors and trigger rapid defense responses, namely effector-triggered immunity. R protein-mediated pathogen resistance is usually race specific. During plant-pathogen coevolution,plant genomes accumulated large numbers of R genes. Even though plant R genes provide important natural resources for breeding disease-resistant crops, their presence in the plant genome comes at a cost. Misregulation of R genes leads to developmental defects, such as stunted growth and reduced fertility. In the past decade, many microRNAs(miRNAs) have been identified to target various R genes in plant genomes. miRNAs reduce R gene levels under normal conditions and allow induction of R gene expression under various stresses. For these reasons, we consider R genes to be double-edged "swords" and miRNAs as molecular "scabbards". In the present review, we summarize the contributions and potential problems of these "swords" and discuss the features and production of the "scabbards", as well as the mechanisms used to pull the "sword" from the "scabbard"when needed.  相似文献   

18.

Background

CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a prokaryotic adaptive defence system that provides resistance against alien replicons such as viruses and plasmids. Spacers in a CRISPR cassette confer immunity against viruses and plasmids containing regions complementary to the spacers and hence they retain a footprint of interactions between prokaryotes and their viruses in individual strains and ecosystems. The human gut is a rich habitat populated by numerous microorganisms, but a large fraction of these are unculturable and little is known about them in general and their CRISPR systems in particular.

Results

We used human gut metagenomic data from three open projects in order to characterize the composition and dynamics of CRISPR cassettes in the human-associated microbiota. Applying available CRISPR-identification algorithms and a previously designed filtering procedure to the assembled human gut metagenomic contigs, we found 388 CRISPR cassettes, 373 of which had repeats not observed previously in complete genomes or other datasets. Only 171 of 3,545 identified spacers were coupled with protospacers from the human gut metagenomic contigs. The number of matches to GenBank sequences was negligible, providing protospacers for 26 spacers.Reconstruction of CRISPR cassettes allowed us to track the dynamics of spacer content. In agreement with other published observations we show that spacers shared by different cassettes (and hence likely older ones) tend to the trailer ends, whereas spacers with matches in the metagenomes are distributed unevenly across cassettes, demonstrating a preference to form clusters closer to the active end of a CRISPR cassette, adjacent to the leader, and hence suggesting dynamical interactions between prokaryotes and viruses in the human gut. Remarkably, spacers match protospacers in the metagenome of the same individual with frequency comparable to a random control, but may match protospacers from metagenomes of other individuals.

Conclusions

The analysis of assembled contigs is complementary to the approach based on the analysis of original reads and hence provides additional data about composition and evolution of CRISPR cassettes, revealing the dynamics of CRISPR-phage interactions in metagenomes.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-202) contains supplementary material, which is available to authorized users.  相似文献   

19.
During the lytic phase of infection, replication of herpesvirus genomes initiates at the lytic origin of replication, oriLyt. Many herpesviruses harbor more than one lytic origin, but so far, only one oriLyt has been identified for human cytomegalovirus (HCMV). Evidence for the existence of additional lytic origins of HCMV has remained elusive. On the basis of transient replication assays with cloned viral fragments, HCMV oriLyt was described as a core region of 1.5 kbp (minimal oriLyt) flanked by auxiliary sequences required for maximal replication activity (complete oriLyt). It remained unclear whether minimal oriLyt alone can drive the replication of HCMV in the absence of its accessory regions. To investigate the sequence requirements of oriLyt in the context of the viral genome, mutant genomes were constructed lacking either minimal or complete oriLyt. These genomes were not infectious, suggesting that HCMV contains only one lytic origin of replication. Either minimal or complete oriLyt was then ectopically reinserted into the oriLyt-depleted genomes. Only the mutant genomes carrying complete oriLyt led to infectious progeny. Remarkably, inversion of the 1.5-kbp core origin relative to its flanking regions resulted in a replication-defective genome. Mutant genomes carrying minimal oriLyt plus the left flanking region gave rise to minifoci, but genomes harboring minimal oriLyt together with the right flanking region were noninfectious. We conclude that the previously defined minimal lytic origin is not sufficient to drive replication of the HCMV genome. Rather, our results underline the importance of the accessory regions and their correct arrangement for the function of HCMV oriLyt.  相似文献   

20.
Diversity-generating retroelements (DGRs) recognize novel ligands through massive protein sequence variation, a property shared uniquely with the adaptive immune response. Little is known about how recognition is achieved by DGR variable proteins. Here, we present the structure of the Bordetella bacteriophage DGR variable protein major tropism determinant (Mtd) bound to the receptor pertactin, revealing remarkable adaptability in the static binding sites of Mtd. Despite large dissimilarities in ligand binding mode, principles underlying selective recognition were strikingly conserved between Mtd and immunoreceptors. Central to this was the differential amplification of binding strengths by avidity (i.e., multivalency), which not only relaxed the demand for optimal complementarity between Mtd and pertactin but also enhanced distinctions among binding events to provide selectivity. A quantitatively similar balance between complementarity and avidity was observed for Bordetella bacteriophage DGR as occurs in the immune system, suggesting that variable repertoires operate under a narrow set of conditions to recognize novel ligands.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号