首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Parvoviruses are rapidly evolving viruses that infect a wide range of hosts, including vertebrates and invertebrates. Extensive methylation of the parvovirus genome has been recently demonstrated. A global pattern of methylation of CpG dinucleotides is seen in vertebrate genomes, compared to “fractional” methylation patterns in invertebrate genomes. It remains unknown if the loss of CpG dinucleotides occurs in all viruses of a given DNA virus family that infect host species spanning across vertebrates and invertebrates. We investigated the link between the extent of CpG dinucleotide depletion among autonomous parvoviruses and the evolutionary lineage of the infected host. We demonstrate major differences in the relative abundance of CpG dinucleotides among autonomous parvoviruses which share similar genome organization and common ancestry, depending on the infected host species. Parvoviruses infecting vertebrate hosts had significantly lower relative abundance of CpG dinucleotides than parvoviruses infecting invertebrate hosts. The strong correlation of CpG dinucleotide depletion with the gain in TpG/CpA dinucleotides and the loss of TpA dinucleotides among parvoviruses suggests a major role for CpG methylation in the evolution of parvoviruses. Our data present evidence that links the relative abundance of CpG dinucleotides in parvoviruses to the methylation capabilities of the infected host. In sum, our findings support a novel perspective of host-driven evolution among autonomous parvoviruses.  相似文献   

2.
Stable isotope probing (SIP) of nucleic acids is a powerful tool that can identify the functional capabilities of noncultivated microorganisms as they occur in microbial communities. While it has been suggested previously that nucleic acid SIP can be performed with 15N, nearly all applications of this technique to date have used 13C. Successful application of SIP using 15N-DNA (15N-DNA-SIP) has been limited, because the maximum shift in buoyant density that can be achieved in CsCl gradients is approximately 0.016 g ml−1 for 15N-labeled DNA, relative to 0.036 g ml−1 for 13C-labeled DNA. In contrast, variation in genome G+C content between microorganisms can result in DNA samples that vary in buoyant density by as much as 0.05 g ml−1. Thus, natural variation in genome G+C content in complex communities prevents the effective separation of 15N-labeled DNA from unlabeled DNA. We describe a method which disentangles the effects of isotope incorporation and genome G+C content on DNA buoyant density and makes it possible to isolate 15N-labeled DNA from heterogeneous mixtures of DNA. This method relies on recovery of “heavy” DNA from primary CsCl density gradients followed by purification of 15N-labeled DNA from unlabeled high-G+C-content DNA in secondary CsCl density gradients containing bis-benzimide. This technique, by providing a means to enhance separation of isotopically labeled DNA from unlabeled DNA, makes it possible to use 15N-labeled compounds effectively in DNA-SIP experiments and also will be effective for removing unlabeled DNA from isotopically labeled DNA in 13C-DNA-SIP applications.  相似文献   

3.
4.
It is understood that DNA and amino acid substitution rates are highly sequence context-dependent, e.g., C --> T substitutions in vertebrates may occur much more frequently at CpG sites and that cysteine substitution rates may depend on support of the context for participation in a disulfide bond. Furthermore, many applications rely on quantitative models of nucleotide or amino acid substitution, including phylogenetic inference and identification of amino acid sequence positions involved in functional specificity. We describe quantification of the context dependence of nucleotide substitution rates using baboon, chimpanzee, and human genomic sequence data generated by the NISC Comparative Sequencing Program. Relative mutation rates are reported for the 96 classes of mutations of the form 5' alphabetagamma 3' --> 5' alphadeltagamma 3', where alpha, beta, gamma, and delta are nucleotides and beta not equal delta, based on maximum likelihood calculations. Our results confirm that C --> T substitutions are enhanced at CpG sites compared with other transitions, relatively independent of the identity of the preceding nucleotide. While, as expected, transitions generally occur more frequently than transversions, we find that the most frequent transversions involve the C at CpG sites (CpG transversions) and that their rate is comparable to the rate of transitions at non-CpG sites. A four-class model of the rates of context-dependent evolution of primate DNA sequences, CpG transitions > non-CpG transitions approximately CpG transversions > non-CpG transversions, captures qualitative features of the mutation spectrum. We find that despite qualitative similarity of mutation rates among different genomic regions, there are statistically significant differences.  相似文献   

5.
The vast majority of surface ocean bacteria are uncultivated. Compared with their cultured relatives, they frequently exhibit a streamlined genome, reduced G+C content and distinct gene repertoire. These genomic traits are relevant to environmental adaptation, and have generally been thought to become fixed in marine bacterial populations through selection. Using single-cell genomics, we sequenced four uncultivated cells affiliated with the ecologically relevant Roseobacter clade and used a composition-heterogeneous Bayesian phylogenomic model to resolve these single-cell genomes into a new clade. This lineage has no representatives in culture, yet accounts for ∼35% of Roseobacters in some surface ocean waters. Analyses of multiple genomic traits, including genome size, G+C content and percentage of noncoding DNA, suggest that these single cells are representative of oceanic Roseobacters but divergent from isolates. Population genetic analyses showed that substitution of physicochemically dissimilar amino acids and replacement of G+C-rich to G+C-poor codons are accelerated in the uncultivated clade, processes that are explained equally well by genetic drift as by the more frequently invoked explanation of natural selection. The relative importance of drift vs selection in this clade, and perhaps in other marine bacterial clades with streamlined G+C-poor genomes, remains unresolved until more evidence is accumulated.  相似文献   

6.

Background  

Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related.  相似文献   

7.
Recently, prokaryotic DNAs containing unmethylated CpG motifs have been shown to be intrinsically immunostimulatory both in vitro and in vivo, tending to promote Th1-like responses. In contrast, CpG dinucleotides in mammalian DNAs are extensively methylated on cytosines and hence immunologically inert. Since the herpes simplex virus (HSV) genome is unmethylated and G+C rich, we predicted that CpG motifs would be highly prevalent in the HSV genome; hence, we examined the immunostimulatory potential of purified HSV DNA in vitro and in vivo. Mouse splenocyte cultures treated with HSV DNA or HSV-derived oligodeoxyribonucleotides (ODNs) showed strong proliferative responses and production of inflammatory cytokines (gamma interferon [IFN-γ], tumor necrosis factor [TNF], and interleukin-6 [IL-6]) in vitro, whereas splenocytes treated with mammalian CV-1 DNA or non-CpG ODN did not. After immunization with ovalbumin (OVA), only splenocytes from mice immunized with HSV DNA or HSV-ODN as the adjuvants proliferated strongly and produced typical Th1 responses, including CD8+ cytotoxic T-lymphocyte responses, upon restimulation with OVA. Furthermore, HSV-ODN synergized with IFN-γ to induce nitric oxide (NO), IL-6, and TNF production from macrophages. These results demonstrate that HSV DNA and HSV-ODN are immunostimulatory, driving potent Th1 responses both in vitro and in vivo. Considering that HSV DNA has been found to persist in nonneuronal cells, these results fuel speculation that HSV DNA might play a role in pathogenesis, in particular, in diseases like herpes stromal keratitis (HSK) that involve chronic inflammatory responses in the absence of virus or viral antigens.  相似文献   

8.
Alternating self-complementary oligonucleotides starting with a 5'-pyrimidine usually form left-handed Z-DNA; however, with a 5'-purine start sequence they form the right-handed A-DNA. Here we report the crystal structure of the decamer d(GCGCGCGCGC) with a 5'-purine start in the Z-DNA form. The decamer crystallizes in the hexagonal space group P6(5)22, unit cell dimensions a = b = 18.08 and c = 43.10 A, with one of the following four dinucleotide diphosphates in the asymmetric unit: d(pGpC)/d(GpCp)/d(pCpG)/d(CpGp). The molecular replacement method, starting with d(pGpC) of the isomorphous Z-DNA hexamer d(araC-dG)3 without the 2'-OH group of arabinose, was used in the structure analysis. The method gave the solution only after the sugar-phosphate conformation of the GpC step was manipulated. The refinement converged to a final R value of 18.6% for 340 unique reflections in the resolution range 8.0-1.9 A. A result of the sequence alternation is the alternation in the nucleotide conformation; guanosine is C3'-endo, syn, and cytidine is C2'-endo, anti. The CpG step phosphodiester conformation is the same as ZI or ZII, whereas that of the GpC step phosphodiester is "intermediate" in the sense that zeta (O3'-P bond) is the same as ZII but alpha (P-O5' bond) is the same as ZI. The duplexes generated from the dinucleotide asymmetric unit are stacked one on top of the other in the crystal to form an infinite pseudocontinuous helix. This renders it a quasi-polymerlike structure that has assumed the Z-DNA conformation further strengthened by the long inner Z-forming stretch d(CG)4. An interesting feature of the structure is the presence of water strings in both the major and the minor grooves. In the minor groove the cytosine carbonyl oxygen atoms of the GpC and CpG steps are cross-bridged by water molecules that are not themselves hydrogen bonded but are enclosed by the water rings in the mouth of the minor groove. In the major groove three independent water molecules form a zigzagging continuous water string that runs throughout the duplex.  相似文献   

9.
Regulatory change has long been hypothesized to drive the delineation of the human phenotype from other closely related primates. Here we provide evidence that CpG dinucleotides play a special role in this process. CpGs enable epigenome variability via DNA methylation, and this epigenetic mark functions as a regulatory mechanism. Therefore, species-specific CpGs may influence species-specific regulation. We report non-polymorphic species-specific CpG dinucleotides (termed “CpG beacons”) as a distinct genomic feature associated with CpG island (CGI) evolution, human traits and disease. Using an inter-primate comparison, we identified 21 extreme CpG beacon clusters (≥ 20/kb peaks, empirical p < 1.0 × 10−3) in humans, which include associations with four monogenic developmental and neurological disease related genes (Benjamini-Hochberg corrected p = 6.03 × 10−3). We also demonstrate that beacon-mediated CpG density gain in CGIs correlates with reduced methylation in these species in orthologous CGIs over time, via human, chimpanzee and macaque MeDIP-seq. Therefore mapping into both the genomic and epigenomic space the identified CpG beacon clusters define points of intersection where a substantial two-way interaction between genetic sequence and epigenetic state has occurred. Taken together, our data support a model for CpG beacons to contribute to CGI evolution from genesis to tissue-specific to constitutively active CGIs.  相似文献   

10.

Background  

Neighboring nucleotides exert a striking influence on mutation, with the hypermutability of CpG dinucleotides in many genomes being an exemplar. Among the approaches employed to measure the relative importance of sequence neighbors on molecular evolution have been continuous-time Markov process models for substitutions that treat sequences as a series of independent tuples. The most widely used examples are the codon substitution models. We evaluated the suitability of derivatives of the nucleotide frequency weighted (hereafter NF) and tuple frequency weighted (hereafter TF) models for measuring sequence context dependent substitution. Critical properties we address are their relationships to an independent nucleotide process and the robustness of parameter estimation to changes in sequence composition. We then consider the impact on inference concerning dinucleotide substitution processes from application of these two forms to intron sequence alignments from primates.  相似文献   

11.
Why does the human factor IX gene have a G + C content of 40%?   总被引:20,自引:2,他引:18       下载免费PDF全文
The factor IX gene has a G + C content of approximately 40% in all mammalian species examined. In human factor IX, C----T and G----A transitions at the dinucleotide CpG are elevated at least 24-fold relative to other transitions. Can the G + C content be explained solely by this hot spot of mutation? Using our mathematical model, we show that the elevation of mutation at CpG cannot alone lower the G + C content below 45%. To search for other hot spots of mutation that might contribute to the reduction of G + C content, we assessed the relative rates of base substitution in our sample of 160 families with hemophilia B. Seventeen independent single-base substitutions are reported herein for a total of 96 independent point mutations in our sample. The following conclusions emerge from the analysis of our data and, where appropriate, the data of others: (1) Transversions at CpG are elevated an estimated 7.7-fold relative to other transversions. (2) The mutation rates at non-CpG dinucleotides are remarkably uniform; none of the observed rates are either more than twofold above the median for transitions or more than threefold above the median for transversions. (3) The pattern of recent mutation is compatible with the pattern during mammalian evolution that has maintained the G + C content of the factor IX gene at approximately 40%.  相似文献   

12.
Using data from primates, we show that molecular clocks in sites that have been part of a CpG dinucleotide in recent past (CpG sites) and non-CpG sites are of markedly different nature, reflecting differences in their molecular origins. Notably, single nucleotide substitutions at non-CpG sites show clear generation-time dependency, indicating that most of these substitutions occur by errors during DNA replication. On the other hand, substitutions at CpG sites occur relatively constantly over time, as expected from their primary origin due to methylation. Therefore, molecular clocks are heterogeneous even within a genome. Furthermore, we propose that varying frequencies of CpG dinucleotides in different genomic regions may have contributed significantly to conflicting earlier results on rate constancy of mammalian molecular clock. Our conclusion that different regions of genomes follow different molecular clocks should be considered when inferring divergence times using molecular data and in phylogenetic analysis.  相似文献   

13.
Amino acid substitutions at nonconserved protein positions can have noncanonical and “long-distance” outcomes on protein function. Such outcomes might arise from changes in the internal protein communication network, which is often accompanied by changes in structural flexibility. To test this, we calculated flexibilities and dynamic coupling for positions in the linker region of the lactose repressor protein. This region contains nonconserved positions for which substitutions alter DNA-binding affinity. We first chose to study 11 substitutions at position 52. In computations, substitutions showed long-range effects on flexibilities of DNA-binding positions, and the degree of flexibility change correlated with experimentally measured changes in DNA binding. Substitutions also altered dynamic coupling to DNA-binding positions in a manner that captured other experimentally determined functional changes. Next, we broadened calculations to consider the dynamic coupling between 17 linker positions and the DNA-binding domain. Experimentally, these linker positions exhibited a wide range of substitution outcomes: Four conserved positions tolerated hardly any substitutions (“toggle”), ten nonconserved positions showed progressive changes from a range of substitutions (“rheostat”), and three nonconserved positions tolerated almost all substitutions (“neutral”). In computations with wild-type lactose repressor protein, the dynamic couplings between the DNA-binding domain and these linker positions showed varied degrees of asymmetry that correlated with the observed toggle/rheostat/neutral substitution outcomes. Thus, we propose that long-range and noncanonical substitutions outcomes at nonconserved positions arise from rewiring long-range communication among functionally important positions. Such calculations might enable predictions for substitution outcomes at a range of nonconserved positions.  相似文献   

14.
Myosin binding protein C (MyBP-C) is a thick-filament protein that limits cross-bridge cycling rates and reduces myocyte power output. To investigate mechanisms by which MyBP-C affects contraction, we assessed effects of recombinant N-terminal domains of cardiac MyBP-C (cMyBP-C) on contractile properties of permeabilized rat cardiac trabeculae. Here, we show that N-terminal fragments of cMyBP-C that contained the first three immunoglobulin domains of cMyBP-C (i.e., C0, C1, and C2) plus the unique linker sequence termed the MyBP-C “motif” or “m-domain” increased Ca2+ sensitivity of tension and increased rates of tension redevelopment (i.e., ktr) at submaximal levels of Ca2+. At concentrations ≥20 μM, recombinant proteins also activated force in the absence of Ca2+ and inhibited maximum Ca2+-activated force. Recombinant proteins that lacked the combination of C1 and the motif did not affect contractile properties. These results suggest that the C1 domain plus the motif constitute a functional unit of MyBP-C that can activate the thin filament.  相似文献   

15.
16.
We develop here an analytical evolution model based on a dinucleotide mutation matrix 16 x 16 with six substitution parameters associated with the three types of substitutions in the two dinucleotide sites. It generalizes the previous models based on the nucleotide mutation matrices 4 x 4. It determines at some time t the exact occurrence probabilities of dinucleotides mutating randomly according to these six substitution parameters. Furthermore, several properties and two applications of this model allow to derive 16 evolutionary analytical solutions of dinucleotides and also a dinucleotide phylogenetic distance. Finally, based on this mathematical model, the SED (Stochastic Evolution of Dinucleotides) web server has been developed for deriving evolutionary analytical solutions of dinucleotides.  相似文献   

17.
Previous studies have shown that the guanine plus cytosine (G+C) content of ribosomal RNAs (rRNAs) is highly correlated with bacterial growth temperatures. This correlation is strongest in the double-stranded stem regions of the rRNA, a fact that can be explained by selection for increased structural stability at high growth temperatures. In this study, we examined the single-stranded regions of 16S rRNAs. We reasoned that, since these regions of the molecule are subject to less structural constraint than the stem regions, their nucleotide content might simply reflect the overall nucleotide content of the genome. Contrary to this expectation, however, we found that all of the single-stranded regions are characterized by very high adenine (A) and relatively low cytosine (C) contents. Moreover, the nucleotide content of these single-stranded regions is surprisingly constant between species, despite dramatic differences in optimal growth temperatures, and despite large differences in the overall genomic G+C content. This provides compelling evidence for strong stabilizing selection acting on 16S rRNA single-stranded regions. We found that selection favors purines (A+G), and especially adenine (A), in the single-stranded regions of these rRNAs.  相似文献   

18.
Asymmetrical distribution of CpG in an 'average' mammalian gene.   总被引:24,自引:7,他引:17       下载免费PDF全文
The frequency and distribution of the rare dinucleotide CpG was examined in 15 mammalian genes. CpG is highly methylated at cytosine in mammalian DNA (1,2) and 5-methylcytosine (5mC) is thought to undergo a transition mutation via deamination to produce thymine (3). This would result in the accumulation of TpG and CpA and depletion of CpG during evolution (4). Consistent with this hypothesis, the gene sample of 26,541 dinucleotides contained CpG at 40% the frequency expected by base composition and the CpG transition products, TpG+CpA, were significantly elevated at 124% of expected random frequency. However, because CpG occurs at only 25% of expected random frequency in the genome, the sampled genes were considerably enriched in this dinucleotide. CpGs were asymmetrically distributed in sequences flanking the genes. 5'-flanking sequences were enriched in CpG at 135% of the frequency expected assuming a symmetrical distribution of all the CpGs in the sampled genes (p less than 0.01), while 3'-flanking regions were depleted in CpG at 40% of expected values (p less than 0.0001). This asymmetry may reflect the role of 5-methylcytosine in gene expression. In contrast the frequencies of GpC and GpT+ ApC did not differ significantly from that predicted by base composition and these dinucleotides were not asymmetrically distributed.  相似文献   

19.
Heterotrimeric G-proteins are a class of signal transduction proteins highly conserved throughout evolution that serve as dynamic molecular switches regulating the intracellular communication initiated by extracellular signals including sensory information. This property is achieved by a guanine nucleotide cycle wherein the inactive, signaling-incompetent Gα subunit is normally bound to GDP; activation to signaling-competent Gα occurs through the exchange of GDP for GTP (typically catalyzed via seven-transmembrane domain G-protein coupled receptors [GPCRs]), which dissociates the Gβγ dimer from Gα-GTP and initiates signal transduction. The hydrolysis of GTP, greatly accelerated by “Regulator of G-protein Signaling” (RGS) proteins, returns Gα to its inactive GDP-bound form and terminates signaling. Through extensive characterization of mammalian Gα isoforms, the rate-limiting step in this cycle is currently considered to be the GDP/GTP exchange rate, which can be orders of magnitude slower than the GTP hydrolysis rate. However, we have recently demonstrated that, in Arabidopsis, the guanine nucleotide cycle appears to be limited by the rate of GTP hydrolysis rather than nucleotide exchange. This finding has important implications for the mechanism of sugar sensing in Arabidopsis. We also discuss these data on Arabidopsis G-protein nucleotide cycling in relation to recent reports of putative plant GPCRs and heterotrimeric G-protein effectors in Arabidopsis.Key words: Arabidopsis, glucose, G-protein, nucleotide exchange, RGS protein  相似文献   

20.
Recently, Fryxell and Moon (2005) examined methylation-dependent transition rates (5mC deamination rates), which were calculated by the difference between the CpG transition and GpC transition rates, using 4,437 transition mutations in CpG or GpC dinucleotides. They concluded that 5mC deamination rates were highly dependent on local GC content but not on local sequence lengths over which GC content was calculated or the genomic regions where the mutations occurred. Here, we reexamined these statements by using 292,216 CpG-->TpG/CpA and GpC-->GpT/ApC mutations, an increase of 66 times as much data. Contrary to Fryxell and Moon's conclusions, our analysis indicated that 5mC deamination rates in the human genome were dependent on both the local sequence length and the genomic region. Some explanations for their conclusions were provided.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号