首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
CpG dinucleotides mutate at a high rate because cytosine is vulnerable to deamination, cytosines in CpG dinucleotides are often methylated, and deamination of 5-methylcytosine (5mC) produces thymidine. Previous experiments have shown that DNA melting is the rate-limiting step in cytosine deamination. Here we show, through the analysis of human single-nucleotide polymorphisms (SNPs), that the mutation rate produced by 5mC deamination is highly dependent on local GC content. In fact, linear regression analysis showed that the log(10) of the 5mC mutation rates (inferred from SNP frequencies) had slopes of -3 when graphed with respect to the GC content of neighboring sequences. This is the ideal slope that would be expected if the correlation between CpG underrepresentation and GC content had been solely caused by DNA melting. Moreover, this same result was obtained regardless of the SNP locations (all SNPs versus only SNPs in noncoding intergenic regions, excluding CpG islands) and regardless of the lengths over which GC content was calculated (SNP sequences with a modal length of 564 bp versus genomic contigs with a modal length of 163 kb). Several alternative interpretations are discussed.  相似文献   

2.
Vertebrate genomes are characterized with CpG deficiency, particularly for GCpoor regions. The GC content-related CpG deficiency is probably caused by context-dependent deamination of methylated CpG sites. This hypothesis was examined in this study by comparing nucleotide frequencies at CpG flanking positions among invertebrate and vertebrate genomes. The finding is a transition of nucleotide preference of 5' T to 5' A at the invertebrate-vertebrate boundary, indicating that a large number of CpG sites with 5' Ts were depleted because of global DNA methylation developed in vertebrates. At genome level, we investigated CpG observed/expected (obs/exp) values in 500 bp fragments, and found that higher CpG obs/exp value is shown in GC-poor regions of invertebrate genomes (except sea urchin) but in GC-rich sequences of vertebrate genomes. We next compared GC content at CpG flanking positions with genomic average, showing that the GC content is lower than the average in invertebrate genomes, but higher than that in vertebrate genomes. These results indicate that although 5' T and 5' A are different in inducing deamination of methylated CpG sites, GC content is even more important in affecting the deamination rate. In all the tests, the results of sea urchin are similar to vertebrates perhaps due to its fractional DNA methylation. CpG deficiency is therefore suggested to be mainly a result of high mutation rates of methylated CpG sites in GC-poor regions.  相似文献   

3.
Summary Sequence data from regions of five vertebrate vitellogenin genes were used to examine the frequency, distribution, and mutability of the dinucleotide CpG, the preferred modification site for eukaryotic DNA methyltransferases. The observed level of the CpG dinucleotide in all five genes was markedly lower than that expected from the known mononucleotide frequencies. CpG suppression was greater in introns than in exons. CpG-containing codons were found to be avoided in the vitellogenin genes, but not completely despite the redundancy of the genetic code. Frequency and distribution patterns of this dinucleotide varied dramatically among these otherwise closely related genes. Dense clusters of CpG dinucleotides tended to appear in regions of either functional or structural interest (e.g., in the transposon-like Vi-element ofXenopus) and these clusters contained 5-methylcytosine (5 mC). 5 mC is known to undergo deamination to form thymidine, but the extent to which this transition occurs in the heavily methylated genomes of vertebrates and its contribution to CpG suppression are still unclear. Sequence comparison of the methylated vitellogenin gene regions identified CT and GA substitutions that were found to occur at relatively high frequencies. The predicted products of CpG deamination, TpG and CpA, were elevated. These findings are consistent with the view that CpG distribution and methylation are interdependent and that deamination of 5 mC plays an important role in promoting evolutionary change at the nucleotide sequence level.  相似文献   

4.
Cytosine methylation at CpG dinucleotides is a central component of epigenetic regulation in vertebrates, and the base excision repair (BER) pathway is important for maintaining both the genetic stability and the methylation status of CpG sites. This perspective focuses on two enzymes that are of particular importance for the genetic and epigenetic integrity of CpG sites, methyl binding domain 4 (MBD4) and thymine DNA glycosylase (TDG). We discuss their capacity for countering C to T mutations at CpG sites, by initiating base excision repair of G·T mismatches generated by deamination of 5-methylcytosine (5mC). We also consider their role in active DNA demethylation, including pathways that are initiated by oxidation and/or deamination of 5mC.  相似文献   

5.
Asymmetrical distribution of CpG in an 'average' mammalian gene.   总被引:24,自引:7,他引:17       下载免费PDF全文
The frequency and distribution of the rare dinucleotide CpG was examined in 15 mammalian genes. CpG is highly methylated at cytosine in mammalian DNA (1,2) and 5-methylcytosine (5mC) is thought to undergo a transition mutation via deamination to produce thymine (3). This would result in the accumulation of TpG and CpA and depletion of CpG during evolution (4). Consistent with this hypothesis, the gene sample of 26,541 dinucleotides contained CpG at 40% the frequency expected by base composition and the CpG transition products, TpG+CpA, were significantly elevated at 124% of expected random frequency. However, because CpG occurs at only 25% of expected random frequency in the genome, the sampled genes were considerably enriched in this dinucleotide. CpGs were asymmetrically distributed in sequences flanking the genes. 5'-flanking sequences were enriched in CpG at 135% of the frequency expected assuming a symmetrical distribution of all the CpGs in the sampled genes (p less than 0.01), while 3'-flanking regions were depleted in CpG at 40% of expected values (p less than 0.0001). This asymmetry may reflect the role of 5-methylcytosine in gene expression. In contrast the frequencies of GpC and GpT+ ApC did not differ significantly from that predicted by base composition and these dinucleotides were not asymmetrically distributed.  相似文献   

6.
7.
Methylation of cytosine at the 5-carbon position (5 mC) is observed in both prokaryotes and eukaryotes. In humans, DNA methylation at CpG sites plays an important role in gene regulation and has been implicated in development, gene silencing, and cancer. In addition, the CpG dinucleotide is a known hot spot for pathologic mutations genome-wide. CpG tracts may adopt left-handed Z-DNA conformations, which have also been implicated in gene regulation and genomic instability. Methylation facilitates this B-Z transition but the underlying mechanism remains unclear. Herein, four structural models of the dinucleotide d(GC)(5) repeat sequence in B-, methylated B-, Z-, and methylated Z-DNA forms were constructed and an aggregate 100 nanoseconds of molecular dynamics simulations in explicit solvent under physiological conditions was performed for each model. Both unmethylated and methylated B-DNA were found to be more flexible than Z-DNA. However, methylation significantly destabilized the BII, relative to the BI, state through the Gp5mC steps. In addition, methylation decreased the free energy difference between B- and Z-DNA. Comparisons of α/γ backbone torsional angles showed that torsional states changed marginally upon methylation for B-DNA, and Z-DNA. Methylation-induced conformational changes and lower energy differences may contribute to the transition to Z-DNA by methylated, over unmethylated, B-DNA and may be a contributing factor to biological function.  相似文献   

8.
Transitions at CpG dinucleotides, referred to as “CpG substitutions”, are a major mutational input into vertebrate genomes and a leading cause of human genetic disease. The prevalence of CpG substitutions is due to their mutational origin, which is dependent on DNA methylation. In comparison, other single nucleotide substitutions (for example those occurring at GpC dinucleotides) mainly arise from errors during DNA replication. Here we analyzed high quality BAC-based data from human, chimpanzee, and baboon to investigate regional variation of CpG substitution rates.

We show that CpG substitutions occur approximately 15 times more frequently than other single nucleotide substitutions in primate genomes, and that they exhibit substantial regional variation. Patterns of CpG rate variation are consistent with differences in methylation level and susceptibility to subsequent deamination. In particular, we propose a “distance-decaying” hypothesis, positing that due to the molecular mechanism of a CpG substitution, rates are correlated with the stability of double-stranded DNA surrounding each CpG dinucleotide, and the effect of local DNA stability may decrease with distance from the CpG dinucleotide.

Consistent with our “distance-decaying” hypothesis, rates of CpG substitution are strongly (negatively) correlated with regional G+C content. The influence of G+C content decays as the distance from the target CpG site increases. We estimate that the influence of local G+C content extends up to 1,500~2,000 bps centered on each CpG site. We also show that the distance-decaying relationship persisted when we controlled for the effect of long-range homogeneity of nucleotide composition. GpC sites, in contrast, do not exhibit such “distance-decaying” relationship. Our results highlight an example of the distinctive properties of methylation-dependent substitutions versus substitutions mostly arising from errors during DNA replication. Furthermore, the negative relationship between G+C content and CpG rates may provide an explanation for the observation that GC-rich SINEs show lower CpG rates than other repetitive elements.

  相似文献   

9.
Jiang C  Zhao Z 《Genomics》2006,88(5):527-534
So far, there is no genome-wide estimation of the mutational spectrum in humans. In this study, we systematically examined the directionality of the point mutations and maintenance of GC content in the human genome using approximately 1.8 million high-quality human single nucleotide polymorphisms and their ancestral sequences in chimpanzees. The frequency of C-->T (G-->A) changes was the highest among all mutation types and the frequency of each type of transition was approximately fourfold that of each type of transversion. In intergenic regions, when the GC content increased, the frequency of changes from G or C increased. In exons, the frequency of G:C-->A:T was the highest among the genomic categories and contributed mainly by the frequent mutations at the CpG sites. In contrast, mutations at the CpG sites, or CpG-->TpG/CpA mutations, occurred less frequently in the CpG islands relative to intergenic regions with similar GC content. Our results suggest that the GC content is overall not in equilibrium in the human genome, with a trend toward shifting the human genome to be AT rich and shifting the GC content of a region to approach the genome average. Our results, which differ from previous estimates based on limited loci or on the rodent lineage, provide the first representative and reliable mutational spectrum in the recent human genome and categorized genomic regions.  相似文献   

10.
Cytosine methylation and the fate of CpG dinucleotides in vertebrate genomes   总被引:30,自引:1,他引:29  
Summary The dinucleotide CpG is a hotspot for mutation in the human genome as a result of (1) the modification of the 5 cytosine by cellular DNA methyltransferases and (2) the consequent high frequency of spontaneous deamination of 5-methyl cytosine (5mC) to thymidine. DNA methylation thus contributes significantly, albeit indirectly, to the incidence of human genetic disease. We have attempted to estimate for the first time the in vivo rate of deamination of 5mC from the measured rate of 5mC deamination in vitro and the known error frequency of the cellular G/T mismatch-repair system. The accuracy and utility of this estimate (m d ) was then assessed by comparison with clinical data, and an improved estimate of m d (1.66x10-16 s-1) was derived. Comparison of the CpG mutation rates exibited by globin gene and pseudogene sequences from human, chimpanzee and macaque provided further estimates of m d , all of which were consistent with the first. Use of this value in a mathematical model then permitted the estimation of the length of time required to produce the level of CpG suppression currently found in the bulk DNA of vertebrate genomes. This time span, approximately 450 million years, corresponds closely to the estimated time since the emergence and adaptive radiation of the vertebrates and thus coincides with the probable advent of heavily methylated genomes. An accurate estimate of the 5mC deamination rate is important not only for clinical medicine but also for studies of gene evolution. Our data suggest both that patterns of vertebrate gene methylation may be comparatively stable over relatively long periods of evolutionary time, and that the rate of CpG deamination can, under certain limited conditions, serve as a molecular clock.  相似文献   

11.
12.
Although methylation of DNA at some sites regulates gene expression, 5mC at many sites does not appear to have any effect. We present evidence that hemimethylation at many different sites can act as a discrimination signal in mismatch repair. Deamination of 5mC in a symmetrically methylated doublet CpG yields the mismatched base pair T/G in a hemi-methylated doublet pair. Because both bases in the mismatched pair are normal constituents of DNA, identifying the incorrect base is problematic. The only apparent distinction of the two is the methylation on the strand opposite the deamination event. Using available methylases we have produced hemi-methylated SV40 DNAs that are mismatched at a single T/G or A/C basepair in a sequence that mimics the lesion resulting from the deamination of a 5mCpG. Methylation at the adjacent cytosine results in the replacement of the T much more frequently than when no methylation is present in the heteroduplex. Cytosine methylation at sites farther removed from the mismatch is equally effective in replacing the incorrect T at the mismatch. Although methylation in vertebrates is almost exclusively on cytosine in the doublet CpG, methylation of cytosines in other doublets, as well as methylation of adenosine, also act as strand discrimination signals. Perhaps some of the excess methylation in vertebrate DNAs may serve to direct mismatch repair.  相似文献   

13.
Cohen NM  Kenigsberg E  Tanay A 《Cell》2011,145(5):773-786
Mammalian CpG islands are key epigenomic elements that were first characterized experimentally as genomic fractions with low levels of DNA methylation. Currently, CpG islands are defined based on their genomic sequences alone. Here, we develop evolutionary models to show that several distinct evolutionary processes generate and maintain CpG islands. One central evolutionary regime resulting in enriched CpG content is driven by low levels of DNA methylation and consequentially low rates of CpG deamination. Another major force forming CpG islands is biased gene conversion that stabilizes constitutively methylated CpG islands by balancing rapid deamination with CpG fixation. Importantly, evolutionary analysis and population genetics data suggest that selection for high CpG content is not?a significant factor contributing to conservation of CpGs in differentially methylated regions. The heterogeneous, but not selective, origins of CpG islands have direct implications for the understanding of DNA methylation patterns in healthy and diseased cells.  相似文献   

14.
In 4M NaCl solutions (dC-dG)n (n = 3,4; approximately 9 mM) exist as a mixture o +/- B and Z forms. The low and high field components of two 31P NMR resonances originating from internal phosphodiester groups are assigned to the GpC and CpG linkages, respectively. Low temperatures stabilize the Z-forms, which completely disappear above 50 degrees C (n = 3) and 65 degrees C (n = 4). delta H = -44 and -17 kJ/mol for B to Z transition in the hexamer and octamer duplexes, respectively. Temperature dependent changes (0-50 degrees C range) in the spin-lattice relaxation times at 145.7 MHz are distinctly different for the 31P nuclei o +/- GpC and CpG groups. The relaxation data can be explained by assuming that the GpC phosphodiester groups undergo more local internal motion than do the CpG groups.  相似文献   

15.
Transgenic mutation assays utilizing bacterial target genes display a high frequency of spontaneous mutation at CpG sequences. This is believed to result from the fact that: (1) the prokaryotic genes currently being used as transgenic mutation targets have a high CpG content and (2) these sequences are methylated by mammalian cells to produce 5-methylcytosine (5MC), a known promutagenic base. To study the effect of CpG content on the frequency and type of spontaneous mutation, we have synthesized an analogue of the bacterial lacI target gene (mrkII) that contains a reduced number of CpG sequences. This gene was inserted into a lambda vector and used to construct trangenic mice that undergo vector rescue from genomic DNA upon in vitro packaging. Results on spontaneous mutation frequency and spectrum have been collected and compared to those observed at the lacI gene in Big Blue™ transgenic mice. Spontaneous mutations at the mrkII gene occurred at a frequency in the mid-10−5 range and were predominantly base pair substitutions, similar to results seen in Big Blue™. However, mrkII mutations were distributed toward the carboxyl end of the gene instead of the bias toward the amino terminus seen in lacI. Unexpectedly, 23% of the spontaneous mrkII mutations were GC → AT transitions at CpG sequences (compared to 32% in lacI), despite the reduction in CpG number from 95 in lacI to only 13 in mrkII. Nine of the CpG bases undergoing transition mutations in mrkII have not been recorded previously as spontaneous sites in Big Blue™. Therefore, substantial reduction of the number of CpG sequences in the lacI transgene did not significantly reduce the rate of spontaneous mutation or alter the contribution of CpG-related events. This suggests that other factors are also operating to establish frequency and composition of spontaneous mutations in transgenic targets.  相似文献   

16.
Hydroxylation of 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC) by TET enzymes presents a particular regulatory mechanism in the mammalian brain. However, although methylation and hydroxymethylation of cytosines in non-CpG contexts have been reported, these mechanisms remain poorly understood. Here, we applied TAB-seq and oxBS-seq selectively to detect 5hmC and 5mC at base resolution in olfactory bulb derived from female mice. We found that active turnover of 5mC to 5hmC occurred in both CpG and non-CpG contexts. Strikingly, we identified a different sequence preference for 5mC and 5hmC in a CH context, in which H = A, C, or T, TNCA/TC for 5mC and NNCA/T/CN for 5hmC. More importantly, we found that genes showing 5mC to 5hmC turnover showed only limited overlap in CpG and CH contexts, and that olfactory receptor genes were marked with higher turnover of 5mC to 5hmC in non-CpG context. Collectively, we identified an unexpected sequence preference for non-CpG hydroxymethylation and distinct target genes regulated by the turnover of 5mC to 5hmC in CpG and CH contexts.  相似文献   

17.
DNA methylation is a major epigenetic modification of the genome that affects basic biological functions, such as gene expression and cell development. We used the human genome sequences and the DNA methylation data that are available in order to establish a map of the levels of GC and methylation in isochores. We also looked for the correlations that hold between GC levels and the distribution of the (1) dinucleotide CpG, (2) ratio 5mC/CpG, and (3) CpG islands. Our results show that methylation levels, CpG frequencies, and the density of CpG islands are positively correlated with the GC level of isochores. In contrast, the correlation between the 5mC/CpG ratio and GC is a negative one because the increase in methylation lags behind that of CpG, to reach a plateau in the GC-richest, gene-richest isochore families H2 and H3. In conclusion, there are more CpG targets that remain unmethylated in the GC-richest, gene-richest isochores in comparison with the other isochores. This conclusion supports the idea that the widespread methylation under consideration here has a general inhibitory effect on gene expression.  相似文献   

18.
We have determined the 5-methylcytosine (5mC) content in high molecular weight DNA, from two dicot (tobacco and pea) and two monocot (wheat and maize) plant species, fractionated according to base composition. The results show that the proportion of 5mC in the genomic fractions increases linearly with their guanine + cytosine (G + C) content while the proportion of non-methylated cytosine remains almost constant. This can be interpreted as a consequence of a difference in mutation pressure related to spontaneous deamination of 5mC to thymine between the different compartments of plant genomes.  相似文献   

19.
Summary The extent to which CpG dinucleotides were depleted in a large set of angiosperm genes was, on average, very similar to the extent of CpG depletion in total angiosperm genomic DNA and far less than the extent of CpG depletion in vertebrate genes. Gene sequences from Arabidopsis thaliana, a dicotyledonous species with relatively low levels of total 5-methylcytosine, were just as CpG depleted as the angiosperm genes in general. Furthermore, levels of TpG and CpA, the potential deamination mutation products of methylated CpG, were elevated in A. thaliana genes, supporting a high rate of deamination mutation as the cause of the CpG deficiency. Using a method that takes into account the dinucleotide frequencies within each sequence of interest, we calculated the expected frequencies of CpNpG trinucleotides, which are also highly methylated in angiosperm genomes. CpNpG trinucleotides were not extensively enriched or depleted in the angiosperm genes. Two hypotheses could account for our results. Differential depletion of CpG and CpNpG within angiosperm genes and differential depletion of CpG in angiosperm and vertebrate genes could arise from different efficiencies of mismatch repair or from different levels of cytosine methylation in the cell lineages that contribute to germ cells.Offprint requests to: M. Gardiner-Garden  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号