首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Simmen MW 《Genomics》2008,92(1):33-40
In mammalian genomes CpGs occur at one-fifth their expected frequency. This is accepted as resulting from cytosine methylation and deamination of 5-methylcytosine leading to TpG and CpA dinucleotides. The corollary that a CpG deficit should correlate with TpG excess has not hitherto been systematically tested at a genomic level. I analyzed genome sequences (human, chimpanzee, mouse, pufferfish, zebrafish, sea squirt, fruitfly, mosquito, and nematode) to do this and generally to assess the hypothesis that CpG deficit, TpG excess, and other data are accountable in terms of 5-methylcytosine mutation. In all methylated genomes local CpG deficit decreases with higher G + C content. Local TpG surplus, while positively associated with G + C level in mammalian genomes but negatively associated with G + C in nonmammalian methylated genomes, is always explicable in terms of the CpG trend under the methylation model. Covariance of dinucleotide abundances with G + C demonstrates that correlation analyses should control for G + C. Doing this reveals a strong negative correlation between local CpG and TpG abundances in methylated genomes, in accord with the methylation hypothesis. CpG deficit also correlates with CpT excess in mammals, which may reflect enhanced cytosine mutation in the context 5'-YCG-3'. Analyses with repeat-masked sequences show that the results are not attributable to repetitive elements.  相似文献   

2.
To understand the variation in genomic composition and its effect on codon usage, we performed the comparative analysis of codon usage and nucleotide usage in the genes of three dicots, Glycine max, Arabidopsis thaliana and Medicago truncatula. The dicot genes were found to be A/T rich and have predominantly A-ending and/or T-ending codons. GC3s directly mimic the usage pattern of global GC content. Relative synonymous codon usage analysis suggests that the high usage frequency of A/T over G/C mononucleotide containing codons in AT-rich dicot genome is due to compositional constraint as a factor of codon usage bias. Odds ratio analysis identified the dinucleotides TpG, TpC, GpA, CpA and CpT as over-represented, where, CpG and TpA as under-represented dinucleotides. The results of (NcExp?NcObs)/NcExp plot suggests that selection pressure other than mutation played a significant role in influencing the pattern of codon usage in these dicots. PR2 analysis revealed the significant role of selection pressure on codon usage. Analysis of varience on codon usage at start and stop site showed variation in codon selection in these sites. This study provides evidence that the dicot genes were subjected to compositional selection pressure.  相似文献   

3.
The only natural postsynthetic modification known to occur in mammalian DNA is the methylation in the 5 position of deoxycytidines. Of the four 5'-CpN-3' dinucleotides (ie. CpG, CpC, CpA, and CpT), the dinucleotide which contains the highest proportion of deoxycytidines methylated is CpG, with 40 to 80% methylation in different mammalian genomes. It has also been shown that CpA, CpT, and CpC are methylated as well but to a much lower extent. Here we report the result of a full nearest neighbour analysis (together with quantitation of methylation levels in the 4 CpN dinucleotides) for DNA from human spleen. Using the values we have calculated the overall frequencies for all the methylated dinucleotides in the human genome. Because of the relative underrepresentation (by 7 to 10 fold) of the CpG dinucleotide, only 45.5% of total mC was present in mCpG, with 54.5% in mCpA, mCpT plus mCpC. These calculations have implications for studies into the function and significance of DNA methylation in mammalian cells.  相似文献   

4.
《Epigenetics》2013,8(6):823-828
In mammalian genomes, the methylation of cytosine residues within CpG dinucleotides is crucial to normal development and cell differentiation. However, methylation of cytosines in the contexts of CpA, CpT, and CpC (non-CpG methylation) has been reported for decades, yet remains poorly understood. In recent years, whole genome bisulphite sequencing (WGBS) has confirmed significant levels of non-CpG methylation in specific tissues and cell types. Non-CpG methylation has several properties that distinguish it from CpG methylation. Here we review the literature describing non-CpG methylation in mammalian cells, describe the important characteristics that distinguish it from CpG methylation, and discuss its functional importance.  相似文献   

5.
In mammalian genomes, the methylation of cytosine residues within CpG dinucleotides is crucial to normal development and cell differentiation. However, methylation of cytosines in the contexts of CpA, CpT, and CpC (non-CpG methylation) has been reported for decades, yet remains poorly understood. In recent years, whole genome bisulphite sequencing (WGBS) has confirmed significant levels of non-CpG methylation in specific tissues and cell types. Non-CpG methylation has several properties that distinguish it from CpG methylation. Here we review the literature describing non-CpG methylation in mammalian cells, describe the important characteristics that distinguish it from CpG methylation, and discuss its functional importance.  相似文献   

6.
This study reports the analysis of codon usage in 35 complete Homo sapiens genes. Both codon frequency and inter-codon interference exhibit patterns of evolutionary interest. There is a significant positive correlation between the frequency with which a given codon is used and the frequency with which its complement is used. Since the frequency of appearance of the complementary codon on the coding strand is equal to the frequency of appearance of the original codon on the non-coding strand, in the same phase, the non-coding strand is found to resemble the coding strand in triplet composition. The same effect has been observed in Escherichia coli. This preference for the use of certain complementary triplets as codons suggests that the evolution of the use of the genetic code depended to some extent upon the double-stranded nature of the coding material. In addition, the effect of discrimination against the use of two dinucleotides, CpG and UpA, is observed in codon usage and also in adjacent codon interference. Codons beginning with G, or A, are unlikely to be preceded by codons ending in C, or U, respectively. Consideration of codon assignment in the genetic code together with the observed CpG infrequency suggests that the evolution of the code may have been influenced by conditions in which the use of CpG dinucleotides was unfavorable. The infrequent use of UpA dinucleotides can be explained as the result of frameshift mutation during gene evolution.  相似文献   

7.
Parvoviruses are rapidly evolving viruses that infect a wide range of hosts, including vertebrates and invertebrates. Extensive methylation of the parvovirus genome has been recently demonstrated. A global pattern of methylation of CpG dinucleotides is seen in vertebrate genomes, compared to “fractional” methylation patterns in invertebrate genomes. It remains unknown if the loss of CpG dinucleotides occurs in all viruses of a given DNA virus family that infect host species spanning across vertebrates and invertebrates. We investigated the link between the extent of CpG dinucleotide depletion among autonomous parvoviruses and the evolutionary lineage of the infected host. We demonstrate major differences in the relative abundance of CpG dinucleotides among autonomous parvoviruses which share similar genome organization and common ancestry, depending on the infected host species. Parvoviruses infecting vertebrate hosts had significantly lower relative abundance of CpG dinucleotides than parvoviruses infecting invertebrate hosts. The strong correlation of CpG dinucleotide depletion with the gain in TpG/CpA dinucleotides and the loss of TpA dinucleotides among parvoviruses suggests a major role for CpG methylation in the evolution of parvoviruses. Our data present evidence that links the relative abundance of CpG dinucleotides in parvoviruses to the methylation capabilities of the infected host. In sum, our findings support a novel perspective of host-driven evolution among autonomous parvoviruses.  相似文献   

8.
Understanding the cause of the changes in the amino acid composition of proteins is essential for understanding the evolution of protein functions. Since the early 1970s, it has been known that the frequency of some amino acids in protein sequences is increasing and that of others is decreasing. Recently, it was found that the trends of amino acid changes were similar in 15 taxa representing Bacteria, Archaea, and Eukaryota. However, the cause of this similarity in the trend of the gains and losses of amino acids continued to be debated. Here, we show that this trend of the gain and loss of amino acids can be simply explained by CpG hypermutability. We found that the frequency of amino acids coded by codons with TpG dinucleotides and those with CpA dinucleotides is increasing, while that of amino acids coded by codons with CpG dinucleotides is decreasing. We also found that organisms that lack DNA methyltransferase show different trends of the gain and loss of amino acids. DNA methyltransferase methylates CpG dinucleotides and induces CpG hypermutability. The incorporation of CpG hypermutability into models of protein evolution will improve studies on protein evolution in different organisms. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

9.

Background  

Neighboring nucleotides exert a striking influence on mutation, with the hypermutability of CpG dinucleotides in many genomes being an exemplar. Among the approaches employed to measure the relative importance of sequence neighbors on molecular evolution have been continuous-time Markov process models for substitutions that treat sequences as a series of independent tuples. The most widely used examples are the codon substitution models. We evaluated the suitability of derivatives of the nucleotide frequency weighted (hereafter NF) and tuple frequency weighted (hereafter TF) models for measuring sequence context dependent substitution. Critical properties we address are their relationships to an independent nucleotide process and the robustness of parameter estimation to changes in sequence composition. We then consider the impact on inference concerning dinucleotide substitution processes from application of these two forms to intron sequence alignments from primates.  相似文献   

10.
Site-specific methylation of cytosines is a key epigenetic mark of vertebrate DNA. While a majority of the methylated residues are in the symmetrical (meC)pG:Gp(meC) configuration, a smaller, but significant fraction is found in the CpA, CpT and CpC asymmetric (non-CpG) dinucleotides. CpG methylation is reproducibly maintained by the activity of the DNA methyltransferase 1 (Dnmt1) on the newly replicated hemimethylated substrates (meC)pG:GpC. On the other hand, establishment and hereditary maintenance of non-CpG methylation patterns have not been analyzed in detail. We previously reported the occurrence of site- and allele-specific methylation at both CpG and non-CpG sites. Here we characterize a hereditary complex of non-CpG methylation, with the transgenerational maintenance of three distinct profiles in a constant ratio, associated with extensive CpG methylation. These observations raised the question of the signal leading to the maintenance of the pattern of asymmetric methylation. The complete non-CpG pattern was reinstated at each generation in spite of the fact that the majority of the sperm genomes contained either none or only one methylated non-CpG site. This observation led us to the hypothesis that the stable CpG patterns might act as blueprints for the maintenance of non-CpG DNA methylation. As predicted, non-CpG DNA methylation profiles were abrogated in a mutant lacking Dnmt1, the enzymes responsible for CpG methylation, but not in mutants defective for either Dnmt3a or Dnmt2.  相似文献   

11.
Repeat-induced point mutation (RIP) is the most intriguing among the known mechanisms of repeated sequences inactivation because of its ability to produce irreversible mutation of repeated DNA. Discovered for the first time in Neurospora crassa, RIP is characterized by C:G to T:A transitions in duplicated sequences. The mechanisms and distribution of RIP are still purely investigated. Mobile elements are a common target for the processes which lead to homology-dependent silencing because of their ability to propagate themselves. We have done comparative analysis of LTR retrotransposons in genomic scale from genomes of two aspergilli fungi--Aspergillus funmigatus and A. nidulans, based on several copies we reconstructed "de-RIP" retroelements. Investigations of frequencies of CpG, CpA and TpG sites, which are potential targets for mutagenesis, showed the much lower frequencies of these sites in mobile elements in comparison with structural genes. LTR retrotransposons from A. fumigatus and A. nidulans have different ratio of types of substitutions. Our analysis indicates that two investigated fungi have or had the RIP-like processes for repeated sequences inactivation, in various modes. Whereas in A. fumigatus the context for mutagenesis consists of both CpG and CpA sites, in A. nidulans inactivation seems to proceed only on CpG dinucleotides. The present investigation gives a theoretical background for planning of experimental studying of RIP inactivation in aspergilli.  相似文献   

12.
In primate genomes more than 40% of CpG islands are found within repetitive elements. With more than one million copies in the human genome, the Alu family of retrotransposons represents the most successful short interspersed element (SINE) in primates and CpG dinucleotides make up about 20% of Alu sequences. It is generally thought that CpG dinucleotides mutate approximately ten times faster than other dinucleotides due to cytosine methylation and the subsequent deamination and conversion of C-->T. However, the disparity of Alu subfamily age estimations based upon CpG or non-CpG substitution density indicates a more complex relationship between CpG and non-CpG substitutions within the Alu elements. Here we report an analysis of the mutation patterns for 5296 Alu elements comprising 20 subfamilies. Our results indicate a relatively constant CpG versus non-CpG substitution ratio of approximately 6 for the young (AluY) and intermediate (AluS) Alu subfamilies. However, a more complex non-linear relationship between CpG and non-CpG substitutions was observed when old (AluJ) subfamilies were included in the analysis. These patterns may be the result of the slowdown of the neutral mutation rate during primate evolution and/or an increase in the CpG mutation rate as the consequence of increased DNA methylation in response to a burst of retrotransposition activity approximately 35 million years ago.  相似文献   

13.
Summary In species where actin genes exist as single copies, analysis of their synonymous codon usage and of the substitutions occurring between the genes of closely related species shows that there is a positive selection for codons that do not have highly mutable CpG dinucleotides in codon positions 2 and 3 when the GC content of these genes is less than 57%.  相似文献   

14.
CpG and TpA dinucleotides are underrepresented in the human genome. The CpG deficiency is due to the high mutation rate from C to T in methylated CpG's. The TpA suppression was thought to reflect a counterselection against TpA's destabilizing effect in RNA. Unexpectedly, the TpA and CpG deficiencies vary according to the G+C contents of sequences. It has been proposed that the variation in CpG suppression was correlated with a particular chromatin organization in G+C-rich isochores. Here, we present an improved model of dinucleotide evolution accounting for the overlap between successive dinucleotides. We show that an increased mutation rate from CpG to TpG or CpA induces both an apparent TpA deficiency and a correlation between CpG and TpA deficiencies and G+C content. Moreover, this model shows that the ratio of observed over expected CpG frequency underestimates the real CpG deficiency in G+C-rich sequences. The predictions of our model fit well with observed frequencies in human genomic data. This study suggests that previously published selectionist interpretations of patterns of dinucleotide frequencies should be taken with caution. Moreover, we propose new criteria to identify unmethylated CpG islands taking into account this bias in the measure of CpG depletion.  相似文献   

15.
An in vitro DNA synthesizing system from mouse fibroblasts has been used to study DNA methylation. DNA methylation occurs in two phases, one at the replication fork and the other farther behind it. Although 4% of the dCMP residues in mouse cell DNA are mdCMP, only 1.7% of the total [alpha 32P]dCMP in newly replicated DNA is methylated in vitro. No methylation of Okazaki fragments was detected. Nearest neighbor analysis of the newly replicated DNA revealed that, although 40% of the CpG dinucleotides were methylated, significant amounts of cytosine methylation were also found in CpC, CpT, and CpA dinucleotides.  相似文献   

16.
17.
One of the key objectives of comparative genomics is the characterization of the forces that shape genomes over the course of evolution. In the last decades, evidence has been accumulated that for vertebrate genomes also epigenetic modifications have to be considered in this context. Especially, the elevated mutation frequency of 5-methylcytosine (5mC) is assumed to facilitate the depletion of CpG dinucleotides in species that exhibit global DNA methylation. For instance, the underrepresentation of CpG dinucleotides in many mammalian genomes is attributed to this effect, which is only neutralized in so-called CpG islands (CGIs) that are preferentially unmethylated and thus partially protected from rapid CpG decay. For primate-specific CpG-rich transposable elements from the ALU family, it is unclear whether their elevated CpG frequency is caused by their small age or by the absence of DNA methylation. In consequence, these elements are often misclassified in CGI annotations. We present a method for the estimation of germ line methylation from pairwise ancestral-descendant alignments. The approach is validated in a simulation study and tested on DNA repeats from the AluSx family. We conclude that a predicted unmethylated state in the germ line is highly correlated with epigenetic activity of the respective genomic region. Thus, CpG-rich repeats can be facilitated as in silico probes for the epigenetic potential of their genomic neighborhood.  相似文献   

18.
19.
Differences in the relative abundance of dinucleotides, if any may provide important clues on host-driven evolution of viruses. We studied dinucleotide frequencies of large DNA viruses infecting vertebrates (n = 105; viruses infecting mammals = 99; viruses infecting aves = 6; viruses infecting reptiles = 1) and invertebrates (n = 88; viruses infecting insects = 84; viruses infecting crustaceans = 4). We have identified systematic depletion of CpT(ApG) dinucleotides and over-representation of CpG dinucleotides as the unique genomic signature of large DNA viruses infecting invertebrates. Detailed investigation of this unique genomic signature suggests the existence of invertebrate host-induced pressures specifically targeting CpT(ApG) and CpG dinucleotides. The depletion of CpT dinucleotides among large DNA viruses infecting invertebrates is at least in part, explained by non-canonical DNA methylation by the infected host. Our findings highlight the role of invertebrate host-related factors in shaping virus evolution and they also provide the necessary framework for future studies on evolution, epigenetics and molecular biology of viruses infecting this group of hosts.  相似文献   

20.
Sabia and Chapare viruses in the Arenavirus family cause viral hemorrhagic fever among humans with a fatality rate of 30% with no treatment models. Therefore, it is of interest to document the codon usage, amino acid patterns and associated factors influencing the observed variations in Sabia and Chapare viruses for host adaptation. Multivariate statistical analysis revealed compositional constraint and host selection pressure influencing the viral codon usage patterns. These data suggests the codon usage signatures in Sabia and Chapare viruses for host adaptation in the human host implying its role in the rapid progression of the infection. Dinucleotides UpG and CpA were noted to be over-represented among the Sabia, Chapare viruses and human genomes. Strong restraint from the usage of CpG dinucleotides among viruses is linked with the molecular mimicry of the human immune system. Thus, the data reported from this study help in understanding the mechanism of viral adaptation inside the host genome for further consideration in drug discovery.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号