首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 20 毫秒
1.
2.
Vertebrate genomes are characterized with CpG deficiency, particularly for GCpoor regions. The GC content-related CpG deficiency is probably caused by context-dependent deamination of methylated CpG sites. This hypothesis was examined in this study by comparing nucleotide frequencies at CpG flanking positions among invertebrate and vertebrate genomes. The finding is a transition of nucleotide preference of 5' T to 5' A at the invertebrate-vertebrate boundary, indicating that a large number of CpG sites with 5' Ts were depleted because of global DNA methylation developed in vertebrates. At genome level, we investigated CpG observed/expected (obs/exp) values in 500 bp fragments, and found that higher CpG obs/exp value is shown in GC-poor regions of invertebrate genomes (except sea urchin) but in GC-rich sequences of vertebrate genomes. We next compared GC content at CpG flanking positions with genomic average, showing that the GC content is lower than the average in invertebrate genomes, but higher than that in vertebrate genomes. These results indicate that although 5' T and 5' A are different in inducing deamination of methylated CpG sites, GC content is even more important in affecting the deamination rate. In all the tests, the results of sea urchin are similar to vertebrates perhaps due to its fractional DNA methylation. CpG deficiency is therefore suggested to be mainly a result of high mutation rates of methylated CpG sites in GC-poor regions.  相似文献   

3.
It is well known that the dinucleotide CpG is under-represented in the genomic DNA of many vertebrates. This is commonly thought to be due to the methylation of cytosine residues in this dinucleotide and the corresponding high rate of deamination of 5-methycytosine, which lowers the frequency of this dinucleotide in DNA. Surprisingly, many single-stranded RNA viruses that replicate in these vertebrate hosts also have a very low presence of CpG dinucleotides in their genomes. Viruses are obligate intracellular parasites and the evolution of a virus is inexorably linked to the nature and fate of its host. One therefore expects that virus and host genomes should have common features. In this work, we compare evolutionary patterns in the genomes of ssRNA viruses and their hosts. In particular, we have analyzed dinucleotide patterns and found that the same patterns are pervasively over- or under-represented in many RNA viruses and their hosts suggesting that many RNA viruses evolve by mimicking some of the features of their host's genes (DNA) and likely also their corresponding mRNAs. When a virus crosses a species barrier into a different host, the pressure to replicate, survive and adapt, leaves a footprint in dinucleotide frequencies. For instance, since human genes seem to be under higher pressure to eliminate CpG dinucleotide motifs than avian genes, this pressure might be reflected in the genomes of human viruses (DNA and RNA viruses) when compared to those of the same viruses replicating in avian hosts. To test this idea we have analyzed the evolution of the influenza virus since 1918. We find that the influenza A virus, which originated from an avian reservoir and has been replicating in humans over many generations, evolves in a direction strongly selected to reduce the frequency of CpG dinucleotides in its genome. Consistent with this observation, we find that the influenza B virus, which has spent much more time in the human population, has adapted to its human host and exhibits an extremely low CpG dinucleotide content. We believe that these observations directly show that the evolution of RNA viral genomes can be shaped by pressures observed in the host genome. As a possible explanation, we suggest that the strong selection pressures acting on these RNA viruses are most likely related to the innate immune response and to nucleotide motifs in the host DNA and RNAs.  相似文献   

4.
Transitions at CpG dinucleotides, referred to as “CpG substitutions”, are a major mutational input into vertebrate genomes and a leading cause of human genetic disease. The prevalence of CpG substitutions is due to their mutational origin, which is dependent on DNA methylation. In comparison, other single nucleotide substitutions (for example those occurring at GpC dinucleotides) mainly arise from errors during DNA replication. Here we analyzed high quality BAC-based data from human, chimpanzee, and baboon to investigate regional variation of CpG substitution rates.

We show that CpG substitutions occur approximately 15 times more frequently than other single nucleotide substitutions in primate genomes, and that they exhibit substantial regional variation. Patterns of CpG rate variation are consistent with differences in methylation level and susceptibility to subsequent deamination. In particular, we propose a “distance-decaying” hypothesis, positing that due to the molecular mechanism of a CpG substitution, rates are correlated with the stability of double-stranded DNA surrounding each CpG dinucleotide, and the effect of local DNA stability may decrease with distance from the CpG dinucleotide.

Consistent with our “distance-decaying” hypothesis, rates of CpG substitution are strongly (negatively) correlated with regional G+C content. The influence of G+C content decays as the distance from the target CpG site increases. We estimate that the influence of local G+C content extends up to 1,500~2,000 bps centered on each CpG site. We also show that the distance-decaying relationship persisted when we controlled for the effect of long-range homogeneity of nucleotide composition. GpC sites, in contrast, do not exhibit such “distance-decaying” relationship. Our results highlight an example of the distinctive properties of methylation-dependent substitutions versus substitutions mostly arising from errors during DNA replication. Furthermore, the negative relationship between G+C content and CpG rates may provide an explanation for the observation that GC-rich SINEs show lower CpG rates than other repetitive elements.

  相似文献   

5.
Based on 152 mitochondrial genomes and 36 bacterial chromosomes that have been completely sequenced, as well as three long contigs for human chromosomes 6, 21, and 22, we examined skews of mononucleotide frequencies and the relative abundance of dinucleotides in one DNA strand. Each group of these genomes has its own characteristics. Regarding mitochondrial genomes, both CpG and GpT are underrepresented, while either GpG or CpC or both are overrepresented. The relative frequency of nucleotide T vs A and of nucleotide G vs C is strongly skewed, due presumably to strand asymmetry in replication errors and unidirectional DNA replication from single origins. Exceptions are found in the plant and yeast mitochondrial genomes, each of which may replicate from multiple origins. Regarding bacterial genomes, the ``universal' rule of CpG deficiency is restricted to archaebacteria and some eubacteria. In other eubacteria, the most underrepresented dinucleotide is either TpA or GpT. In general, there are significant T vs A and G vs C skews in each half of the bacterial genome, although these are almost exactly canceled out over the whole genome. Regarding human chromosomes 6, 21, and 22, dinucleotide CpG tends to be avoided. The relative frequency of mononucleotides exhibits conspicuous local skews, suggesting that each of these chromosomal segments contains more than one DNA replication origin. It is concluded that, when there are several replicons in a genomic region, not only the number of DNA replication origins but also the directionality is important and that the observed patterns of nucleotide frequencies in the genome strongly support the hypothesis of strand asymmetry in replication errors. Received: 1 November 2000 / Accepted: 12 March 2001  相似文献   

6.
Some bacterial genomes are known to have low CpG dinucleotide frequencies. While their causes are not clearly understood, the frequency of CpG is suppressed significantly in the genome of Mycoplasma genitalium, but not in that of Mycoplasma pneumoniae. We compared orthologous gene pairs of the two closely related species to analyze CpG substitution patterns between these two genomes. We also divided genome sequences into three regions: protein-coding, noncoding, and RNA-coding, and obtained the CpG frequencies for each region for each organism. It was found that the observed/expected ratio of CpG dinucleotides is low in both the protein-coding and noncoding regions; while that ratio is in the normal range in the RNA-coding region. Our results indicate that CpG suppression of the Mycoplasma genome is not caused by (1) biased usage amino acid; (2) biased usage of synonymous codon; or (3) methylation effects by the CpG methyltransferase in the genomes of their hosts. Instead, we consider it likely that a certain global pressure, such as genome-wide pressure for the advantages of DNA stability or replication, has the effect of decreasing CpG over the entire genome, which, in turn, resulted in the biased codon usage.  相似文献   

7.
8.
Echinomycin binding to alternating AT.   总被引:1,自引:1,他引:0       下载免费PDF全文
K R Fox  J N Marks    K Waterloh 《Nucleic acids research》1991,19(24):6725-6730
We have studied the binding of echinomycin to DNA fragments containing GC-rich regions flanked by blocks of alternating AT by DNase I footprinting and diethylpyrocarbonate modification. Regions of alternating AT flanking the sequences CCCG, CCGC, CGGC and GG show a four base pair DNase I cleavage pattern and reaction of alternate adenines with diethylpyrocarbonate. This pattern is strongest when the AT-block is immediately adjacent to the CpG ligand binding site. We explain these phenomena by suggesting that echinomycin binds to the dinucleotide step ApT in a cooperative fashion. The cooperative effects can be transmitted through the dinucleotide step GC but not CC or AA. No such repetitive patterns are seen with surrounding regions of (ATT).(AAT). Evidence is presented for secondary drug binding sites at CpC and TpG with weaker interaction at the CpG site within the hexanucleotide TTCGAA.  相似文献   

9.
No simple model exists that accurately describes the melting behavior and breathing dynamics of double-stranded DNA as a function of nucleotide sequence. This is especially true for homogenous and periodic DNA sequences, which exhibit large deviations in melting temperature from predictions made by additive thermodynamic contributions. Currently, no method exists for analysis of the DNA breathing dynamics of repeats and of highly G/C- or A/T-rich regions, even though such sequences are widespread in vertebrate genomes. Here, we extend the nonlinear Peyrard–Bishop–Dauxois (PBD) model of DNA to include a sequence-dependent stacking term, resulting in a model that can accurately describe the melting behavior of homogenous and periodic sequences. We collect melting data for several DNA oligos, and apply Monte Carlo simulations to establish force constants for the 10 dinucleotide steps (CG, CA, GC, AT, AG, AA, AC, TA, GG, TC). The experiments and numerical simulations confirm that the GG/CC dinucleotide stacking is remarkably unstable, compared with the stacking in GC/CG and CG/GC dinucleotide steps. The extended PBD model will facilitate thermodynamic and dynamic simulations of important genomic regions such as CpG islands and disease-related repeats.  相似文献   

10.
It has been known since the development of nearest neighbor analysis that the frequency of the dinucleotide CpG is markedly suppressed in vertebrate DNA (i.e. less than %C x %G). This suppression appears to be heterogeneous since it was shown some years ago that three vertebrate tRNA genes did not exhibit CpG suppression. We have analyzed 13 different human tRNA genes and found that they also do not exhibit CpG suppression. Because CpG suppression has been linked, to some extent at least, to the methylation-deamination process by which a methylated CpG is mutated to TpG, we investigated whether the lack of suppression of CpG in tRNAs could originate from an absence of methylation. Three human tRNA genes were selected from Genbank (lysine, Proline, and Phenylalanine) and examined for methylation at HpaII sites by polymerase chain reaction (PCR) and Southern blot analysis. The observed patterns were consistent with the absence of methylation at the seven HpaII sites analyzed in and around the tRNA genes, and we predict that the remaining CpGs in these genes will be unmethylated. Since GC-rich promoter regions also escape CpG suppression and since they are generally unmethylated, avoidance of methylation may be a general explanation for the absence of CpG suppression in selected regions of vertebrate genomes.  相似文献   

11.
Interspersed repeats have emerged as a valuable tool for studying neutral patterns of molecular evolution. Here we analyze variation in the rate and pattern of nucleotide substitution across all autosomes in the chicken genome by comparing the present-day CR1 repeat sequences with their ancestral copies and reconstructing nucleotide substitutions with a maximum likelihood model. The results shed light on the origin and evolution of large-scale heterogeneity in GC content found in the genomes of birds and mammals--the isochore structure. In contrast to mammals, where GC content is becoming homogenized, heterogeneity in GC content is being reinforced in the chicken genome. This is also supported by patterns of substitution inferred from alignments of introns in chicken, turkey, and quail. Analysis of individual substitution frequencies is consistent with the biased gene conversion (BGC) model of isochore evolution, and it is likely that patterns of evolution in the chicken genome closely resemble those in the ancestral amniote genome, when it is inferred that isochores originated. Microchromosomes and distal regions of macrochromosomes are found to have elevated substitution rates and a more GC-biased pattern of nucleotide substitution. This can largely be accounted for by a strong correlation between GC content and the rate and pattern of substitution. The results suggest that an interaction between increased mutability at CpG motifs and fixation biases due to BGC could explain increased levels of divergence in GC-rich regions.  相似文献   

12.
DNA methylation and the frequency of CpG in animal DNA.   总被引:120,自引:35,他引:85       下载免费PDF全文
An analysis of nearest neighbour dinucleotide frequencies and the level of DNA methylation in animals strongly supports the suggestion that 5-methylcytosine (5mC) tends to mutate abnormally frequently to T. This tendency is the likely cause of the CpG deficiency in heavily methylated genomes.  相似文献   

13.
14.
Presence and role of cytosine methylation in DNA viruses of animals   总被引:1,自引:0,他引:1       下载免费PDF全文
Nucleotide composition varies greatly among DNA viruses of animals, yet the evolutionary pressures and biological mechanisms driving these patterns are unclear. One of the most striking discrepancies lies in the frequency of CpG (the dinucleotide CG, linked by a phosphate group), which is underrepresented in most small DNA viruses (those with genomes below 10 kb) but not in larger DNA viruses. Cytosine methylation might be partially responsible, but research on this topic has focused on a few virus groups. For several viruses that integrate their genome into the host genome, the methylation status during this stage has been studied extensively, and the relationship between methylation and viral-induced tumor formation has been examined carefully. However, for actively replicating viruses—particularly small DNA viruses—the methylation status of CpG motifs is rarely known and the effects on the viral life cycle are obscure. In vertebrate host genomes, most cytosines at CpG sites are methylated, which in vertebrates acts to regulate gene expression and facilitates the recognition of unmethylated, potentially pathogen-associated DNA. Here we briefly introduce cytosine methylation before reviewing what is currently known about CpG methylation in DNA viruses.  相似文献   

15.
Methylation pattern of mouse mitochondrial DNA.   总被引:3,自引:0,他引:3       下载免费PDF全文
The pattern of methylation of mouse mitochondrial DNA (mtDNA) was studied using several techniques. By employing a sensitive analytical procedure it was possible to show that this DNA contains the modified base 5-methylcytosine (m5Cyt). This residue occurred exclusively at the dinucleotide sequence CpG at a frequency of 3 to 5%. The pattern of methylation was further investigated by determining the state of methylation of several MspI (HpaII) sites. Different sites were found to be methylated to a different extent, implying that methylation of mtDNA is nonrandom. Based on the known base composition and nucleotide sequence of mouse mtDNA, the dinucleotide sequence CpG was found to be underrepresented in this DNA. The features of mtDNA methylation (CpG methylation, partial methylation of specific sites and CpG underrepresentation) are also characteristic of vertebrate nuclear DNA. This resemblance may reflect functional relationship between the mitochondrial and nuclear genomes.  相似文献   

16.
Human promoters divide into 2 classes, the low CpG (LCG) and the high CpG (HCG), based on their CpG dinucleotide content. The LCG class of promoters is hypermethylated and is associated with tissue-specific genes, whereas the HCG class is hypomethylated and associated with broadly expressed genes. By analyzing several chordate genomes separated for hundreds of millions of years, here we show that the divide between low CpG and high CpG promoters is conserved in several distantly related vertebrate taxa (including human, chicken, frog, lizard, and fish) but not in close invertebrate outgroups (sea squirts). Furthermore, LCG and HCG promoters are distinctively associated with tissue-specific and broadly expressed genes in these distantly related vertebrate taxa. Our results indicate that the function of DNA methylation on gene expression is conserved across these vertebrate taxa and suggest that the 2 classes of promoters have evolved early in vertebrate evolution, as a consequence of the advent of global DNA methylation.  相似文献   

17.
DNA methylation is one of the many hypotheses proposed to explain the observed deficiency in CpG dinucleotides in a variety of genomes covering a wide taxonomic distribution. Recent studies challenged the methylation hypothesis on empirical grounds. First, it cannot explain why the Mycoplasma genitalium genome exhibits strong CpG deficiency without DNA methylation. Second, it cannot explain the great variation in CpG deficiency between M. genitalium and M. pneumoniae that also does not have CpG-specific methyltransferase genes. I analyzed the genomic sequences of these Mycoplasma species together with the recently sequenced genomes of M. pulmonis, Ureaplasma urealyticum, and Staphylococcus aureus, and found the results fully compatible with the methylation hypothesis. In particular, I present compelling empirical evidence to support the following scenario. The common ancestor of the three Mycoplasma species has CpG-specific methyltransferases, and has evolved strong CpG deficiency as a result of the specific DNA methylation. Subsequently, this ancestral genome diverged into M. pulmonis and the common ancestor of M. pneumoniae and M. genitalium. M. pulmonis has retained methyltransferases and exhibits the strongest CpG deficiency. The common ancestor lost the methyltransferase gene and then diverged into M. genitalium and M. pneumoniae. M. genitalium and M. pneumoniae, after losing methylation activities, began to regain CpG dinucleotides through random mutation. M. genitalium evolved more slowly than M. pneumoniae, gained relatively fewer CpG dinucleotides, and is more CpG-deficient.  相似文献   

18.
Unmethylated CpG dinucleotide motifs in bacterial DNA, as well as oligodeoxynucleotides (ODN) containing these motifs, are potent stimuli for many host immunological responses. These CpG motifs may enhance host responses to bacterial infection and are being examined as immune activators for therapeutic applications in cancer, allergy/asthma, and infectious diseases. However, little attention has been given to processes that down-modulate this response. The iron-binding protein lactoferrin is present at mucosal surfaces and at sites of infection. Since lactoferrin is known to bind DNA, we tested the hypothesis that lactoferrin will bind CpG-containing ODN and modulate their biological activity. Physiological concentrations of lactoferrin (regardless of iron content) rapidly bound CpG ODN. The related iron-binding protein transferrin lacked this capacity. ODN binding by lactoferrin did not require the presence of CpG motifs and was calcium independent. The process was inhibited by high salt, and the highly cationic N-terminal sequence of lactoferrin (lactoferricin B) was equivalent to lactoferrin in its ODN-binding ability, suggesting that ODN binding by lactoferrin occurs via charge-charge interaction. Heparin and bacterial LPS, known to bind to the lactoferricin component of lactoferrin, also inhibited ODN binding. Lactoferrin and lactoferricin B, but not transferrin, inhibited CpG ODN stimulation of CD86 expression in the human Ramos B cell line and decreased cellular uptake of ODN, a process required for CpG bioactivity. Lactoferrin binding of CpG-containing ODN may serve to modulate and terminate host response to these potent immunostimulatory molecules at mucosal surfaces and sites of bacterial infection.  相似文献   

19.
Different DNA motifs are required for optimal stimulation of mouse and human immune cells by CpG oligodeoxynucleotides (ODN). These species differences presumably reflect sequence differences in TLR9, the CpG DNA receptor. In this study, we show that this sequence specificity is restricted to phosphorothioate (PS)-modified ODN and is not observed when a natural phosphodiester backbone is used. Thus, human and mouse cells have not evolved to recognize different CpG motifs in natural DNA. Nonoptimal PS-ODN (i.e., mouse CpG motif on human cells and vice versa) gave delayed and less sustained phosphorylation of p38 MAPK than optimal motifs. When the CpG dinucleotide was inverted to GC in each ODN, some residual activity of the PS-ODN was retained in a species-specific, TLR-9-dependent manner. Thus, TLR9 may be responsible for mediating many published CpG-independent responses to PS-ODN.  相似文献   

20.
Eukaryotic DNA methylation   总被引:24,自引:0,他引:24  
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号