首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Substitutions occurring in noncoding sequences of the plant chloroplast genome violate the independence of sites that is assumed by substitution models in molecular evolution. The probability that a substitution at a site is a transversion, as opposed to a transition, increases significantly with increasing A + T content of the two adjacent nucleotides. In the present study, this dependency of substitutions on local context is examined further in a number of noncoding regions from the chloroplast genome of members of the grass family (Poaceae). Two features were examined; the influence of specific neighboring bases, as opposed to the general A + T content, on transversion proportion and an influence on substitutions by nucleotides other than the two immediately adjacent to the site of substitution. In both cases, a significant effect was found. In the case of specific nucleotides, transversion proportion is significantly higher at sites with a pyrimidine immediately 5′ on either strand. Substitutions at sites of the type YNR, where N is the site of substitution, have the highest rate of transversion. This specific effect is secondary to the A + T content effect such that, in terms of proportion of substitutions that are transversions, the nucleotides are ranked T > A > C > G as to their effect when they are immediately 5′ to the site of substitution. In the case of nucleotides other than the immediate neighbors, a significant influence on substitution dynamics is observed in the case where the two neighboring bases are both A and/or T. Thus, substitutions are primarily, but not exclusively, influenced by the composition of the two nucleotides that are immediately adjacent. These results indicate that the pattern of molecular evolution of the plant chloroplast genome is extremely complex as a result of a variety of inter-site dependencies. Received: 18 October 1996 / Accepted: 12 April 1997  相似文献   

2.
Zhao Z  Zhang F 《Genomics》2006,87(1):68-74
A genome-wide view of sequence mutability in mice is still limited, although biologists usually assume the same scenario for mice as for humans. In this study, we examined the sequence context in the local environment of 482,528 mouse single nucleotide polymorphisms (SNPs). We found that CpG-containing short sequences, in general, had more representation in the local sequences of SNPs compared to the genome sequences. The extent of this overrepresentation was stronger in mice than in humans, which is inconsistent with previous observations of the weaker neighboring-nucleotide biases on mouse SNPs. To exclude the CpG effect, we compared the distribution patterns of short sequences among the six categories of SNPs. The results revealed an even stronger pattern in the CpG-containing group for C/G substitution compared to for A/G or C/T substitutions. We next performed the first genome-wide sequence context analysis of SNPs in the mouse CpG islands. SNPs occurring at CpG sites were 3.14-fold less prevalent than expected, suggesting the suppression of methylation-dependent deamination in the CpG islands. The extent of this suppression was less in mice than in humans. Finally, compared with humans, the observations of a greater deficit of CpG dinucleotides, a stronger overrepresentation of CpG-containing n-mers surrounding the polymorphic sites, and a higher SNP/genome ratio of CpG dinucleotides in the mouse genome support the "loss of CpG islands" model in the mouse lineage.  相似文献   

3.
DNA composition dynamics across genomes of diverse taxonomy is a major subject of genome analyses. DNA composition changes are characteristics of both replication and repair machineries. We investigated 3,611,007 single nucleotide polymorphisms (SNPs) generated by comparing two sequenced rice genomes from distant inbred lines (subspecies), including those from 242,811 introns and 45,462 protein-coding sequences (CDSs). Neighboring-nucleotide effects (NNEs) of these SNPs are diverse, depending on structural content-based classifications (genomewide, intronic, and CDS) and sequence context-based categories (A/C, A/G, A/T, C/G, C/T, and G/T substitutions) of the analyzed SNPs. Strong and evident NNEs and nucleotide proportion biases surrounding the analyzed SNPs were observed in 1-3 bp sequences on both sides of an SNP. Strong biases were observed around neighboring nucleotides of protein-coding SNPs, which exhibit a periodicity of three in nucleotide content, constrained by a combined effect of codon-related rules and DNA repair mechanisms. Unlike a previous finding in the human genome, we found negative correlation between GC contents of chromosomes and the magnitude of corresponding bias of nucleotide C at -1 site and G at +1 site. These results will further our understanding of the mutation mechanism in rice as well as its evolutionary implications.  相似文献   

4.
5.
Regional biases in substitution pattern are likely to be responsible for the large-scale variation in base composition observed in vertebrate genomes. However, the evolutionary forces responsible for these biases are still not clearly defined. In order to study the processes of mutation and fixation across the entire human genome, we analyzed patterns of substitution in Alu repeats since their insertion. We also studied patterns of human polymorphism within the repeats. There is a highly significant effect of recombination rate on the pattern of substitution, whereas no such effect is seen on the pattern of polymorphism. These results suggest that regional biases in substitution are caused by biased gene conversion, a process that increases the probability of fixation of mutations that increase GC content. Furthermore, the strongest correlate of substitution patterns is found to be male recombination rates rather than female or sex-averaged recombination rates. This indicates that in addition to sexual dimorphism in recombination rates, the sexes also differ in the relative rates of crossover and gene conversion.  相似文献   

6.
Models of amino acid substitution were developed and compared using maximum likelihood. Two kinds of models are considered. "Empirical" models do not explicitly consider factors that shape protein evolution, but attempt to summarize the substitution pattern from large quantities of real data. "Mechanistic" models are formulated at the codon level and separate mutational biases at the nucleotide level from selective constraints at the amino acid level. They account for features of sequence evolution, such as transition-transversion bias and base or codon frequency biases, and make use of physicochemical distances between amino acids to specify nonsynonymous substitution rates. A general approach is presented that transforms a Markov model of codon substitution into a model of amino acid replacement. Protein sequences from the entire mitochondrial genomes of 20 mammalian species were analyzed using different models. The mechanistic models were found to fit the data better than empirical models derived from large databases. Both the mutational distance between amino acids (determined by the genetic code and mutational biases such as the transition-transversion bias) and the physicochemical distance are found to have strong effects on amino acid substitution rates. A significant proportion of amino acid substitutions appeared to have involved more than one codon position, indicating that nucleotide substitutions at neighboring sites may be correlated. Rates of amino acid substitution were found to be highly variable among sites.   相似文献   

7.
It is well known that due to the degeneracy of genetic code, most of the silent substitutions appear in the third codon position, so the mutation frequency of the third codon position is much higher than that of the first two positions. However, it remains unknown whether the directionality of point mutation in three codon positions is similar or not. In this paper, through analyzing 15 sets of orthologous genes, it is revealed that most of the substitution types are significantly different between any two codon positions, especially between the 2nd and the 3rd phases. Furthermore, the average frequencies of each type of substitution calculated from the fifteen sets of orthologous genes are similar to those identified in single nucleotide polymorphisms (SNPs) of human and mouse genome. The present analyses suggest that the nucleotide substitution in protein-coding sequences is not only context-dependent (so called neighboring-nucleotide effects), but also phase-dependent, which is of significance to improving the prevalent nucleotide-evolution models.  相似文献   

8.
A 3.1-kb intergenic DNA fragment located between the psi beta-globin and delta-globin genes in the beta-globin gene cluster was cloned from gorilla, orangutan, rhesus monkey, and spider monkey, and the nucleotide sequence of each fragment was determined. The phylogeny of these four sequences, together with two previously published allelic sequences from humans and one from chimpanzee, was constructed, and the accumulation of mutations in the region was analyzed. The sites of base substitutions are not evenly distributed within the region: two Alu repeats have accumulated 0.21 + 0.02 substitutions/site with 0.15 + 0.008 substitutions/site in the remainder of the fragment. The occurrence of substitutions at neighboring sites is more frequent than would be expected if they were independent. The observed excesses disappear when ancestral -CG- dinucleotide sites are excluded. The phylogenetic relationships of the sequences indicate that the human sequence shares a most recent coancestor with the chimpanzee sequence. The data also show that great apes have accumulated fewer mutations in this part of the genome than has the rhesus monkey. The relative rates of accumulation of 12 kinds of nucleotide substitution in the region during primate evolution are asymmetric in the DNA strands. From these rates of accumulation, the origin of a simple stretch of sequence near the 3' end of the 3.1-kb fragment was deduced to be a sequence comprising 50% T and 50% C on one strand. The two oppositely oriented Alu sequences in the 3.1-kb region were inserted at their present positions before the divergence of the New-World monkeys from other lineages. Our analysis shows that the nucleotide sequences of the two Alu repeats in spider monkey are unexpectedly similar both to each other and to the deduced ancestral sequence of Alu repeats. The data suggest that there has been some type of recombinational event between the spider monkey Alu repeats but that it was not a simple gene conversion.   相似文献   

9.
Zea ribosomal repeat evolution and substitution patterns   总被引:2,自引:1,他引:1  
Zea and Tripsacum nuclear ribosomal internal transcribed spacer (ITS) sequences were used to evaluate patterns of concerted evolution, rates of substitutions, patterns of methylation-induced deamination, and structural constraints of the ITS. ITS pseudogenes were identified by their phylogenetic position, differences in nucleotide composition, extensive deamination at ancestral methylation sites, and substitutions resulting in low-stability secondary RNA structures. Selection was important in shaping the kinds of polymorphisms and substitutions observed in the ITS. ITS substitution rates were significantly different among the Zea taxa. Deamination of cytosines at methylation sites was a potent mutation source, but selection appeared to maintain high methylation site density throughout the ribosomal repeat except for the gene promoter. Nucleotide divergence statistics identified selectively constrained regions at the 5' ends of the ITS1 and ITS2.   相似文献   

10.
11.
It has been known that in noncoding regions of the chloroplast genome, the pattern of nucleotide substitution is influenced by the two nucleotides flanking the substitution site. In a GC-rich environment, a bias toward transition was observed, whereas in an AT-rich environment, a bias toward transversion was observed. In this study, the influence of the two adjacent neighbors on the substitution pattern was observed in the first intron of the mitochondrial nad4 gene, although the AT content of this intron is only 48%. The proportion of transversions increases from 0.32 to 0.75 as the A + T content (number of A's + T's) of the two nearest neighbors increases from 0 to 2. This trend was also observed in another mitochondrial group I intron with an AT content of 64%. In addition, a similar, though weaker, effect was observed in vertebrate pseudogenes. So this effect is present in all three types of genomes. Furthermore, in contrast to the situation in the noncoding regions of chloroplast DNA, where most nucleotide substitutions occurred in the categories with an A + T content of either 1 or 2, nucleotide substitutions in the mitochondrial first nad4 intron occurred more evenly in three categories of different A + T contents. This might be due largely to the difference in the AT content (0.48 vs. 0.72) between the mitochondrial first nad4 intron and the chloroplast DNA regions studied.  相似文献   

12.
13.
14.
M D Been  T R Cech 《Cell》1987,50(6):951-961
Circularization and reverse circularization of the Tetrahymena thermophila rRNA intervening sequence resemble the first and second steps in splicing, respectively. However, site-specific base substitutions show that different nucleotides are involved in selection of the 5' splice site and the circularization sites. Furthermore, a substitution at the major circularization site that prevents circularization can be suppressed by second substitutions at two different nucleotide positions. A model is proposed in which adjacent and overlapping sequences can function as a binding site, forming a short duplex with the sequence at the circularization site and thus directing circularization and reverse circularization. Because the 5' exon-binding site and three potential circularization binding sites fall within a contiguous eight nucleotide region, this sequence may translocate relative to the catalytic core of the ribozyme in a template-like manner.  相似文献   

15.
Interspersed repeats have emerged as a valuable tool for studying neutral patterns of molecular evolution. Here we analyze variation in the rate and pattern of nucleotide substitution across all autosomes in the chicken genome by comparing the present-day CR1 repeat sequences with their ancestral copies and reconstructing nucleotide substitutions with a maximum likelihood model. The results shed light on the origin and evolution of large-scale heterogeneity in GC content found in the genomes of birds and mammals--the isochore structure. In contrast to mammals, where GC content is becoming homogenized, heterogeneity in GC content is being reinforced in the chicken genome. This is also supported by patterns of substitution inferred from alignments of introns in chicken, turkey, and quail. Analysis of individual substitution frequencies is consistent with the biased gene conversion (BGC) model of isochore evolution, and it is likely that patterns of evolution in the chicken genome closely resemble those in the ancestral amniote genome, when it is inferred that isochores originated. Microchromosomes and distal regions of macrochromosomes are found to have elevated substitution rates and a more GC-biased pattern of nucleotide substitution. This can largely be accounted for by a strong correlation between GC content and the rate and pattern of substitution. The results suggest that an interaction between increased mutability at CpG motifs and fixation biases due to BGC could explain increased levels of divergence in GC-rich regions.  相似文献   

16.
Castle JC 《PloS one》2011,6(6):e20660
Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3' splice sites, 5' splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3' UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation.  相似文献   

17.
唐萍  王强  陈建群 《遗传》2008,30(11):1506-1512
摘要: 核苷酸替代和indels(插入、缺失统称)发生是进化的重要动力。以茄科植物为研究对象, 探讨茄属中番茄和马铃薯、烟草属中绒毛状烟草和普通烟草分化时叶绿体基因组indels和核苷酸替代的发生方式, 以及这两种突变对基因组造成的影响。结果显示: indels和核苷酸替代的发生都不是随意的。indels发生在A+T丰富的区域, 1 bp indels占据总数的30%以上, 大部分indels都为低于10 bp的较短片段。核苷酸替代表现出Ts(转换)/Tv(颠换)偏差, 但T→G, A→C颠换频率却明显增加。Ts/Tv比值出现种属特异性, 番茄和马铃薯比较时替代的Ts/Tv比值低于绒毛状烟草和普通烟草比较时Ts/Tv比值。不同物种替代的(A+T)/(G+C)比值有一定差异, 从而影响基因组的(G+C)%, 此比值的差异与形成物种的生长习性有一定的关系。  相似文献   

18.
Mononucleotide repeats (MNRs) are abundant in eukaryotic genomes and exhibit a high degree of length variability due to insertion and deletion events. However, the relationship between these repeats and mutation rates in surrounding sequences has not been systematically investigated. We have analyzed the frequency of single nucleotide polymorphisms (SNPs) at positions close to and within MNRs in the human genome. Overall, we find a 2- to 4-fold increase in the SNP frequency at positions immediately adjacent to the boundaries of MNRs, relative to that at more distant bases. This relationship exhibits a strong asymmetry between 3' and 5' ends of repeat tracts and is dependent upon the repeat motif, length and orientation of surrounding repeats. Our analysis suggests that the incorporation or exclusion of bases adjacent to the boundary of the repeat through substitutions, in which these nucleotides mutate towards or away from the base present within the repeat, respectively, may be another mechanism by which MNRs expand and contract in the human genome.  相似文献   

19.
20.
Genomic parasites have evolved distinctive lifestyles to optimize replication in the context of the genomes they inhabit. Here, we introduced new DNA into eukaryotic cells using bacteriophage Mu DNA transposition complexes, termed ‘transpososomes’. Following electroporation of transpososomes and selection for marker gene expression, efficient integration was verified in yeast, mouse and human genomes. Although Mu has evolved in prokaryotes, strong biases were seen in the target site distributions in eukaryotic genomes, and these biases differed between yeast and mammals. In Saccharomyces cerevisiae transposons accumulated outside of genes, consistent with selection against gene disruption. In mouse and human cells, transposons accumulated within genes, which previous work suggests is a favorable location for efficient expression of selectable markers. Naturally occurring transposons and viruses in yeast and mammals show related, but more extreme, targeting biases, suggesting that they are responding to the same pressures. These data help clarify the constraints exerted by genome structure on genomic parasites, and illustrate the wide utility of the Mu transpososome technology for gene transfer in eukaryotic cells.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号