首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
2.
Lobry JR  Sueoka N 《Genome biology》2002,3(10):research0058.1-research005814

Background

When there are no strand-specific biases in mutation and selection rates (that is, in the substitution rates) between the two strands of DNA, the average nucleotide composition is theoretically expected to be A = T and G = C within each strand. Deviations from these equalities are therefore evidence for an asymmetry in selection and/or mutation between the two strands. By focusing on weakly selected regions that could be oriented with respect to replication in 43 out of 51 completely sequenced bacterial chromosomes, we have been able to detect asymmetric directional mutation pressures.

Results

Most of the 43 chromosomes were found to be relatively enriched in G over C and T over A, and slightly depleted in G+C, in their weakly selected positions (intergenic regions and third codon positions) in the leading strand compared with the lagging strand. Deviations from A = T and G = C were highly correlated between third codon positions and intergenic regions, with a lower degree of deviation in intergenic regions, and were not correlated with overall genomic G+C content.

Conclusions

During the course of bacterial chromosome evolution, the effects of asymmetric directional mutation pressures are commonly observed in weakly selected positions. The degree of deviation from equality is highly variable among species, and within species is higher in third codon positions than in intergenic regions. The orientation of these effects is almost universal and is compatible in most cases with the hypothesis of an excess of cytosine deamination in the single-stranded state during DNA replication. However, the variation in G+C content between species is influenced by factors other than asymmetric mutation pressure.
  相似文献   

3.
4.
From investigation of eight Flavobacterium sp. genes encoding enzyme proteins, it was found that six genes had nonstop frames (NSFs) on the antisense strands, and base sequences of the genes are mainly composed of repeating triplet sequence(s), 5'-GNC-3' (where G and C are guanine and cytosine, and N is either of the four bases), in the reading frames. Thus, we concluded that the biased nucleotide sequences on the sense strands produce NSFs on the corresponding antisense strands. Furthermore, from the precise alignments of both nucleotide and amino acid sequences of two related Flavobacterium sp. genes, nyIB and nyIB', it was found that base replacements might have occurred symmetrically in the codons. That is, transversions between G and C were observed at high frequencies at the first and third positions of codons, but not at the second positions. At the first position, AG base transitions were observed much more than similar CT transitions, whereas CT transitions were found at the third positions at a relatively high frequency. These suggest that symmetrical base replacements in codons might be the main contribution to evolution in Flavobacterium sp. genes.  相似文献   

5.
The GC contents of 2670 prokaryotic genomes that belong to diverse phylogenetic lineages were analyzed in this paper. These genomes had GC contents that ranged from 13.5% to 74.9%. We analyzed the distance of base frequencies at the three codon positions, codon frequencies, and amino acid compositions across genomes with respect to the differences in the GC content of these prokaryotic species. We found that although the phylogenetic lineages were remote among some species, a similar genomic GC content forced them to adopt similar base usage patterns at the three codon positions, codon usage patterns, and amino acid usage patterns. Our work demonstrates that in prokaryotic genomes: a) base usage, codon usage, and amino acid usage change with GC content with a linear correlation; b) the distance of each usage has a linear correlation with the GC content difference; and c) GC content is more essential than phylogenetic lineage in determining base usage, codon usage, and amino acid usage. This work is exceptional in that we adopted intuitively graphic methods for all analyses, and we used these analyses to examine as many as 2670 prokaryotes. We hope that this work is helpful for understanding common features in the organization of microbial genomes.  相似文献   

6.
Summary Unrelated organisms with DNA of extreme G + C content (25% or 70%) are found to share very specific patterns of nearest neighbour base doublet frequency in their DNAs. This is shown to be a result of restrictions on the extremity of amino acid composition in their proteins, combined with a maximisation of the use of one type of base pair in redundant codon positions. Inferences are made about the universal nature of the genetic code and the proportion of DNA used for specifying protein in different species. The composition of coding DNA strands in these organisms is also discussed.  相似文献   

7.
I have examined potential determinants of the asymmetric distribution of nucleotide sequences in the genome of Escherichia coli as cataloged in GenBank release 44. I have used the frequency of occurrence of all possible tetranucleotides in a given sequence catalog or derivative as a comparative measure of asymmetry. The GenBank-cataloged strand and its complement show statistically similar (not complementary) distributions. The distribution is statistically similar in comparisons between the protein coding subset and the total genome, the coding subset and selected non-coding genes, the coding subset and the remainder of the DNA, and the coding subset and stable RNA sequences. I have compared the distribution in the genome of E. coli with the distributions found in the cataloged genomes of Salmonella typhimurium, Bacillus subtilis, and of coliphages lambda and T7. The distribution summed in both strands of the cataloged DNA differs statistically only in comparisons with lytic bacteriophage T7 because only the two strands of T7 show statistically dissimilar distributions. Despite similarities in tetranucleotide distribution, the pattern of codon complementarity in B. subtilis is different than that documented for E. coli. Thus, sequence asymmetry does not seem related to specific DNA function or to documented similarities or differences in codon bias. The sequence asymmetry of the E. coli genome may thus reflect a hitherto unsuspected pattern impressed on both strands of DNA which is or can be packaged into bacterial genomes.  相似文献   

8.
The extent to which base composition and codon usage vary among RNA viruses, and the possible causes of this bias, is undetermined in most cases. A maximum-likelihood statistical method was used to test whether base composition and codon usage bias covary with arthropod association in the genus Flavivirus, a major source of disease in humans and animals. Flaviviruses are transmitted by mosquitoes, by ticks, or directly between vertebrate hosts. Those viruses associated with ticks were found to have a significantly lower G+C content than non-vector-borne flaviviruses and this difference was present throughout the genome at all amino acids and codon positions. In contrast, mosquito-borne viruses had an intermediate G+C content which was not significantly different from those of the other two groups. In addition, biases in dinucleotide and codon usage that were independent of base composition were detected in all flaviviruses, but these did not covary with arthropod association. However, the overall effect of these biases was slight, suggesting only weak selection at synonymous sites. A preliminary analysis of base composition, codon usage, and vector specificity in other RNA virus families also revealed a possible association between base composition and vector specificity, although with biases different from those seen in the Flavivirus genus. Received: 29 August 2000 / Accepted: 19 December 2000  相似文献   

9.
Q. Liu 《Plant biosystems》2013,147(1):100-106
Abstract

A comprehensive analysis of sequence patterns around the stop codons was performed, by using more than 26,000 rice full-length cDNA sequences. Here it is shown that the bias was most outstanding at the position immediately before the stop codons (?1 codon), where the AAC codon was strongly preferred among ANC codons. Compared with other positions, the codon immediately after the stop codons (+1 codon) also displayed an apparent difference, and had a strong consensus for base A at the first, C at the second, and A at the third letters, respectively. Notably, the base biases at the positions directly downstream of the stop codons, such as the +4, +5 and +6 positions, were much stronger than other positions in the 3′-UTR region, suggesting that those base positions might act as an extended stop signal in the process of protein synthesis. Examination of the relationship between sequence pattern and gene expression level, assessed by CAI values and EST counting, revealed a tendency towards bigger base biases for highly expressed genes. It could be inferred that the translation stop signal is possibly involved in many sequence recognition elements other than the stop codons; highly expressed genes should hold strong sequence consensus around the stop codons for efficient translation termination.  相似文献   

10.
11.
Codon contexts in enterobacterial and coliphage genes   总被引:6,自引:0,他引:6  
This investigation of the codon context of enterobacteria, plasmid, and phage protein genes was based on a search for correlations between the presence of one base type at codon position III and the presence of another base type at some other position in adjacent codons. Enterobacterial genes were compared with eukaryotic sequences for codon context effects. In enterobacterial genes, base usage at codon position III is correlated with the third position of the upstream adjacent codon and with all three positions of the downstream codon. Plasmid genes are free of context biases. Phage genes are heterogeneous: MS2 codons have no biased context, whereas lambda genes partly follow the trends of the host bacterium, and T7 genes have biased codon contexts that differ from those of the host. It has been reported that two successive third-codon positions tend to be occupied by two purines or two pyrimidines in Escherichia coli genes of low expression level. Here, the extent to which highly expressed protein genes can modulate base usage at two successive codon positions III, given the constraints on codon usage and protein sequence that act on them, was quantified. This demonstrates that the above-mentioned favored patterns are not a characteristic of weakly expressed genes but occur in all genes in which codon context can vary appreciably. The correlation between successive third-codon positions is a distinct feature of enterobacteria and of some phages, one that may result from adaptation of gene structure to translational efficiency. Conversely, codon context in yeast and human genes is biased--but for reasons unrelated to translation.   相似文献   

12.
Compositional distributions in the three codon positions of the coding sequences of 12 fully sequenced prokaryotic genomes, which are publicly available, were investigated. A universal compositional correlation was observed in most of the genomes under investigation irrespective of their overall genomic GC contents. In all the genomes, the GC contents at the first codon positions are always greater than the overall GC contents of the genomes whereas the reverse is true in the case of second codon positions. GC contents at the third codon positions are higher than the overall genomic GC contents in high GC containing genomes, and the opposite situation was found in case of low GC genomes except for Helicobacter pylori. In high-GC rich genomes, the GC contents at the first + second codon positions are less than the GC contents at the third codon positions, and they are low in low-GC genomes except for Helicobacter pylori. The distributions of four bases at the three different positions were also investigated for all 12 organisms. It was observed that in high-GC genomes G is the most dominant base and in low-GC genomes A is the most dominant base in the first codon positions. But purine bases, i.e., (A + G), predominantly occur in the first codon position. In the second codon position, A is the most dominant base in most of the organisms and G is the least dominant base in all the organisms. There is no unique regular pattern of individual bases at the third codon positions; however, there are significant differences in the occurrences of (G + C) contents in the third codon positions among the different organisms. Calculations of dinucleotide frequencies in 12 different organisms indicate that in GC-rich genomes GG, GC, CC, and CG dinucleotides are the most dominant whereas the reverse is true in case of low-GC genomes. Biological implications of these results are discussed in this paper.  相似文献   

13.
Synonymous codon usage of 53 protein coding genes in chloroplast genome of Coffea arabica was analyzed for the first time to find out the possible factors contributing codon bias. All preferred synonymous codons were found to use A/T ending codons as chloroplast genomes are rich in AT. No difference in preference for preferred codons was observed in any of the two strands, viz., leading and lagging strands. Complex correlations between total base compositions (A, T, G, C, GC) and silent base contents (A3, T3, G3, C3, GC3) revealed that compositional constraints played crucial role in shaping the codon usage pattern of C. arabica chloroplast genome. ENC Vs GC3 plot grouped majority of the analyzed genes on or just below the left side of the expected GC3 curve indicating the influence of base compositional constraints in regulating codon usage. But some of the genes lie distantly below the continuous curve confirmed the influence of some other factors on the codon usage across those genes. Influence of compositional constraints was further confirmed by correspondence analysis as axis 1 and 3 had significant correlations with silent base contents. Correlation of ENC with axis 1, 4 and CAI with 1, 2 prognosticated the minor influence of selection in nature but exact separation of highly and lowly expressed genes could not be seen. From the present study, we concluded that mutational pressure combined with weak selection influenced the pattern of synonymous codon usage across the genes in the chloroplast genomes of C. arabica.  相似文献   

14.
Codon usage in selected AT-rich bacteria   总被引:8,自引:0,他引:8  
H H Winkler  D O Wood 《Biochimie》1988,70(8):977-986
The relationship between DNA base composition and codon bias in very AT-rich bacteria was analyzed. Five clostridial genes, five mycoplasmal genes and three rickettsial genes constituted the data base. In the genes of these three organisms, the rule for codon bias was very simple: use U or A in the first and third positions of the codon when possible. This was contrasted with the bias found in Bacillus subtilis and Escherichia coli. The rule for Bacillus subtilis was equally straightforward: use all codons without bias. Only in E. coli, amongst the species examined, did the codon bias appear to be a complicated codon 'choice'.  相似文献   

15.
One of the main causes of bacterial chromosome asymmetry is replication-associated mutational pressure. Different rates of nucleotide substitution accumulation on leading and lagging strands implicate qualitative and quantitative differences in the accumulation of mutations in protein coding sequences lying on different DNA strands. We show that the divergence rate of orthologs situated on leading strands is lower than the divergence rate of those situated on lagging strands. The ratio of the mutation accumulation rate for sequences lying on lagging strands to that of sequences lying on leading strands is rather stable and time-independent. The divergence rate of sequences which changed their positions, with respect to the direction of replication fork movement, is not stable—sequences which have recently changed their positions are the most prone to mutation accumulation. This effect may influence estimations of evolutionary distances between species and the topology of phylogenetic trees. Received: 24 July 2000 / Accepted: 16 January 2001  相似文献   

16.
Adenine nucleotides have been found to appear preferentially in the regions after the initiation codons or before the termination codons of bacterial genes. Our previous experiments showed that AAA and AAT, the two most frequent second codons in Escherichia coli, significantly enhance translation efficiency. To determine whether such a characteristic feature of base frequencies exists in eukaryote genes, we performed a comparative analysis of the base biases at the gene terminal portions using the proteomes of seven eukaryotes. Here we show that the base appearance at the codon third positions of gene terminal regions is highly biased in eukaryote genomes, although the codon third positions are almost free from amino acid preference. The bias changes depending on its position in a gene, and is characteristic of each species. We also found that bias is most outstanding at the second codon, the codon after the initiation codon. NCN is preferred in every genome; in particular, GCG is strongly favored in human and plant genes. The presence of the bias implies that the base sequences at the second codon affect translation efficiency in eukaryotes as well as bacteria.  相似文献   

17.
Saccone C  Gissi C  Reyes A  Larizza A  Sbisà E  Pesole G 《Gene》2002,286(1):3-12
The mitochondrial genome (mtDNA), due to its peculiar features such as exclusive presence of orthologous genes, uniparental inheritance, lack of recombination, small size and constant gene content, certainly represents a major model system in studies on evolutionary genomics in metazoan. In 800 million years of evolution the gene content of metazoan mitochondrial genomes has remained practically frozen but several evolutionary processes have taken place. These processes, reviewed here, include rearrangements of gene order, changes in base composition and arising of compositional asymmetry between the two strands, variations in the genetic code and evolution of codon usage, lineage-specific nucleotide substitution rates and evolutionary patterns of mtDNA control regions.  相似文献   

18.
19.
The variation in base composition at the three codon sites in relation to gene expressivity, the latter estimated by the Codon Adaptation Index, has been studied in a sample of 1371 Escherichia coli genes. Correlation and regression analyses show that increasing expression levels are accompanied by higher frequencies of base G at first, of base A at second and of base C at third codon positions. However, correlation between expressivity and base compositional biases at each codon site was only significant and positive at first codon position. The preference for G-starting codons as gene expression level increases is discussed in terms of translational optimization.  相似文献   

20.
Chloroplast DNAs (ctDNA) from pea and corn plants were examined in the electron microscope for the presence of replicative intermediates. Pea and corn ctDNAs were each found to contain two displacement loops (D-loops). The D-loops were 820 (+/- 90) base pairs long in pea ctDNA and 860 (+/- 125) base pairs long in corn ctDNA. In each ctDNA, the two D-loops were located at positions that were 7100 +/- 240) base pairs apart. The displacing strands of the two D-loops were located on opposite strands of the parental DNA molecule and they were seen to expand toward each other. The D-loops in the ctDNA from pea and corn exhibited branch migration and thus were easily distinguished from the denatured regions that were also present in these closed circular ctDNAs. In addition, the positions of the D-loops were found to be distinct from the positions of the denaturation loops (Den-loops). The Den-loops were also shown to be located at AT-rich regions in these ctDNA molecules. D-loops and Den-loops were also found in the circular and catenated ctDNA oligomers from pea and corn plants. Mapping the positions of the D-loops relative to the positions of the Den-loops showed that the structure of the D-loop-containing region in the pea and corn ctDNAs has been conserved to a greater extent than the structure of the rest of the two ctDNA molecules.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号