首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
2.
Despite the degeneracy of the genetic code, whereby different codons encode the same amino acid, alternative codons and amino acids are utilized nonrandomly within and between genomes. Such biases in codon and amino acid usage have been demonstrated extensively in prokaryote genomes and likely reflect a balance between the action of mutation, selection, and genetic drift. Here, we quantify the effects of selection and mutation drift as causes of codon and amino acid-usage bias in a large collection of nematode partial genomes from 37 species spanning approximately 700 Myr of evolution, as inferred from expressed sequence tag (EST) measures of gene expression and from base composition variation. Average G + C content at silent sites among these taxa ranges from 10% to 63%, and EST counts range more than 100-fold, underlying marked differences between the identities of major codons and optimal codons for a given species as well as influencing patterns of amino acid abundance among taxa. Few species in our sample demonstrate a dominant role of selection in shaping intragenomic codon-usage biases, and these are principally free living rather than parasitic nematodes. This suggests that deviations in effective population size among species, with small effective sizes among parasites, are partly responsible for species differences in the extent to which selection shapes patterns of codon usage. Nevertheless, a consensus set of optimal codons emerges that is common to most taxa, indicating that, with some notable exceptions, selection for translational efficiency and accuracy favors similar sets of codons regardless of the major codon-usage trends defined by base compositional properties of individual nematode genomes.  相似文献   

3.
A significant problem in biological motif analysis arises when the background symbol distribution is biased (e.g. high/low GC content in the case of DNA sequences). This can lead to overestimation of the amount of information encoded in a motif. A motif can be depicted as a signal using information theory (IT). We apply two concepts from IT, distortion and patterned interference (a type of noise), to model genomic and codon bias respectively. This modeling approach allows us to correct a raw signal to recover signals that are weakened by compositional bias. The corrected signal is more likely to be discriminated from a biased background by a macromolecule. We apply this correction technique to recover ribosome-binding site (RBS) signals from available sequenced and annotated prokaryotic genomes having diverse compositional biases. We observed that linear correction was sufficient for recovering signals even at the extremes of these biases. Further comparative genomics studies were made possible upon correction of these signals. We find that the average Euclidian distance between RBS signal frequency matrices of different genomes can be significantly reduced by using the correction technique. Within this reduced average distance, we can find examples of class-specific RBS signals. Our results have implications for motif-based prediction, particularly with regards to the estimation of reliable inter-genomic model parameters.  相似文献   

4.
The evolution of isochores: evidence from SNP frequency distributions   总被引:4,自引:0,他引:4  
Lercher MJ  Smith NG  Eyre-Walker A  Hurst LD 《Genetics》2002,162(4):1805-1810
The large-scale systematic variation in nucleotide composition along mammalian and avian genomes has been a focus of the debate between neutralist and selectionist views of molecular evolution. Here we test whether the compositional variation is due to mutation bias using two new tests, which do not assume compositional equilibrium. In the first test we assume a standard population genetics model, but in the second we make no assumptions about the underlying population genetics. We apply the tests to single-nucleotide polymorphism data from noncoding regions of the human genome. Both models of neutral mutation bias fit the frequency distributions of SNPs segregating in low- and medium-GC-content regions of the genome adequately, although both suggest compositional nonequilibrium. However, neither model fits the frequency distribution of SNPs from the high-GC-content regions. In contrast, a simple population genetics model that incorporates selection or biased gene conversion cannot be rejected. The results suggest that mutation biases are not solely responsible for the compositional biases found in noncoding regions.  相似文献   

5.
Membrane spanning regions can be used as markers for studying the robustness of biologically important units of proteins against evolutionary change (R. Sawada and S. Mitaku, Genes to Cells, 2010). We carried out computational experiments of extensive DNA mutations on the assumption of constant GC content or constant codon positional nucleotide biases. Randomized sequences were evaluated by membrane protein prediction systems SOSUI and SOSUIsignal. When all amino acid sequences from the total real genomes of 538 prokaryotes were analysed, ratios of membrane proteins to all genes in the total genomes were almost constant around a ratio of 22% with a standard deviation of 1.56. When the nucleotide sequences were randomized, keeping only the GC contents constant, the ratios of membrane proteins became highly diverse with a standard deviation of 10.1. When the codon positional nucleotide biases were taken into account; however, the diverse ratios of membrane proteins converged to a value of ~25% with a standard deviation of 3.55. These results suggest that codon compositional biases play an important role in the evolution of prokaryotes for maintaining a constant ratio of membrane proteins. Further detailed analysis suggested that non-uniform nucleotide compositional biases at the terminal regions are the reason for the small but significant deviation.  相似文献   

6.
The vertebrate genome: isochores and evolution   总被引:18,自引:6,他引:12  
  相似文献   

7.
Minimal absent words have been computed in genomes of organisms from all domains of life. Here, we explore different sets of minimal absent words in the genomes of 22 organisms (one archaeota, thirteen bacteria and eight eukaryotes). We investigate if the mutational biases that may explain the deficit of the shortest absent words in vertebrates are also pervasive in other absent words, namely in minimal absent words, as well as to other organisms. We find that the compositional biases observed for the shortest absent words in vertebrates are not uniform throughout different sets of minimal absent words. We further investigate the hypothesis of the inheritance of minimal absent words through common ancestry from the similarity in dinucleotide relative abundances of different sets of minimal absent words, and find that this inheritance may be exclusive to vertebrates.  相似文献   

8.
Most prokaryotic genomes display strand compositional asymmetries, but the reasons for these biases remain unclear. When the distribution of gene orientation is biased, as it often is, this may induce a bias in composition, as codon frequencies are not identical. We show here that this effect can be estimated and removed, and that the residual base skews are the highest at third base codon positions and lower at first and second positions. This strongly suggests that compositional asymmetries result from 1) a replication-related mutational bias that is filtered through selective pressure and/or from 2) an uneven distribution of gene orientation. In most cases, the mutational bias alters the codon usage and amino acid frequencies of the leading and the lagging strand. However, these features are not ubiquitous amongst prokaryotes, and the biological reasons for them remain to be found.  相似文献   

9.
Summary The evolution of vertebrate genomes can be investigated by analyzing their regional compositional patterns, namely the compositional distributions of large DNA fragments (in the 30–100-kb size range), of coding sequences, and of their different codon positions. This approach has shown the existence of two evolutionary modes. In the conservative mode, compositional patterns are maintained over long times (many million years), in spite of the accumulation of enormous numbers of base substitutions. In the transitional, or shifting, mode, compositional patterns change into new ones over much shorter times.The conservation of compositional patterns, which has been investigated in mammalian genomes, appears to be due in part to some measure of compositional conservation in the base substitution process, and in part to negative selection acting at regional (isochore) levels in the genome and eliminating deviations from a narrow range of values, presumably corresponding to optimal functional properties. On the other hand, shifts of compositional patterns, such as those that occurred between cold-blooded and warm-blooded vertebrates, appear to be due essentially to both negative and positive selection again operating at the isochore level, largely under the influence of changes in environmental conditions, and possibly taking advantage of mutational biases in the replication/repair enzymes and/or in the enzyme make-up of nucleotide precursor pools. Other events (like translocations and changes in chromosomal structure) also play a role in the transitional mode of genome evolution.The present findings (1) indicate that isochores, which correspond to the DNA segments of individual or contiguous chromatin domains, represent selection units in the vertebrate genome; and (2) shed new light on the selectionist-neutralist controversy.This work was presented at the EMBO Workshop on Evolution (Cambridge, UK, 4–6 July 1988) and at the 16th International Congress of Genetics (Toronto, Canada, 20–27 August 1988)  相似文献   

10.
Genome-scale phylogeny and the detection of systematic biases   总被引:17,自引:0,他引:17  
Phylogenetic inference from sequences can be misled by both sampling (stochastic) error and systematic error (nonhistorical signals where reality differs from our simplified models). A recent study of eight yeast species using 106 concatenated genes from complete genomes showed that even small internal edges of a tree received 100% bootstrap support. This effective negation of stochastic error from large data sets is important, but longer sequences exacerbate the potential for biases (systematic error) to be positively misleading. Indeed, when we analyzed the same data set using minimum evolution optimality criteria, an alternative tree received 100% bootstrap support. We identified a compositional bias as responsible for this inconsistency and showed that it is reduced effectively by coding the nucleotides as purines and pyrimidines (RY-coding), reinforcing the original tree. Thus, a comprehensive exploration of potential systematic biases is still required, even though genome-scale data sets greatly reduce sampling error.  相似文献   

11.
The genomes of the ancestors of mammals and birds underwent a compositional change in which the gene-richest regions increased their GC levels. Here we investigated this compositional transition by analyzing the levels of G and C in third codon positions, as well as the codon frequencies of orthologous genes from human, chicken and Xenopus. The results may be summed up as follows: (i) GC-poor genes, that did not undergo the compositional transition, showed only minor differences in orthologous sets from Xenopus, human and chicken; this is remarkable in view of the very many nucleotide substitutions that occurred over the long evolutionary times separating these species; (ii) GC-rich genes, that underwent the compositional transition, showed large differences between Xenopus and warm-blooded vertebrates, but not between chicken and human. In other words, the independent changes that occurred in avian and mammalian genes, on the average, were the same.  相似文献   

12.
Costantini M  Bernardi G 《Gene》2008,410(2):241-248
Many years ago compositional correlations were found to hold between coding and contiguous non-coding sequences. These correlations were essentially studied in whole genomes of mammals, which are characterized by strong compositional heterogeneities. Here we investigated whether these correlations also hold within the much more homogeneous isochore families. This point was checked not only in the case of mammals, but also in that of phylogenetically distant vertebrates, which are characterized by very different compositional patterns. Indeed, these are remarkably different in cold- and warm-blooded vertebrates. Fish genomes, for instance, are much more homogeneous than those of mammals and birds. The compositional correlations between coding sequences and the corresponding introns, or their 5′ and 3′ flanking regions, were studied in the isochore families of the fully sequenced genomes from four fishes (Brachydanio rerio, Oryzias latipes, Gasterosteus aculeatus and Tetraodon nigroviridis), human and chicken.  相似文献   

13.
Many bacterial genomes are under asymmetric mutational pressure which introduces compositional asymmetry into DNA molecule resulting in many biases in coding structure of chromosomes. One of the processes affected by the asymmetry is translocation changing the position of the coding sequence on chromosome in respect to the orientation on the leading and lagging DNA strand. When analysing sets of paralogs in 50 genomes, we found that the number of observed genes which switched their positions on DNA strand is lowest for genomes with the highest DNA asymmetry. However, the number of orthologs which changed DNA strand increases with the phylogenetic distance between the compared genomes. Nevertheless, there is a fraction of coding sequences that stay on the leading strand in all analysed genomes, whereas there are no sequences that stay always on the lagging strand. Since sequences diverge very fast after switching the DNA strand, this bias in mobility of sequences is responsible, in part, for higher divergence rates among some of coding sequences located on the lagging DNA strand.  相似文献   

14.
The compositional distributions of large (main-band) DNA fragments from eight birds belonging to eight different orders (including both paleognathous and neognathous species) are very broad and extremely close to each other. These findings, which are paralleled by the compositional similarity of homologous coding sequences and their codon positions, support the idea that birds are a monophyletic group.The compositional distribution of third-codon positions of genes from chicken, the only avian species for which a relatively large number of coding sequences is known, is very broad and bimodal, the minor GC-richer peak reaching 100% GC. The very high compositional heterogeneity of avian genomes is accompanied (as in the case of mammalian genomes) by a very high speciation rate compared to cold-blooded vertebrates which are characterized by genomes that are much less heterogeneous. The higher GC levels attained by avian compared to mammalian genomes might be correlated with the higher body temperature (41–43°C) of birds compared to mammals (37°C).A comparison of GC levels of coding sequences and codon positions from man and chicken revealed very close average GC levels and standard deviations. Homologous coding sequences and codon positions from man and chicken showed a surprisingly high degree of compositional similarity which was, however, higher for GC-poor than for GC-rich sequences. This indicates that GC-poor isochores of warm-blooded vertebrates reflect the composition of the isochores of the genome of the common reptilian ancestor of mammals and birds, which underwent only a small compositional change at the transition from cold- to warm-blooded vertebrates. In contrast, the GC-rich isochores of birds and mammals are the result of large compositional changes at the same evolutionary transition, where were in part different in the two classes of warm-blooded vertebrates.Correspondence to: G. Bernaadi  相似文献   

15.
The compositional distributions of large DNA fragments reflect those of the isochores that make up vertebrate genomes and can provide novel phylogenetic insights in the case of mammalian genomes (see Sabeur et al. 1993). This approach has been complemented here by an analysis of the compositional patterns of coding sequences and their codon positions (which also reflect the isochore pattern) and by a comparison of the base compositions of codon positions from homologous genes in a number of pairs of species. The results obtained using these two approaches support the existence of a general compositional pattern for mammalian genomes and of a distinct pattern for Myomorpha. The other two “special” patterns identified in a megachiropteran and in pangolin could not be tested here. Presented at the NATO Advanced Research Workshop onGenome Organization and Evolution, Spetsai, Greece, 16–22 September 1992  相似文献   

16.
17.
We describe a powerful new approach for discovering globally conserved regulatory elements between two genomes. The method is fast, simple and comprehensive, without requiring alignments. Its application to pairs of yeasts, worms, flies and mammals yields a large number of known and novel putative regulatory elements. Many of these are validated by independent biological observations, have spatial and/or orientation biases, are co-conserved with other elements and show surprising conservation across large phylogenetic distances.  相似文献   

18.
We describe a powerful new approach for discovering globally conserved regulatory elements between two genomes. The method is fast, simple and comprehensive, without requiring alignments. Its application to pairs of yeasts, worms, flies and mammals yields a large number of known and novel putative regulatory elements. Many of these are validated by independent biological observations, have spatial and/or orientation biases, are co-conserved with other elements and show surprising conservation across large phylogenetic distances.  相似文献   

19.
S Zoubak  A Rynditch  G Bernardi 《Gene》1992,119(2):207-213
The compositional distributions of genomes, genes (and their third codon positions) and long terminal repeats from retroviruses of warm-blooded vertebrates are characterized by a striking bimodality which is accompanied by a remarkable compositional homogeneity within each retroviral genome. A first, major class of retroviral genomes is GC-rich, whereas a second, minor class is GC-poor. Representative expressed viral genomes from the two classes integrate in GC-rich and GC-poor isochores, respectively, of host genomes. The first class comprises all oncoviruses (except B-types and some D-types), the second, lentiviruses, spumaviruses, as well as B-type and some D-type oncoviruses (e.g., mouse mammary tumor virus and simian retroviruses type D, respectively). The compositional bimodal distribution of retroviral genomes and the accompanying compositional homogeneity within each retroviral genome appear to be the result of the compositional evolution of retroviral genomes in their integrated form.  相似文献   

20.

Background  

Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号