首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 906 毫秒
1.
Most previous studies of the evolution of codon usage bias (CUB) and intronic GC content (iGC) in Drosophila melanogaster were based on between-species comparisons, reflecting long-term evolutionary events. However, a complete picture of the evolution of CUB and iGC cannot be drawn without knowledge of their more recent evolutionary history. Here, we used a polymorphism dataset collected from Zimbabwe to study patterns of the recent evolution of CUB and iGC. Analyzing coding and intronic data jointly with a model which can simultaneously estimate selection, mutational, and demographic parameters, we have found that: (1) natural selection is probably acting on synonymous codons; (2) a constant population size model seems to be sufficient to explain most of the observed synonymous polymorphism patterns; (3) GC is favored over AT in introns. In agreement with the long-term evolutionary patterns, ongoing selection acting on X-linked synonymous codons is stronger than that acting on autosomal codons. The selective differences between preferred and unpreferred codons tend to be greater than the differences between GC and AT in introns, suggesting that natural selection, not just biased gene conversion, may have influenced the evolution of CUB. Interestingly, evidence for non-equilibrium evolution comes exclusively from the intronic data. However, three different models, an equilibrium model with two classes of selected sites and two non-equilibrium models with changes in either population size or mutational parameters, fit the intronic data equally well. These results show that using inadequate selection (or demographic) models can result in incorrect estimates of demographic (or selection) parameters.  相似文献   

2.
We considered genome‐wide four‐fold degenerate sites from an African Drosophila melanogaster population and compared them to short introns. To include divergence and to polarize the data, we used its close relatives Drosophila simulans, Drosophila sechellia, Drosophila erecta and Drosophila yakuba as outgroups. In D. melanogaster, the GC content at four‐fold degenerate sites is higher than in short introns; compared to its relatives, more AT than GC is fixed. The former has been explained by codon usage bias (CUB) favouring GC; the latter by decreased intensity of directional selection or by increased mutation bias towards AT. With a biallelic equilibrium model, evidence for directional selection comes mostly from the GC‐rich ancestral base composition. Together with a slight mutation bias, it leads to an asymmetry of the unpolarized allele frequency spectrum, from which directional selection is inferred. Using a quasi‐equilibrium model and polarized spectra, however, only purifying and no directional selection is detected. Furthermore, polarized spectra are proportional to those of the presumably unselected short introns. As we have no evidence for a decrease in effective population size, relaxed CUB must be due to a reduction in the selection coefficient. Going beyond the biallelic model and considering all four bases, signs of directional selection are stronger. In contrast to short introns, complementary bases show strand specificity and allele frequency spectra depend on mutation directions. Hence, the traditional biallelic model to describe the evolution of four‐fold degenerate sites should be replaced by more complex models assuming only quasi‐equilibrium and accounting for all four bases.  相似文献   

3.
Galtier N  Bazin E  Bierne N 《Genetics》2006,172(1):221-228
The study of base composition evolution in Drosophila has been achieved mostly through the analysis of coding sequences. Third codon position GC content, however, is influenced by both neutral forces (e.g., mutation bias) and natural selection for codon usage optimization. In this article, large data sets of noncoding DNA sequence polymorphism in D. melanogaster and D. simulans were gathered from public databases to try to disentangle these two factors-noncoding sequences are not affected by selection for codon usage. Allele frequency analyses revealed an asymmetric pattern of AT vs. GC noncoding polymorphisms: AT --> GC mutations are less numerous, and tend to segregate at a higher frequency, than GC --> AT ones, especially at GC-rich loci. This is indicative of nonstationary evolution of base composition and/or of GC-biased allele transmission. Fitting population genetics models to the allele frequency spectra confirmed this result and favored the hypothesis of a biased transmission. These results, together with previous reports, suggest that GC-biased gene conversion has influenced base composition evolution in Drosophila and explain the correlation between intron and exon GC content.  相似文献   

4.
The aim of this study was to analyze patterns of nucleotidic composition and codon usage in the pea aphid genome (Acyrthosiphon pisum). A collection of 60,000 expressed sequence tags (ESTs) in the pea aphid has been used to automatically reconstruct 5809 coding sequences (CDSs), based on similarity with known proteins and on coding style recognition. Reconstructions were manually checked for ribosomal proteins, leading to tentatively reconstruct the nea-complete set of this category. Pea aphid coding sequences showed a shift toward AT (especially at the third codon position) compared to drosophila homologues. Genes with a putative high level of expression (ribosomal and other genes with high EST support) remained more GC3-rich and had a distinct codon usage from bulk sequences: they exhibited a preference for C-ending codons and CGT (for arginine), which thus appeared optimal for translation. However, the discrimination was not as strong as in drosophila, suggesting a reduced degree of translational selection. The space of variation in codon usage for A. pisum appeared to be larger than in drosophila, with a substantial fraction of genes that remained GC3-rich. Some of those (in particular some structural proteins) also showed high levels of codon bias and a very strong preference for C-ending codons, which could be explained either by strong translational selection or by other mechanisms. Finally, genomic traces were analyzed to build 206 fragments containing a full CDS, which allowed studying the correlations between GC contents of coding and those of noncoding (flanking and introns) sequences.  相似文献   

5.
We estimated the intensity of selection on preferred codons in Drosophila pseudoobscura and D. miranda at X-linked and autosomal loci, using a published data set on sequence variability at 67 loci, by means of an improved method that takes account of demographic effects. We found evidence for stronger selection at X-linked loci, consistent with their higher levels of codon usage bias. The estimates of the strength of selection and mutational bias in favor of unpreferred codons were similar to those found in other species, after taking into account the fact that D. pseudoobscura showed evidence for a recent expansion in population size. We examined correlates of synonymous and nonsynonymous diversity in these species and found no evidence for effects of recurrent selective sweeps on nonsynonymous mutations, which is probably because this set of genes have much higher than average levels of selective constraints. There was evidence for correlated effects of levels of selective constraints on protein sequences and on codon usage, as expected under models of selection for translational accuracy. Our analysis of a published data set on D. melanogaster provided evidence for the effects of selective sweeps of nonsynonymous mutations on linked synonymous diversity, but only in the subset of loci that experienced the highest rates of nonsynonymous substitutions (about one-quarter of the total) and not at more slowly evolving loci. Our correlational analysis of this data set suggested that both selective constraints on protein sequences and recurrent selective sweeps affect the overall level of codon usage.  相似文献   

6.
Population,evolutionary and genomic consequences of interference selection   总被引:3,自引:0,他引:3  
Comeron JM  Kreitman M 《Genetics》2002,161(1):389-410
Weakly selected mutations are most likely to be physically clustered across genomes and, when sufficiently linked, they alter each others' fixation probability, a process we call interference selection (IS). Here we study population genetics and evolutionary consequences of IS on the selected mutations themselves and on adjacent selectively neutral variation. We show that IS reduces levels of polymorphism and increases low-frequency variants and linkage disequilibrium, in both selected and adjacent neutral mutations. IS can account for several well-documented patterns of variation and composition in genomic regions with low rates of crossing over in Drosophila. IS cannot be described simply as a reduction in the efficacy of selection and effective population size in standard models of selection and drift. Rather, IS can be better understood with models that incorporate a constant "traffic" of competing alleles. Our simulations also allow us to make genome-wide predictions that are specific to IS. We show that IS will be more severe at sites in the center of a region containing weakly selected mutations than at sites located close to the edge of the region. Drosophila melanogaster genomic data strongly support this prediction, with genes without introns showing significantly reduced codon bias in the center of coding regions. As expected, if introns relieve IS, genes with centrally located introns do not show reduced codon bias in the center of the coding region. We also show that reasonably small differences in the length of intermediate "neutral" sequences embedded in a region under selection increase the effectiveness of selection on the adjacent selected sequences. Hence, the presence and length of sequences such as introns or intergenic regions can be a trait subject to selection in recombining genomes. In support of this prediction, intron presence is positively correlated with a gene's codon bias in D. melanogaster. Finally, the study of temporal dynamics of IS after a change of recombination rate shows that nonequilibrium codon usage may be the norm rather than the exception.  相似文献   

7.
Loewe L  Charlesworth B 《Genetics》2007,175(3):1381-1393
Background selection involves the reduction in effective population size caused by the removal of recurrent deleterious mutations from a population. Previous work has examined this process for large genomic regions. Here we focus on the level of a single gene or small group of genes and investigate how the effects of background selection caused by nonsynonymous mutations are influenced by the lengths of coding sequences, the number and length of introns, intergenic distances, neighboring genes, mutation rate, and recombination rate. We generate our predictions from estimates of the distribution of the fitness effects of nonsynonymous mutations, obtained from DNA sequence diversity data in Drosophila. Results for genes in regions with typical frequencies of crossing over in Drosophila melanogaster suggest that background selection may influence the effective population sizes of different regions of the same gene, consistent with observed differences in codon usage bias along genes. It may also help to cause the observed effects of gene length and introns on codon usage. Gene conversion plays a crucial role in determining the sizes of these effects. The model overpredicts the effects of background selection with large groups of nonrecombining genes, because it ignores Hill-Robertson interference among the mutations involved.  相似文献   

8.
The molecular evolution of the histone multigene family was studied by cloning and determining the nucleotide sequences of the histone 3 genes in seven Drosophila species, D. takahashii, D. lutescens, D. ficusphila, D. persimilis, D.pseudoobscura, D. americana and D. immigrans. CT repeats, a TATA box and an AGTG motif in the 5' region, and a hairpin loop and purine-rich motifs (CAA(T/G)GAGA) in the 3' region were conserved even in distantly related species. In D. hydei and D.americana, the GC content at the third codon position in the protein coding region was relatively low (49% and 45%), while in D. takahashii and D. lutescens it was relatively high (64% and 65%). The non- significant correlation between the GC contents in the 3' region and at the third codon position as well as the evidence of less constraint in the 3' region suggested that mutational bias may not be the major mechanism responsible for the biased nucleotide change at the third codon position or for codon usage bias.  相似文献   

9.
The nature of the forces affecting base composition is a key question in genome evolution. There is uncertainty as to whether differences in the GC contents of non-coding sequences reflect differences in mutational bias, or in the intensity of selection or biased gene conversion. We have used a polymorphism dataset for non-coding sequences on the X chromosome of Drosophila simulans to examine this question. The proportion of GC-->AT versus AT-->GC polymorphic mutations in a locus is correlated with its GC content. This implies the action of forces that favour GC over AT base pairs, which are apparently strongest in GC-rich sequences.  相似文献   

10.
Codon usage in Clonorchis sinensis was analyzed using 12,515 codons from 38 coding sequences. Total GC content was 49.83%, and GC1, GC2 and GC3 contents were 56.32%, 43.15% and 50.00%, respectively. The effective number of codons converged at 51-53 codons. When plotted against total GC content or GC3, codon usage was distributed in relation to GC3 biases. Relative synonymous codon usage for each codon revealed a single major trend, which was highly correlated with GC content at the third position when codons began with A or U at the first two positions. In codons beginning with G or C base at the first two positions, the G or C base rarely occurred at the third position. These results suggest that codon usage is shaped by a bias towards G or C at the third base, and that this is affected by the first and second bases.  相似文献   

11.
Summary We have analyzed the correlation that exists between the GC levels of third and first or second codon position for about 1400 human coding sequences. The linear relationship that was found indicates that the large differences in GC level of third codon positions of human genes are paralleled by smaller differences in GC levels of first and second codon positions. Whereas third codon position differences correspond to very large differences in codon usage within the human genome, the first and second codon position differences correspond to smaller, yet very remarkable, differences in the amino acid composition of encoded proteins. Because GC levels of codon positions are linearly correlated with the GC levels of the isochores harboring the corresponding genes, both codon usage and amino acid composition are different for proteins encoded by genes located in isochores of different GC levels. Furthermore, we have also shown that a linear relationship with a unity slope and a correlation coefficient of 0.77 exists between GC levels of introns and exons from the 238 human genes currently available for this analysis. Introns are, however, about 5% lower in GC, on average, than exons from the same genes.  相似文献   

12.
Background: Mitochondrial ND gene, which encodes NADH dehydrogenase, is the first enzyme of the mitochondrial electron transport chain. Leigh syndrome, a neurodegenerative disease caused by mutation in the ND2 gene (T4681C), is associated with bilateral symmetric lesions in basal ganglia and subcortical brain regions. Therefore, it is of interest to analyze mitochondrial DNA to glean information for evolutionary relationship. This study highlights on the analysis of compositional dynamics and selection pressure in shaping the codon usage patterns in the coding sequence of MT-ND2 gene across pisces, aves and mammals by using bioinformatics tools like effective number of codons (ENC), codon adaptation index (CAI), relative synonymous codon usage (RSCU) etc. Results: We observed a low codon usage bias as reflected by high ENC values in MT-ND2 gene among pisces, aves and mammals. The most frequently used codons were ending with A/C at the 3rd position of codon and the gene was AT rich in all the three classes. The codons TCA, CTA, CGA and TGA were over represented in all three classes. The F1 correspondence showed significant positive correlation with G, T3 and CAI while the F2 axis showed significant negative correlation with A and T but significant positive correlation with G, C, G3, C3, ENC, GC, GC1, GC2 and GC3. Conclusions: The codon usage bias in MTND2 gene is not associated with expression level. Mutation pressure and natural selection affect the codon usage pattern in MT-ND 2 gene.  相似文献   

13.
紫花苜蓿叶绿体基因组密码子偏好性分析   总被引:1,自引:0,他引:1  
喻凤  韩明 《广西植物》2021,41(12):2069-2076
为分析紫花苜蓿叶绿体基因组密码子偏好性的使用模式,该文以紫花苜蓿叶绿体基因组中筛选到的49条蛋白质编码序列为研究对象,利用CodonW、CUSP、CHIPS、SPSS等软件对其密码子的使用模式和偏好性进行研究。结果表明:(1)紫花苜蓿叶绿体基因的第3位密码子的平均GC含量为26.44%,有效密码子数(ENC)在40.6~51.41之间,多数密码子的偏好性较弱。(2)相对同义密码子使用度(RSCU)分析发现,RSCU>1 的密码子数目有30个,以A、U结尾的有29个,说明了紫花苜蓿叶绿体基因组A或U出现的频率较高。(3)中性分析发现,GC3与 GC12的相关性不显著,表明密码子偏性主要受自然选择的影响; ENC-plot 分析发现一部分基因落在曲线的下方及周围,表明突变也影响了部分密码子偏性的形成。此外,有17个密码子被鉴定为紫花苜蓿叶绿体基因组的最优密码子。紫花苜蓿叶绿体基因组的密码子偏好性可能受自然选择和突变的共同作用。该研究将为紫花苜蓿叶绿体基因工程的开展和目标性状的遗传改良奠定基础。  相似文献   

14.
为分析栽培大豆和野生大豆线粒体基因组的密码子使用特征差异,该文以其线粒体基因组编码序列为研究对象,比较其密码子偏性形成的影响因素和演化过程。结果表明:(1)栽培大豆和野生大豆线粒体基因组编码区的GC含量分别为44.56%和44.58%,说明栽培大豆和野生大豆线粒体编码基因均富含A/T碱基。(2)栽培大豆和野生大豆线粒体基因组密码子第1位、第2位GC含量平均值与第3位GC含量的相关性均呈极显著水平,说明突变在其密码子偏性形成中的作用不可忽略; PR2-plot分析显示,在同义密码子第3位碱基的使用频率上,嘌呤低于嘧啶; Nc-plot分析中Nc比值位于-0.1~0.2区间的基因数占总基因数的95%以上;突变和选择等多重因素共同作用影响了大豆线粒体基因组编码序列密码子使用偏性的形成。(3)有20、21个密码子分别被确定为栽培大豆和野生大豆线粒体基因组编码序列的最优密码子,其中除丝氨酸TCC密码子外均以A或T结尾。综上结果认为,栽培大豆线粒体密码子偏性的形成受选择的影响要高于野生大豆,这可能是栽培大豆由野生大豆经长期人工栽培驯化的结果。  相似文献   

15.
The correlation was shown between the length of introns and the codon usage of the coding sequences of the corresponding genes, which in some cases can be related to the level of gene expression. The link is positive in the unicellular organisms, i.e., genes with the longer introns show the higher bias of codon usage. It is most pronounced in baker's yeast, where it is definitely related to the level of gene expression—genes with the higher level of expression have the longer introns. The correlation is inverted in multicellular organisms as compared to unicellular ones. Some organisms, however, do not show the link. The presence or absence of the link does not seem to be related to the GC percent of the coding sequences. Received: 7 December 1999 / Accepted: 10 May 2000  相似文献   

16.
Adaptive codon usage provides evidence of natural selection in one of its most subtle forms: a fitness benefit of one synonymous codon relative to another. Codon usage bias is evident in the coding sequences of a broad array of taxa, reflecting selection for translational efficiency and/or accuracy as well as mutational biases. Here, we quantify the magnitude of selection acting on alternative codons in genes of the nematode Caenorhabditis remanei, an outcrossing relative of the model organism C. elegans, by fitting the expected mutation-selection-drift equilibrium frequency distribution of preferred and unpreferred codon variants to the empirical distribution. This method estimates the intensity of selection on synonymous codons in genes with high codon bias as N(e)s = 0.17, a value significantly greater than zero. In addition, we demonstrate for the first time that estimates of ongoing selection on codon usage among genes, inferred from nucleotide polymorphism data, correlate strongly with long-term patterns of codon usage bias, as measured by the frequency of optimal codons in a gene. From the pattern of polymorphisms in introns, we also infer that these findings do not result from the operation of biased gene conversion toward G or C nucleotides. We therefore conclude that coincident patterns of current and ancient selection are responsible for shaping biased codon usage in the C. remanei genome.  相似文献   

17.
Maside X  Charlesworth B 《Genetics》2007,176(4):2293-2305
We present the results of a survey of DNA sequence variability at X-linked and autosomal loci in Drosophila americana and of patterns of DNA sequence evolution among D. americana and four other related species in the virilis group of Drosophila. D. americana shows a typical level of silent polymorphism for a Drosophila species, but has an unusually low ratio of nonsynonymous to silent variation. Both D. virilis and D. americana also show a low ratio of nonsynonymous to synonymous substitutions along their respective lineages since the split from their common ancestor. The proportion of amino acid substitutions between D. americana and its relatives that are caused by positive selection, as estimated by extensions of the McDonald-Kreitman test, appears to be unusually high. We cannot, however, exclude the possibility that this reflects a recent increase in the intensity of selection on nonsynonymous mutations in D. americana and D. virilis. We also find that base composition at neutral sites appears to be in overall equilibrium among these species, but there is evidence for departure from equilibrium for codon usage in some lineages.  相似文献   

18.
19.
Fortes GG  Bouza C  Martínez P  Sánchez L 《Genetica》2007,129(3):281-289
To review the general consideration about the different compositional structure of warm and cold-blooded vertebrates genomes, we used of the increasing number of genetic sequences, including coding (exons) and non-coding (introns) regions, that have been deposited on the databases throughout last years. The nucleotide distributions of the third codon positions (GC3) have been analyzed in 1510 coding sequences (CDS) of fish, 1414 CDS of amphibians and 320 CDS of reptiles. Also, the relationship between GC content of 74, 56 and 25 CDS of fish, amphibians and reptiles, respectively and that of their corresponding introns (GCI) have been considerated. In accordance with recent data, sequence analysis showed the presence of very GC3-rich CDS in these poikilotherm vertebrates. However, very high diversity in compositional patterns among different orders of fish, amphibians and reptiles was found. Significant positive correlations between GC3 and GCI was also confirmed for the genes analyzed. Nevertheless, introns resulted to be poorer in GC than their corresponding CDS, this difference being larger than in human genome. Because the limited number of available sequences including exons and introns we must be cautious about the results derived from them. However, the indicious of higher GC richness of coding sequences than of their corresponding introns could aid to understand the discrepancy of sequence analysis with the ultracentrifugation studies in cold-blooded vertebrates that did not predict the existence of GC-rich isochores.  相似文献   

20.
Synonymous codons are widely selected for various biological mechanisms in both prokaryotes and eukaryotes. Recent evidence suggests that microRNA (miRNA) function may affect synonymous codon choices near miRNA target sites. To better understand this, we perform genome-wide analysis on synonymous codon usage around miRNA target sites in four plant genomes. We observed a general trend of increased site accessibility around miRNA target sites in plants. Guanine-cytosine (GC)-poor codons are preferred in the flank region of miRNA target sites. Within-genome analyses show significant variation among miRNA targets in species. GC content of the target gene can partly explain the variation of site accessibility among miRNA targets. miRNA targets in GC-rich genes show stronger selection signals than those in GC-poor genes. Gene's codon usage bias and the conservation level of miRNA and its target also have some effects on site accessibility, but the expression level of miRNA or its target and the mechanism of miRNA activity do not contribute to site accessibility differences among miRNA targets. We suggest that synonymous codons near miRNA targets are selected for efficient miRNA binding and proper miRNA function. Our results present a new dimension of natural selection on synonymous codons near miRNA target sites in plants, which will have important implications of coding sequence evolution.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号