首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 312 毫秒
1.
Biased codon usage in many species results from a balance among mutation, weak selection, and genetic drift. Here I show that selection to maintain biased codon usage is reduced in Drosophila miranda relative to its ancestor. Analyses of mutation patterns in noncoding DNA suggest that the extent of this reduction cannot be explained by changes in mutation bias or by biased gene conversion. Low levels of variability in D. miranda relative to its sibling species, D. pseudoobscura, suggest that it has a much smaller effective population size. Reduced codon usage bias in D. miranda may thus result from the reduced efficacy of selection against newly arising mutations to unpreferred codons. [Reviewing Editor: Dr. Richard Kliman]  相似文献   

2.
Recent work has shown that Drosophila melanogaster genes with fast-evolving nonsynonymous sites have lower codon usage bias. This pattern has been attributed to interference between positive selection at nonsynonymous sites and weak selection on codon usage. Here we have looked for this correlation in a much larger and less biased dataset, comprising 630 gene pairs from D. melanogaster and D. yakuba. We confirmed that there is a negative correlation between the rate of nonsynonymous substitutions (dN) and codon bias in D. melanogaster. We then tested the interference hypothesis and other alternative explanations, including one involving gene expression. We found that dN indeed correlates with the level of gene expression. Given that gene expression is a strong determinant of codon bias, the relationship between dN and codon bias might be a by-product of gene expression. However, our tests show that none of the hypotheses we consider seem to explain the data fully.This article contains online supplementary material.Reviewing Editor: Dr. John Huelsenbeck  相似文献   

3.
Abstract To investigate the phylogenetic relationships and molecular evolution of α-amylase (Amy) genes in the Drosophila montium species subgroup, we constructed the phylogenetic tree of the Amy genes from 40 species from the montium subgroup. On our tree the sequences of the auraria, kikkawai, and jambulina complexes formed distinct tight clusters. However, there were a few inconsistencies between the clustering pattern of the sequences and taxonomic classification in the kikkawai and jambulina complexes. Sequences of species from other complexes (bocqueti, bakoue, nikananu, and serrata) often did not cluster with their respective taxonomic groups. This suggests that relationships among the Amy genes may be different from those among species due to their particular evolution. Alternatively, the current taxonomy of the investigated species is unreliable. Two types of divergent paralogous Amy genes, the so-called Amy1- and Amy3-type genes, previously identified in the D. kikkawai complex, were common in the montium subgroup, suggesting that the duplication event from which these genes originate is as ancient as the subgroup or it could even predate its differentiation. Thc Amy1-type genes were closer to the Amy genes of D. melanogaster and D. pseudoobscura than to the Amy3-type genes. In the Amy1-type genes, the loss of the ancestral intron occurred independently in the auraria complex and in several Afrotropical species. The GC content at synonymous third codon positions (GC3s) of the Amy1-type genes was higher than that of the Amy3-type genes. Furthermore, the Amy1-type genes had more biased codon usage than the Amy3-type genes. The correlations between GC3s and GC content in the introns (GCi) differed between these two Amy-type genes. These findings suggest that the evolutionary forces that have affected silent sites of the two Amy-type genes in the montium species subgroup may differ.  相似文献   

4.

Background  

Codon usage bias (CUB), the uneven use of synonymous codons, is a ubiquitous observation in virtually all organisms examined. The pattern of codon usage is generally similar among closely related species, but differs significantly among distantly related organisms, e.g., bacteria, yeast, and Drosophila. Several explanations for CUB have been offered and some have been supported by observations and experiments, although a thorough understanding of the evolutionary forces (random drift, mutation bias, and selection) and their relative importance remains to be determined. The recently available complete genome DNA sequences of twelve phylogenetically defined species of Drosophila offer a hitherto unprecedented opportunity to examine these problems. We report here the patterns of codon usage in the twelve species and offer insights on possible evolutionary forces involved.  相似文献   

5.
In this paper we investigate the relationships among intron density (number of introns per kilobase of coding sequence), gene expression level, and strength of splicing signals in two species: Drosophila melanogaster and Caenorhabditis elegans. We report a negative correlation between intron density and gene expression levels, opposite to the effect previously observed in human. An increase in splice site strength has been observed in long introns in D. melanogaster. We show this is also true of C. elegans. We also examine the relationship between intron density and splice site strength. There is an increase in splice site strength as the intron structure becomes less dense. This could suggest that introns are not recognized in isolation but could function in a cooperative manner to ensure proper splicing. This effect remains if we control for the effects of alternative splicing on splice site strength. Reviewing Editor: Dr. Nicolas Galtier  相似文献   

6.
Codon usage patterns in 16 chromosomes coincided with each other in Saccharomyces cerevisiae, and the same result was obtained from Encephalitozoon cuniculi consisting of 11 chromosomes, although each chromosome function differs. In addition, preferential codon usage in the regenerated coding systems for Leu and Lys differed between Saccharomyces cerevisiae and Encephalitozoon cuniculi. These results cannot be explained by Darwins natural selection theory or by the neutral theory proposed against Darwins. Furthermore, the codon usage patterns were examined in both prokaryotes and eukaryotes. The use of G or C at the third codon position was much lower than T or A in Ureaplasma urealyticum, whereas inversely the use of G or C at the third codon position was much higher than T or A in Mycobacterium tuberculosis. Additionally, Candida albicans and Plasmodium falciparum also showed a very low usage of G or C at the third codon position. It is a difficult leap to speculate that the inverse codon usage change occurred over the genome during biological evolution. Thus, the present results strongly suggest that organisms were derived from different origins, indicating that the origin of life was plural, based on genomic structures.  相似文献   

7.
Enterogenic Escherichia coli (ETEC) F18 strains are the main pathogenic bacteria causing severe diarrhea in humans and domestic animals. However, the information about synonymous codon usage pattern of ETEC F18 genome remains unclear. We conducted a genome-wide analysis of synonymous codon usage patterns in the ETEC F18 strain SRA: SAMN02471895. After filtering of the complete genome sequence, 4327 coding sequences were analyzed using multivariate statistical methods to calculate synonymous codon usage patterns and to evaluate the influence of various factors in shaping the codon usage. The mean GC content was 51.38%, with a slight preference for G/C-ending codons. Twenty-two codons were determined as ‘‘optimal codons”. ENC plots showed some of the genes were on or close to the expected curve, while only points with low-ENC values were below the curve. PR2 analysis showed that GC and AT were not used proportionally, suggesting major roles for mutational pressure and natural selection in shaping usage. Neutrality plots showed a significant correlation between GC12 and GC3, suggesting that mutational pressure is responsible for nucleotide composition in shaping the strength of codon usage. Translational selection was the main factor shaping the codon usage pattern of ETEC F18 genome, while other factors such as protein length, GRAVY and ARO values also influenced codon usage to some extent. We analyzed the codon usage pattern systematically and identified the factors shaping codon usage bias in the ETEC F18 genome. Such information further elucidates the mechanisms of synonymous codon usage bias and provides the basis of molecular genetic engineering and evolutionary studies.  相似文献   

8.
The human cancer susceptibility gene, BRCA2, functions in double-strand break repair by homologous recombination, and it appears to function via interaction of a repetitive region (“BRC repeats”) with RAD-51. A putatively simpler homolog, dmbrca2, was identified in Drosophila melanogaster recently and also affects mitotic and meiotic double-strand break repair. In this study, we examined patterns of repeat variation both within Drosophila pseudoobscura and among available Drosophila genome sequences. We identified extensive variation within and among closely related Drosophila species in BRC repeat number, to the extent that variation within this genus recapitulates the extent of variation found across the entire animal kingdom. We describe patterns of evolution across species by documenting recent repeat expansions (sometimes in tandem arrays) and homogenizations within available genome sequences. Overall, we have documented patterns and modes of evolution in a new model system of a gene which is important to human health.  相似文献   

9.
Broad-scale differences in crossover rate across the genome have been characterized in most genomes studied. Fine-scale differences, however, have only been examined in a few taxa, such as Arabidopsis, yeast, humans, and mice. No prior studies have directly looked for fine-scale recombination rate heterogeneity in Drosophila. We produced 370 Drosophila pseudoobscura containing a crossover event within the 2-megabase (MB) region between the genes yellow and white. We then examined 19 intervals within this region and determined where the crossovers occurred. We found that recombination events occur nonrandomly on a small scale and that mild “hotspots“ of a few kilobases exist in Drosophila. Among the regions studied, recombination rates varied from 1.4 to 52 cM/MB. We also observed a trend toward high codon bias in regions of high recombination. Finally, we identified a significantly positive correlation between recombination rate and simple repeats, as well as the motif CACAC. These sequence features may contribute to broad-scale variation in crossover rate and, thus, shed light on features associated with crossover rate heterogeneity at a genome-wide scale. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Dmitri Petrov]  相似文献   

10.
Hao da C  Yang L  Huang B 《Genetica》2009,135(2):123-135
Evolutionary patterns of sequence divergence were analyzed in genes from the conifer genus Taxus (yew), encoding paclitaxel biosynthetic enzymes taxadiene synthase (TS) and 10-deacetylbaccatin III-10β-O-acetyltransferase (DBAT). N-terminal fragments of TS, full-length DBAT and internal transcribed spacer (ITS) were amplified from 15 closely related Taxus species and sequenced. Premature stop codons were not found in TS and DBAT sequences. Codon usage bias was not found, suggesting that synonymous mutations are selectively neutral. TS and DBAT gene trees are not consistent with the ITS tree, where species formed monophyletic clades. In fact, for both genes, alleles were sometimes shared across species and parallel amino acid substitutions were identified. While both TS and DBAT are, overall, under purifying selection, we identified a number of amino acids of TS under positive selection based on inference using maximum likelihood models. Positively selected amino acids in the N-terminal region of TS suggest that this region might be more important for enzyme function than previously thought. Moreover, we identify lineages with significantly elevated rates of amino acid substitution using a genetic algorithm. These findings demonstrate that the pattern of adaptive paclitaxel biosynthetic enzyme evolution can be documented between closely related Taxus species, where species-specific taxane metabolism has evolved recently.  相似文献   

11.
The helicase gene of Autographa californica multiple nucleopolyhedrovirus (AcMNPV) is not only involved in viral DNA replication, but also plays a role in viral host range. To identify the codon usage bias of helicase of AcMNPV, the codon usage bias of helicase was especially studies in AcMNPV and 41 reference strains of baculoviruses by calculating the codon adaptation index (CAI), effective number of codon (ENc), relative synonymous codon usage (RSCU), and other indices. The helicase of baculovirus is less biased (mean ENc?=?50.539?>?40; mean CAI?=?0.246). AcMNPV helicase has a strong bias toward the synonymous codons with G and C at the third codon position (GC3s?=?53.6%). The plot of GC3s against ENc values revealed that GC compositional constraints are the main factor that determines the codon usage bias of major of helicase. Several indicators supported that the codon usage pattern of helicase is mainly subject to mutation pressure. Analysis of variation in codon usage and amino acid composition indicated AcMNPV helicase shows the significant preference for one or more postulated codons for each amino acid. A cluster analysis based on RSCU values suggested that AcMNPV is evolutionarily closer to members of group I alphabaculovirus. Comparison of the codon usage pattern among E. coli, yeast, mouse, human and AcMNPV showed that yeast is a suitable expression system for AcMNPV helicase. AcMNPV helicase shows weak codon usage bias. This study may help in elucidating the functional mechanism of AcMNPV helicase and the evolution of baculovirus helicases.  相似文献   

12.

Background  

Coding sequence (CDS) length, gene size, and intron length vary within a genome and among genomes. Previous studies in diverse organisms, including human, D. Melanogaster, C. elegans, S. cerevisiae, and Arabidopsis thaliana, indicated that there are negative relationships between expression level and gene size, CDS length as well as intron length. Different models such as selection for economy model, genomic design model, and mutational bias hypotheses have been proposed to explain such observation. The debate of which model is a superior one to explain the observation has not been settled down. The chicken (Gallus gallus) is an important model organism that bridges the evolutionary gap between mammals and other vertebrates. As D. Melanogaster, chicken has a larger effective population size, selection for chicken genome is expected to be more effective in increasing protein synthesis efficiency. Therefore, in this study the chicken was used as a model organism to elucidate the interaction between gene features and expression pattern upon selection pressure.  相似文献   

13.
The aim of this study was to analyze patterns of nucleotidic composition and codon usage in the pea aphid genome (Acyrthosiphon pisum). A collection of 60,000 expressed sequence tags (ESTs) in the pea aphid has been used to automatically reconstruct 5809 coding sequences (CDSs), based on similarity with known proteins and on coding style recognition. Reconstructions were manually checked for ribosomal proteins, leading to tentatively reconstruct the nea-complete set of this category. Pea aphid coding sequences showed a shift toward AT (especially at the third codon position) compared to drosophila homologues. Genes with a putative high level of expression (ribosomal and other genes with high EST support) remained more GC3-rich and had a distinct codon usage from bulk sequences: they exhibited a preference for C-ending codons and CGT (for arginine), which thus appeared optimal for translation. However, the discrimination was not as strong as in drosophila, suggesting a reduced degree of translational selection. The space of variation in codon usage for A. pisum appeared to be larger than in drosophila, with a substantial fraction of genes that remained GC3-rich. Some of those (in particular some structural proteins) also showed high levels of codon bias and a very strong preference for C-ending codons, which could be explained either by strong translational selection or by other mechanisms. Finally, genomic traces were analyzed to build 206 fragments containing a full CDS, which allowed studying the correlations between GC contents of coding and those of noncoding (flanking and introns) sequences.  相似文献   

14.
We report a significant negative correlation between nonsynonymous polymorphism and intron length in Drosophila melanogaster. This correlation is similar to that between protein divergence and intron length previously reported in Drosophila. We show that the relationship can be explained by the content of conserved noncoding sequences (CNS) within introns. In addition, genes with a high regulatory complexity and many genetic interactions also exhibit larger amounts of CNS within their introns and lower values of nonsynonymous polymorphism. The present study provides relevant evidence on the importance of intron content and expression patterns on the levels of coding polymorphism. Electronic Supplementary Material The online version of this article (doi:) contains supplementary material, which is available to authorized users. [Reviewing Editor: Dr. Dmitri Petrov]  相似文献   

15.
A two-parameter statistical model was used to predict the solubility of 96 putative virulence-associated proteins of Flavobacterium psychrophilum (CSF259-93) upon over expression in Escherichia coli. This analysis indicated that 88.5% of the F. psychrophilum proteins would be expressed as insoluble aggregates (inclusion bodies). These solubility predictions were verified experimentally by colony filtration blot for six different F. psychrophilum proteins. A comprehensive analysis of codon usage identified over a dozen codons that are used frequently in F. psychrophilum, but that are rarely used in E. coli. Expression of F. psychrophilum proteins in E. coli was often associated with production of minor molecular weight products, presumably because of the codon usage bias between these two organisms. Expression of recombinant protein in the presence of rare tRNA genes resulted in marginal improvements in the expressed products. Consequently, Vibrio parahaemolyticus was developed as an alternative expression host because its codon usage is similar to F. psychrophilum. A full-length recombinant F. psychrophilum hemolysin was successfully expressed and purified from V. parahaemolyticus in soluble form, whereas this protein was insoluble upon expression in E. coli. We show that V. parahaemolyticus can be used as an alternate heterologous expression system that can remedy challenges associated with expression and production of F. psychrophilum recombinant proteins.  相似文献   

16.
17.
Arabidopsis ACT2 represents an ancient class of vegetative plant actins and is strongly and constitutively expressed in almost all Arabidopsis sporophyte vegetative tissues. Using the beta glucuronidase report system, the studies showed that ACT2 5′ regulatory region was significantly more active than CaMV 35S promoter in Arabidopsis seedlings and gametophyte vegetative tissues of Physcomitrella patens. Its activity was also observed in rice and maize seedlings. Thus, the ACT2 5′ regulatory region could potentially serve as a strong regulator to express a transgene in divergent plant species. ACT2 5′ regulatory region contained 15 conserved sequence elements, an ancient intron in its 5′ un-translated region (5′ UTR), and a purine-rich stretch followed by a pyrimidine-rich stretch (PuPy). Mutagenesis and deletion analysis illustrated that some of the conserved sequence elements and the region containing PuPy sequences played regulatory roles in Arabidopsis. Interestingly, mutation of the conserved elements did not lead a dramatic change in the activity of ACT2 5′ regulatory region. The ancient intron in ACT2 5′ UTR was required for its strong expression in both Arabidopsis and P. patens, but did not fully function as a canonical intron. Thus, it was likely that some of the conserved sequence elements and gene structures had been preserved in ACT2 5′ regulatory region over the course of land plant evolution partly due to their functional importance. The studies provided additional evidences that identification of evolutionarily conserved features in non-coding region might be used as an efficient strategy to predict gene regulatory elements.  相似文献   

18.
Chromosomal inversion polymorphism was characterized in Finnish Drosophila montana populations. A total of 14 polymorphic inversions were observed in Finnish D. montana of which nine had not been described before. The number of polymorphic inversions in each chromosome was not significantly different from that expected, assuming equal chance of occurrence in the euchromatic genome. There was, however, no correlation between the number of polymorphic inversions and that of fixed inversions in each chromosome. Therefore, a simple neutral model does not explain the evolutionary dynamics of inversions. Furthermore, in contrast to results obtained by others, no significant correlation was found between the two transposable elements (TEs) Penelope and Ulysses and inversion breakpoints in D. montana. This result suggests that these TEs were not involved in the creation of the polymorphic inversions seen in D. montana. A comparative analysis of D. montana and Drosophila virilis polytene chromosomes 4 and 5 was performed with D. virilis bacteriophage P1 clones, thus completing the comparative studies of the two species.  相似文献   

19.
20.
It is important and meaningful to understand the codon usage pattern and the factors that shape codon usage of maize. In this study, trends in synonymous codon usage in maize have been firstly examined through the multivariate statistical analysis on 7402 cDNA sequences. The results showed that the genes positions on the primary axis were strongly negatively correlated with GC3s, GC content of individual gene and gene expression level assessed by the codon adaptation index (CAI) values, which indicated that nucleotide composition and gene expression level were the main factors in shaping the codon usage of maize, and the variation in codon usage among genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. At the same time, CDS length and the hydrophobicity of each protein were, respectively, significantly correlated with the genes locations on the primary axis, GC3s and CAI values. We infer that genes length and the hydrophobicity of the encoded protein may play minor role in shaping codon usage bias. Additional 28 codons ending with a G or C base have been defined as “optimal codons”, which may provide useful information for maize gene-transformation and gene prediction.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号