首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 156 毫秒
1.

Background

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori.

Results

A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans).

Conclusions

The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the “optimal codons” of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1596-z) contains supplementary material, which is available to authorized users.  相似文献   

2.

Background

Xanthophyllomyces dendrorhous is a basal agaricomycete with uncertain taxonomic placement, known for its unique ability to produce astaxanthin, a carotenoid with antioxidant properties. It was the aim of this study to elucidate the organization of its CoA-derived pathways and to use the genomic information of X. dendrorhous for a phylogenomic investigation of the Basidiomycota.

Results

The genome assembly of a haploid strain of Xanthophyllomyces dendrorhous revealed a genome of 19.50 Megabases with 6385 protein coding genes. Phylogenetic analyses were conducted including 48 fungal genomes. These revealed Ustilaginomycotina and Agaricomycotina as sister groups. In the latter a well-supported sister-group relationship of two major orders, Polyporales and Russulales, was inferred. Wallemia occupies a basal position within the Agaricomycotina and X. dendrorhous represents the basal lineage of the Tremellomycetes, highlighting that the typical tremelloid parenthesomes have either convergently evolved in Wallemia and the Tremellomycetes, or were lost in the Cystofilobasidiales lineage. A detailed characterization of the CoA-related pathways was done and all genes for fatty acid, sterol and carotenoid synthesis have been assigned.

Conclusions

The current study ascertains that Wallemia with tremelloid parenthesomes is the most basal agaricomycotinous lineage and that Cystofilobasidiales without tremelloid parenthesomes are deeply rooted within Tremellomycetes, suggesting that parenthesomes at septal pores might be the core synapomorphy for the Agaricomycotina. Apart from evolutionary insights the genome sequence of X. dendrorhous will facilitate genetic pathway engineering for optimized astaxanthin or oxidative alcohol production.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1380-0) contains supplementary material, which is available to authorized users.  相似文献   

3.

Background

There is a significant difference between synonymous codon usage in many organisms, and it is known that codons used more frequently generally showed efficient decoding rate. At the gene level, however, there are conflicting reports on the existence of a correlation between codon adaptation and translation efficiency, even in the same organism.

Results

To resolve this issue, we cultured Escherichia coli under conditions designed to maintain constant levels of mRNA and protein and subjected the cells to ribosome profiling (RP) and mRNA-seq analyses. We showed that the RP results correlated more closely with protein levels generated under similar culture conditions than with the mRNA abundance from the mRNA-seq. Our result indicated that RP/mRNA ratio could be used as a measure of translation efficiency at gene level. On the other hand, the RP data showed that codon-specific ribosome density at the decoding site negatively correlated with codon usage, consistent with the hypothesis that preferred codons display lower ribosome densities due to their faster decoding rate. However, highly codon-adapted genes showed higher ribosome densities at the gene level, indicating that the efficiency of translation initiation, rather than higher elongation efficiency of preferred codons, exerted a greater effect on ribosome density and thus translation efficiency.

Conclusions

These findings indicate that evolutionary pressure on highly expressed genes influenced both codon bias and translation initiation efficiency and therefore explains contradictory findings that codon usage bias correlates with translation efficiency of native genes, but not with the artificially created gene pool, which was not subjected to evolution pressure.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1115) contains supplementary material, which is available to authorized users.  相似文献   

4.
SK Behura  DW Severson 《PloS one》2012,7(8):e43111

Background

Codon bias is a phenomenon of non-uniform usage of codons whereas codon context generally refers to sequential pair of codons in a gene. Although genome sequencing of multiple species of dipteran and hymenopteran insects have been completed only a few of these species have been analyzed for codon usage bias.

Methods and Principal Findings

Here, we use bioinformatics approaches to analyze codon usage bias and codon context patterns in a genome-wide manner among 15 dipteran and 7 hymenopteran insect species. Results show that GAA is the most frequent codon in the dipteran species whereas GAG is the most frequent codon in the hymenopteran species. Data reveals that codons ending with C or G are frequently used in the dipteran genomes whereas codons ending with A or T are frequently used in the hymenopteran genomes. Synonymous codon usage orders (SCUO) vary within genomes in a pattern that seems to be distinct for each species. Based on comparison of 30 one-to-one orthologous genes among 17 species, the fruit fly Drosophila willistoni shows the least codon usage bias whereas the honey bee (Apis mellifera) shows the highest bias. Analysis of codon context patterns of these insects shows that specific codons are frequently used as the 3′- and 5′-context of start and stop codons, respectively.

Conclusions

Codon bias pattern is distinct between dipteran and hymenopteran insects. While codon bias is favored by high GC content of dipteran genomes, high AT content of genes favors biased usage of synonymous codons in the hymenopteran insects. Also, codon context patterns vary among these species largely according to their phylogeny.  相似文献   

5.
6.

Background

The frequency of synonymous codon usage varies widely between organisms. Suboptimal codon content limits expression of viral, experimental or therapeutic heterologous proteins due to limiting cognate tRNAs. Codon content is therefore often adjusted to match codon bias of the host organism. Codon content also varies between genes within individual mammalian species. However, little attention has been paid to the consequences of codon content upon translation of host proteins.

Methodology/Principal Findings

In comparing the splicing repressor activities of transfected human PTB and its two tissue-restricted paralogs–nPTB and ROD1–we found that the three proteins were expressed at widely varying levels. nPTB was expressed at 1–3% the level of PTB despite similar levels of mRNA expression and 74% amino acid identity. The low nPTB expression was due to the high proportion of codons with A or U at the third codon position, which are suboptimal in human mRNAs. Optimization of the nPTB codon content, akin to the “humanization” of foreign ORFs, allowed efficient translation in vivo and in vitro to levels comparable with PTB. We were then able to demonstrate that all three proteins act as splicing repressors.

Conclusions/Significance

Our results provide a striking illustration of the importance of mRNA codon content in determining levels of protein expression, even within cells of the natural host species.  相似文献   

7.

Background

Codon usage plays a crucial role when recombinant proteins are expressed in different organisms. This is especially the case if the codon usage frequency of the organism of origin and the target host organism differ significantly, for example when a human gene is expressed in E. coli. Therefore, to enable or enhance efficient gene expression it is of great importance to identify rare codons in any given DNA sequence and subsequently mutate these to codons which are more frequently used in the expression host.

Results

We describe an open-source web-based application, ATGme, which can in a first step identify rare and highly rare codons from most organisms, and secondly gives the user the possibility to optimize the sequence.

Conclusions

This application provides a simple user-friendly interface utilizing three optimization strategies: 1. one-click optimization, 2. bulk optimization (by codon-type), 3. individualized custom (codon-by-codon) optimization. ATGme is an open-source application which is freely available at: http://atgme.org  相似文献   

8.

Background

The selection of variable sites for inclusion in genomic analyses can influence results, especially when exemplar populations are used to determine polymorphic sites. We tested the impact of ascertainment bias on the inference of population genetic parameters using empirical and simulated data representing the three major continental groups of cattle: European, African, and Indian. We simulated data under three demographic models. Each simulated data set was subjected to three ascertainment schemes: (I) random selection; (II) geographically biased selection; and (III) selection biased toward loci polymorphic in multiple groups. Empirical data comprised samples of 25 individuals representing each continental group. These cattle were genotyped for 47,506 loci from the bovine 50 K SNP panel. We compared the inference of population histories for the empirical and simulated data sets across different ascertainment conditions using FST and principal components analysis (PCA).

Results

Bias toward shared polymorphism across continental groups is apparent in the empirical SNP data. Bias toward uneven levels of within-group polymorphism decreases estimates of FST between groups. Subpopulation-biased selection of SNPs changes the weighting of principal component axes and can affect inferences about proportions of admixture and population histories using PCA. PCA-based inferences of population relationships are largely congruent across types of ascertainment bias, even when ascertainment bias is strong.

Conclusions

Analyses of ascertainment bias in genomic data have largely been conducted on human data. As genomic analyses are being applied to non-model organisms, and across taxa with deeper divergences, care must be taken to consider the potential for bias in ascertainment of variation to affect inferences. Estimates of FST, time of separation, and population divergence as estimated by principal components analysis can be misleading if this bias is not taken into account.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1469-5) contains supplementary material, which is available to authorized users.  相似文献   

9.
10.

Background

Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into single copy genes.

Results

We identified single copy genes across 29 angiosperm genomes. A significant negative correlation was found between the number of duplicate blocks and the number of single copy genes. We found that a considerable number of single copy genes are located in organelles, showing a preference for binding and catalytic activity. The analysis of effective number of codons (Nc) illustrates that single copy genes have a stronger codon bias than non-single copy genes in eudicots. The relative high expression level of single copy genes was partially confirmed by the RNA-seq data, rather than the Codon Adaptation Index (CAI). Unlike in most other species, a strongly negatively correlation occurs between Nc and GC3 among single copy genes in grass genomes. When compared to all non-single copy genes, single copy genes indicate more conservation (as indicated by Ka and Ks values). But our alternative splicing (AS) results reveal that selective constraints are weaker in single copy genes than in low copy family genes (1–10 in-paralogs) and stronger than high copy family genes (>10 in-paralogs). Using concatenated shared single copy genes, we obtained a well-resolved phylogenetic tree. With the addition of intron sequences, the branch support is improved, but striking incongruences are also evident. Therefore, it is noteworthy that inclusion of intron sequences seems more appropriate for the phylogenetic reconstruction at lower taxonomic levels.

Conclusions

Our analysis provides insight into the evolutionary characteristics of single copy genes across 29 angiosperm genomes. The results suggest that there are key differences in evolutionary constraints between single copy genes and non-single copy genes. And to some extent, these evolutionary constraints show some species-specific differences, especially between eudicots and monocots. Our preliminary evidence also suggests that the concatenated shared single copy genes are well suited for use in resolving phylogenetic relationships.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-504) contains supplementary material, which is available to authorized users.  相似文献   

11.
12.
13.

Background

Astaxanthin is a potent antioxidant with increasing biotechnological interest. In Xanthophyllomyces dendrorhous, a natural source of this pigment, carotenogenesis is a complex process regulated through several mechanisms, including the carbon source. X. dendrorhous produces more astaxanthin when grown on a non-fermentable carbon source, while decreased astaxanthin production is observed in the presence of high glucose concentrations. In the present study, we used a comparative proteomic and metabolomic analysis to characterize the yeast response when cultured in minimal medium supplemented with glucose (fermentable) or succinate (non-fermentable).

Results

A total of 329 proteins were identified from the proteomic profiles, and most of these proteins were associated with carotenogenesis, lipid and carbohydrate metabolism, and redox and stress responses. The metabolite profiles revealed 92 metabolites primarily associated with glycolysis, the tricarboxylic acid cycle, amino acids, organic acids, sugars and phosphates. We determined the abundance of proteins and metabolites of the central pathways of yeast metabolism and examined the influence of these molecules on carotenogenesis.Similar to previous proteomic-stress response studies, we observed modulation of abundance from several redox, stress response, carbohydrate and lipid enzymes. Additionally, the accumulation of trehalose, absence of key ROS response enzymes, an increased abundance of the metabolites of the pentose phosphate pathway and tricarboxylic acid cycle suggested an association between the accumulation of astaxanthin and oxidative stress in the yeast. Moreover, we observed the increased abundance of late carotenogenesis enzymes during astaxanthin accumulation under succinate growth conditions.

Conclusions

The use of succinate as a carbon source in X. dendrorhous cultures increases the availability of acetyl-CoA for the astaxanthin production compared with glucose, likely reflecting the positive regulation of metabolic enzymes of the tricarboxylic acid and glyoxylate cycles. The high metabolite level generated in this pathway could increase the cellular respiration rate, producing reactive oxygen species, which induces carotenogenesis.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1484-6) contains supplementary material, which is available to authorized users.  相似文献   

14.

Background

The application of phages is a promising tool to reduce the number of Campylobacter along the food chain. Besides the efficacy against a broad range of strains, phages have to be safe in terms of their genomes. Thus far, no genes with pathogenic potential (e.g., genes encoding virulence factors) have been detected in Campylobacter phages. However, preliminary studies suggested that the genomes of group II phages may be diverse and prone to genomic rearrangements.

Results

We determined and analysed the genomic sequence (182,761 bp) of group II phage CP21 that is closely related to the already characterized group II phages CP220 and CPt10. The genomes of these phages are comprised of four modules separated by very similar repeat regions, some of which harbouring open reading frames (ORFs). Though, the arrangement of the modules and the location of some ORFs on the genomes are different in CP21 and in CP220/CPt10. In this work, a PCR system was established to study the modular genome organization of other group II phages demonstrating that they belong to different subgroups of the CP220-like virus genus, the prototypes of which are CP21 and CP220. The subgroups revealed different restriction patterns and, interestingly enough, also distinct host specificities, tail fiber proteins and tRNA genes. We additionally analysed the genome of group II phage vB_CcoM-IBB_35 (IBB_35) for which to date only five individual contigs could be determined. We show that the contigs represent modules linked by long repeat regions enclosing some yet not identified ORFs (e.g., for a head completion protein). The data suggest that IBB_35 is a member of the CP220 subgroup.

Conclusion

Campylobacter group II phages are diverse regarding their genome organization. Since all hitherto characterized group II phages contain numerous genes for transposases and homing endonucleases as well as similar repeat regions, it cannot be excluded that these phages are genetically unstable. To answer this question, further experiments and sequencing of more group II phages should be performed.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1837-1) contains supplementary material, which is available to authorized users.  相似文献   

15.

Background

Although mitochondrial (mt) gene order is highly conserved among vertebrates, widespread gene rearrangements occur in anurans, especially in neobatrachians. Protein coding genes in the mitogenome experience adaptive or purifying selection, yet the role that selection plays on genomic reorganization remains unclear. We sequence the mitogenomes of three species of Glandirana and hot spots of gene rearrangements of 20 frog species to investigate the diversity of mitogenomic reorganization in the Neobatrachia. By combing these data with other mitogenomes in GenBank, we evaluate if selective pressures or functional constraints act on mitogenomic reorganization in the Neobatrachia. We also look for correlations between tRNA positions and codon usage.

Results

Gene organization in Glandirana was typical of neobatrachian mitogenomes except for the presence of pseudogene trnS (AGY). Surveyed ranids largely exhibited gene arrangements typical of neobatrachian mtDNA although some gene rearrangements occurred. The correlation between codon usage and tRNA positions in neobatrachians was weak, and did not increase after identifying recurrent rearrangements as revealed by basal neobatrachians. Codon usage and tRNA positions were not significantly correlated when considering tRNA gene duplications or losses. Change in number of tRNA gene copies, which was driven by genomic reorganization, did not influence codon usage bias. Nucleotide substitution rates and dN/dS ratios were higher in neobatrachian mitogenomes than in archaeobatrachians, but the rates of mitogenomic reorganization and mt nucleotide diversity were not significantly correlated.

Conclusions

No evidence suggests that adaptive selection drove the reorganization of neobatrachian mitogenomes. In contrast, protein-coding genes that function in metabolism showed evidence for purifying selection, and some functional constraints appear to act on the organization of rRNA and tRNA genes. As important nonadaptive forces, genetic drift and mutation pressure may drive the fixation and evolution of mitogenomic reorganizations.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-691) contains supplementary material, which is available to authorized users.  相似文献   

16.

Background

Despite having predominately deleterious fitness effects, transposable elements (TEs) are major constituents of eukaryote genomes in general and of plant genomes in particular. Although the proportion of the genome made up of TEs varies at least four-fold across plants, the relative importance of the evolutionary forces shaping variation in TE abundance and distributions across taxa remains unclear. Under several theoretical models, mating system plays an important role in governing the evolutionary dynamics of TEs. Here, we use the recently sequenced Capsella rubella reference genome and short-read whole genome sequencing of multiple individuals to quantify abundance, genome distributions, and population frequencies of TEs in three recently diverged species of differing mating system, two self-compatible species (C. rubella and C. orientalis) and their self-incompatible outcrossing relative, C. grandiflora.

Results

We detect different dynamics of TE evolution in our two self-compatible species; C. rubella shows a small increase in transposon copy number, while C. orientalis shows a substantial decrease relative to C. grandiflora. The direction of this change in copy number is genome wide and consistent across transposon classes. For insertions near genes, however, we detect the highest abundances in C. grandiflora. Finally, we also find differences in the population frequency distributions across the three species.

Conclusion

Overall, our results suggest that the evolution of selfing may have different effects on TE evolution on a short and on a long timescale. Moreover, cross-species comparisons of transposon abundance are sensitive to reference genome bias, and efforts to control for this bias are key when making comparisons across species.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-602) contains supplementary material, which is available to authorized users.  相似文献   

17.

Background

Previous genome-wide association analyses identified QTL regions in the X chromosome for percentage of normal sperm and scrotal circumference in Brahman and Tropical Composite cattle. These traits are important to be studied because they are indicators of male fertility and are correlated with female sexual precocity and reproductive longevity. The aim was to investigate candidate genes in these regions and to identify putative causative mutations that influence these traits. In addition, we tested the identified mutations for female fertility and growth traits.

Results

Using a combination of bioinformatics and molecular assay technology, twelve non-synonymous SNPs in eleven genes were genotyped in a cattle population. Three and nine SNPs explained more than 1% of the additive genetic variance for percentage of normal sperm and scrotal circumference, respectively. The SNPs that had a major influence in percentage of normal sperm were mapped to LOC100138021 and TAF7L genes; and in TEX11 and AR genes for scrotal circumference. One SNP in TEX11 was explained ~13% of the additive genetic variance for scrotal circumference at 12 months. The tested SNP were also associated with weight measurements, but not with female fertility traits.

Conclusions

The strong association of SNPs located in X chromosome genes with male fertility traits validates the QTL. The implicated genes became good candidates to be used for genetic evaluation, without detrimentally influencing female fertility traits.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1595-0) contains supplementary material, which is available to authorized users.  相似文献   

18.
Palidwor GA  Perkins TJ  Xia X 《PloS one》2010,5(10):e13431

Background

In spite of extensive research on the effect of mutation and selection on codon usage, a general model of codon usage bias due to mutational bias has been lacking. Because most amino acids allow synonymous GC content changing substitutions in the third codon position, the overall GC bias of a genome or genomic region is highly correlated with GC3, a measure of third position GC content. For individual amino acids as well, G/C ending codons usage generally increases with increasing GC bias and decreases with increasing AT bias. Arginine and leucine, amino acids that allow GC-changing synonymous substitutions in the first and third codon positions, have codons which may be expected to show different usage patterns.

Principal Findings

In analyzing codon usage bias in hundreds of prokaryotic and plant genomes and in human genes, we find that two G-ending codons, AGG (arginine) and TTG (leucine), unlike all other G/C-ending codons, show overall usage that decreases with increasing GC bias, contrary to the usual expectation that G/C-ending codon usage should increase with increasing genomic GC bias. Moreover, the usage of some codons appears nonlinear, even nonmonotone, as a function of GC bias. To explain these observations, we propose a continuous-time Markov chain model of GC-biased synonymous substitution. This model correctly predicts the qualitative usage patterns of all codons, including nonlinear codon usage in isoleucine, arginine and leucine. The model accounts for 72%, 64% and 52% of the observed variability of codon usage in prokaryotes, plants and human respectively. When codons are grouped based on common GC content, 87%, 80% and 68% of the variation in usage is explained for prokaryotes, plants and human respectively.

Conclusions

The model clarifies the sometimes-counterintuitive effects that GC mutational bias can have on codon usage, quantifies the influence of GC mutational bias and provides a natural null model relative to which other influences on codon bias may be measured.  相似文献   

19.
20.

Background

Next-generation sequencing technologies are rapidly generating whole-genome datasets for an increasing number of organisms. However, phylogenetic reconstruction of genomic data remains difficult because de novo assembly for non-model genomes and multi-genome alignment are challenging.

Results

To greatly simplify the analysis, we present an Assembly and Alignment-Free (AAF) method (https://sourceforge.net/projects/aaf-phylogeny) that constructs phylogenies directly from unassembled genome sequence data, bypassing both genome assembly and alignment. Using mathematical calculations, models of sequence evolution, and simulated sequencing of published genomes, we address both evolutionary and sampling issues caused by direct reconstruction, including homoplasy, sequencing errors, and incomplete sequencing coverage. From these results, we calculate the statistical properties of the pairwise distances between genomes, allowing us to optimize parameter selection and perform bootstrapping. As a test case with real data, we successfully reconstructed the phylogeny of 12 mammals using raw sequencing reads. We also applied AAF to 21 tropical tree genome datasets with low coverage to demonstrate its effectiveness on non-model organisms.

Conclusion

Our AAF method opens up phylogenomics for species without an appropriate reference genome or high sequence coverage, and rapidly creates a phylogenetic framework for further analysis of genome structure and diversity among non-model organisms.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1647-5) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号