期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Regulation of the nuclear genes encoding the cytoplasmic and mitochondrial leucyl-tRNA synthetases of Neurospora crassa. 总被引：7，自引：3，他引：4

下载免费PDF全文

C M Chow U L Rajbhandary 《Molecular and cellular biology》1989,9(11):4645-4652

相似文献

2.

Reduced Dosage of Genes Encoding Ribosomal Protein S18 Suppresses a Mitochondrial Initiation Codon Mutation in Saccharomyces Cerevisiae 总被引：4，自引：0，他引：4

下载免费PDF全文

L. S. Folley T. D. Fox 《Genetics》1994,137(2):369-379

A yeast mitochondrial translation initiation codon mutation affecting the gene for cytochrome oxidase subunit III (COX3) was partially suppressed by a spontaneous nuclear mutation. The suppressor mutation also caused cold-sensitive fermentative growth on glucose medium. Suppression and cold sensitivity resulted from inactivation of the gene product of RPS18A, one of two unlinked genes that code the essential cytoplasmic small subunit ribosomal protein termed S18 in yeast. The two S18 genes differ only by 21 silent substitutions in their exons; both are interrupted by a single intron after the 15th codon. Yeast S18 is homologous to the human S11 (70% identical) and the Escherichia coli S17 (35% identical) ribosomal proteins. This highly conserved family of ribosomal proteins has been implicated in maintenance of translational accuracy and is essential for assembly of the small ribosomal subunit. Characterization of the original rps18a-1 missense mutant and rps18aΔ and rps18bΔ null mutants revealed that levels of suppression, cold sensitivity and paromomycin sensitivity all varied directly with a limitation of small ribosomal subunits. The rps18a-1 mutant was most affected, followed by rps18aΔ then rps18bΔ. Mitochondrial mutations that decreased COX3 expression without altering the initiation codon were not suppressed. This allele specificity implicates mitochondrial translation in the mechanism of suppression. We could not detect an epitope-tagged variant of S18 in mitochondria. Thus, it appears that suppression of the mitochondrial translation initiation defect is caused indirectly by reduced levels of cytoplasmic small ribosomal subunits, leading to changes in either cytoplasmic translational accuracy or the relative levels of cytoplasmic translation products. 相似文献

3.

Structure of the yeast isoleucyl-tRNA synthetase gene (ILS1). DNA-sequence, amino-acid sequence of proteolytic peptides of the enzyme and comparison of the structure to those of other known aminoacyl-tRNA synthetases 总被引：2，自引：0，他引：2

U Englisch S Englisch P Markmeyer J Schischkoff H Sternbach H Kratzin F Cramer 《Biological chemistry Hoppe-Seyler》1987,368(8):971-979

The ILS1 gene encoding for cytoplasmic isoleucyl-tRNA synthetase from Saccharomyces cerevisiae was subcloned from a 5.4-kb insert of the shuttle vector YEp13 to M13mp8 and M13mp9. Nucleotide sequence analysis of a 4.3-kb BamHI-HpaI fragment revealed a single open reading frame from which we deduced the amino-acid sequence of the enzyme. Independently obtained amino-acid sequence information from ten tryptic peptides of the purified enzyme confirmed the gene-derived structure. The enzyme is comprised of 1073 amino-acids consistent with earlier determinations of its molecular mass. The codon usage of ILS1 is typical of abundant yeast proteins. A significant homology to E. coli isoleucyl- and valyl-tRNA synthetases as well as to yeast valyl-tRNA synthetase was detected. The characteristic amino-acid residues of the aminoacyl-adenylate site and of the potential binding site of the 3'-end of tRNA found in other synthetases are present in the structure. 相似文献

4.

The tRNA aminoacylation co-factor Arc1p is excluded from the nucleus by an Xpo1p-dependent mechanism

Galani K Hurt E Simos G 《FEBS letters》2005,579(5):969-975

Arc1p, a yeast tRNA-binding protein, forms a complex with the aminoacyl-tRNA synthetases, methionyl tRNA synthetase (MetRS) and glutamyl tRNA synthetase (GluRS). Although this complex localizes normally in the cytoplasm, in the absence of Arc1p the two free synthetases are also found inside the nucleus. In this work, in order to localize free Arc1 we abolished complex assembly by deleting the appended domains from both MetRS and GluRS. Surprisingly, free Arc1p remained cytoplasmic even when fitted with a strong nuclear localization signal (NLS). However, NLS-Arc1p accumulated in the nucleus when Xpo1/Crm1, the export receptor for NES-containing cargo proteins, was mutated. Thus, the cytoplasmic location of Arc1p is maintained by Xpo1p-dependent nuclear export and Arc1p could act as an adapter in the nucleocytoplasmic trafficking of tRNA and/or the tRNA-aminoacylation machinery. 相似文献

5.

Quantifying codon usage in signal peptides: Gene expression and amino acid usage explain apparent selection for inefficient codons

《生物化学与生物物理学报:生物膜》2018,1860(12):2479-2485

The Sec secretion pathway is found across all domains of life. A critical feature of Sec secreted proteins is the signal peptide, a short peptide with distinct physicochemical properties located at the N-terminus of the protein. Previous work indicates signal peptides are biased towards translationally inefficient codons, which is hypothesized to be an adaptation driven by selection to improve the efficacy and efficiency of the protein secretion mechanisms. We investigate codon usage in the signal peptides of E. coli using the Codon Adaptation Index (CAI), the tRNA Adaptation Index (tAI), and the ribosomal overhead cost formulation of the stochastic evolutionary model of protein production rates (ROC-SEMPPR). Comparisons between signal peptides and 5′-end of cytoplasmic proteins using CAI and tAI are consistent with a preference for inefficient codons in signal peptides. Simulations reveal these differences are due to amino acid usage and gene expression – we find these differences disappear when accounting for both factors. In contrast, ROC-SEMPPR, a mechanistic population genetics model capable of separating the effects of selection and mutation bias, shows codon usage bias (CUB) of the signal peptides is indistinguishable from the 5′-ends of cytoplasmic proteins. Additionally, we find CUB at the 5′-ends is weaker than later segments of the gene. Results illustrate the value in using models grounded in population genetics to interpret genetic data. We show failure to account for mutation bias and the effects of gene expression on the efficacy of selection against translation inefficiency can lead to a misinterpretation of codon usage patterns. 相似文献

6.

Codon usage in Xanthophyllomyces dendrorhous (formerly Phaffia rhodozyma) 总被引：1，自引：0，他引：1

Jan C. Verdoes Albert J.J. van Ooyen 《Biotechnology letters》2000,22(1):9-13

By sequence analysis of 96 randomly selected clones in a cDNA library of Xanthophyllomyces dendrorhous, ten novel, full-length clones encoding cytoplasmic ribosomal proteins (rp) were found. The deduced amino acid sequences showed significant homology to their counterparts from eukaryotic origin including mammals, fungi and plants. Some ribosmal protein encoding cDNAs appeared several times, but by Southern blot analysis it was shown they are encoded by a single copy gene. The nucleotide sequences of ten full length cDNAs were used to investigate the codon usage in X. dendrorhous. 相似文献

7.

Codon usage in higher plants, green algae, and cyanobacteria 总被引：3，自引：1，他引：2

Campbell WH Gowri G 《Plant physiology》1990,92(1):1-11

Codon usage is the selective and nonrandom use of synonymous codons by an organism to encode the amino acids in the genes for its proteins. During the last few years, a large number of plant genes have been cloned and sequenced, which now permits a meaningful comparison of codon usage in higher plants, algae, and cyanobacteria. For the nuclear and organellar genes of these organisms, a small set of preferred codons are used for encoding proteins. Codon usage is different for each genome type with the variation mainly occurring in choices between codons ending in cytidine (C) or guanosine (G) versus those ending in adenosine (A) or uridine (U). For organellar genomes, chloroplastic and mitochrondrial proteins are encoded mainly with codons ending in A or U. In most cyanobacteria and the nuclei of green algae, proteins are encoded preferentially with codons ending in C or G. Although only a few nuclear genes of higher plants have been sequenced, a clear distinction between Magnoliopsida (dicot) and Liliopsida (monocot) codon usage is evident. Dicot genes use a set of 44 preferred codons with a slight preference for codons ending in A or U. Monocot codon usage is more restricted with an average of 38 codons preferred, which are predominantly those ending in C or G. But two classes of genes can be recognized in monocots. One set of monocot genes uses codons similar to those in dicots, while the other genes are highly biased toward codons ending in C or G with a pattern similar to nuclear genes of green algae. Codon usage is discussed in relation to evolution of plants and prospects for intergenic transfer of particular genes. 相似文献

8.

Codon usages in different gene classes of the Escherichia coli genome 总被引：3，自引：0，他引：3

Samuel Karlin Jan Mrázek & Allan M. Campbell 《Molecular microbiology》1998,29(6):1341-1355

A new measure for assessing codon bias of one group of genes with respect to a second group of genes is introduced. In this formulation, codon bias correlations for Escherichia coli genes are evaluated for level of expression, for contrasts along genes, for genes in different 200 kb (or longer) contigs around the genome, for effects of gene size, for variation over different function classes, for codon bias in relation to possible lateral transfer and for dicodon bias for some gene classes. Among the function classes, codon biases of ribosomal proteins are the most deviant from the codon frequencies of the average E. coli gene. Other classes of ‘highly expressed genes’ (e.g. amino acyl tRNA synthetases, chaperonins, modification genes essential to translation activities) show less extreme codon biases. Consistently for genes with experimentally determined expression rates in the exponential growth phase, those of highest molar abundances are more deviant from the average gene codon frequencies and are more similar in codon frequencies to the average ribosomal protein gene. Independent of gene size, the codon biases in the 5′ third of genes deviate by more than a factor of two from those in the middle and 3′ thirds. In this context, there appear to be conflicting selection pressures imposed by the constraints of ribosomal binding, or more generally the early phase of protein synthesis (about the first 50 codons) may be more biased than the complete nascent polypeptide. In partitioning the E. coli genome into 10 equal lengths, pronounced differences in codon site 3 G+C frequencies accumulate. Genes near to oriC have 5% greater codon site 3 G+C frequencies than do genes from the ter region. This difference also is observed between small (100–300 codons) and large (>800 codons) genes. This result contrasts with that for eukaryotic genomes (including human, Caenorhabditis elegans and yeast) where long genes tend to have site 3 more AT rich than short genes. Many of the above results are special for E. coli genes and do not apply to genes of most bacterial genomes. A gene is defined as alien (possibly horizontally transferred) if its codon bias relative to the average gene exceeds a high threshold and the codon bias relative to ribosomal proteins is also appropriately high. These are identified, including four clusters (operons). The bulk of these genes have no known function. 相似文献

9.

Translational Selection Is Ubiquitous in Prokaryotes

Fran Supek Nives ?kunca Jelena Repar Kristian Vlahovi?ek Tomislav ?muc 《PLoS genetics》2010,6(6)

Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea. 相似文献

10.

1 New biology from ancient enzymes

Paul Schimmel 《Journal of biomolecular structure & dynamics》2013,31(1)

The aminoacyl tRNA synthetases arose early in evolution to establish the genetic code during translation. Long thought of as cytoplasmic enzymes with a single defined function, new studies have demonstrated their roles in nuclear and extracellular signaling pathways, where they regulate angiogenesis, inflammation, mTor signaling, tumorigenesis, and more. These novel functions are typically associated with novel domains added to higher eukaryote tRNA synthetases, and specific resected forms that are generated by alternative splicing and natural proteolysis. The tRNA synthetases are now seen as central “nodes” that use their novel domains to connect with multiple-cell signaling pathways through a variety of interacting partners. These partners include nuclear proteins, extracellular receptors, cytoplasmic proteins, and cellular RNAs. This new biology from tRNA synthetases is an endless frontier. 相似文献

11.

A comparative proteomic strategy for subcellular proteome research: ICAT approach coupled with bioinformatics prediction to ascertain rat liver mitochondrial proteins and indication of mitochondrial localization for catalase

Jiang XS Dai J Sheng QH Zhang L Xia QC Wu JR Zeng R 《Molecular & cellular proteomics : MCP》2005,4(1):12-34

Subcellular proteomics, as an important step to functional proteomics, has been a focus in proteomic research. However, the co-purification of "contaminating" proteins has been the major problem in all the subcellular proteomic research including all kinds of mitochondrial proteome research. It is often difficult to conclude whether these "contaminants" represent true endogenous partners or artificial associations induced by cell disruption or incomplete purification. To solve such a problem, we applied a high-throughput comparative proteome experimental strategy, ICAT approach performed with two-dimensional LC-MS/MS analysis, coupled with combinational usage of different bioinformatics tools, to study the proteome of rat liver mitochondria prepared with traditional centrifugation (CM) or further purified with a Nycodenz gradient (PM). A total of 169 proteins were identified and quantified convincingly in the ICAT analysis, in which 90 proteins have an ICAT ratio of PM:CM>1.0, while another 79 proteins have an ICAT ratio of PM:CM<1.0. Almost all the proteins annotated as mitochondrial according to Swiss-Prot annotation, bioinformatics prediction, and literature reports have a ratio of PM:CM>1.0, while proteins annotated as extracellular or secreted, cytoplasmic, endoplasmic reticulum, ribosomal, and so on have a ratio of PM:CM<1.0. Catalase and AP endonuclease 1, which have been known as peroxisomal and nuclear, respectively, have shown a ratio of PM:CM>1.0, confirming the reports about their mitochondrial location. Moreover, the 125 proteins with subcellular location annotation have been used as a testing dataset to evaluate the efficiency for ascertaining mitochondrial proteins by ICAT analysis and the bioinformatics tools such as PSORT, TargetP, SubLoc, MitoProt, and Predotar. The results indicated that ICAT analysis coupled with combinational usage of different bioinformatics tools could effectively ascertain mitochondrial proteins and distinguish contaminant proteins and even multilocation proteins. Using such a strategy, many novel proteins, known proteins without subcellular location annotation, and even known proteins that have been annotated as other locations have been strongly indicated for their mitochondrial location. 相似文献

12.

Strong Heterogeneity in Nucleotidic Composition and Codon Bias in the Pea Aphid (<Emphasis Type="Italic">Acyrthosiphon pisum)</Emphasis> Shown by EST-Based Coding Genome Reconstruction

Rispe C Legeai F Gauthier JP Tagu D 《Journal of molecular evolution》2007,65(4):413-424

The aim of this study was to analyze patterns of nucleotidic composition and codon usage in the pea aphid genome (Acyrthosiphon pisum). A collection of 60,000 expressed sequence tags (ESTs) in the pea aphid has been used to automatically reconstruct 5809 coding sequences (CDSs), based on similarity with known proteins and on coding style recognition. Reconstructions were manually checked for ribosomal proteins, leading to tentatively reconstruct the nea-complete set of this category. Pea aphid coding sequences showed a shift toward AT (especially at the third codon position) compared to drosophila homologues. Genes with a putative high level of expression (ribosomal and other genes with high EST support) remained more GC3-rich and had a distinct codon usage from bulk sequences: they exhibited a preference for C-ending codons and CGT (for arginine), which thus appeared optimal for translation. However, the discrimination was not as strong as in drosophila, suggesting a reduced degree of translational selection. The space of variation in codon usage for A. pisum appeared to be larger than in drosophila, with a substantial fraction of genes that remained GC3-rich. Some of those (in particular some structural proteins) also showed high levels of codon bias and a very strong preference for C-ending codons, which could be explained either by strong translational selection or by other mechanisms. Finally, genomic traces were analyzed to build 206 fragments containing a full CDS, which allowed studying the correlations between GC contents of coding and those of noncoding (flanking and introns) sequences. 相似文献

13.

Cloning and characterization of the gene encoding the highly expressed ribosomal protein l3 of the ciliated protozoan Tetrahymena thermophila. Evidence for differential codon usage in highly expressed genes

Larsen LK Andreasen PH Dreisig H Palm L Nielsen H Engberg J Kristiansen K 《Cell biology international》1999,23(8):551-560

We have cloned and characterized the cDNA and the macronuclear genomic copy of the highly conserved ribosomal protein (r-protein) L3 of Tetrahymena thermophila. The r-protein L3 is encoded by a single copy gene interrupted by one intron. The organization of the promoter region exhibits features characteristic of ribosomal protein genes in Tetrahymena. The codon usage of the L3 gene is highly biased. A thorough analysis of codon usage in Tetrahymena genes revealed that genes could be categorized into two classes according to codon usage bias. Class A comprises r-protein genes and a number of other highly expressed genes. Class B comprises weakly expressed genes such as the conjugation induced CnjB and CnjC genes, but surprisingly, this class also contains abundantly expressed genes such as the genes encoding the surface antigens SerH3 and SerH1. Codon usage is slightly more restricted in class A than in class B, but both classes exhibit distinct and different codon usage biases. Class A genes preferentially use C and U in the silent third codon positions, whereas class B genes preferentially use A and U in the silent third codon positions. The analysis suggests that two different strategies have been employed for optimization of codon usage in the A+T-rich genome of Tetrahymena. 相似文献

14.

Simultaneous horizontal gene transfer of a gene coding for ribosomal protein l27 and operational genes in Arthrobacter sp

Garcia-Vallvé S Simó FX Montero MA Arola L Romeu A 《Journal of molecular evolution》2002,55(6):632-637

Phylogenetic analysis of bacterial L27 ribosomal proteins showed that, against taxonomy, the L27 protein from the Actinobacteria Arthrobacter sp. clusters with protein sequences from the Bacillus group. The L27 gene clusters in the Arthrobacter sp. genome with six genes responsible for creatinine and sarcosine degradation. Phylogenetic analyses of orthologue proteins encoded by three of these genes also showed a phylogenetic relationship with Bacillus species. Comparisons between the synonymous codon usage of the Arthrobacter sp. genes and those from complete genomes showed that Arthrobacter genes encoding the L27 ribosomal protein and the proteins responsible for the degradation of creatinine and sarcosine have a codon usage that is more similar to that of Bacillus species than that of Arthrobacter. We suggest that the Arthrobacter sp. genes encoding the L27 ribosomal protein and the proteins responsible for the degradation of creatinine and sarcosine were acquired simultaneously through horizontal gene transfer from an unknown Bacillus species. 相似文献

15.

Factors affecting mito-nuclear codon usage interactions in the OXPHOS system of Drosophila melanogaster

Zheng Sun Liang Ma Robert W.Muohy Xiansheng Zhang Dawei Huang 《遗传学报》2008,35(12):729-735

Codon usage bias varies considerably among genomes and even within the genes of the same genome.In eukaryotic organisms,energy production in the form of oxidative phosphorylation(OXPHOS)is the only process under control of both nuclear and mitochondrial genomes.Although factors affecting codon usage in a single genome have been studied,this has not occurred when both interactional genomes are involved.Consequently, we investigated whether or not other factors influence codon usage of coevolved genes.We used Drosophila melanogaster as a model organism.Our χ2 test on the number of codons of nuclear and mitochondrial genes involved in the OXPHOS system was significantly different (χ2=7945.16,P<0.01).A plot of effective number of codons against GC3s content of nuclear genes showed that few genes lie on the expected curve,indicating that codon usage was random.Correspondence analysis indicated a significant correlation between axis 1 and codon adaptation index(R=0.947,P<0.01)in every nuclear gene sequence.Thus,codon usage bias of nuclear genes appeared to be affected by translational selection.Correlation between axis 1 coordinates and GC content(R=0.814.P<0.01)indicated that the codon usage of nuclear genes was also affected by GC composition.Analysis of mitochondrial genes did not reveal a significant correlation between axis 1 and any parameter.Statistical analyses indicated that codon usages of both nDNA and mtDNA were subjected to context-dependent mutations. 相似文献

16.

Predicted highly expressed genes of diverse prokaryotic genomes 总被引：13，自引：0，他引：13

下载免费PDF全文

Karlin S Mrázek J 《Journal of bacteriology》2000,182(18):5238-5250

相似文献

17.

Highly expressed and alien genes of the Synechocystis genome

Mrázek J Bhaya D Grossman AR Karlin S 《Nucleic acids research》2001,29(7):1590-1601

Comparisons of codon frequencies of genes to several gene classes are used to characterize highly expressed and alien genes on the SYNECHOCYSTIS: PCC6803 genome. The primary gene classes include the ensemble of all genes (average gene), ribosomal protein (RP) genes, translation processing factors (TF) and genes encoding chaperone/degradation proteins (CH). A gene is predicted highly expressed (PHX) if its codon usage is close to that of the RP/TF/CH standards but strongly deviant from the average gene. Putative alien (PA) genes are those for which codon usage is significantly different from all four classes of gene standards. In SYNECHOCYSTIS:, 380 genes were identified as PHX. The genes with the highest predicted expression levels include many that encode proteins vital for photosynthesis. Nearly all of the genes of the RP/TF/CH gene classes are PHX. The principal glycolysis enzymes, which may also function in CO(2) fixation, are PHX, while none of the genes encoding TCA cycle enzymes are PHX. The PA genes are mostly of unknown function or encode transposases. Several PA genes encode polypeptides that function in lipopolysaccharide biosynthesis. Both PHX and PA genes often form significant clusters (operons). The proteins encoded by PHX and PA genes are described with respect to functional classifications, their organization in the genome and their stoichiometry in multi-subunit complexes. 相似文献

18.

Comparative studies on codon usage pattern of chloroplasts and their host nuclear genes in four plant species 总被引：1，自引：0，他引：1

Liu Q Xue Q 《Journal of genetics》2005,84(1):55-62

A detailed comparison was made of codon usage of chloroplast genes with their host (nuclear) genes in the four angiosperm speciesOryza sativa, Zea mays, Triticum aestivum andArabidopsis thaliana. The average GC content of the entire genes, and at the three codon positions individually, was higher in nuclear than in chloroplast genes, suggesting different genomic organization and mutation pressures in nuclear and chloroplast genes. The results of Nc-plots and neutrality plots suggested that nucleotide compositional constraint had a large contribution to codon usage bias of nuclear genes inO. sativa, Z. mays, andT. aestivum, whereas natural selection was likely to be playing a large role in codon usage bias in chloroplast genomes. Correspondence analysis and chi-test showed that regardless of the genomic environment (species) of the host, the codon usage pattern of chloroplast genes differed from nuclear genes of their host species by their AU-richness. All the chloroplast genomes have predominantly A- and/or U-ending codons, whereas nuclear genomes have G-, C- or U-ending codons as their optimal codons. These findings suggest that the chloroplast genome might display particular characteristics of codon usage that are different from its host nuclear genome. However, one feature common to both chloroplast and nuclear genomes in this study was that pyrimidines were found more frequently than purines at the synonymous codon position of optimal codons. 相似文献

19.

Conserved codon composition of ribosomal protein coding genes in Escherichia coli,Mycobacterium tuberculosis and Saccharomyces cerevisiae: lessons from supervised machine learning in functional genomics

Lin K Kuang Y Joseph JS Kolatkar PR 《Nucleic acids research》2002,30(11):2599-2607

Genomics projects have resulted in a flood of sequence data. Functional annotation currently relies almost exclusively on inter-species sequence comparison and is restricted in cases of limited data from related species and widely divergent sequences with no known homologs. Here, we demonstrate that codon composition, a fusion of codon usage bias and amino acid composition signals, can accurately discriminate, in the absence of sequence homology information, cytoplasmic ribosomal protein genes from all other genes of known function in Saccharomyces cerevisiae, Escherichia coli and Mycobacterium tuberculosis using an implementation of support vector machines, SVM(light). Analysis of these codon composition signals is instructive in determining features that confer individuality to ribosomal protein genes. Each of the sets of positively charged, negatively charged and small hydrophobic residues, as well as codon bias, contribute to their distinctive codon composition profile. The representation of all these signals is sensitively detected, combined and augmented by the SVMs to perform an accurate classification. Of special mention is an obvious outlier, yeast gene RPL22B, highly homologous to RPL22A but employing very different codon usage, perhaps indicating a non-ribosomal function. Finally, we propose that codon composition be used in combination with other attributes in gene/protein classification by supervised machine learning algorithms. 相似文献

20.

Genes adopt non‐optimal codon usage to generate cell cycle‐dependent oscillations in protein levels

Tamar Danon Thomas Christian Takao Igarashi Lydia Cohen Ya‐Ming Hou Lars Juhl Jensen 《Molecular systems biology》2012,8(1)

The cell cycle is a temporal program that regulates DNA synthesis and cell division. When we compared the codon usage of cell cycle‐regulated genes with that of other genes, we discovered that there is a significant preference for non‐optimal codons. Moreover, genes encoding proteins that cycle at the protein level exhibit non‐optimal codon preferences. Remarkably, cell cycle‐regulated genes expressed in different phases display different codon preferences. Here, we show empirically that transfer RNA (tRNA) expression is indeed highest in the G2 phase of the cell cycle, consistent with the non‐optimal codon usage of genes expressed at this time, and lowest toward the end of G1, reflecting the optimal codon usage of G1 genes. Accordingly, protein levels of human glycyl‐, threonyl‐, and glutamyl‐prolyl tRNA synthetases were found to oscillate, peaking in G2/M phase. In light of our findings, we propose that non‐optimal (wobbly) matching codons influence protein synthesis during the cell cycle. We describe a new mathematical model that shows how codon usage can give rise to cell‐cycle regulation. In summary, our data indicate that cells exploit wobbling to generate cell cycle‐dependent dynamics of proteins. 相似文献