首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background  

Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species.  相似文献   

2.

Background  

Accurate amino acid insertion during peptide elongation requires tRNAs loaded by cognate amino acids and that anticodons match codons. However, tRNA misloading does not necessarily cause misinsertions: misinsertion is avoided when anticodons mismatch codons coding for misloaded amino acids.  相似文献   

3.
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome. © 1996 Wiley-Liss, Inc.  相似文献   

4.
Shao ZQ  Zhang YM  Feng XY  Wang B  Chen JQ 《PloS one》2012,7(3):e33547

Background

In yeast coding sequences, once a particular codon has been used, subsequent occurrence of the same amino acid tends to use codons sharing the same tRNA. Such a phenomenon of co-tRNA codons pairing bias (CTCPB) is also found in some other eukaryotes but it is not known whether it occurs in prokaryotes.

Methodology/Principal Findings

In this study, we focused on a total of 773 bacterial genomes to investigate their synonymous codon pairing preferences. After calculating the actual frequencies of synonymous codon pairs and comparing them with their expected values, we detected an obvious pairing bias towards identical codon pairs. This seems consistent with the previously reported CTCPB phenomenon, since identical codons are certainly read by the same tRNA. However, among co-tRNA but non-identical codon pairs, only 22 were often found overrepresented, suggesting that many co-tRNA codons actually do not preferentially pair together in prokaryotes. Therefore, the previously reported co-tRNA codons pairing rule needs to be more rigorously defined. The affinity differences between a tRNA anticodon and its readable codons should be taken into account. Moreover, both within-gene-shuffling tests and phylogenetic analyses support the idea that translational selection played an important role in shaping the observed synonymous codon pairing pattern in prokaryotes.

Conclusions

Overall, a high level of synonymous codon pairing bias was detected in 73% investigated bacterial species, suggesting the synonymous codon ordering strategy has been prevalently adopted by prokaryotes to improve their translational efficiencies. The findings in this study also provide important clues to better understand the complex dynamics of translational process.  相似文献   

5.

Background

Little is known about the role of amino acids in cellular signaling pathways, especially as it pertains to pathways that regulate the rate of aging. However, it has been shown that methionine or tryptophan restriction extends lifespan in higher eukaryotes and increased proline or tryptophan levels increase longevity in C. elegans. In addition, leucine strongly activates the TOR signaling pathway, which when inhibited increases lifespan.

Results

Therefore each of the 20 proteogenic amino acids was individually supplemented to C. elegans and the effects on lifespan were determined. All amino acids except phenylalanine and aspartate extended lifespan at least to a small extent at one or more of the 3 concentrations tested with serine and proline showing the largest effects. 11 of the amino acids were less potent at higher doses, while 5 even decreased lifespan. Serine, proline, or histidine-mediated lifespan extension was greatly inhibited in eat-2 worms, a model of dietary restriction, in daf-16/FOXO, sir-2.1, rsks-1 (ribosomal S6 kinase), gcn-2, and aak-2 (AMPK) longevity pathway mutants, and in bec-1 autophagy-defective knockdown worms. 8 of 10 longevity-promoting amino acids tested activated a SKN-1/Nrf2 reporter strain, while serine and histidine were the only amino acids from those to activate a hypoxia-inducible factor (HIF-1) reporter strain. Thermotolerance was increased by proline or tryptophan supplementation, while tryptophan-mediated lifespan extension was independent of DAF-16/FOXO and SKN-1/Nrf2 signaling, but tryptophan and several related pyridine-containing compounds induced the mitochondrial unfolded protein response and an ER stress response. High glucose levels or mutations affecting electron transport chain (ETC) function inhibited amino acid-mediated lifespan extension suggesting that metabolism plays an important role. Providing many other cellular metabolites to C. elegans also increased longevity suggesting that anaplerosis of tricarboxylic acid (TCA) cycle substrates likely plays a role in lifespan extension.

Conclusions

Supplementation of C. elegans with 18 of the 20 individual amino acids extended lifespan, but lifespan often decreased with increasing concentration suggesting hormesis. Lifespan extension appears to be caused by altered mitochondrial TCA cycle metabolism and respiratory substrate utilization resulting in the activation of the DAF-16/FOXO and SKN-1/Nrf2 stress response pathways.

Electronic supplementary material

The online version of this article (doi:10.1186/s12863-015-0167-2) contains supplementary material, which is available to authorized users.  相似文献   

6.

Background

Cyanobacteria are well known for the production of a range of secondary metabolites. Whilst recent genome sequencing projects has led to an increase in the number of publically available cyanobacterial genomes, the secondary metabolite potential of many of these organisms remains elusive. Our study focused on the 11 publically available Subsection V cyanobacterial genomes, together with the draft genomes of Westiella intricata UH strain HT-29-1 and Hapalosiphon welwitschii UH strain IC-52-3, for their genetic potential to produce secondary metabolites. The Subsection V cyanobacterial genomes analysed in this study are reported to produce a diverse range of natural products, including the hapalindole-family of compounds, microcystin, hapalosin, mycosporine-like amino acids and hydrocarbons.

Results

A putative gene cluster for the cyclic depsipeptide hapalosin, known to reverse P-glycoprotein multiple drug resistance, was identified within three Subsection V cyanobacterial genomes, including the producing cyanobacterium H. welwitschii UH strain IC-52-3. A number of orphan NRPS/PKS gene clusters and ribosomally-synthesised and post translationally-modified peptide gene clusters (including cyanobactin, microviridin and bacteriocin gene clusters) were identified. Furthermore, gene clusters encoding the biosynthesis of mycosporine-like amino acids, scytonemin, hydrocarbons and terpenes were also identified and compared.

Conclusions

Genome mining has revealed the diversity, abundance and complex nature of the secondary metabolite potential of the Subsection V cyanobacteria. This bioinformatic study has identified novel biosynthetic enzymes which have not been associated with gene clusters of known classes of natural products, suggesting that these cyanobacteria potentially produce structurally novel secondary metabolites.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1855-z) contains supplementary material, which is available to authorized users.  相似文献   

7.
8.

Background

Production of proteins as therapeutic agents, research reagents and molecular tools frequently depends on expression in heterologous hosts. Synthetic genes are increasingly used for protein production because sequence information is easier to obtain than the corresponding physical DNA. Protein-coding sequences are commonly re-designed to enhance expression, but there are no experimentally supported design principles.

Principal Findings

To identify sequence features that affect protein expression we synthesized and expressed in E. coli two sets of 40 genes encoding two commercially valuable proteins, a DNA polymerase and a single chain antibody. Genes differing only in synonymous codon usage expressed protein at levels ranging from undetectable to 30% of cellular protein. Using partial least squares regression we tested the correlation of protein production levels with parameters that have been reported to affect expression. We found that the amount of protein produced in E. coli was strongly dependent on the codons used to encode a subset of amino acids. Favorable codons were predominantly those read by tRNAs that are most highly charged during amino acid starvation, not codons that are most abundant in highly expressed E. coli proteins. Finally we confirmed the validity of our models by designing, synthesizing and testing new genes using codon biases predicted to perform well.

Conclusion

The systematic analysis of gene design parameters shown in this study has allowed us to identify codon usage within a gene as a critical determinant of achievable protein expression levels in E. coli. We propose a biochemical basis for this, as well as design algorithms to ensure high protein production from synthetic genes. Replication of this methodology should allow similar design algorithms to be empirically derived for any expression system.  相似文献   

9.
Knight RD  Freeland SJ  Landweber LF 《Genome biology》2001,2(4):research0010.1-research001013

Background  

Correlations between genome composition (in terms of GC content) and usage of particular codons and amino acids have been widely reported, but poorly explained. We show here that a simple model of processes acting at the nucleotide level explains codon usage across a large sample of species (311 bacteria, 28 archaea and 257 eukaryotes). The model quantitatively predicts responses (slope and intercept of the regression line on genome GC content) of individual codons and amino acids to genome composition.  相似文献   

10.

Background

NADPH-cytochrome P450 reductase (CPR) plays a central role in cytochrome P450 action. The genes coding for P450s are not yet fully identified in the bed bug, Cimex lectularius. Hence, we decided to clone cDNA and knockdown the expression of the gene coding for CPR which is suggested to be required for the function of all P450s to determine whether or not P450s are involved in resistance of bed bugs to insecticides.

Methodology/Principal Findings

The full length Cimex lectularius CPR (ClCPR) cDNA was isolated from a deltamethrin resistant bed bug population (CIN-1) using a combined PCR strategy. Bioinformatics and in silico modeling were employed to identify three conserved binding domains (FMN, FAD, NADP), a FAD binding motif, and the catalytic residues. The critical amino acids involved in FMN, FAD, NADP binding and their putative functions were also analyzed. No signal peptide but a membrane anchor domain with 21 amino acids which facilitates the localization of ClCPR on the endoplasmic reticulum was identified in ClCPR protein. Phylogenetic analysis showed that ClCPR is closer to the CPR from the body louse, Pediculus humanus corporis than to the CPRs from the other insect species studied. The ClCPR gene was ubiquitously expressed in all tissues tested but showed an increase in expression as immature stages develop into adults. We exploited the traumatic insemination mechanism of bed bugs to inject dsRNA and successfully knockdown the expression of the gene coding for ClCPR. Suppression of the ClCPR expression increased susceptibility to deltamethrin in resistant populations but not in the susceptible population of bed bugs.

Conclusions/Significance

These data suggest that P450-mediated metabolic detoxification may serve as one of the resistance mechanisms in bed bugs.  相似文献   

11.
Summary AGA and AGG (AGR) are arginine codons in the universal genetic code. These codons are read as serine or are used as stop codons in metazoan mitochondria. The arginine residues coded by AGR in yeast orTrypanosoma are coded by arginine CGN throughout metazoan mitochondria. AGR serine sites in metazoan mitochondria are occupied mainly in corresponding sites in yeast orTrypanosoma mitochondria by UCN serine, AGY serine, or codons for amino acids other than serine or arginine. Based on these observations, we propose the following evolutionary events. AGR codons became unassigned because of deletion of tRNA Arg (UCU) and elimination of AGR codons by conversion to CGN arginine codons. Upon acquisition by serine tRNA of pairing ability with AGR codons, some codons for amino acids other than arginine mutated to AGR, and were caputed by anticodon GCU in serine tRNA. During vertebrate mitochondrial evolution, AGR stop codons presumably were created from UAG stop by deletion of the first nucleotide U and by use of R as the third nucleotide that had existed next to the ancestral UAG stop.  相似文献   

12.

Background

Polymorphism in genes of regulating enzymes, transporters and receptors of the neurotransmitters of the central nervous system have been associated with altered behaviour, and single nucleotide polymorphisms (SNPs) represent the most frequent type of genetic variation. The serotonin and dopamine signalling systems have a central influence on different behavioural phenotypes, both of invertebrates and vertebrates, and this study was undertaken in order to explore genetic variation that may be associated with variation in behaviour.

Results

Single nucleotide polymorphisms in canine genes related to behaviour were identified by individually sequencing eight dogs (Canis familiaris) of different breeds. Eighteen genes from the dopamine and the serotonin systems were screened, revealing 34 SNPs distributed in 14 of the 18 selected genes. A total of 24,895 bp coding sequence was sequenced yielding an average frequency of one SNP per 732 bp (1/732). A total of 11 non-synonymous SNPs (nsSNPs), which may be involved in alteration of protein function, were detected. Of these 11 nsSNPs, six resulted in a substitution of amino acid residue with concomitant change in structural parameters.

Conclusion

We have identified a number of coding SNPs in behaviour-related genes, several of which change the amino acids of the proteins. Some of the canine SNPs exist in codons that are evolutionary conserved between five compared species, and predictions indicate that they may have a functional effect on the protein. The reported coding SNP frequency of the studied genes falls within the range of SNP frequencies reported earlier in the dog and other mammalian species. Novel SNPs are presented and the results show a significant genetic variation in expressed sequences in this group of genes. The results can contribute to an improved understanding of the genetics of behaviour.  相似文献   

13.

Background

The human kinome containing 478 eukaryotic protein kinases has over 100 uncharacterized kinases with unknown substrates and biological functions. The Ser/Thr kinase 35 (STK35, Clik1) is a member of the NKF 4 (New Kinase Family 4 ) in the kinome with unknown substrates and biological functions. Various high throughput studies indicate that STK35 could be involved in various human diseases such as colorectal cancer and malaria.

Methodology/Principal Findings

In this study, we found that the previously published coding sequence of the STK35 gene is incomplete. The newly identified sequence of the STK35 gene codes for a protein of 534 amino acids with a N-terminal elongation of 133 amino acids. It has been designated as STK35L (STK35 long). Since it is the first of further homologous kinases we termed it as STK35L1. The STK35L1 protein (58 kDa on SDS-PAGE), but not STK35 (44 kDa), was found to be expressed in all human cells studied (endothelial cells, HeLa, and HEK cells) and was down-regulated after silencing with specific siRNA. EGFP-STK35L1 was localized in the nucleus and the nucleolus. By combining syntenic and gene structure pattern data and homology searches, two further STK35L1 homologs, STK35L2 (previously known as PDIK1L) and STK35L3, were found. All these protein kinase homologs were conserved throughout the vertebrates. The STK35L3 gene was specifically lost during placental mammalian evolution. Using comparative genomics, we have identified orthologous sets of these three protein kinases genes and their possible ancestor gene in two sea squirt genomes.

Conclusions/Significance

We found the full-length coding sequence of the STK35 gene and termed it as STK35L1. We identified a new third STK35-like gene, STK35L3, in vertebrates and a possible ancestor gene in sea squirt genome. This study will provide a comprehensive platform to explore the role of STK35L kinases in cell functions and human diseases.  相似文献   

14.

Background

Microsatellites have been used extensively in the field of comparative genomics. By studying microsatellites in coding regions we have a simple model of how genotypic changes undergo selection as they are directly expressed in the phenotype as altered proteins. The simplest of these tandem repeats in coding regions are the tri-nucleotide repeats which produce a repeat of a single amino acid when translated into proteins. Tri-nucleotide repeats are often disease associated, and are also known to be unstable to both expansion and contraction. This makes them sensitive markers for studying proteome evolution, in closely related species.

Results

The evolutionary history of the family of malarial causing parasites Plasmodia is complex because of the life-cycle of the organism, where it interacts with a number of different hosts and goes through a series of tissue specific stages. This study shows that the divergence between the primate and rodent malarial parasites has resulted in a lineage specific change in the simple amino acid repeat distribution that is correlated to A–T content. The paper also shows that this altered use of amino acids in SAARs is consistent with the repeat distributions being under selective pressure.

Conclusions

The study shows that simple amino acid repeat distributions can be used to group related species and to examine their phylogenetic relationships. This study also shows that an outgroup species with a similar A–T content can be distinguished based only on the amino acid usage in repeats, and suggest that this might be a useful feature for proteome clustering. The lineage specific use of amino acids in repeat regions suggests that comparative studies of SAAR distributions between proteomes gives an insight into the mechanisms of expansion and the selective pressures acting on the organism.  相似文献   

15.
16.

Background

GBV-C infection is associated with prolonged survival in HIV-infected people and GBV-C inhibits HIV replication in co-infection models. Expression of the GBV-C nonstructural phosphoprotein 5A (NS5A) decreases surface levels of the HIV co-receptor CXCR4, induces the release of SDF-1 and inhibits HIV replication in Jurkat CD4+ T cell lines.

Methodology/Principal Findings

Jurkat cell lines stably expressing NS5A protein and peptides were generated and HIV replication in these cell lines assessed. HIV replication was significantly inhibited in all cell lines expressing NS5A amino acids 152–165. Substitution of an either alanine or glycine for the serine at position 158 (S158A or S158G) resulted in a significant decrease in the HIV inhibitory effect. In contrast, substituting a phosphomimetic amino acid (glutamic acid; S158E) inhibited HIV as well as the parent peptide. HIV inhibition was associated with lower levels of surface expression of the HIV co-receptor CXCR4 and increased release of the CXCR4 ligand, SDF-1 compared to control cells. Incubation of CD4+ T cell lines with synthetic peptides containing amino acids 152–167 or the S158E mutant peptide prior to HIV infection resulted in HIV replication inhibition compared to control peptides.

Conclusions/Significance

Expression of GBV-C NS5A amino acids 152–165 are sufficient to inhibit HIV replication in vitro, and the serine at position 158 appears important for this effect through either phosphorylation or structural changes in this peptide. The addition of synthetic peptides containing 152–167 or the S158E substitution to Jurkat cells resulted in HIV replication inhibition in vitro. These data suggest that GBV-C peptides or a peptide mimetic may offer a novel, cellular-based approach to antiretroviral therapy.  相似文献   

17.

Background

KaiC, a central clock protein in cyanobacteria, undergoes circadian oscillations between hypophosphorylated and hyperphosphorylated forms in vivo and in vitro. Structural analyses of KaiC crystals have identified threonine and serine residues in KaiC at three residues (T426, S431, and T432) as potential sites at which KaiC is phosphorylated; mutation of any of these three sites to alanine abolishes rhythmicity, revealing an essential clock role for each residue separately and for KaiC phosphorylation in general. Mass spectrometry studies confirmed that the S431 and T432 residues are key phosphorylation sites, however, the role of the threonine residue at position 426 was not clear from the mass spectrometry measurements.

Methodology and Principal Findings

Mutational approaches and biochemical analyses of KaiC support a key role for T426 in control of the KaiC phosphorylation status in vivo and in vitro and demonstrates that alternative amino acids at residue 426 dramatically affect KaiC''s properties in vivo and in vitro, especially genetic dominance/recessive relationships, KaiC dephosphorylation, and the formation of complexes of KaiC with KaiA and KaiB. These mutations alter key circadian properties, including period, amplitude, robustness, and temperature compensation. Crystallographic analyses indicate that the T426 site is phosphorylatible under some conditions, and in vitro phosphorylation assays of KaiC demonstrate labile phosphorylation of KaiC when the primary S431 and T432 sites are blocked.

Conclusions and Significance

T426 is a crucial site that regulates KaiC phosphorylation status in vivo and in vitro and these studies underscore the importance of KaiC phosphorylation status in the essential cyanobacterial circadian functions. The regulatory roles of these phosphorylation sites–including T426–within KaiC enhance our understanding of the molecular mechanism underlying circadian rhythm generation in cyanobacteria.  相似文献   

18.
Palidwor GA  Perkins TJ  Xia X 《PloS one》2010,5(10):e13431

Background

In spite of extensive research on the effect of mutation and selection on codon usage, a general model of codon usage bias due to mutational bias has been lacking. Because most amino acids allow synonymous GC content changing substitutions in the third codon position, the overall GC bias of a genome or genomic region is highly correlated with GC3, a measure of third position GC content. For individual amino acids as well, G/C ending codons usage generally increases with increasing GC bias and decreases with increasing AT bias. Arginine and leucine, amino acids that allow GC-changing synonymous substitutions in the first and third codon positions, have codons which may be expected to show different usage patterns.

Principal Findings

In analyzing codon usage bias in hundreds of prokaryotic and plant genomes and in human genes, we find that two G-ending codons, AGG (arginine) and TTG (leucine), unlike all other G/C-ending codons, show overall usage that decreases with increasing GC bias, contrary to the usual expectation that G/C-ending codon usage should increase with increasing genomic GC bias. Moreover, the usage of some codons appears nonlinear, even nonmonotone, as a function of GC bias. To explain these observations, we propose a continuous-time Markov chain model of GC-biased synonymous substitution. This model correctly predicts the qualitative usage patterns of all codons, including nonlinear codon usage in isoleucine, arginine and leucine. The model accounts for 72%, 64% and 52% of the observed variability of codon usage in prokaryotes, plants and human respectively. When codons are grouped based on common GC content, 87%, 80% and 68% of the variation in usage is explained for prokaryotes, plants and human respectively.

Conclusions

The model clarifies the sometimes-counterintuitive effects that GC mutational bias can have on codon usage, quantifies the influence of GC mutational bias and provides a natural null model relative to which other influences on codon bias may be measured.  相似文献   

19.

Background

Cysticercosis and hydatidosis seriously affect human health and are responsible for considerable economic loss in animal husbandry in non-developed and developed countries. S3Pvac and EG95 are the only field trial-tested vaccine candidates against cysticercosis and hydatidosis, respectively. S3Pvac is composed of three peptides (KETc1, GK1 and KETc12), originally identified in a Taenia crassiceps cDNA library. S3Pvac synthetically and recombinantly expressed is effective against experimentally and naturally acquired cysticercosis.

Methodology/Principal Findings

In this study, the homologous sequences of two of the S3Pvac peptides, GK1 and KETc1, were identified and further characterized in Taenia crassiceps WFU, Taenia solium, Taenia saginata, Echinococcus granulosus and Echinococcus multilocularis. Comparisons of the nucleotide and amino acid sequences coding for KETc1 and GK1 revealed significant homologies in these species. The predicted secondary structure of GK1 is almost identical between the species, while some differences were observed in the C terminal region of KETc1 according to 3D modeling. A KETc1 variant with a deletion of three C-terminal amino acids protected to the same extent against experimental murine cysticercosis as the entire peptide. On the contrary, immunization with the truncated GK1 failed to induce protection. Immunolocalization studies revealed the non stage-specificity of the two S3Pvac epitopes and their persistence in the larval tegument of all species and in Taenia adult tapeworms.

Conclusions/Significance

These results indicate that GK1 and KETc1 may be considered candidates to be included in the formulation of a multivalent and multistage vaccine against these cestodiases because of their enhancing effects on other available vaccine candidates.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号