首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Over the past decade, evidence has accumulated that new protein‐coding genes can emerge de novo from previously non‐coding DNA. Most studies have focused on large scale computational predictions of de novo protein‐coding genes across a wide range of organisms. In contrast, experimental data concerning the folding and function of de novo proteins are scarce. This might be due to difficulties in handling de novo proteins in vitro, as most are short and predicted to be disordered. Here, we propose a guideline for the effective expression of eukaryotic de novo proteins in Escherichia coli. We used 11 sequences from Drosophila melanogaster and 10 from Homo sapiens, that are predicted de novo proteins from former studies, for heterologous expression. The candidate de novo proteins have varying secondary structure and disorder content. Using multiple combinations of purification tags, E. coli expression strains, and chaperone systems, we were able to increase the number of solubly expressed putative de novo proteins from 30% to 62%. Our findings indicate that the best combination for expressing putative de novo proteins in E. coli is a GST‐tag with T7 Express cells and co‐expressed chaperones. We found that, overall, proteins with higher predicted disorder were easier to express.StatementToday, we know that proteins do not only evolve by duplication and divergence of existing proteins but also arise from previously non‐coding DNA. These proteins are called de novo proteins. Their properties are still poorly understood and their experimental analysis faces major obstacles. Here, we aim to present a starting point for soluble expression of de novo proteins with the help of chaperones and thereby enable further characterization.  相似文献   

2.
An in situ nuclear magnetic resonance (NMR) bioreactor was developed and employed to monitor microbial metabolism under batch growth conditions in real time. We selected Moorella thermoacetica ATCC 49707 as a test case. M. thermoacetica (formerly Clostridium thermoaceticum) is a strictly anaerobic, thermophilic, acetogenic, gram-positive bacterium with potential for industrial production of chemicals. The metabolic profiles of M. thermoacetica were characterized during growth in batch mode on xylose (a component of lignocellulosic biomass) using the new generation NMR bioreactor in combination with high-resolution NMR (HR-NMR) spectroscopy. In situ NMR measurements were performed using water-suppressed H-1 NMR spectroscopy at 500 MHz, and aliquots of the bioreactor contents were taken for 600-MHz HR-NMR spectroscopy at specific intervals to confirm metabolite identifications and expand metabolite coverage. M. thermoacetica demonstrated the metabolic potential to produce formate, ethanol, and methanol from xylose, in addition to its known capability of producing acetic acid. Real-time monitoring of bioreactor conditions showed a temporary pH decrease, with a concomitant increase in formic acid during exponential growth. Fermentation experiments performed outside of the magnet showed that the strong magnetic field employed for NMR detection did not significantly affect cell metabolism. Use of the in situ NMR bioreactor facilitated monitoring of the fermentation process, enabling identification of intermediate and endpoint metabolites and their correlation with pH and biomass produced during culture growth. Real-time monitoring of culture metabolism using the NMR bioreactor in combination with HR-NMR spectroscopy will allow optimization of the metabolism of microorganisms producing valuable bioproducts.  相似文献   

3.
4.
To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203). Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706''s mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer''s brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer''s disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions.  相似文献   

5.
Novel sequences are DNA sequences present in an individual''s genome but absent in the human reference assembly. They are predicted to be biologically important, both individual and population specific, and consistent with the known human migration paths. Recent works have shown that an average person harbors 2–5 Mb of such sequences and estimated that the human pan-genome contains as high as 19–40 Mb of novel sequences. To identify them in a de novo genome assembly, some existing sequence aligners have been used but no computational method has been specifically proposed for this task. In this work, we developed NSIT (Novel Sequence Identification Tool), a software that can accurately and efficiently identify novel sequences in an individual''s de novo whole genome assembly. We identified and characterized 1.1 Mb, 1.2 Mb, and 1.0 Mb of novel sequences in NA18507 (African), YH (Asian), and NA12878 (European) de novo genome assemblies, respectively. Our results show very high concordance with the previous work using the respective reference assembly. In addition, our results using the latest human reference assembly suggest that the amount of novel sequences per individual may not be as high as previously reported. We additionally developed a graphical viewer for comparisons of novel sequence contents. The viewer also helped in identifying sequence contamination; we found 130 kb of Epstein-Barr virus sequence in the previously published NA18507 novel sequences as well as 287 kb of zebrafish repeats in NA12878 de novo assembly. NSIT requires 2GB of RAM and 1.5–2 hrs on a commodity desktop. The program is applicable to input assemblies with varying contig/scaffold sizes, ranging from 100 bp to as high as 50 Mb. It works in both 32-bit and 64-bit systems and outperforms, by large margins, other fast sequence aligners previously applied to this task. To our knowledge, NSIT is the first software designed specifically for novel sequence identification in a de novo human genome assembly.  相似文献   

6.
Constraint-based methods provide powerful computational techniques to allow understanding and prediction of cellular behavior. These methods rely on physiochemical constraints to eliminate infeasible behaviors from the space of available behaviors. One such constraint is thermodynamic feasibility, the requirement that intracellular flux distributions obey the laws of thermodynamics. The past decade has seen several constraint-based methods that interpret this constraint in different ways, including those that are limited to small networks, rely on predefined reaction directions, and/or neglect the relationship between reaction free energies and metabolite concentrations. In this work, we utilize one such approach, thermodynamics-based metabolic flux analysis (TMFA), to make genome-scale, quantitative predictions about metabolite concentrations and reaction free energies in the absence of prior knowledge of reaction directions, while accounting for uncertainties in thermodynamic estimates. We applied TMFA to a genome-scale network reconstruction of Escherichia coli and examined the effect of thermodynamic constraints on the flux space. We also assessed the predictive performance of TMFA against gene essentiality and quantitative metabolomics data, under both aerobic and anaerobic, and optimal and suboptimal growth conditions. Based on these results, we propose that TMFA is a useful tool for validating phenotypes and generating hypotheses, and that additional types of data and constraints can improve predictions of metabolite concentrations.  相似文献   

7.
De novo mutations affect risk for many diseases and disorders, especially those with early-onset. An example is autism spectrum disorders (ASD). Four recent whole-exome sequencing (WES) studies of ASD families revealed a handful of novel risk genes, based on independent de novo loss-of-function (LoF) mutations falling in the same gene, and found that de novo LoF mutations occurred at a twofold higher rate than expected by chance. However successful these studies were, they used only a small fraction of the data, excluding other types of de novo mutations and inherited rare variants. Moreover, such analyses cannot readily incorporate data from case-control studies. An important research challenge in gene discovery, therefore, is to develop statistical methods that accommodate a broader class of rare variation. We develop methods that can incorporate WES data regarding de novo mutations, inherited variants present, and variants identified within cases and controls. TADA, for Transmission And De novo Association, integrates these data by a gene-based likelihood model involving parameters for allele frequencies and gene-specific penetrances. Inference is based on a Hierarchical Bayes strategy that borrows information across all genes to infer parameters that would be difficult to estimate for individual genes. In addition to theoretical development we validated TADA using realistic simulations mimicking rare, large-effect mutations affecting risk for ASD and show it has dramatically better power than other common methods of analysis. Thus TADA''s integration of various kinds of WES data can be a highly effective means of identifying novel risk genes. Indeed, application of TADA to WES data from subjects with ASD and their families, as well as from a study of ASD subjects and controls, revealed several novel and promising ASD candidate genes with strong statistical support.  相似文献   

8.
Development of genome-scale metabolic models and various constraints-based flux analyses have enabled more sophisticated examination of metabolism. Recently reported metabolite essentiality studies are also based on the constraints-based modeling, but approaches metabolism from a metabolite-centric perspective, providing synthetic lethal combination of reactions and clues for the rational discovery of antibacterials. In this study, metabolite essentiality analysis was applied to the genome-scale metabolic models of four microorganisms: Escherichia coli, Helicobacter pylori, Mycobacterium tuberculosis and Staphylococcus aureus. Furthermore, chokepoints, metabolites surrounded by enzymes that uniquely consume and/or produce them, were also calculated based on the network properties of the above organisms. A systematic drug targeting strategy was developed by combining information from these two methods. Final drug target metabolites are presented and examined with knowledge from the literature.  相似文献   

9.
The de novo synthesis of PAL is demonstrated to occur sometime between imbibition and the end of a 4-hr white light treatment. H2OD2O transfer experiments indicate that PAL synthesis may occur during the light period whilst D2O-H2O transfer experiments indicate that synthesis of inactive PAL may occur during dark growth followed by activation by light. Neither of these observations is conclusive. De novo synthesis of PAL occurs in excised hypocotyls of gherkin and tuber discs of potato either in darkness or in light. It is concluded that there is as yet no evidence which definitively shows that light controls PAL levels by regulating the rate of de novo synthesis.  相似文献   

10.
11.
12.
As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark''s Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.  相似文献   

13.
Synthesis of dipicolinic acid inPenicillium citreoviride showed typical kinetics of a secondary metabolite. Its synthesis resumed during idiophase and continued through stationary phase of growth. Total duration of synthesis was 100 h at the end of which its synthesis was arrested. Production of dipicolinic acid by the cells was subject to catabolite repression by glucose and was not subject to end product inhibition by exogenously added dipicolinic acid. Unlike the bacteria, dipicolinic acid synthesis in this mold was highly sensitive to inhibition by calcium ions in the growth medium. Calcium promoted sporulation but dipicolinic acid was not found to be present in detectable amounts in mold spores. Addition of dipicolinic acid and Ca2+ completely inhibited itsde novo synthesis, an effect not observed when calcium was replaced by Mg2+ When the mold was grown in the presence of calcium alone, its inhibitory effects onde novo synthesis of dipicolinic acid were expressed only after some of this metabolite was first synthesised by the producer cells suggesting that the active feedback inhibitor is probably a Ca: dipicolinic acid complex. It is suggested that over-production of this metabolite is very important to the mold in increasing its survival potential in nature by retrieving the essential minerals from the environment through ligand: metal complex at a time when cells are in the process of dying, so that a proper mineral balance is maintained within the cells  相似文献   

14.
Based on growth or nitrogen balance, amino acids (AA) had traditionally been classified as nutritionally essential (indispensable) or non-essential (dispensable) for animals and humans. Nutritionally essential AA (EAA) are defined as either those AA whose carbon skeletons cannot be synthesized de novo in animal cells or those that normally are insufficiently synthesized de novo by the animal organism relative to its needs for maintenance, growth, development, and health and which must be provided in the diet to meet requirements. In contrast, nutritionally non-essential AA (NEAA) are those AA which can be synthesized de novo in adequate amounts by the animal organism to meet requirements for maintenance, growth, development, and health and, therefore, need not be provided in the diet. Although EAA and NEAA had been described for over a century, there are no compelling data to substantiate the assumption that NEAA are synthesized sufficiently in animals and humans to meet the needs for maximal growth and optimal health. NEAA play important roles in regulating gene expression, cell signaling pathways, digestion and absorption of dietary nutrients, DNA and protein synthesis, proteolysis, metabolism of glucose and lipids, endocrine status, men and women fertility, acid–base balance, antioxidative responses, detoxification of xenobiotics and endogenous metabolites, neurotransmission, and immunity. Emerging evidence indicates dietary essentiality of “nutritionally non-essential amino acids” for animals and humans to achieve their full genetic potential for growth, development, reproduction, lactation, and resistance to metabolic and infectious diseases. This concept represents a new paradigm shift in protein nutrition to guide the feeding of mammals (including livestock), poultry, and fish.  相似文献   

15.
Malaria parasites generate vast quantities of heme during blood stage infection via hemoglobin digestion and limited de novo biosynthesis, but it remains unclear if parasites metabolize heme for utilization or disposal. Recent in vitro experiments with a heme oxygenase (HO)-like protein from Plasmodium falciparum suggested that parasites may enzymatically degrade some heme to the canonical HO product, biliverdin (BV), or its downstream metabolite, bilirubin (BR). To directly test for BV and BR production by P. falciparum parasites, we DMSO-extracted equal numbers of infected and uninfected erythrocytes and developed a sensitive LC-MS/MS assay to quantify these tetrapyrroles. We found comparable low levels of BV and BR in both samples, suggesting the absence of HO activity in parasites. We further tested live parasites by targeted expression of a fluorescent BV-binding protein within the parasite cytosol, mitochondrion, and plant-like plastid. This probe could detect exogenously added BV but gave no signal indicative of endogenous BV production within parasites. Finally, we recombinantly expressed and tested the proposed heme degrading activity of the HO-like protein, PfHO. Although PfHO bound heme and protoporphyrin IX with modest affinity, it did not catalyze heme degradation in vivo within bacteria or in vitro in UV absorbance and HPLC assays. These observations are consistent with PfHO''s lack of a heme-coordinating His residue and suggest an alternative function within parasites. We conclude that P. falciparum parasites lack a canonical HO pathway for heme degradation and thus rely fully on alternative mechanisms for heme detoxification and iron acquisition during blood stage infection.  相似文献   

16.
Recent advances in de novo protein evolution have made it possible to create synthetic proteins from unbiased libraries that fold into stable tertiary structures with predefined functions. However, it is not known whether such proteins will be functional when expressed inside living cells or how a host organism would respond to an encounter with a non-biological protein. Here, we examine the physiology and morphology of Escherichia coli cells engineered to express a synthetic ATP-binding protein evolved entirely from non-biological origins. We show that this man-made protein disrupts the normal energetic balance of the cell by altering the levels of intracellular ATP. This disruption cascades into a series of events that ultimately limit reproductive competency by inhibiting cell division. We now describe a detailed investigation into the synthetic biology of this man-made protein in a living bacterial organism, and the effect that this protein has on normal cell physiology.  相似文献   

17.
Although α-linolenic acid is nearly absent from Cyanidium caldarium cultured at 53 °C, it is the most abundant unsaturated fatty acid in 20 °C-grown cells. A sudden growth temperature shift of 55 to 25 °C does not stimulate the immediate biosynthesis of α-linolenic acid. However, after an induction period of 48 h, synthesis of α-linolenic acid from acetate can be detected, and the fatty acid accumulates in phosphatidyl choline and sulfolipid. The newly synthesized α-linolenic acid appears to be formed primarily by de novo synthesis and to a much lesser extent from the elongation of a previously formed hexadecatrienoic acid precursor. On the other hand, when a cell-free algal preparation was presented with a hexadecatrienoic acid precursor in the presence of [14C] malonyl-CoA, the α-linolenic acid formed demonstrated a synthesis by elongation of the precursor. While the cell appears enzymatically capable of α-linolenic acid biosynthesis by both the de novo and elongation processes, de novo synthesis of α-linolenic acid appears to be the more significant mode of synthesis.  相似文献   

18.
19.
Acetate is a primary inhibitory metabolite in Escherichia coli cultivation which is detrimental to bacterial growth and the formation of desired products. It can be derived from acetyl coenzyme A by the phosphotransacetylase (Pta)–acetate kinase (AckA) pathway. In this study, the fermentation characteristics of Pta mutant strain E. coli TRTHΔpta were compared with those of the control strain E. coli TRTH in a 30-L fermentor. The effects of glucose concentration and dissolved oxygen (DO) level were investigated, and the results suggest that DO and glucose concentration are vital influencing parameters for the production of L-tryptophan. Based on our experimental results, we then tested a DO-stat fed-batch fermentation strategy. When DO was controlled at about 20 % during L-tryptophan fermentation in the DO-stat fed-batch system, the pta mutant was able to maintain a higher growth rate at the exponential phase, and the final biomass and L-tryptophan production were increased to 55.3 g/L and 35.2 g/L, respectively. Concomitantly, as the concentration of acetate decreased to 0.7 g/L, the accumulation of pyruvate and lactate increased in the mutant strain as compared with the control strain. This characterization of the recombinant mutant strain provides useful information for the rational modification of metabolic fluxes to improve tryptophan production.  相似文献   

20.
Naturalized soil Escherichia coli populations need to resist common soil desiccation stress in order to inhabit soil environments. In this study, four representative soil E. coli strains and one lab strain, MG1655, were tested for desiccation resistance via die-off experiments in sterile quartz sand under a potassium acetate-induced desiccation condition. The desiccation stress caused significantly lower die-off rates of the four soil strains (0.17 to 0.40 day−1) than that of MG1655 (0.85 day−1). Cellular responses, including extracellular polymeric substance (EPS) production, exogenous glycine betaine (GB) uptake, and intracellular compatible organic solute synthesis, were quantified and compared under the desiccation and hydrated control conditions. GB uptake appeared not to be a specific desiccation response, while EPS production showed considerable variability among the E. coli strains. All E. coli strains produced more intracellular trehalose, proline, and glutamine under the desiccation condition than the hydrated control, and only the trehalose concentration exhibited a significant correlation with the desiccation-contributed die-off coefficients (Spearman''s ρ = −1.0; P = 0.02). De novo trehalose synthesis was further determined for 15 E. coli strains from both soil and nonsoil sources to determine its prevalence as a specific desiccation response. Most E. coli strains (14/15) synthesized significantly more trehalose under the desiccation condition, and the soil E. coli strains produced more trehalose (106.5 ± 44.9 μmol/mg of protein [mean ± standard deviation]) than the nonsoil reference strains (32.5 ± 10.5 μmol/mg of protein).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号