首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Analyses of microbial genome sequences reveal numerous examples of gene clusters encoding proteins typically involved in complex natural product biosynthesis but not associated with the production of known natural products. In Streptomyces coelicolor M145 there are several gene clusters encoding new nonribosomal peptide synthetase (NRPS) systems not associated with known metabolites. Application of structure-based models for substrate recognition by NRPS adenylation domains predicts the amino acids incorporated into the putative peptide products of these systems, but the accuracy of these predictions is untested. Here we report the isolation and structure determination of the new tris-hydroxamate tetrapeptide iron chelator coelichelin from S. coelicolor using a genome mining approach guided by substrate predictions for the trimodular NRPS CchH, and we show that this enzyme, which lacks a C-terminal thioesterase domain, together with a homolog of enterobactin esterase (CchJ), are required for coelichelin biosynthesis. These results demonstrate that accurate prediction of adenylation domain substrate selectivity is possible and raise intriguing mechanistic questions regarding the assembly of a tetrapeptide by a trimodular NRPS.  相似文献   

2.
Nonribosomal peptide synthetases (NRPSs) synthesize a diverse array of bioactive small peptides, many of which are used in medicine. There is considerable interest in predicting NRPS substrate specificity in order to facilitate investigation of the many “cryptic” NRPS genes that have not been linked to any known product. However, the current sequence similarity‐based methods are unable to produce reliable predictions when there is a lack of prior specificity data, which is a particular problem for fungal NRPSs. We conducted virtual screening on the specificity‐determining domain of NRPSs, the adenylation domain, and found that virtual screening using experimentally determined structures results in good enrichment of the cognate substrate. Our results indicate that the conformation of the adenylation domain and in particular the conformation of a key conserved aromatic residue is important in determining the success of the virtual screening. When homology models of NRPS adenylation domains of known specificity, rather than experimentally determined structures, were built and used for virtual screening, good enrichment of the cognate substrate was also achieved in many cases. However, the accuracy of the models was key to the reliability of the predictions and there was a large variation in the results when different models of the same domain were used. This virtual screening approach is promising and is able to produce enrichment of the cognate substrates in many cases, but improvements in building and assessing homology models are required before the approach can be reliably applied to these models. Proteins 2015; 83:2052–2066. © 2015 Wiley Periodicals, Inc.  相似文献   

3.
Nonribosomal peptide synthetases (NRPSs) are large, multidomain proteins that are involved in the biosynthesis of an array of secondary metabolites. We report the structure of the third adenylation domain from the siderophore-synthesizing NRPS, SidN, from the endophytic fungus Neotyphodium lolii. This is the first structure of a eukaryotic NRPS domain, and it reveals a large binding pocket required to accommodate the unusual amino acid substrate, Nδ-cis-anhydromevalonyl-Nδ-hydroxy-l-ornithine (cis-AMHO). The specific activation of cis-AMHO was confirmed biochemically, and an AMHO moiety was unambiguously identified as a component of the fungal siderophore using mass spectroscopy. The protein structure shows that the substrate binding pocket is defined by 17 amino acid residues, in contrast to both prokaryotic adenylation domains and to previous predictions based on modeling. Existing substrate prediction methods for NRPS adenylation domains fail for domains from eukaryotes due to the divergence of their signature sequences from those of prokaryotes. Thus, this new structure will provide a basis for improving prediction methods for eukaryotic NRPS enzymes that play important and diverse roles in the biology of fungi.  相似文献   

4.
Syringopeptin is a necrosis-inducing phytotoxin, composed of 22 amino acids attached to a 3-hydroxy fatty acid tail. Syringopeptin, produced by Pseudomonas syringae pv. syringae, functions as a virulence determinant in the plant-pathogen interaction. A 73,800-bp DNA region was sequenced, and analysis identified three large open reading frames, sypA, sypB, and sypC, that are 16.1, 16.3, and 40.6 kb in size. Sequence analysis of the putative SypA, SypB, and SypC sequences determined that they are homologous to peptide synthetases, containing five, five, and twelve amino acid activation modules, respectively. Each module exhibited characteristic domains for condensation, aminoacyl adenylation, and thiolation. Within the aminoacyl adenylation domain is a region responsible for substrate specificity. Phylogenetic analysis of the substrate-binding pockets resulted in clustering of the 22 syringopeptin modules into nine groups. This clustering reflects the substrate amino acids predicted to be recognized by each of the respective modules based on placement of the syringopeptin NRPS (nonribosomal peptide synthetase) system in the linear (type A) group. Finally, SypC contains two C-terminal thioesterase domains predicted to catalyze the release of syringopeptin from the synthetase and peptide cyclization to form the lactone ring. The syringopeptin synthetases, which carry 22 NRPS modules, represent the largest linear NRPS system described for a prokaryote.  相似文献   

5.
Nonribosomal peptide synthetases (NRPSs) are multimodular proteins capable of producing important peptide natural products. Using an assembly line process, the amino acid substrate and peptide intermediates are passed between the active sites of different catalytic domains of the NRPS while bound covalently to a peptidyl carrier protein (PCP) domain. Examination of the linker sequences that join the NRPS adenylation and PCP domains identified several conserved proline residues that are not found in standalone adenylation domains. We examined the roles of these proline residues and neighboring conserved sequences through mutagenesis and biochemical analysis of the reaction catalyzed by the adenylation domain and the fully reconstituted NRPS pathway. In particular, we identified a conserved LPxP motif at the start of the adenylation‐PCP linker. The LPxP motif interacts with a region on the adenylation domain to stabilize a critical catalytic lysine residue belonging to the A10 motif that immediately precedes the linker. Further, this interaction with the C‐terminal subdomain of the adenylation domain may coordinate movement of the PCP with the conformational change of the adenylation domain. Through this work, we extend the conserved A10 motif of the adenylation domain and identify residues that enable proper adenylation domain function. Proteins 2014; 82:2691–2702. © 2014 Wiley Periodicals, Inc.  相似文献   

6.
Mitchell CA  Shi C  Aldrich CC  Gulick AM 《Biochemistry》2012,51(15):3252-3263
Many bacteria use large modular enzymes for the synthesis of polyketide and peptide natural products. These multidomain enzymes contain integrated carrier domains that deliver bound substrates to multiple catalytic domains, requiring coordination of these chemical steps. Nonribosomal peptide synthetases (NRPSs) load amino acids onto carrier domains through the activity of an upstream adenylation domain. Our lab recently determined the structure of an engineered two-domain NRPS containing fused adenylation and carrier domains. This structure adopted a domain-swapped dimer that illustrated the interface between these two domains. To continue our investigation, we now examine PA1221, a natural two-domain protein from Pseudomonas aeruginosa. We have determined the amino acid specificity of this new enzyme and used domain specific mutations to demonstrate that loading the downstream carrier domain within a single protein molecule occurs more quickly than loading of a nonfused carrier domain intermolecularly. Finally, we have determined crystal structures of both apo- and holo-PA1221 proteins, the latter using a valine-adenosine vinylsulfonamide inhibitor that traps the adenylation domain-carrier domain interaction. The protein adopts an interface similar to that seen with the prior adenylation domain-carrier protein construct. A comparison of these structures with previous structures of multidomain NRPSs suggests that a large conformational change within the NRPS adenylation domains guides the carrier domain into the active site for thioester formation.  相似文献   

7.
To engineer the substrate specificities of nonribosomal peptide synthetases (NRPS), we developed a method to display NRPS modules on M13 phages and select catalytically active adenylation (A) domains that would load azide functionalized substrate analogs to the neighboring peptidyl carrier protein (PCP) domains. Biotin conjugated difluorinated cyclooctyne was used for copper free cycloaddition with an azide substituted substrate attached to PCP. Biotin-labeled phages were selected by binding to streptavidin.  相似文献   

8.
Nonribosomal peptide synthetases (NRPSs) are multi-modular enzymes involved in the biosynthesis of natural products. Bacillamide C was synthesized by Bacillus atrophaeus C89. A nonribosomal peptide synthetase (NRPS) cluster found in the genome of B. atrophaeus C89 was hypothesized to be responsible for the biosynthesis of bacillamide C using alanine and cysteine as substrates. Here, the structure analysis of adenylation domains based on homologous proteins with known crystal structures indicated locations of the substrate-binding pockets. Molecular docking suggested alanine and cysteine as the potential substrates for the two adenylation domains in the NRPS cluster. Furthermore, biochemical characterization of the purified recombinant adenylation domains proved that alanine and cysteine were the optimum substrates for the two adenylation domains. The results provided the in vitro evidence for the hypothesis that the two adenylation domains in the NRPS of B. atrophaeus C89 preferentially select alanine and cysteine, respectively, as a substrate to synthesize bacillamide C. Furthermore, this study on substrates selectivity of adenylation domains provided basis for rational design of bacillamide analogs.  相似文献   

9.
ATPase activity of non-ribosomal peptide synthetases   总被引:1,自引:0,他引:1  
Adenylation domains of non-ribosomal peptide synthetases (NRPS) catalyse the formation of aminoacyl adenylates, and in addition synthesize mono- and dinucleoside polyphosphates. Here, we show that NRPS systems furthermore contain an ATPase activity in the range of up to 2 P(i)/min. The hydrolysis rate by apo-tyrocidine synthetase 1 (apo-TY1) is enhanced in the presence of non-cognate amino acid substrates, correlating well with their structural features and the diminishing adenylation efficiency. A comparative analysis of the functional relevance of an analogous sequence motif in P-type ATPases and adenylate kinases (AK) allowed a putative assignment of the invariant aspartate residue from the TGDLA(V)R(K) core sequence in NRPS as the Mg(2+) binding site. Less pronounced variations in ATPase activity are observed in domains with relaxed amino acid specificity of gramicidin S synthetase 2 (GS2) and delta-(L-aminoadipyl)-L-cysteinyl-D-valine synthetase (ACVS), known to produce a set of substitutional variants of the respective peptide product. These results disclose new perspectives about the mode of substrate selection by NRPS.  相似文献   

10.
Nonribosomal peptide synthetases (NRPS) are multifunctional proteins that catalyze the synthesis of the peptide products with enormous biological potential. The process of biosynthesis starts with the adenylation (A) domain, which during the catalytic cycle undergoes extensive structural rearrangements. In this paper, we present the first study of the tyrocidine synthetase 1 A-domain (TycA-A) fluorescence properties. The TycA-A protein contains five potentially fluorescent Trp residues at positions 227, 301, 323, 376 and 406. The contribution of each Trp to the TycA-A emission was determined using protein variants bearing single Trp to Phe substitutions. The accessibility of the Trp side chains during adenylation showed that only W227 is affected by substrate binding. The protein variant containing solely fluorescent W227 residue was constructed and further used as a probe to explore the binding effect of different non-cognate amino acid substrates. The results indicate a different accessibility of W227 residue in the presence of non-cognate amino acids, which might offer an explanation for the higher aminoacyl-adenenylate leakage. Overall, our results suggest that intrinsic tryptophan fluorescence could be used as a method to probe the effect of substrate binding on the local structure in NRPS adenylation domains.  相似文献   

11.
Molecular screening using degenerate PCR to determine the presence of secondary metabolite genes in cyanobacteria was performed. This revealed 18 NRPS and 19 PKS genes in the 21 new cyanobacterial strains examined, representing three families of cyanobacteria (Nostocales, Chroococales and Oscillatoriales). A BLAST analysis shows that these genes have similarities to known cyanobacterial natural products. Analysis of the NRPS adenylation domain indicates the presence of novel features previously ascribed to both proteobacteria and cyanobacteria. Furthermore, binding-pocket predictions reveal diversity in the amino acids used during the biosynthesis of compounds. A similar analysis of the PKS ketosynthase domain shows significant structural diversity and their presence in both mixed modules with NRPS domains and individually as part of a PKS module. We have been able to classify the NRPS genes on the basis of their binding-pockets. Further, we show how this data can be used to begin to link structure to function by an analysis of the compounds Scyptolin A and Hofmannolin from Scytonema sp. PCC 7110.  相似文献   

12.
Successful genome mining is dependent on accurate prediction of protein function from sequence. This often involves dividing protein families into functional subtypes (e.g., with different substrates). In many cases, there are only a small number of known functional subtypes, but in the case of the adenylation domains of nonribosomal peptide synthetases (NRPS), there are >500 known substrates. Latent semantic indexing (LSI) was originally developed for text processing but has also been used to assign proteins to families. Proteins are treated as ‘‘documents’’ and it is necessary to encode properties of the amino acid sequence as ‘‘terms’’ in order to construct a term-document matrix, which counts the terms in each document. This matrix is then processed to produce a document-concept matrix, where each protein is represented as a row vector. A standard measure of the closeness of vectors to each other (cosines of the angle between them) provides a measure of protein similarity. Previous work encoded proteins as oligopeptide terms, i.e. counted oligopeptides, but used no information regarding location of oligopeptides in the proteins. A novel tokenization method was developed to analyze information from multiple alignments. LSI successfully distinguished between two functional subtypes in five well-characterized families. Visualization of different ‘‘concept’’ dimensions allows exploration of the structure of protein families. LSI was also used to predict the amino acid substrate of adenylation domains of NRPS. Better results were obtained when selected residues from multiple alignments were used rather than the total sequence of the adenylation domains. Using ten residues from the substrate binding pocket performed better than using 34 residues within 8 Å of the active site. Prediction efficiency was somewhat better than that of the best published method using a support vector machine.  相似文献   

13.
ABSTRACT: BACKGROUND: Pelgipeptin, a potent antibacterial and antifungal agent, is a non-ribosomally synthesised lipopeptide antibiotic. This compound consists of a beta-hydroxy fatty acid and nine amino acids. To date, there is no information about its biosynthetic pathway. RESULTS: A potential pelgipeptin synthetase gene cluster (plp) was identified from Paenibacillus elgii B69 through genome analysis. The gene cluster spans 40.8 kb with eight open reading frames. Among the genes in this cluster, three large genes, plpD, plpE, and plpF, were shown to encode non-ribosomal peptide synthetases (NRPS), with one, seven, and one module(s), respectively. Bioinformatic analysis of the substrate specificity of all nine adenylation domains indicated that the sequence of the NRPS modules is well collinear with the order of amino acids in pelgipeptin. Additional biochemical analysis of four recombinant adenylation domains (PlpD A1, PlpE A1, PlpE A3, and PlpF A1) provided further evidence that the plp gene cluster involved in pelgipeptin biosynthesis. CONCLUSIONS: In this study, a gene cluster (plp) responsible for the biosynthesis of pelgipeptin was identified from the genome sequence of Paenibacillus elgii B69. The identification of the plp gene cluster provides an opportunity to develop novel lipopeptide antibiotics by genetic engineering.  相似文献   

14.
Recently, the solved crystal structure of a phenylalanine-activating adenylation (A) domain enlightened the structural basis for the specific recognition of the cognate substrate amino acid in nonribosomal peptide synthetases (NRPSs). By adding sequence comparisons and homology modeling, we successfully used this information to decipher the selectivity-conferring code of NRPSs. Each codon combines the 10 amino residues of a NRPS A domain that are presumed to build up the substrate-binding pocket. In this study, the deciphered code was exploited for the first time to rationally alter the substrate specificity of whole NRPS modules in vitro and in vivo. First, the single-residue Lys239 of the L-Glu-activating initiation module C-A(Glu)-PCP of the surfactin synthetase A was mutated to Gln239 to achieve a perfect match to the postulated L-Gln-activating binding pocket. Biochemical characterization of the mutant protein C-A(Glu)-PCP(Lys239 --> Gln) revealed the postulated alteration in substrate specificity from L-Glu to L-Gln without decrease in catalytic efficiency. Second, according to the selectivity-conferring code, the binding pockets of L-Asp and L-Asn-activating A domains differs in three positions: Val299 versus Ile, His322 versus Glu, and Ile330 versus Val, respectively. Thus, the binding pocket of the recombinant A domain AspA, derived from the second module of the surfactin synthetases B, was stepwisely adapted for the recognition of L-Asn. Biochemical characterization of single, double, and triple mutants revealed that His322 represents a key position, whose mutation was sufficient to give rise to the intended selectivity-switch. Subsequently, the gene fragment encoding the single-mutant AspA(His322 --> Glu) was introduced back into the surfactin biosynthetic gene cluster. The resulting Bacillus subtilis strain was found to produce the expected so far unknown lipoheptapeptide [Asn(5)]surfactin. This indicates that site-directed mutagenesis, guided by the selectivity-conferring code of NRPS A domains, represents a powerful alternative for the genetic manipulation of NRPS biosynthetic templates and the rational design of novel peptide antibiotics.  相似文献   

15.
Lysobacter lactamgenus produces cephabacins, a class of beta-lactam antibiotics which have an oligopeptide moiety attached to the cephem ring at the C-3 position. The nonribosomal peptide synthetase (NRPS) system, which comprises four distinct modules, is required for the biosynthesis of this short oligopeptide, when one takes the chemical structure of these antibiotics into consideration. The cpbI gene, which has been identified in a region upstream of the pcbAB gene, encodes the NRPS - polyketide synthase hybrid complex, where NRPS is composed of three modules, while the cpbK gene -- which has been reported as being upstream of cpbI-- comprises a single NRPS module. An in silico protein analysis was able to partially reveal the specificity of each module. The four recombinant adenylation (A) domains from each NRPS module were heterologously expressed in Escherichia coli and purified. Biochemical data from ATP-PPi exchange assays indicated that L-arginine was an effective substrate for the A1 domain, while the A2, A3 and A4 domains activated L-alanine. These findings are in an agreement with the known chemical structure of cephabacins, as well as with the anticipated substrate specificity of the NRPS modules in CpbI and CpbK, which are involved in the assembly of the tetrapeptide at the C-3 position.  相似文献   

16.
The most common sequences of peptaibiotics are 11-residue peptaibols found widely distributed in the genus Trichoderma/Hypocrea. Frequently associated are 14-residue peptaibols sharing partial sequence identity. Genome sequencing projects of three Trichoderma strains of the major clades reveal the presence of up to three types of nonribosomal peptide synthetases with 7, 14, or 18-20 amino acid-adding modules. Here, we provide evidence that the 14-module NRPS type found in T. virens, T. reesei (teleomorph Hypocrea jecorina), and T. atroviride produces both 11- and 14-residue peptaibols based on the disruption of the respective NRPS gene of T. reesei, and bioinformatic analysis of their amino acid-activating domains and modules. The sequences of these peptides may be predicted from the gene sequences and have been confirmed by analysis of families of 11- and 14-residue peptaibols from the strain 618, termed hypojecorins A (23 sequences determined, 4 new) and B (3 sequences determined, 2 new), and the recently established trichovirins A from T. virens. The distribution of 11- and 14-residue products is strain-specific and depends on growth conditions as well. Possible mechanisms of module skipping are discussed.  相似文献   

17.
微生物许多非核糖体肽类次生代谢产物主要是由非核糖体肽合成酶(NRPS)催化合成。参考Gontang发布的非核糖体肽合成酶(NRPS)通用引物设计扩增NRPS腺苷酰化结构域基因序列的特异引物,从海洋链霉菌L1的基因组DNA中扩增获得一个715 bp的NRPS基因序列。测序结果及比对分析表明该片段属于NRPS腺苷酰化结构域部分序列。对其拟翻译的氨基酸序列组成成分、理化性质进行分析,显示其包含AFD class I超基因家族核心结合区,为NRPS腺苷酰化结构域(A结构域)所在区域。对氨基酸序列的二级结构预测和三级结构模拟,发现与数据库中肠菌素合酶F组分的结构相似。为后续研究A结构域的特异性及完整NRPS基因簇克隆提供了参考。  相似文献   

18.
Albicidin is a pathotoxin produced by Xanthomonas albilineans, a xylem-invading pathogen that causes leaf scald disease of sugarcane. Albicidin is synthesized by a nonribosomal pathway via modular polyketide synthase and nonribosomal peptide synthetase (NRPS) megasynthases, and NRPS adenylation (A) domains are responsible for the recognition and activation of specific amino acid substrates. DNA fragments (0.5 kb) encoding the regions responsible for the substrate specificities of six albicidin NRPS A domains from 16 strains of X. albilineans representing the known diversity of this pathogen were amplified and sequenced. Polymorphism analysis of these DNA fragments at different levels (DNA, protein, and NRPS signature) showed that these pathogenicity loci were highly conserved. The conservation of these loci most likely reflects purifying selective pressure, as revealed by a comparison with the variability of nucleotide and amino acid sequences of two housekeeping genes (atpD and efp) of X. albilineans. Nevertheless, the 16 strains of X. albilineans were differentiated into several groups by a phylogenetic analysis of the nucleotide sequences corresponding to the NRPS A domains. One of these groups was representative of the genetic diversity previously found within the pathogen by random fragment length polymorphism and amplified fragment length polymorphism analyses. This group, which differed by three single synonymous nucleotide mutations, contained only four strains of X. albilineans that were all involved in outbreaks of sugarcane leaf scald. The amount of albicidin produced in vitro in agar and liquid media varied among the 16 strains of X. albilineans. However, no relationship among the amount of albicidin produced in vitro and the pathotypes and genetic diversity of the pathogen was found. The NRPS loci contributing to the synthesis of the primary structure of albicidin apparently are not involved in the observed pathogenicity differences among strains of X. albilineans.  相似文献   

19.
The modular polyketide synthase (PKS) and nonribosomal peptide synthetase (NRPS) have been found to be involved in natural product synthesis in many microorganisms. Study on their diversities in natural environment may provide important ecological insights, in addition to opportunities for antibacterial drugs development. In this study, the PKS and NRPS gene diversities in two coast sediments near China Zhongshan Station were studied. The phylogenetic analysis of amino acid (AA) sequences indicated that the identified ketosynthase (KS) domains were clustered with those from diverse bacterial groups, including Proteobacteria, Firmicutes, Planctomycetes, Cyanobacteria, Actinobacteria, and some uncultured symbiotic bacteria. One new branch belonging to hybrid PKS/NRPS enzyme complexes and five independent clades were found on the phylogenetic tree. The obtained adenylation (A) domains were mainly clustered within the Cyanobacteria and Proteobacteria group. Most of the identified KS and A domains showed below 80 and 60% identities at the AA level to their closest matches in GenBank, respectively. The diversities of both KS and A domains in natural environmental sample were different from those in sewage-contaminated sample. These results revealed the great diversity and novelty of both PKS and NRPS genes in Antarctic sediment.  相似文献   

20.
There is a growing interest in the Non-ribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs) of microbes, fungi and plants because they can produce bioactive peptides such as antibiotics. The ability to identify the substrate specificity of the enzyme''s adenylation (A) and acyl-transferase (AT) domains is essential to rationally deduce or engineer new products. We here report on a Hidden Markov Model (HMM)-based ensemble method to predict the substrate specificity at high quality. We collected a new reference set of experimentally validated sequences. An initial classification based on alignment and Neighbor Joining was performed in line with most of the previously published prediction methods. We then created and tested single substrate specific HMMs and found that their use improved the correct identification significantly for A as well as for AT domains. A major advantage of the use of HMMs is that it abolishes the dependency on multiple sequence alignment and residue selection that is hampering the alignment-based clustering methods. Using our models we obtained a high prediction quality for the substrate specificity of the A domains similar to two recently published tools that make use of HMMs or Support Vector Machines (NRPSsp and NRPS predictor2, respectively). Moreover, replacement of the single substrate specific HMMs by ensembles of models caused a clear increase in prediction quality. We argue that the superiority of the ensemble over the single model is caused by the way substrate specificity evolves for the studied systems. It is likely that this also holds true for other protein domains. The ensemble predictor has been implemented in a simple web-based tool that is available at http://www.cmbi.ru.nl/NRPS-PKS-substrate-predictor/.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号