首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Comprehensive analysis of distinctive polyketide and nonribosomal peptide structural motifs encoded in microbial genomes
Authors:Minowa Yohsuke  Araki Michihiro  Kanehisa Minoru
Institution:Bioinformatics Center, Institute for Chemical Research, Kyoto University Uji, Kyoto 611-0011, Japan. minowa@pharm.kyoto-u.ac.jp
Abstract:We developed a highly accurate method to predict polyketide (PK) and nonribosomal peptide (NRP) structures encoded in microbial genomes. PKs/NRPs are polymers of carbonyl/peptidyl chains synthesized by polyketide synthases (PKS) and nonribosomal peptide synthetases (NRPS). We analyzed domain sequences corresponding to specific substrates and physical interactions between PKSs/NRPSs in order to predict which substrates (carbonyl/peptidyl units) are selected and assembled into highly ordered chemical structures. The predicted PKs/NRPs were represented as the sequences of carbonyl/peptidyl units to extract the structural motifs efficiently. We applied our method to 4529 PKSs/NRPSs and found 619 PKs/NRPs. We also collected 1449 PKs/NRPs whose chemical structures have been determined experimentally. The structural sequences were compared using the Smith-Waterman algorithm, and clustered into 271 clusters. From the compound clusters, we extracted 33 structural motifs that are significantly related with their bioactivities. We used the structural motifs to infer functions of 13 novel PKs/NRPs clusters produced by Pseudomonas spp. and Burkholderia spp. and found a putative virulence factor. The integrative analysis of genomic and chemical information given here will provide a strategy to predict the chemical structures, the biosynthetic pathways, and the biological activities of PKs/NRPs, which is useful for the rational design of novel PKs/NRPs.
Keywords:PK  polyketide  NRP  nonribosomal peptide  PKS  polyketide synthase  NRPS  nonribosomal peptide synthetase  AT  acyltransferase  ACP  acyl carrier protein  A  adenylation domain  PCP  peptidyl carrier protein  KS  beta-keto synthase  C domain  condensation domain  KR  ketoreductase  DH  dehydratase  TE domain  thioesterase domain  CAL domain  CoA ligase domain  HMM  hidden Markov model
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号