首页 | 本学科首页   官方微博 | 高级检索  
   检索      


A lead discovery strategy driven by a comprehensive analysis of proteases in the peptide substrate space
Authors:Sai Chetan K Sukuru  Florian Nigsch  Jean Quancard  Martin Renatus  Rajiv Chopra  Natasja Brooijmans  Dmitri Mikhailov  Zhan Deng  Allen Cornett  Jeremy L Jenkins  Ulrich Hommel  John W Davies  Meir Glick
Institution:1. Lead Discovery Informatics, Lead Finding Platform, Novartis Institutes for BioMedical Research, Inc., 250 Massachusetts Avenue, Cambridge, Massachusetts 02139;2. Global Discovery Chemistry, Novartis Institutes for BioMedical Research, 4002 Basel, Switzerland;3. Expertise Platform Proteases, Novartis Institutes for BioMedical Research, 4002 Basel, Switzerland;4. Protein Structure Unit, Center for Proteomic Chemistry, Novartis Institutes for BioMedical Research, Inc., 250 Massachusetts Avenue, Cambridge, Massachusetts 02139;5. Structural Biology Platform, Center for Proteomic Chemistry, Novartis Institutes for BioMedical Research, 4002 Basel, Switzerland
Abstract:We present here a comprehensive analysis of proteases in the peptide substrate space and demonstrate its applicability for lead discovery. Aligned octapeptide substrates of 498 proteases taken from the MEROPS peptidase database were used for the in silico analysis. A multiple‐category naïve Bayes model, trained on the two‐dimensional chemical features of the substrates, was able to classify the substrates of 365 (73%) proteases and elucidate statistically significant chemical features for each of their specific substrate positions. The positional awareness of the method allows us to identify the most similar substrate positions between proteases. Our analysis reveals that proteases from different families, based on the traditional classification (aspartic, cysteine, serine, and metallo), could have substrates that differ at the cleavage site (P1–P1′) but are similar away from it. Caspase‐3 (cysteine protease) and granzyme B (serine protease) are previously known examples of cross‐family neighbors identified by this method. To assess whether peptide substrate similarity between unrelated proteases could reliably translate into the discovery of low molecular weight synthetic inhibitors, a lead discovery strategy was tested on two other cross‐family neighbors—namely cathepsin L2 and matrix metallo proteinase 9, and calpain 1 and pepsin A. For both these pairs, a naïve Bayes classifier model trained on inhibitors of one protease could successfully enrich those of its neighbor from a different family and vice versa, indicating that this approach could be prospectively applied to lead discovery for a novel protease target with no known synthetic inhibitors.
Keywords:chemical space  in silico  lead discovery  proteases  substrate space
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号