Identifying interacting residues using Boolean Learning and Support Vector Machines: case study on mRFP and DsRed proteins |
| |
Authors: | Loo Bernard L W Dubey Anshul Realff Matthew J Lee Jay H Bommarius Andreas S |
| |
Affiliation: | School of Chemical and Biomolecular Engineering, Atlanta, GA 30332, USA. |
| |
Abstract: | In a protein, interactions exist between amino acid residues that influence the protein's structural integrity or stability and thus affect its catalytic function. The loss of this interaction due to mutations in these amino acids usually leads to a non-functional protein. Probing the sequence space of a protein through mutations or recombinations, as performed in directed evolution to search for an improved variant, frequently results in such inactive sequences. In this work, we demonstrate the use of machine learning to identify such interacting residues and the use of template engineering strategies to increase the fraction of active variants in a library. We show that using the sequences from recombination of monomeric red fluorescent protein (mRFP) and Discosoma red fluorescent protein (DsRed), we were able to identify a pair of interacting residues using an algorithm based on Boolean Learning and Support Vector Machines. The interaction between the identified residues was verified through point mutations on the mRFP and DsRed genes. We also show that it is possible to use such results to alter the parental genes such that the probability of disrupting the important interactions is minimized. This will result in a larger fraction of active variants in the recombinant library and allow us to access more functional space. We demonstrate this effect by comparing the recombinant library of wild-type (WT) DsRed, mRFP and an altered sequence of DsRed with mRFP WT genes. |
| |
Keywords: | Boolean Learning Support Vector Machines DsRed Fluorescent proteins Interacting position mRFP |
本文献已被 PubMed 等数据库收录! |
|