首页 | 本学科首页   官方微博 | 高级检索  
     


Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model
Authors:Liang-Tsung Huang  M. Michael Gromiha  Shinn-Ying Ho
Affiliation:(1) Institute of Information Engineering and Computer Science, Feng-Chia University, Taichung, 407, Taiwan;(2) Department of Computer Science and Information Engineering, Ming-Dao University, Changhua, 523, Taiwan;(3) Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), AIST Tokyo Waterfront Bio-IT Research Building, 2-42 Aomi, Koto-ku, Tokyo 135-0064, Japan;(4) Department of Biological Science and Technology, and Institute of Bioinformatics, National Chiao Tung University, Hsinchu, 300, Taiwan
Abstract:Understanding the mechanism of the protein stability change is one of the most challenging tasks. Recently, the prediction of protein stability change affected by single point mutations has become an interesting topic in molecular biology. However, it is desirable to further acquire knowledge from large databases to provide new insights into the nature of them. This paper presents an interpretable prediction tree method (named iPTREE-2) that can accurately predict changes of protein stability upon mutations from sequence based information and analyze sequence characteristics from the viewpoint of composition and order. Therefore, iPTREE-2 based on a regression tree algorithm exhibits the ability of finding important factors and developing rules for the purpose of data mining. On a dataset of 1859 different single point mutations from thermodynamic database, ProTherm, iPTREE-2 yields a correlation coefficient of 0.70 between predicted and experimental values. In the task of data mining, detailed analysis of sequences reveals the possibility of the compositional specificity of residues in different ranges of stability change and implies the existence of certain patterns. As building rules, we found that the mutation residues in wild type and in mutant protein play an important role. The present study demonstrates that iPTREE-2 can serve the purpose of predicting protein stability change, especially when one requires more understandable knowledge.
Keywords:Bioinformatics  Data mining  Decision trees  Prediction  Protein stability
本文献已被 PubMed SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号