首页 | 本学科首页   官方微博 | 高级检索  
     


AFP-Pred: A random forest approach for predicting antifreeze proteins from sequence-derived properties
Authors:Kandaswamy Krishna Kumar  Chou Kuo-Chen  Martinetz Thomas  Möller Steffen  Suganthan P N  Sridharan S  Pugalenthi Ganesan
Affiliation:a Institute for Neuro- and Bioinformatics, University of Lübeck, 23538 Lübeck, Germany
b Graduate School for Computing in Medicine and Life Sciences, University of Lübeck, 23538 Lübeck, Germany
c Gordon Life Science Institute, San Diego, CA 92130, USA
d School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore 639798, Singapore
e Bharathidasan University, Tiruchirappalli, Tamilnadu 620 024, India
f Laboratory of Structural Biochemistry, Genome Institute of Singapore, 60 Biopolis Street, Singapore 138672, Singapore
Abstract:Some creatures living in extremely low temperatures can produce some special materials called “antifreeze proteins” (AFPs), which can prevent the cell and body fluids from freezing. AFPs are present in vertebrates, invertebrates, plants, bacteria, fungi, etc. Although AFPs have a common function, they show a high degree of diversity in sequences and structures. Therefore, sequence similarity based search methods often fails to predict AFPs from sequence databases. In this work, we report a random forest approach “AFP-Pred” for the prediction of antifreeze proteins from protein sequence. AFP-Pred was trained on the dataset containing 300 AFPs and 300 non-AFPs and tested on the dataset containing 181 AFPs and 9193 non-AFPs. AFP-Pred achieved 81.33% accuracy from training and 83.38% from testing. The performance of AFP-Pred was compared with BLAST and HMM. High prediction accuracy and successful of prediction of hypothetical proteins suggests that AFP-Pred can be a useful approach to identify antifreeze proteins from sequence information, irrespective of their sequence similarity.
Keywords:Thermal hysteresis proteins   Ice binding proteins   Freeze tolerance   Physicochemical properties   Machine learning method
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号