首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Hot spot prediction in protein-protein interactions by an ensemble system
Authors:Quanya Liu  Peng Chen  Bing Wang  Jun Zhang  Jinyan Li
Institution:1.Institute of Physical Science and Information Technology, Anhui University,Hefei, Anhui,China;2.School of Electrical and Information Engineering, Anhui University of Technology,Ma’anshan, Anhui,China;3.School of Electrical and Information Engineering, Anhui University of Technology,Ma’anshan, Anhui,China;4.School of Electrical Engineering and Automation, Anhui University,Hefei, Anhui,China;5.Advanced Analytics Institute and Centre for Health Technologies, University of Technology, Sydney,Sydney,Australia
Abstract:

Background

Hot spot residues are functional sites in protein interaction interfaces. The identification of hot spot residues is time-consuming and laborious using experimental methods. In order to address the issue, many computational methods have been developed to predict hot spot residues. Moreover, most prediction methods are based on structural features, sequence characteristics, and/or other protein features.

Results

This paper proposed an ensemble learning method to predict hot spot residues that only uses sequence features and the relative accessible surface area of amino acid sequences. In this work, a novel feature selection technique was developed, an auto-correlation function combined with a sliding window technique was applied to obtain the characteristics of amino acid residues in protein sequence, and an ensemble classifier with SVM and KNN base classifiers was built to achieve the best classification performance.

Conclusion

The experimental results showed that our model yields the highest F1 score of 0.92 and an MCC value of 0.87 on ASEdb dataset. Compared with other machine learning methods, our model achieves a big improvement in hot spot prediction.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号