首页 | 本学科首页   官方微博 | 高级检索  
     


Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile
Authors:Taigang Liu  Xiaoqi Zheng  Jun Wang
Affiliation:1. School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, China;2. College of Advanced Science and Technology, Dalian University of Technology, Dalian 116024, China;3. Department of Mathematics, Shanghai Normal University, Shanghai 200234, China;4. Scientific Computing Key Laboratory of Shanghai Universities, Shanghai 200234, China
Abstract:
Knowledge of structural class plays an important role in understanding protein folding patterns. In this study, a simple and powerful computational method, which combines support vector machine with PSI-BLAST profile, is proposed to predict protein structural class for low-similarity sequences. The evolution information encoding in the PSI-BLAST profiles is converted into a series of fixed-length feature vectors by extracting amino acid composition and dipeptide composition from the profiles. The resulting vectors are then fed to a support vector machine classifier for the prediction of protein structural class. To evaluate the performance of the proposed method, jackknife cross-validation tests are performed on two widely used benchmark datasets, 1189 (containing 1092 proteins) and 25PDB (containing 1673 proteins) with sequence similarity lower than 40% and 25%, respectively. The overall accuracies attain 70.7% and 72.9% for 1189 and 25PDB datasets, respectively. Comparison of our results with other methods shows that our method is very promising to predict protein structural class particularly for low-similarity datasets and may at least play an important complementary role to existing methods.
Keywords:Protein structural class   Sequence similarity   Support vector machine   Position-specific score matrix
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号