Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile |
| |
Authors: | Taigang Liu Xiaoqi Zheng Jun Wang |
| |
Affiliation: | 1. School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, China;2. College of Advanced Science and Technology, Dalian University of Technology, Dalian 116024, China;3. Department of Mathematics, Shanghai Normal University, Shanghai 200234, China;4. Scientific Computing Key Laboratory of Shanghai Universities, Shanghai 200234, China |
| |
Abstract: | ![]() Knowledge of structural class plays an important role in understanding protein folding patterns. In this study, a simple and powerful computational method, which combines support vector machine with PSI-BLAST profile, is proposed to predict protein structural class for low-similarity sequences. The evolution information encoding in the PSI-BLAST profiles is converted into a series of fixed-length feature vectors by extracting amino acid composition and dipeptide composition from the profiles. The resulting vectors are then fed to a support vector machine classifier for the prediction of protein structural class. To evaluate the performance of the proposed method, jackknife cross-validation tests are performed on two widely used benchmark datasets, 1189 (containing 1092 proteins) and 25PDB (containing 1673 proteins) with sequence similarity lower than 40% and 25%, respectively. The overall accuracies attain 70.7% and 72.9% for 1189 and 25PDB datasets, respectively. Comparison of our results with other methods shows that our method is very promising to predict protein structural class particularly for low-similarity datasets and may at least play an important complementary role to existing methods. |
| |
Keywords: | Protein structural class Sequence similarity Support vector machine Position-specific score matrix |
本文献已被 ScienceDirect 等数据库收录! |
|