Prediction of protein relative solvent accessibility with a two-stage SVM approach |
| |
Authors: | Nguyen Minh N Rajapakse Jagath C |
| |
Institution: | BioInformatics Research Centre, School of Computer Engineering, Nanyang Technological University, Singapore. |
| |
Abstract: | Information on relative solvent accessibility (RSA) of amino acid residues in proteins provides valuable clues to the prediction of protein structure and function. A two-stage approach with support vector machines (SVMs) is proposed, where an SVM predictor is introduced to the output of the single-stage SVM approach to take into account the contextual relationships among solvent accessibilities for the prediction. By using the position-specific scoring matrices (PSSMs) generated by PSI-BLAST, the two-stage SVM approach achieves accuracies up to 90.4% and 90.2% on the Manesh data set of 215 protein structures and the RS126 data set of 126 nonhomologous globular proteins, respectively, which are better than the highest published scores on both data sets to date. A Web server for protein RSA prediction using a two-stage SVM method has been developed and is available (http://birc.ntu.edu.sg/~pas0186457/rsa.html). |
| |
Keywords: | protein structure prediction solvent accessibility support vector machines PSI‐BLAST |
本文献已被 PubMed 等数据库收录! |
|