首页 | 本学科首页   官方微博 | 高级检索  
     

基于支持向量机方法的蛋白可溶性预测
引用本文:王娴,李骜,王明会,冯焕清. 基于支持向量机方法的蛋白可溶性预测[J]. 生物物理学报, 2005, 21(1): 60-64
作者姓名:王娴  李骜  王明会  冯焕清
作者单位:中国科学技术大学电子科学与技术系 合肥230026(王娴,李骜,王明会),中国科学技术大学电子科学与技术系 合肥230026(冯焕清)
基金项目:中国科学技术大学研究生创新基金(KD2004053)
摘    要:按照蛋白质序列中残基的相对可溶性,将其分为两类(表面/内部)和三类(表面/中间/内部)进行预测。选择不同窗宽和参数对数据进行训练和预测,以确保得到最好的分类效果,并同其他已有方法进行比较。对同一数据集不同分类阈值的预测结果显示,支持向量机方法对蛋白质可溶性的整体预测效果优于神经网络和信息论的方法。其中,对两类数据的最优分类结果达到79.0%,对三类数据的最优分类结果达到67.5%,表明支持向量机是蛋白质残基可溶性预测的一种有效方法。

关 键 词:支持向量机 氨基酸残基 可溶性 预测
收稿时间:2004-03-24
修稿时间:2004-03-24

PREDICTION OF PROTEIN SOLVENT ACCESSIBILITY WITH SUPPORT VECTOR MACHINE
WANG Xian,LI Ao,WANG Ming-hui,FENG Huan-qing. PREDICTION OF PROTEIN SOLVENT ACCESSIBILITY WITH SUPPORT VECTOR MACHINE[J]. Acta Biophysica Sinica, 2005, 21(1): 60-64
Authors:WANG Xian  LI Ao  WANG Ming-hui  FENG Huan-qing
Affiliation:Department of Electronic Science and Technology, University of Science and Technology of China, Hefei 230026, China
Abstract:Residues in protein sequences can be divided into two classes (exposed/buried) or three classes (exposed/intermediate/buried) according to their relative solvent accessibility. Several lengths and parameters of window were explored to achieve the best performance. The prediction accuracies of support vector machine (SVM) for different cut-off thresholds were analyzed and compared with other methods, which showed that the SVM was a better method than neural network and information theory when using the same dataset. The best accuracy, in two-class problem, could be as high as 79.0%, and in three-class problem, could be as high as 67.5%. These results show that the support vector machine is an effective method in the prediction of protein solvent accessibility.
Keywords:Support vector machine  Amino acid residue  Solvent accessibility  Bioinformatics
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《生物物理学报》浏览原始摘要信息
点击此处可从《生物物理学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号