首页 | 本学科首页   官方微博 | 高级检索  
     


Classifier ensembles for protein structural class prediction with varying homology
Authors:Kedarisetti Kanaka Durga  Kurgan Lukasz  Dick Scott
Affiliation:Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Alta., Canada.
Abstract:
Structural class characterizes the overall folding type of a protein or its domain. A number of computational methods have been proposed to predict structural class based on primary sequences; however, the accuracy of these methods is strongly affected by sequence homology. This paper proposes, an ensemble classification method and a compact feature-based sequence representation. This method improves prediction accuracy for the four main structural classes compared to competing methods, and provides highly accurate predictions for sequences of widely varying homologies. The experimental evaluation of the proposed method shows superior results across sequences that are characterized by entire homology spectrum, ranging from 25% to 90% homology. The error rates were reduced by over 20% when compared with using individual prediction methods and most commonly used composition vector representation of protein sequences. Comparisons with competing methods on three large benchmark datasets consistently show the superiority of the proposed method.
Keywords:SCOP structural class   Sequence homology   Protein primary sequence   Feature selection   Classifier ensemble
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号