首页 | 本学科首页   官方微博 | 高级检索  
   检索      


A global machine learning based scoring function for protein structure prediction
Authors:Eshel Faraggi  Andrzej Kloczkowski
Institution:1. Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, , Indianapolis, Indiana, 46202;2. Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, , Columbus, Ohio, 43215;3. Physics Division, Research and Information Systems, LLC, , Carmel, Indiana, 46032;4. Department of Pediatrics, The Ohio State University, , Columbus, Ohio, 43215
Abstract:We present a knowledge‐based function to score protein decoys based on their similarity to native structure. A set of features is constructed to describe the structure and sequence of the entire protein chain. Furthermore, a qualitative relationship is established between the calculated features and the underlying electromagnetic interaction that dominates this scale. The features we use are associated with residue–residue distances, residue–solvent distances, pairwise knowledge‐based potentials and a four‐body potential. In addition, we introduce a new target to be predicted, the fitness score, which measures the similarity of a model to the native structure. This new approach enables us to obtain information both from decoys and from native structures. It is also devoid of previous problems associated with knowledge‐based potentials. These features were obtained for a large set of native and decoy structures and a back‐propagating neural network was trained to predict the fitness score. Overall this new scoring potential proved to be superior to the knowledge‐based scoring functions used as its inputs. In particular, in the latest CASP (CASP10) experiment our method was ranked third for all targets, and second for freely modeled hard targets among about 200 groups for top model prediction. Ours was the only method ranked in the top three for all targets and for hard targets. This shows that initial results from the novel approach are able to capture details that were missed by a broad spectrum of protein structure prediction approaches. Source codes and executable from this work are freely available at http://mathmed.org /#Software and http://mamiris.com/ . Proteins 2014; 82:752–759. © 2013 Wiley Periodicals, Inc.
Keywords:tertiary protein structure  protein knowledge potentials  protein potential energy  protein scoring functions  neural network  global features
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号