PONDR-FIT: A meta-predictor of intrinsically disordered amino acids |
| |
Authors: | Bin Xue Roland L. Dunbrack Robert W. Williams A. Keith Dunker Vladimir N. Uversky |
| |
Affiliation: | 1. Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN 46202, USA;2. Institute for Intrinsically Disordered Protein Research, Indiana University School of Medicine, Indianapolis, IN 46202, USA;3. Institute for Cancer Research, Fox Chase Cancer Institute, Philadelphia, PA 19111, USA;4. Department of Biomedical Informatics, Uniformed Services University, Bethesda, MD 20814, USA;5. Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia |
| |
Abstract: | Protein intrinsic disorder is becoming increasingly recognized in proteomics research. While lacking structure, many regions of disorder have been associated with biological function. There are many different experimental methods for characterizing intrinsically disordered proteins and regions; nevertheless, the prediction of intrinsic disorder from amino acid sequence remains a useful strategy especially for many large-scale proteomic investigations. Here we introduced a consensus artificial neural network (ANN) prediction method, which was developed by combining the outputs of several individual disorder predictors. By eight-fold cross-validation, this meta-predictor, called PONDR-FIT, was found to improve the prediction accuracy over a range of 3 to 20% with an average of 11% compared to the single predictors, depending on the datasets being used. Analysis of the errors shows that the worst accuracy still occurs for short disordered regions with less than ten residues, as well as for the residues close to order/disorder boundaries. Increased understanding of the underlying mechanism by which such meta-predictors give improved predictions will likely promote the further development of protein disorder predictors. Access to PONDR-FIT is available at www.disprot.org. |
| |
Keywords: | IDP, Intrinsically Disordered Protein IDR, Intrinsically Disordered Region PDD, Partially Disordered Dataset PDDS, Structured residues in PDD PDDM, Residues with Missing electron density in PDD FOD, Fully Ordered Dataset FDD, Fully Disordered Dataset ANN, Artificial Neural Network ANN-P, ANN trained by PDD ANN-F, ANN trained by FOD/FDD |
本文献已被 ScienceDirect 等数据库收录! |
|