期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Studies of intrinsically disordered proteins that lack a stable tertiary structure but still have important biological functions critically rely on computational methods that predict this property based on sequence information. Although a number of fairly successful models for prediction of protein disorder have been developed over the last decade, the quality of their predictions is limited by available cases of confirmed disorders.

Results

To more reliably estimate protein disorder from protein sequences, an iterative algorithm is proposed that integrates predictions of multiple disorder models without relying on any protein sequences with confirmed disorder annotation. The iterative method alternately provides the maximum a posterior (MAP) estimation of disorder prediction and the maximum-likelihood (ML) estimation of quality of multiple disorder predictors. Experiments on data used at CASP7, CASP8, and CASP9 have shown the effectiveness of the proposed algorithm.

Conclusions

The proposed algorithm can potentially be used to predict protein disorder and provide helpful suggestions on choosing suitable disorder predictors for unknown protein sequences.

相似文献

7.

EGASP: the human ENCODE Genome Annotation Assessment Project

Guigó R Flicek P Abril JF Reymond A Lagarde J Denoeud F Antonarakis S Ashburner M Bajic VB Birney E Castelo R Eyras E Ucla C Gingeras TR Harrow J Hubbard T Lewis SE Reese MG 《Genome biology》2006,7(Z1):S2.1-S231

相似文献

8.

Comparison of computational methods for identifying translation initiation sites in EST data

Afshin?Nadershahi Scott?C?Fahrenkrug Lynda?BM?Ellis Email author 《BMC bioinformatics》2004,5(1):14

Background

Expressed Sequence Tag (EST) sequences are generally single-strand, single-pass sequences, only 200–600 nucleotides long, contain errors resulting in frame shifts, and represent different parts of their parent cDNA. If the cDNAs contain translation initiation sites, they may be suitable for functional genomics studies. We have compared five methods to predict translation initiation sites in EST data: first-ATG, ESTScan, Diogenes, Netstart, and ATGpr.

Results

A dataset of 100 EST sequences, 50 with and 50 without, translation initiation sites, was created. Based on analysis of this dataset, ATGpr is found to be the most accurate for predicting the presence versus absence of translation initiation sites. With a maximum accuracy of 76%, ATGpr more accurately predicts the position or absence of translation initiation sites than NetStart (57%) or Diogenes (50%). ATGpr similarly excels when start sites are known to be present (90%), whereas NetStart achieves only 60% overall accuracy. As a baseline for comparison, choosing the first ATG correctly identifies the translation initiation site in 74% of the sequences. ESTScan and Diogenes, consistent with their intended use, are able to identify open reading frames, but are unable to determine the precise position of translation initiation sites.

Conclusions

ATGpr demonstrates high sensitivity, specificity, and overall accuracy in identifying start sites while also rejecting incomplete sequences. A database of EST sequences suitable for validating programs for translation initiation site prediction is now available. These tools and materials may open an avenue for future improvements in start site prediction and EST analysis.

相似文献

9.

Using several pair-wise informant sequences for <Emphasis Type="Italic">de novo</Emphasis> prediction of alternatively spliced transcripts

Paul Flicek Michael R Brent 《Genome biology》2006,7(Z1):S8

相似文献

10.

Phylogenetically and spatially conserved word pairs associated with gene-expression changes in yeasts 总被引：1，自引：1，他引：0

Chiang DY Moses AM Kellis M Lander ES Eisen MB 《Genome biology》2003,4(7):R43

相似文献

11.

70ProPred: a predictor for discovering sigma70 promoters based on combining multiple features

Wenying He Cangzhi Jia Yucong Duan Quan Zou 《BMC systems biology》2018,12(4):44

相似文献

12.

Target SNP selection in complex disease association studies

Matthias?Wjst Email author 《BMC bioinformatics》2004,5(1):92

相似文献

13.

Assisted transcriptome reconstruction and splicing orthology

Blanquart Samuel Varr&#; Jean-St&#;phane Guertin Paul Perrin Amandine Bergeron Anne Swenson Krister M. 《BMC genomics》2016,17(10):786-164

相似文献

14.

Insights into mammalian transcription control by systematic analysis of ChIP sequencing data

Guillaume Devailly Anagha Joshi 《BMC bioinformatics》2018,19(14):409

相似文献

15.

Incorporating methylation genome information improves prediction accuracy for drug treatment responses

Xiaoxuan Xia Haoyi Weng Ruoting Men Rui Sun Benny Chung Ying Zee Ka Chun Chong Maggie Haitian Wang 《BMC genetics》2018,19(1):78

Background

An accumulation of evidence has revealed the important role of epigenetic factors in explaining the etiopathogenesis of human diseases. Several empirical studies have successfully incorporated methylation data into models for disease prediction. However, it is still a challenge to integrate different types of omics data into prediction models, and the contribution of methylation information to prediction remains to be fully clarified.

Results

A stratified drug-response prediction model was built based on an artificial neural network to predict the change in the circulating triglyceride level after fenofibrate intervention. Associated single-nucleotide polymorphisms (SNPs), methylation of selected cytosine-phosphate-guanine (CpG) sites, age, sex, and smoking status, were included as predictors. The model with selected SNPs achieved a mean 5-fold cross-validation prediction error rate of 43.65%. After adding methylation information into the model, the error rate dropped to 41.92%. The combination of significant SNPs, CpG sites, age, sex, and smoking status, achieved the lowest prediction error rate of 41.54%.

Conclusions

Compared to using SNP data only, adding methylation data in prediction models slightly improved the error rate; further prediction error reduction is achieved by a combination of genome, methylation genome, and environmental factors.

相似文献

16.

Epigenetic regulators sculpt the plastic brain

Ji-Song Guan Hong Xie San-Xiong Liu 《生物学前沿》2017,12(5):317-332

相似文献

17.

AceView: a comprehensive cDNA-supported gene and transcripts annotation

Thierry-Mieg D Thierry-Mieg J 《Genome biology》2006,7(Z1):S12.1-S1214

相似文献

18.

Speckle-type POZ protein functions as a tumor suppressor in non-small cell lung cancer due to DNA methylation

Sumei Yao Xinming Chen Jinliang Chen Yangbo Guan Yifei Liu Jianrong Chen Xuedong Lv 《Cancer cell international》2018,18(1):213