期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

PHD-an automatic mail server for protein secondary structure prediction 总被引：30，自引：0，他引：30

Rost Burkhard; Sander Chris; Schneider Reinhard 《Bioinformatics (Oxford, England)》1994,10(1):53-60

By the middle of 1993, >30 000 protein sequences had beenlisted. For 1000 of these, the three-dimensional (tertiary)structure has been experimentally solved. Another 7000 can bemodelled by homology. For the remaining 21 000 sequences, secondarystructure prediction provides a rough estimate of structuralfeatures. Predictions in three states range between 35% (random)and 88% (homology modelling) overall accuracy. Using informationabout evolutionary conservation as contained in multiple sequencealignments, the secondary structure of 4700 protein sequenceswas predicted by the automatic e-mail server PHD. For proteinswith at least one known homologue, the method has an expectedoverall three-state accuracy of 71.4% for proteins with at leastone known homologue (e on 126 unique protein chains). 相似文献

2.

Progress of 1D protein structure prediction at last

Burkhard Rost Chris Sander 《Proteins》1995,23(3):295-300

Accuracy of predicting protein secondary structure and solvent accessibility from sequence information has been improved significantly by using information contained in multiple sequence alignments as input to a neural 'network system. For the Asilomar meeting, predictions for 13 proteins were generated automatically using the publicly available prediction method PHD. The results confirm the estimate of 72% three-state prediction accuracy. The fairly accurate predictions of secondary structure segments made the tool useful as a starting point for modeling of higher dimensional aspects of protein structure. © 1995 Wiley-Liss, Inc. 相似文献

3.

Improving the prediction of secondary structure of 'TIM-barrel' enzymes. 总被引：1，自引：0，他引：1

T Niermann K Kirschner 《Protein engineering》1991,4(3):359-370

The information contained in aligned sets of homologous protein sequences should improve the score of secondary structure prediction. Seven different enzymes having the (beta/alpha)8 or TIM-barrel fold were used to optimize the prediction with regard to this class of enzymes. The alpha-helix, beta-strand and loop propensities of the Garnier-Osguthorpe-Robson method were averaged at aligned residue positions, leading to a significant improvement over the average score obtained from single sequences. The increased accuracy correlates with the average sequence variability of the aligned set. Further improvements were obtained by using the following averaged properties as weights for the averaged state propensities: amphipathic moment and alpha-helix; hydropathy and beta-strand; chain flexibility and loop. The clustering of conserved residues at the C-terminal ends of the beta-strands was used as an additional positive weight for beta-strand propensity and increased the prediction of otherwise unpredicted beta-strands decisively. The automatic weighted prediction method identifies greater than 95% of the secondary structure elements of the set of seven TIM-barrel enzymes. 相似文献

4.

A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%. 总被引：3，自引：3，他引：0

下载免费PDF全文

P. K. Mehta J. Heringa P. Argos 《Protein science : a publication of the Protein Society》1995,4(12):2517-2525

To improve secondary structure predictions in protein sequences, the information residing in multiple sequence alignments of substituted but structurally related proteins is exploited. A database comprised of 70 protein families and a total of 2,500 sequences, some of which were aligned by tertiary structural superpositions, was used to calculate residue exchange weight matrices within alpha-helical, beta-strand, and coil substructures, respectively. Secondary structure predictions were made based on the observed residue substitutions in local regions of the multiple alignments and the largest possible associated exchange weights in each of the three matrix types. Comparison of the observed and predicted secondary structure on a per-residue basis yielded a mean accuracy of 72.2%. Individual alpha-helix, beta-strand, and coil states were respectively predicted at 66.7, and 75.8% correctness, representing a well-balanced three-state prediction. The accuracy level, verified by cross-validation through jack-knife tests on all protein families, dropped, on average, to only 70.9%, indicating the rigor of the prediction procedure. On the basis of robustness, conceptual clarity, accuracy, and executable efficiency, the method has considerable advantage, especially with its sole reliance on amino acid substitutions within structurally related proteins. 相似文献

5.

Generation of deviation parameters for amino acid singlets, doublets and triplets from three-dimensional structures of proteins and its implications for secondary structure prediction from amino acid sequences

S. A. Mugilan K. Veluraja 《Journal of biosciences》2000,25(1):81-90

We present a new method, secondary structure prediction by deviation parameter (SSPDP) for predicting the secondary structure of proteins from amino acid sequence. Deviation parameters (DP) for amino acid singlets, doublets and triplets were computed with respect to secondary structural elements of proteins based on the dictionary of secondary structure prediction (DSSP)-generated secondary structure for 408 selected nonhomologous proteins. To the amino acid triplets which are not found in the selected dataset, a DP value of zero is assigned with respect to the secondary structural elements of proteins. The total number of parameters generated is 15,432, in the possible parameters of 25,260. Deviation parameter is complete with respect to amino acid singlets, doublets, and partially complete with respect to amino acid triplets. These generated parameters were used to predict secondary structural elements from amino acid sequence. The secondary structure predicted by our method (SSPDP) was compared with that of single sequence (NNPREDICT) and multiple sequence (PHD) methods. The average value of the percentage of prediction accuracy for αhelix by SSPDP, NNPREDICT and PHD methods was found to be 57%, 44% and 69% respectively for the proteins in the selected dataset. For Β-strand the prediction accuracy is found to be 69%, 21% and 53% respectively by SSPDP, NNPREDICT and PHD methods. This clearly indicates that the secondary structure prediction by our method is as good as PHD method but much better than NNPREDICT method. 相似文献

6.

Prediction of Protein Secondary Structure Using Feature Selection and Analysis Approach

Yonge Feng Hao Lin Liaofu Luo 《Acta biotheoretica》2014,62(1):1-14

The prediction of the secondary structure of a protein from its amino acid sequence is an important step towards the prediction of its three-dimensional structure. However, the accuracy of ab initio secondary structure prediction from sequence is about 80 % currently, which is still far from satisfactory. In this study, we proposed a novel method that uses binomial distribution to optimize tetrapeptide structural words and increment of diversity with quadratic discriminant to perform prediction for protein three-state secondary structure. A benchmark dataset including 2,640 proteins with sequence identity of less than 25 % was used to train and test the proposed method. The results indicate that overall accuracy of 87.8 % was achieved in secondary structure prediction by using ten-fold cross-validation. Moreover, the accuracy of predicted secondary structures ranges from 84 to 89 % at the level of residue. These results suggest that the feature selection technique can detect the optimized tetrapeptide structural words which affect the accuracy of predicted secondary structures. 相似文献

7.

A seqlet-based maximum entropy Markov approach for protein secondary structure prediction

Qiwen?Dong Email author Xiaolong?Wang Lei?Lin Yi?Guan 《中国科学C辑(英文版)》2005,48(4):394-405

A novel method for predicting the secondary structures of proteins from amino acid sequence has been presented. The protein secondary structure seqlets that are analogous to the words in natural language have been extracted. These seqlets will capture the relationship between amino acid sequence and the secondary structures of proteins and further form the protein secondary structure dictionary. To be elaborate, the dictionary is organism-specific. Protein secondary structure prediction is formulated as an integrated word segmentation and part of speech tagging problem. The word-lattice is used to represent the results of the word segmentation and the maximum entropy model is used to calculate the probability of a seqlet tagged as a certain secondary structure type. The method is markovian in the seqlets, permitting efficient exact calculation of the posterior probability distribution over all possible word segmentations and their tags by viterbi algorithm. The optimal segmentations and their tags are computed as the results of protein secondary structure prediction. The method is applied to predict the secondary structures of proteins of four organisms respectively and compared with the PHD method. The results show that the performance of this method is higher than that of PHD by about 3.9% Q3 accuracy and 4.6% SOV accuracy. Combining with the local similarity protein sequences that are obtained by BLAST can give better prediction. The method is also tested on the 50 CASP5 target proteins with Q₃ accuracy 78.9% and SOV accuracy 77.1%. A web server for protein secondary structure prediction has been constructed which is available at http://www.insun.hit.edu.cn:81/demos/biology/index.html. 相似文献

8.

A seqlet-based maximum entropy Markov approach for protein secondary structure prediction

DONG Qiwen WANG Xiaolong LIN Lei & GUAN Yi School of Computer Science Technology Harbin Institute of Technology Harbin China 《中国科学：生命科学英文版》2005,48(4):394-405

1 Introduction The prediction of protein structure and function from amino acid sequences is one of the most impor-tant problems in molecular biology. This problem is becoming more pressing as the number of known pro-tein sequences is explored as a result of genome and other sequencing projects, and the protein sequence- structure gap is widening rapidly[1]. Therefore, com-putational tools to predict protein structures are needed to narrow the widening gap. Although the prediction of three dim… 相似文献

9.

Combining evolutionary information and neural networks to predict protein secondary structure 总被引：1，自引：0，他引：1

Burkhard Rost Chris Sander 《Proteins》1994,19(1):55-72

Using evolutionary information contained in multiple sequence alignments as input to neural networks, secondary structure can be predicted at significantly increased accuracy. Here, we extend our previous three-level system of neural networks by using additional input information derived from multiple alignments. Using a position-specific conservation weight as part of the input increases performance. Using the number of insertions and deletions reduces the tendency for overprediction and increases overall accuracy. Addition of the global amino acid content yields a further improvement, mainly in predicting structural class. The final network system has a sustained overall accuracy of 71.6% in a multiple cross-validation test on 126 unique protein chains. A test on a new set of 124 recently solved protein structures that have no significant sequence similarity to the learning set confirms the high level of accuracy. The average cross-validated accuracy for all 250 sequence-unique chains is above 72%. Using various data sets, the method is compared to alternative prediction methods, some of which also use multiple alignments: the performance advantage of the network system is at least 6 percentage points in three-state accuracy. In addition, the network estimates secondary structure content from multiple sequence alignments about as well as circular dichroism spectroscopy on a single protein and classifies 75% of the 250 proteins correctly into one of four protein structural classes. Of particular practical importance is the definition of a position-specific reliability index. For 40% of all residues the method has a sustained three-state accuracy of 88%, as high as the overall average for homology modelling. A further strength of the method is greatly increased accuracy in predicting the placement of secondary structure segments. © 1994 Wiley-Liss, Inc. 相似文献

10.

Use of tetrapeptide signals for protein secondary-structure prediction

Feng Y Luo L 《Amino acids》2008,35(3):607-614

This paper develops a novel sequence-based method, tetra-peptide-based increment of diversity with quadratic discriminant analysis (TPIDQD for short), for protein secondary-structure prediction. The proposed TPIDQD method is based on tetra-peptide signals and is used to predict the structure of the central residue of a sequence fragment. The three-state overall per-residue accuracy (Q ₃) is about 80% in the threefold cross-validated test for 21-residue fragments in the CB513 dataset. The accuracy can be further improved by taking long-range sequence information (fragments of more than 21 residues) into account in prediction. The results show the tetra-peptide signals can indeed reflect some relationship between an amino acid’s sequence and its secondary structure, indicating the importance of tetra-peptide signals as the protein folding code in the protein structure prediction. 相似文献