期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Reduced representation model of protein structure prediction: statistical potential and genetic algorithms. 总被引：5，自引：7，他引：5

下载免费PDF全文

S. Sun 《Protein science : a publication of the Protein Society》1993,2(5):762-785

A reduced representation model, which has been described in previous reports, was used to predict the folded structures of proteins from their primary sequences and random starting conformations. The molecular structure of each protein has been reduced to its backbone atoms (with ideal fixed bond lengths and valence angles) and each side chain approximated by a single virtual united-atom. The coordinate variables were the backbone dihedral angles phi and psi. A statistical potential function, which included local and nonlocal interactions and was computed from known protein structures, was used in the structure minimization. A novel approach, employing the concepts of genetic algorithms, has been developed to simultaneously optimize a population of conformations. With the information of primary sequence and the radius of gyration of the crystal structure only, and starting from randomly generated initial conformations, I have been able to fold melittin, a protein of 26 residues, with high computational convergence. The computed structures have a root mean square error of 1.66 A (distance matrix error = 0.99 A) on average to the crystal structure. Similar results for avian pancreatic polypeptide inhibitor, a protein of 36 residues, are obtained. Application of the method to apamin, an 18-residue polypeptide with two disulfide bonds, shows that it folds apamin to native-like conformations with the correct disulfide bonds formed. 相似文献

2.

Evaluation of current techniques for Ab initio protein structure prediction

Tom Defay Fred E. Cohen 《Proteins》1995,23(3):431-445

The results of a protein structure prediction contest are reviewed. Twelve different groups entered predictions on 14 proteins of known sequence whose structures had been determined but not yet disseminated to the scientific community. Thus, these represent true tests of the current state of structure prediction methodologies. From this work, it is clear that accurate tertiary structure prediction is not yet possible. However, protein fold and motif prediction are possible when the motif is recognizably similar to another known structure. Internal symmetry and the information inherent in an aligned family of homologous sequences facilitate predictive efforts. Novel folds remain a major challenge for prediction efforts. © 1995 Wiley-Liss, Inc. 相似文献

3.

蛋白质结构预测的理论方法及阶段 总被引：2，自引：0，他引：2

孙侠殷志祥《生物学杂志》2007,24(1):16-17,15

一直以来,蛋白质结构预测都是人们研究的焦点,综述了蛋白质结构预测的几种理论方法和不同阶段。相似文献

4.

Perfect temperature for protein structure prediction and folding

Alexei V. Finkelstein Alexander M. Gutin Azat Ya. Badretdinov 《Proteins》1995,23(2):151-162

We have investigated the influence of the “noise” of inevitable errors in energetic parameters on-protein structure prediction. Because of this noise, only a part of all the interactions operating in a protein chain can be taken into account, and therefore a search for the energy minimum becomes inadequate for protein structure prediction. One can rather rely on statistical mechanics: a calculation carried out at a temperature T_* somewhat below that of protein melting gives the best possible, though always approximate prediction. The early stages of protein folding also “take into account” only a part of all the interactions; consequently, the same temperature T_* is favorable for the self-organization of native-like intermediates in protein folding. © 1995 Wiley-Liss, Inc. 相似文献

5.

A protocol for computer-based protein structure and function prediction

Roy A Xu D Poisson J Zhang Y 《Journal of visualized experiments : JoVE》2011,(57):e3259

Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server. 相似文献

6.

蛋白质二级结构预测方法的评价

孟翔燕孟军葛家麒《生物信息学》2010,8(3):206-209

目前评价蛋白质二级结构预测方法主要考虑预测准确率,并没有充分考虑方法自身参数对方法的影响。本文提出一种新型评价方法,将内在评价与外在评价相结合评价预测方法的优劣。以基于混合并行遗传算法的蛋白质二级结构预测方法为例,通过内在评价,合理选取内在参数——切片长度和组内类别数,有效提高预测准确率,同时,通过外在评价,与其他基于随机算法的蛋白质二级结构预测算法比较和与CASP所提供的结论比较,说明了方法的有效性与正确性,以此验证内在评价和外在评价的客观性、公正性和全面性。相似文献

7.

Secondary structure prediction and unrefined tertiary structure prediction for cyclin A,B, and D

Dietlind L. Gerloff Fred E. Cohen 《Proteins》1996,24(1):18-34

We present heuristic-based predictions of the secondary and tertiary structures of the cyclins A, B, and D, representatives of the cyclin superfamily. The list of suggested constraints for tertiary structure assembly was left unrefined in order to submit this report before an announced crystal structure for cyclin A becomes available. To predict these constraints, a master sequence alignment over 270 positions of cyclin types A, B, and D was adjusted based on individual secondary structure predictions for each type. We used new heuristics for predicting aromatic residues at protein-protein interfaces and to identify sequentially distinct regions in the protein chain that cluster in the folded structure. The boundaries of two conjectured domains in the cyclin fold were predicted based on experimental data in the literature. The domain that is important for interaction of the cyclins with cyclin-dependent kinases (CDKs) is predicted to contain six helices; the second domain in the consensus model contains both helices and a β-sheet that is formed by sequentially distant regions in the protein chain. A plausible phosphorylation site is identified. This work represents a blinded test of the method for prediction of secondary and, to a lesser extent, tertiary structure from a set of homologous protein sequences. Evaluation of our predictions will become possible with the publication of the announced crystal structure. 相似文献

8.

SARS病毒S蛋白三维结构预测 总被引：1，自引：0，他引：1

岳俊杰汪莉王月兰李北平梁龙俞炜源黄培堂《生物技术通讯》2003,14(5):399-400

蛋白质结构类型识别方法可以在没有序列同源性的蛋白质之间检测有没有结构相似性。利用蛋白质结构类型识别方法预测了SARS病毒S蛋白N端区域的结构。模建的SARS病毒S蛋白N端区域是一个全折叠的结构。相似文献

9.

Protein structure prediction in genomics 总被引：1，自引：0，他引：1

Jones DT 《Briefings in bioinformatics》2001,2(2):111-125

As the number of completely sequenced genomes rapidly increases, including now the complete Human Genome sequence, the post-genomic problems of genome-scale protein structure determination and the issue of gene function identification become ever more pressing. In fact, these problems can be seen as interrelated in that experimentally determining or predicting or the structure of proteins encoded by genes of interest is one possible means to glean subtle hints as to the functions of these genes. The applicability of this approach to gene characterisation is reviewed, along with a brief survey of the reliability of large-scale protein structure prediction methods and the prospects for the development of new prediction methods. 相似文献

10.

SimFold energy function for de novo protein structure prediction: consensus with Rosetta

Fujitsuka Y Chikenji G Takada S 《Proteins》2006,62(2):381-398

Predicting protein tertiary structures by in silico folding is still very difficult for proteins that have new folds. Here, we developed a coarse-grained energy function, SimFold, for de novo structure prediction, performed a benchmark test of prediction with fragment assembly simulations for 38 test proteins, and proposed consensus prediction with Rosetta. The SimFold energy consists of many terms that take into account solvent-induced effects on the basis of physicochemical consideration. In the benchmark test, SimFold succeeded in predicting native structures within 6.5 A for 12 of 38 proteins; this success rate was the same as that by the publicly available version of Rosetta (ab initio version 1.2) run with default parameters. We investigated which energy terms in SimFold contribute to structure prediction performance, finding that the hydrophobic interaction is the most crucial for the prediction, whereas other sequence-specific terms have weak but positive roles. In the benchmark, well-predicted proteins by SimFold and by Rosetta were not the same for 5 of 12 proteins, which led us to introduce consensus prediction. With combined decoys, we succeeded in prediction for 16 proteins, four more than SimFold or Rosetta separately. For each of 38 proteins, structural ensembles generated by SimFold and by Rosetta were qualitatively compared by mapping sampled structural space onto two dimensions. For proteins of which one of the two methods succeeded and the other failed in prediction, the former had a less scattered ensemble located around the native. For proteins of which both methods succeeded in prediction, often two ensembles were mixed up. 相似文献

11.

神经网络在蛋白质二级结构预测中的应用 总被引：3，自引：0，他引：3

须文波陆克中《生物信息学》2006,4(1):26-29

介绍了蛋白质二级结构预测的研究意义,讨论了用在蛋白质二级结构预测方面的神经网络设计问题,并且较详尽地评述了近些年来用神经网络方法在蛋白质二级结构预测中的主要工作进展情况,展望了蛋白质结构预测的前景。相似文献

12.

Protein structure prediction methods for drug design 总被引：1，自引：0，他引：1

Lengauer T Zimmer R 《Briefings in bioinformatics》2000,1(3):275-288

Along the long path from genomic data to a new drug, the knowledge of three-dimensional protein structure can be of significant help in several places.This paper points out such places, discusses the virtues of protein structure knowledge and reviews bioinformatics methods for gaining such knowledge on the protein structure. 相似文献

13.

Empirical limits for template-based protein structure prediction: the CASP5 example

Contreras-Moreira B Ezkurdia I Tress ML Valencia A 《FEBS letters》2005,579(5):1203-1207

Most protein structure prediction methods use templates to assist in the construction of protein models. In this paper, we analyse the current state of template-based modelling approaches and reach an estimate of the empirical limits of these methods. Our analysis show that current prediction methods are already reaching these empirical accuracy limits in the easier cases, where finding a close homologue to the native target structure is not a problem. However, we find that even in the absence of alignment errors and using optimal templates, template-based methods have intrinsic limitations, suggesting that other methodologies, such as ab initio procedures, must be used if accuracy is ultimately to be improved. 相似文献

14.

Structure prediction of protein complexes by an NMR-based protein docking algorithm

Oliver Kohlbacher Andreas Burchardt Andreas Moll Andreas Hildebrandt Peter Bayer Hans-Peter Lenhof 《Journal of biomolecular NMR》2001,20(1):15-21

Protein docking algorithms can be used to study the driving forces and reaction mechanisms of docking processes. They are also able to speed up the lengthy process of experimental structure elucidation of protein complexes by proposing potential structures. In this paper, we are discussing a variant of the protein-protein docking problem, where the input consists of the tertiary structures of proteins A and B plus an unassigned one-dimensional ¹H-NMR spectrum of the complex AB. We present a new scoring function for evaluating and ranking potential complex structures produced by a docking algorithm. The scoring function computes a `theoretical' ¹H-NMR spectrum for each tentative complex structure and subtracts the calculated spectrum from the experimental one. The absolute areas of the difference spectra are then used to rank the potential complex structures. In contrast to formerly published approaches (e.g. [Morelli et al. (2000) Biochemistry, 39, 2530–2537]) we do not use distance constraints (intermolecular NOE constraints). We have tested the approach with four protein complexes whose three-dimensional structures are stored in the PDB data bank [Bernstein et al. (1977)] and whose ¹H-NMR shift assignments are available from the BMRB database. The best result was obtained for an example, where all standard scoring functions failed completely. Here, our new scoring function achieved an almost perfect separation between good approximations of the true complex structure and false positives. 相似文献

15.

Customised fragments libraries for protein structure prediction based on structural class annotations

Jad Abbass Jean-Christophe Nebel 《BMC bioinformatics》2015,16(1)

Background

Since experimental techniques are time and cost consuming, in silico protein structure prediction is essential to produce conformations of protein targets. When homologous structures are not available, fragment-based protein structure prediction has become the approach of choice. However, it still has many issues including poor performance when targets’ lengths are above 100 residues, excessive running times and sub-optimal energy functions. Taking advantage of the reliable performance of structural class prediction software, we propose to address some of the limitations of fragment-based methods by integrating structural constraints in their fragment selection process.

Results

Using Rosetta, a state-of-the-art fragment-based protein structure prediction package, we evaluated our proposed pipeline on 70 former CASP targets containing up to 150 amino acids. Using either CATH or SCOP-based structural class annotations, enhancement of structure prediction performance is highly significant in terms of both GDT_TS (at least +2.6, p-values < 0.0005) and RMSD (−0.4, p-values < 0.005). Although CATH and SCOP classifications are different, they perform similarly. Moreover, proteins from all structural classes benefit from the proposed methodology. Further analysis also shows that methods relying on class-based fragments produce conformations which are more relevant to user and converge quicker towards the best model as estimated by GDT_TS (up to 10% in average). This substantiates our hypothesis that usage of structurally relevant templates conducts to not only reducing the size of the conformation space to be explored, but also focusing on a more relevant area.

Conclusions

Since our methodology produces models the quality of which is up to 7% higher in average than those generated by a standard fragment-based predictor, we believe it should be considered before conducting any fragment-based protein structure prediction. Despite such progress, ab initio prediction remains a challenging task, especially for proteins of average and large sizes. Apart from improving search strategies and energy functions, integration of additional constraints seems a promising route, especially if they can be accurately predicted from sequence alone.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0576-2) contains supplementary material, which is available to authorized users. 相似文献

16.

Analysis of anisotropic side-chain packing in proteins and application to high-resolution structure prediction

Misura KM Morozov AV Baker D 《Journal of molecular biology》2004,342(2):651-664

pi-pi, Cation-pi, and hydrophobic packing interactions contribute specificity to protein folding and stability to the native state. As a step towards developing improved models of these interactions in proteins, we compare the side-chain packing arrangements in native proteins to those found in compact decoys produced by the Rosetta de novo structure prediction method. We find enrichments in the native distributions for T-shaped and parallel offset arrangements of aromatic residue pairs, in parallel stacked arrangements of cation-aromatic pairs, in parallel stacked pairs involving proline residues, and in parallel offset arrangements for aliphatic residue pairs. We then investigate the extent to which the distinctive features of native packing can be explained using Lennard-Jones and electrostatics models. Finally, we derive orientation-dependent pi-pi, cation-pi and hydrophobic interaction potentials based on the differences between the native and compact decoy distributions and investigate their efficacy for high-resolution protein structure prediction. Surprisingly, the orientation-dependent potential derived from the packing arrangements of aliphatic side-chain pairs distinguishes the native structure from compact decoys better than the orientation-dependent potentials describing pi-pi and cation-pi interactions. 相似文献

17.

基于遗传算法的蛋白质结构预测方法

张超张晖李冀新高红《生物信息学》2006,4(3):128-131

遗传算法源于自然界的进化规律,是一种自适应启发式概率性迭代式全局搜索算法。本文主要介绍了GA的基本原理,算法及优点;总结GA在蛋白质结构预测中建立模型和执行策略,以及多种算法相互结合预测蛋白质结构的研究进展。相似文献

18.

Repeated structure and possible gene duplications in high potential iron protein and rubredoxin 总被引：1，自引：0，他引：1

Andrew D. McLachlan 《Journal of molecular evolution》1980,15(4):309-315

Summary The three-dimensional structures of bacterial high potential iron protein (HIPIP) and rubredoxin have been searched for repeats to test whether these molecules evolved by independent tandem gene duplications. HIPIP has no structural repeats in spite of the observed repeated pattern in the amino acid sequence fromRhodopseudomonas gelatinosa. Rubredoxin fromClostridium pasteurianum has repeated hairpin loops of ten alpha-carbon atoms on both sides of the active centre iron-sulphur complex, which can be superposed within a root mean square deviation of 0.84 Å by rotating about a local pseudo-dyad axis. The structural repeat matches a weak repeat in the amino acid sequence. It is concluded that the sequence repeats in HIPIP are probably a coincidence but that rubredoxin may have evolved by gene duplication from a dimer of two primitive hairpin loops. 相似文献

19.

蛋白质结构从头预测方法研究进展

周建红艾观华方慧生陈凯先《生物信息学》2011,9(1):1-5

蛋白质结构从头预测是不依赖模板仅从氨基酸序列信息得到天然结构。它的关键是正确定义能量函数、精确选用计算机搜索算法来寻找能量最低值。基于此,本文系统介绍了能量函数和构象搜索策略,并列举了几种比较成功的从头预测方法,通过比较得出结论:基于统计学知识的能量函数是近年来从头预测发展的主要方向,现有从头预测的构象搜索都用到Monte Carlo法。这表明随着蛋白质结构预测研究的深入,能量函数的构建、构象搜索方法的选择、大分子蛋白质结构的从头预测等关键性问题都取得了突破性进展。相似文献

20.

Seventy-five percent accuracy in protein secondary structure prediction

Dmitrij Frishman Patrick Argos 《Proteins》1997,27(3):329-335

In this study we present an accurate secondary structure prediction procedure by using a query and related sequences. The most novel aspect of our approach is its reliance on local pairwise alignment of the sequence to be predicted with each related sequence rather than utilization of a multiple alignment. The residue-by-residue accuracy of the method is 75% in three structural states after jack-knife tests. The gain in prediction accuracy compared with the existing techniques, which are at best 72%, is achieved by secondary structure propensities based on both local and long-range effects, utilization of similar sequence information in the form of carefully selected pairwise alignment fragments, and reliance on a large collection of known protein primary structures. The method is especially appropriate for large-scale sequence analysis efforts such as genome characterization, where precise and significant multiple sequence alignments are not available or achievable. Proteins 27:329–335, 1997. © 1997 Wiley-Liss, Inc. 相似文献