共查询到18条相似文献,搜索用时 0 毫秒
1.
We present an analysis of 10 blind predictions prepared for a recent conference, “Critical Assessment of Techniques for Protein Structure Prediction.”1 The sequences of these proteins are not detectably similar to those of any protein in the structure database then available, but we attempted, by a threading method, to recognize similarity to known domain folds. Four of the 10 proteins, as we subsequently learned, do indeed show significant similarity to then-known structures. For 2 of these proteins the predictions were accurate, in the sense that a similar structure was at or near the top of the list of threading scores, and the threading alignment agreed well with the corresponding structural alignment. For the best predicted model mean alignment error relative to the optimal structural alignment was 2.7 residues, arising entirely from small “register shifts” of strands or helices. In the analysis we attempt to identify factors responsible for these successes and failures. Since our threading method does not use gap penalties, we may readily distinguish between errors arising from our prior definition of the “cores” of known structures and errors arising from inherent limitations in the threading potential. It would appear from the results that successful substructure recognition depends most critically on accurate definition of the “fold” of a database protein. This definition must correctly delineate substructures that are, and are not, likely to be conserved during protein evolution. © 1995 Wiley-Liss, Inc. 相似文献
2.
Two new sets of scoring matrices are introduced: H2 for the protein sequence comparison and T2 for the protein sequence-structure correlation. Each element of H2 or T2 measures the frequency with which a pair of amino acid types in one protein, k-residues apart in the sequence, is aligned with another pair of residues, of given amino acid types (for H2) or in given structural states (for T2), in other structurally homologous proteins. There are four types, corresponding to the k-values of 1 to 4, for both H2 and T2. These matrices were set up using a large number of structurally homologous protein pairs, with little sequence homology between the pair, that were recently generated using the structure comparison program SHEBA. The two scoring matrices were incorporated into the main body of the sequence alignment program SSEARCH in the FASTA package and tested in a fold recognition setting in which a set of 107 test sequences were aligned to each of a panel of 3,539 domains that represent all known protein structures. Six procedures were tested; the straight Smith-Waterman (SW) and FASTA procedures, which used the Blosum62 single residue type substitution matrix; BLAST and PSI-BLAST procedures, which also used the Blosum62 matrix; PASH, which used Blosum62 and H2 matrices; and PASSC, which used Blosum62, H2, and T2 matrices. All procedures gave similar results when the probe and target sequences had greater than 30% sequence identity. However, when the sequence identity was below 30%, a similar structure could be found for more sequences using PASSC than using any other procedure. PASH and PSI-BLAST gave the next best results. 相似文献
3.
An elaborate knowledge-based energy function is designed for fold recognition. It is a residue-level single-body potential so that highly efficient dynamic programming method can be used for alignment optimization. It contains a backbone torsion term, a buried surface term, and a contact-energy term. The energy score combined with sequence profile and secondary structure information leads to an algorithm called SPARKS (Sequence, secondary structure Profiles and Residue-level Knowledge-based energy Score) for fold recognition. Compared with the popular PSI-BLAST, SPARKS is 21% more accurate in sequence-sequence alignment in ProSup benchmark and 10%, 25%, and 20% more sensitive in detecting the family, superfamily, fold similarities in the Lindahl benchmark, respectively. Moreover, it is one of the best methods for sensitivity (the number of correctly recognized proteins), alignment accuracy (based on the MaxSub score), and specificity (the average number of correctly recognized proteins whose scores are higher than the first false positives) in LiveBench 7 among more than twenty servers of non-consensus methods. The simple algorithm used in SPARKS has the potential for further improvement. This highly efficient method can be used for fold recognition on genomic scales. A web server is established for academic users on http://theory.med.buffalo.edu. 相似文献
4.
On the study of protein inverse folding problem, one goal is to find simple and efficient potential to evaluate the compatibility between structure and a given sequence. We present here a novo empirical mean force potential to address the importance of electrostatic interactions in protein inverse folding study. It is based on protein main chain polar fraction and constructed in a way similar with Sippl's from a database of 64 known independent three-dimensional protein structures. This potential was applied to recognize the protein native conformations among a conformation pool. Calculated results show that this potential is powerful in picking out native conformations, in addition it can also find structure similarity between proteins with low sequence similarity. The success of this new potential clearly shows the importance of electrostatic factors in protein inverse folding studies. © 1995 Wiley-Liss, Inc. 相似文献
5.
We describe the derivation and testing of a knowledge-based atomic environment potential for the modeling of protein structural energetics. An analysis of the probabilities of atomic interactions in a dataset of high-resolution protein structures shows that the probabilities of non-bonded inter-atomic contacts are not statistically independent events, and that the multi-body contact frequencies are poorly predicted from pairwise contact potentials. A pseudo-energy function is defined that measures the preferences for protein atoms to be in a given microenvironment defined by the number of contacting atoms in the environment and its atomic composition. This functional form is tested for its ability to recognize native protein structures amongst an ensemble of decoy structures and a detailed relative performance comparison is made with a number of common functions used in protein structure prediction. 相似文献
6.
Computational protein structure prediction remains a challenging task in protein bioinformatics. In the recent years, the importance of template-based structure prediction is increasing because of the growing number of protein structures solved by the structural genomics projects. To capitalize the significant efforts and investments paid on the structural genomics projects, it is urgent to establish effective ways to use the solved structures as templates by developing methods for exploiting remotely related proteins that cannot be simply identified by homology. In this work, we examine the effect of using suboptimal alignments in template-based protein structure prediction. We showed that suboptimal alignments are often more accurate than the optimal one, and such accurate suboptimal alignments can occur even at a very low rank of the alignment score. Suboptimal alignments contain a significant number of correct amino acid residue contacts. Moreover, suboptimal alignments can improve template-based models when used as input to Modeller. Finally, we use suboptimal alignments for handling a contact potential in a probabilistic way in a threading program, SUPRB. The probabilistic contacts strategy outperforms the partly thawed approach, which only uses the optimal alignment in defining residue contacts, and also the re-ranking strategy, which uses the contact potential in re-ranking alignments. The comparison with existing methods in the template-recognition test shows that SUPRB is very competitive and outperforms existing methods. 相似文献
7.
A protein fold recognition method was tested by the blind prediction of the structures of a set of proteins. The method evaluates the compatibility of an amino acid sequence with a three-dimensional structure using the four evaluation functions: side-chain packing, solvation, hydrogen-bonding, and local conformation functions. The structures of 14 proteins containing 19 sequences were predicted. The predictions were compared with the experimental structures. The experimental results showed that 9 of the 19 target sequences have known folds or portions of known folds. Among them, the folds of Klebsiella aerogenes urease β subunit (KAUB) and pyruvate phosphate dikinase domain 4 (PPDK4) were successfully recognized; our method predicted that KAUB and PPDK4 would adopt the folds of macromomycin (Ig-fold) and phosphoribosylanthra-nilate isomerase:indoleglycerol-phosphate synthase (TIM barrel), respectively, and the experimental structure revealed that they actually adopt the predicted folds. The predictions for the other targets were not successful, but they often gave secondary structural patterns similar to those of the experimental structures. © 1995 Wiley-Liss, Inc. 相似文献
8.
When a protein sequence does not share any significant sequence similarity with a protein of known structure, homology modeling cannot be applied. However, many novel and interesting methods, such as secondary structure prediction, fold recognition, and prediction of long-range interactions, are being developed and have been shown to be reasonably successful in predicting protein structures from sequence data and evolutionary information. The a priori evaluation of the correctness of a prediction obtained by one of these methods is however often problematic. Consequently, it is important to use all available information provided by as many different methods as possible and all the available experimental data about the protein of interest, since the consistency of the results is indicative of the reliability of the prediction. Hence the need has arisen for suitable tools able to compare results provided by different methods and evaluate their consistency. We have therefore constructed GLASS, a general platform to read, visualize, compare, and evaluate prediction results from many different sources and to project these prediction results into three dimensions. In addition, GLASS allows the comparison of selected parameters calculated for a model with the distribution observed in real protein structures, thus providing an easy way to test new methods for evaluating the likelihood of different structural models. GLASS can be considered as a “workbench” for structural predictions useful to both experimentalists and theoreticians. Proteins 30:339–351, 1998. © 1998 Wiley-Liss, Inc. 相似文献
9.
Yao H Kristensen DM Mihalek I Sowa ME Shaw C Kimmel M Kavraki L Lichtarge O 《Journal of molecular biology》2003,326(1):255-261
Functional sites determine the activity and interactions of proteins and as such constitute the targets of most drugs. However, the exponential growth of sequence and structure data far exceeds the ability of experimental techniques to identify their locations and key amino acids. To fill this gap we developed a computational Evolutionary Trace method that ranks the evolutionary importance of amino acids in protein sequences. Studies show that the best-ranked residues form fewer and larger structural clusters than expected by chance and overlap with functional sites, but until now the significance of this overlap has remained qualitative. Here, we use 86 diverse protein structures, including 20 determined by the structural genomics initiative, to show that this overlap is a recurrent and statistically significant feature. An automated ET correctly identifies seven of ten functional sites by the least favorable statistical measure, and nine of ten by the most favorable one. These results quantitatively demonstrate that a large fraction of functional sites in the proteome may be accurately identified from sequence and structure. This should help focus structure-function studies, rational drug design, protein engineering, and functional annotation to the relevant regions of a protein. 相似文献
10.
Bioinformatics methods to predict protein structure and function. A practical approach 总被引:3,自引:0,他引:3
Protein structure prediction by using bioinformatics can involve sequence similarity searches, multiple sequence alignments,
identification and characterization of domains, secondary structure prediction, solvent accessibility prediction, automatic
protein fold recognition, constructing three-dimensional models to atomic detail, and model validation. Not all protein structure
prediction projects involve the use of all these techniques. A central part of a typical protein structure prediction is the
identification of a suitable structural target from which to extrapolate three-dimensional information for a query sequence.
The way in which this is done defines three types of projects. The first involves the use of standard and well-understood
techniques. If a structural template remains elusive, a second approach using nontrivial methods is required. If a target
fold cannot be reliably identified because inconsistent results have been obtained from nontrivial data analyses, the project
falls into the third type of project and will be virtually impossible to complete with any degree of reliability. In this
article, a set of protocols to predict protein structure from sequence is presented and distinctions among the three types
of project are given. These methods, if used appropriately, can provide valuable indicators of protein structure and function. 相似文献
11.
The stay-green phenotype results from a naturally occurring mutation in which senescent leaves retain their chlorophyll and the associated apoprotein, LHCPII. Protection of this protein pool could deliver grass with enhanced protein content and could decrease the extent of protein degradation by plant proteases in the rumen. This would enhance the efficiency of protein utilization in livestock to the benefit of the environment. Field plots of stay-green and wild-type Lolium perenne were defoliated at intervals to simulate grazing. There were variations in foliar protein content and proteolysis throughout the year, but no significant differences between genotypes when material was analysed fresh or after it was cut and dried to simulate hay-making, which possibly induced senescence. In a subsequent experiment with stay-green and wild-type L temulentum, increased protein retention and decreased protein degradability were observed in stay-green leaves that were allowed to senescence naturally and extensively on the plant. That there is no difference between the two L. perenne genotypes suggests that as a field crop in grazed pastures the stay-green genotype would not confer a nutritional advantage in terms of protein degradability. It is possible that grazing promotes a high proportion of non-senescent to senescent leaf material within the sward and thus any advantage conferred by the stay-green phenotype would be effectively masked by an abundance of mature foliage. It is suggested that the stay-green trait would be of benefit in areas where agricultural practice permits extensive natural senescence to occur. 相似文献
12.
13.
Colorectal cancer (CRC) is the third most prevalent cancer and fourth leading cause of cancer-related deaths globally. It has been shown that the nsSNP variants play an important role in diseases, however it remained unclear how these variants are associated with the disease. Recently, several CRC risk associated SNPs have been discovered, however rs961253 (Lys25Arg at 20p12.3) located in the proximity of bone morphogenetic protein 2 (Bmp2) and fermitin family homolog 1 Fermt1 genes have been reported to be highly associated with the CRC risk. Here we provide evidence for the first time in silico biological functional and structural implications of non-synonymous (nsSNPs) CRC disease-associated variant Lys25Arg via molecular dynamic (MD) simulation. Protein structural analysis was performed with a particular variant allele (A/C, Lys25Arg) and compared with the predicted native protein structure. Our results showed that this nsSNP will cause changes in the protein structure and as a result is associated with the disease. In addition to the native and mutant 3D structures of CRC associated risk allele protein domain (CRAPD), they were also analyzed using solvent accessibility models for further protein stability confirmation. Taken together, this study confirmed that this variant has functional effect and structural impact on the CRAPD and may play an important role in CRC disease progression; hence it could be a reasonable approach for studying the effect of other deleterious variants in future studies. 相似文献
14.
A. Orzechowska M. Lipińska J. Fiedor M. Zaj?c K. Matlak J. Korecki L. Fiedor K. Burda 《BBA》2010,1797(10):1696-1704
Non-heme iron is a conservative component of type II photosynthetic reaction centers of unknown function. We found that in the reaction center from Rba. sphaeroides it exists in two forms, high and low spin ferrous states, whereas in Rsp. rubrum mostly in a low spin state, in line with our earlier finding of its low spin state in the algal photosystem II reaction center (Burda et al., 2003). The temperature dependence of the non-heme iron displacement studied by Mössbauer spectroscopy shows that the surrounding of the high spin iron is more flexible (Debye temperature ~ 165 K) than that of the low spin atom (~ 207 K). Nuclear inelastic scattering measurements of the collective motions in the Rba. sphaeroides reaction center show that the density of vibrational states, originating from non-heme iron, has well-separated modes between lower (4-17 meV) and higher (17-25 meV) energies while in the one from Rsp. rubrum its distribution is more uniform with only little contribution of low energy (~ 6 meV) vibrations. It is the first experimental evidence that the fluctuations of the protein matrix in type II reaction center are correlated to the spin state of non-heme iron. We propose a simple mechanism in which the spin state of non-heme iron directly determines the strength of coupling between the two quinone acceptors (QA and QB) and fast collective motions of protein matrix that play a crucial role in activation and regulation of the electron and proton transfer between these two quinones. We suggest that hydrogen bond network on the acceptor side of reaction center is responsible for stabilization of non-heme iron in different spin states. 相似文献
15.
Elass E Coddeville B Guérardel Y Kremer L Maes E Mazurier J Legrand D 《FEBS letters》2007,581(7):1383-1390
The mycobacterial lipoglycans, lipomannan (LM) and lipoarabinomannan (LAM), regulate host defence mechanisms through their interaction with pattern recognition receptors such as Toll-like receptors (TLRs). We have developed a surface plasmon resonance assay to analyse the molecular basis for the recognition of Mycobacterium kansasii LM or LAM, by immobilized CD14 and LPS-binding protein (LBP) both being capable to promote presentation of bacterial glycolipids to TLRs. The affinity of either LM/LAM was higher to CD14 than to LBP. Kinetic and Scatchard analyses were consistent with a model involving a single class of binding sites. These interactions required the lipidic anchor, but not the carbohydrate domains, of LM or LAM. We also provide evidence that addition of recombinant LBP enhanced the stimulatory effect of LM or LAM on matrix metalloproteinase-9 expression and secretion in macrophages, through a TLR1/TLR2-dependent mechanism. 相似文献
16.
17.
Non-specific lipid transfer proteins belonging to LTP1 family represent the most important allergens for non pollen-related allergies to Rosaceae fruits in the Mediterranean area. Peach LTP1 (Pru p 3) is a major allergen and is considered the prototypic allergenic LTP. On the contrary, pear allergy without pollinosis seems to be under-reported when compared to other Rosaceae fruits suggesting that the as-yet-uncharacterized pear LTP1 (Pyr c 3) has in vivo a low allergenicity. We report here on the identification of four cDNAs encoding for LTP1 in pear fruits. The two isoforms exhibiting amino acid sequences most similar to those of peach and apple homologues were obtained as recombinant proteins. Such isoforms exhibited CD spectra and lipid binding ability typical of LTP1 family. Moreover, pear LTP1 mRNA was mainly found in the peel, as previously shown for other Rosaceae fruits. By means of IgE ELISA assays a considerable immunoreactivity of these proteins to LTP-sensitive patient sera was detected, even though allergic reactions after ingestion of pear were not reported in the clinical history of the patients. Finally, the abundance of LTP1 in protein extracts from pear peel, in which LTP1 from Rosaceae fruits is mainly confined, was estimated to be much lower as compared to peach peel. Our data suggest that the two isoforms of pear LTP1 characterized in this study possess biochemical features and IgE-binding ability similar to allergenic LTPs. Their low concentrations in pear might be the cause of the low frequency of LTP-mediated pear allergy. 相似文献
18.
Claudia Cano-Ramírez María Fernanda López Ana K. Cesar-Ayala Verónica Pineda-Martínez Brian T. Sullivan Gerardo Zúñiga 《Gene》2013
Bark beetles oxidize the defensive monoterpenes of their host trees both to detoxify them and convert them into components of their pheromone system. This oxidation is catalyzed by cytochrome P450 enzymes and occurs in different tissues of the insect, including the gut (i.e., the site where the beetle's pheromones are produced and accumulated) and the antennae (i.e., the olfactory organs used for perception of airborne defensive monoterpenes as well as other host-associated compounds and pheromones). We identified ten new CYP genes in the pine beetle Dendroctonus rhizophagus in either antennae or gut tissue after stimulation with the vapors of major host monoterpenes α-pinene, β-pinene and 3-carene. Five genes belong to the CYP4 family, four to the CYP6 family and one to the CYP9 family. Differential expression of almost all of the CYP genes was observed between sexes, and within these significant differences among time, stimuli, anatomical region, and their interactions were found upon exposure to host monoterpenes. Increased expression of cytochrome P450 genes suggests that they play a role in the detoxification of monoterpenes released by this insect's host trees. 相似文献