首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Atom depth as a descriptor of the protein interior   总被引:3,自引:0,他引:3       下载免费PDF全文
  相似文献   

2.
MOTIVATION: A simple and fast algorithm is described that calculates a measure of protrusion (cx) for atoms in protein structures, directly useable with the common molecular graphics programs. RESULTS: A sphere of predetermined radius is centered around each non-hydrogen atom, and the volume occupied by the protein and the free volume within the sphere (internal and external volumes, respectively) are calculated. Atoms in protruding regions have a high ratio (cx) between the external and the internal volume. The program reads a PDB file, and writes the output in the same format, with cx values in the B factor field. Output structure files can be directly displayed with standard molecular graphics programs like RASMOL, MOLMOL, Swiss-PDB Viewer and colored according to cx values. We show the potential use of this program in the analysis of two protein-protein complexes and in the prediction of limited proteolysis sites in native proteins. AVAILABILITY: The algorithm is implemented in a standalone program written in C and its source is freely available at ftp.icgeb.trieste.it/pub/CX or on request from the authors.  相似文献   

3.
4.
sGAL is a computer program designed to find pairs of sites suitable for introducing chemical cross-links into proteins. sGAL takes a protein structure file in PDB format as input, truncates each residue sequentially to its gamma side chain atom to mimic mutation to Cys, and calculates the exposed surface area of the gamma atom. The user then inputs the minimum and maximum lengths of the cross-linker. sGAL provides as output pairs of residues that would have exposed gamma atom separations that fall within this range. Furthermore, if a line joining the pair of gamma atoms contacts more than a given number of buried atoms, that pair is discarded. In this way, sites for which the protein would sterically interfere with cross-linking are avoided. AVAILABILITY: http://www.chem.utoronto.ca/staff/GAW/links.html; (Surface Racer is also required see: http://monte.biochem.wisc.edu/~tsodikov/surface.html).  相似文献   

5.
Yuan Z  Bailey TL  Teasdale RD 《Proteins》2005,58(4):905-912
The polypeptide backbones and side chains of proteins are constantly moving due to thermal motion and the kinetic energy of the atoms. The B-factors of protein crystal structures reflect the fluctuation of atoms about their average positions and provide important information about protein dynamics. Computational approaches to predict thermal motion are useful for analyzing the dynamic properties of proteins with unknown structures. In this article, we utilize a novel support vector regression (SVR) approach to predict the B-factor distribution (B-factor profile) of a protein from its sequence. We explore schemes for encoding sequences and various settings for the parameters used in SVR. Based on a large dataset of high-resolution proteins, our method predicts the B-factor distribution with a Pearson correlation coefficient (CC) of 0.53. In addition, our method predicts the B-factor profile with a CC of at least 0.56 for more than half of the proteins. Our method also performs well for classifying residues (rigid vs. flexible). For almost all predicted B-factor thresholds, prediction accuracies (percent of correctly predicted residues) are greater than 70%. These results exceed the best results of other sequence-based prediction methods.  相似文献   

6.
Conformational switches observed in the protein backbone play a key role in a variety of fundamental biological activities. This paper describes a web-server that implements a pattern recognition algorithm trained on the examples from the Database of Macromolecular Movements to predict residue positions involved in conformational switches. Prediction can be performed at an adjustable false positive rate using a user-supplied protein sequence in FASTA format or a structure in a Protein Data Bank (PDB) file. If a protein sequence is submitted, then the web-server uses sequence-derived information only (such as evolutionary conservation of residue positions). If a PDB file is submitted, then the web-server uses sequence-derived information and residue solvent accessibility calculated from this file.  相似文献   

7.
FUGUE, a program for recognizing distant homologues by sequence-structure comparison (http://www-cryst.bioc.cam.ac.uk/fugue/), has three key features. (1) Improved environment-specific substitution tables. Substitutions of an amino acid in a protein structure are constrained by its local structural environment, which can be defined in terms of secondary structure, solvent accessibility, and hydrogen bonding status. The environment-specific substitution tables have been derived from structural alignments in the HOMSTRAD database (http://www-cryst.bioc. cam.ac.uk/homstrad/). (2) Automatic selection of alignment algorithm with detailed structure-dependent gap penalties. FUGUE uses the global-local algorithm to align a sequence-structure pair when they greatly differ in length and uses the global algorithm in other cases. The gap penalty at each position of the structure is determined according to its solvent accessibility, its position relative to the secondary structure elements (SSEs) and the conservation of the SSEs. (3) Combined information from both multiple sequences and multiple structures. FUGUE is designed to align multiple sequences against multiple structures to enrich the conservation/variation information. We demonstrate that the combination of these three key features implemented in FUGUE improves both homology recognition performance and alignment accuracy.  相似文献   

8.
Modeling of the electrostatic potential field of plastocyanin   总被引:2,自引:0,他引:2  
The DelPhi computer program is used to calculate the electrostatic potential field of the photosynthetic electron transport protein plastocyanin. Knowledge of the potential field is important for understanding the mechanisms by which plastocyanin interacts with other charged reagents. The program uses a macroscopic, continuum approach in which the protein and solvent are assigned different dielectric constants, the crystal structure of the protein defines the dielectric boundary, and the ionic strength of the solvent is taken into account. The potential field is determined by numerically solving the Poisson-Boltzmann equation. The field surrounding plastocyanin is characterized by a region of positive potential over the copper center active site, and a region of negative potential over the adjacent association site containing tyrosine 83. The shape and magnitude of the potential field shows a strong dependence on the ionic strength and pH of the solvent. The program is able to accurately predict the effect of the copper center oxidation state on the pKa of a tetranitromethane derivative of tyrosine 83 using an intrinsic protein dielectric constant of 2 to 4. Evidence is also presented that the glutamate 68 side chain is exposed to the solvent to a greater extent in the solution structure of plastocyanin than in the crystal structure.  相似文献   

9.
Protein cysteine thiols can be divided into four groups based on their reactivities: those that form permanent structural disulfide bonds, those that coordinate with metals, those that remain in the reduced state, and those that are susceptible to reversible oxidation. Physicochemical parameters of oxidation-susceptible protein thiols were organized into a database named the Balanced Oxidation Susceptible Cysteine Thiol Database (BALOSCTdb). BALOSCTdb contains 161 cysteine thiols that undergo reversible oxidation and 161 cysteine thiols that are not susceptible to oxidation. Each cysteine was represented by a set of 12 parameters, one of which was a label (1/0) to indicate whether its thiol moiety is susceptible to oxidation. A computer program (the C4.5 decision tree classifier re-implemented as the J48 classifier) segregated cysteines into oxidation-susceptible and oxidation-non-susceptible classes. The classifier selected three parameters critical for prediction of thiol oxidation susceptibility: (1) distance to the nearest cysteine sulfur atom, (2) solvent accessibility, and (3) pKa. The classifier was optimized to correctly predict 136 of the 161 cysteine thiols susceptible to oxidation. Leave-one-out cross-validation analysis showed that the percent of correctly classified cysteines was 80.1% and that 16.1% of the oxidation-susceptible cysteine thiols were incorrectly classified. The algorithm developed from these parameters, named the Cysteine Oxidation Prediction Algorithm (COPA), is presented here. COPA prediction of oxidation-susceptible sites can be utilized to locate protein cysteines susceptible to redox-mediated regulation and identify possible enzyme catalytic sites with reactive cysteine thiols.  相似文献   

10.
B-factor from X-ray crystal structure can well measure protein structural flexibility, which plays an important role in different biological processes, such as catalysis, binding and molecular recognition. Understanding the essence of flexibility can be helpful for the further study of the protein function. In this study, we attempted to correlate the flexibility of a residue to its interactions with other residues by representing the protein structure as a residue contact network. Here, several well established network topological parameters were employed to feature such interactions. A prediction model was constructed for B-factor of a residue by using support vector regression (SVR). Pearson correlation coefficient (CC) was used as the performance measure. CC values were 0.63 and 0.62 for single amino acid and for the whole sequence, respectively. Our results revealed well correlations between B-factors and network topological parameters. This suggests that the protein structural flexibility could be well characterized by the inter-amino acid interactions in a protein.  相似文献   

11.
ABSTRACT: BACKGROUND: Multiple structure alignments have received increasing attention in recent years as an alternative to multiple sequence alignments. Although multiple structure alignment algorithms can potentially be applied to a number of problems, they have primarily been used for protein core identification. A method that is capable of solving a variety of problems using structure comparison is still absent. Here we introduce a program msTALI for aligning multiple protein structures. Our algorithm uses several informative features to guide its alignments: torsion angles, backbone Calpha atom positions, secondary structure, residue type, surface accessibility, and properties of nearby atoms. The algorithm allows the user to weight the types of information used to generate the alignment, which expands its utility to a wide variety of problems. RESULTS: msTALI exhibits competitive results on 824 families from the Homstrad and SABmark databases when compared to Matt and Mustang. We also demonstrate success at building a database of protein cores using 341 randomly selected CATH domains and highlight the contribution of msTALI compared to the CATH classifications. Finally, we present an example applying msTALI to the problem of detecting hinges in a protein undergoing rigid-body motion. CONCLUSIONS: msTALI is an effective algorithm for multiple structure alignment. In addition to its performance on standard comparison databases, it utilizes clear, informative features, allowing further customization for domain-specific applications. The C++ source code for msTALI is available for Linux on the web at http://ifestos.cse.sc.edu/mstali.  相似文献   

12.
Yang Q  Sharp KA 《Proteins》2009,74(3):682-700
We describe a method for efficiently generating ensembles of alternate, all-atom protein structures that (a) differ significantly from the starting structure, (b) have good stereochemistry (bonded geometry), and (c) have good steric properties (absence of atomic overlap). The method uses reconstruction from a series of backbone framework structures that are obtained from a modified elastic network model (ENM) by perturbation along low-frequency normal modes. To ensure good quality backbone frameworks, the single force parameter ENM is modified by introducing two more force parameters to characterize the interaction between the consecutive carbon alphas and those within the same secondary structure domain. The relative stiffness of the three parameters is parameterized to reproduce B-factors, while maintaining good bonded geometry. After parameterization, violations of experimental Calpha-Calpha distances and Calpha-Calpha-Calpha pseudo angles along the backbone are reduced to less than 1%. Simultaneously, the average B-factor correlation coefficient improves to R = 0.77. Two applications illustrate the potential of the approach. (1) 102,051 protein backbones spanning a conformational space of 15 A root mean square deviation were generated from 148 nonredundant proteins in the PDB database, and all-atom models with minimal bonded and nonbonded violations were produced from this ensemble of backbone structures using the SCWRL side chain building program. (2) Improved backbone templates for homology modeling. Fifteen query sequences were each modeled on two targets. For each of the 30 target frameworks, dozens of improved templates could be produced In all cases, improved full atom homology models resulted, of which 50% could be identified blind using the D-Fire statistical potential.  相似文献   

13.
14.
Afonnikov  D. A.  Morozov  A. V.  Kolchanov  N. A. 《Biophysics》2008,51(1):56-60

The profile of contact numbers of amino acid residues in proteins contains important information about the protein structure and is connected with the accessibility of residues to solvent. Here we propose a method for predicting the profile of contact numbers of residues in protein from its amino acid sequence. The method is based on regression using a neural network algorithm. The algorithm predicts two types of profiles, namely, the total number of contacts and the number of close contacts with the neighbors in the chain. The Pearson coefficient of correlation between the actual and predicted values of total contact numbers amounted to 0.526–0.703. As for the number of close contacts, this coefficient was higher (0.662–0.743) for all the considered threshold contact distances (6, 8, 10, and 12 Å). The program for prediction of contact numbers CONNP is available at http://wwwmgs2.bionet.nsc.ru/reloaded.

  相似文献   

15.
Sometimes crystals cannot diffract X-rays beyond 3.0 ? resolution due to the intrinsic flexibility associated with the protein. Low resolution diffraction data not only pose a challenge to structure determination, but also hamper interpretation of mechanistic details. Crystals of a 25.6 kDa non-Pfam, hypothetical protein, PF2046, diffracted X-rays to 3.38 ? resolution. A combination of Se-Met derived heavy atom positions with multiple cycles of B-factor sharpening, multi-crystal averaging, restrained refinement followed by manual inspection of electron density and model building resulted in a final model with a R value of 23.5 (Rfree=24.7). The asymmetric unit was large and consisted of six molecules arranged as a homodimer of trimers. Analysis of the structure revealed the presence of a RNA binding domain suggesting a role for PF2046 in the processing of nucleic acids.  相似文献   

16.
A simple, static contact mapping algorithm has been developed as a first step at identifying potential peptide biomimetics from protein interaction partner structure files. This rapid and simple mapping algorithm, “OpenContact” provides screened or parsed protein interaction files based on specified criteria for interatomic separation distances and interatomic potential interactions. The algorithm, which uses all‐atom Amber03 force field models, was blindly tested on several unrelated cases from the literature where potential peptide mimetics have been experimentally developed to varying degrees of success. In all cases, the screening algorithm efficiently predicted proposed or potential peptide biomimetics, or close variations thereof, and provided complete atom‐atom interaction data necessary for further detailed analysis and drug development. In addition, we used the static parsing/mapping method to develop a peptide mimetic to the cancer protein target, epidermal growth factor receptor. In this case, secondary, loop structure for the peptide was indicated from the intra‐protein mapping, and the peptide was subsequently synthesized and shown to exhibit successful binding to the target protein. The case studies, which all involved experimental peptide drug advancement, illustrate many of the challenges associated with the development of peptide biomimetics, in general. Proteins 2014; 82:2253–2262. © 2014 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.  相似文献   

17.
SUMMARY: Voro3D is an original easy-to-use tool, which provides a brand new point of view on protein structures through the three-dimensional (3D) Voronoi tessellations. To construct the Voronoi cells associated with each amino acid by a number of different tessellation methods, Voro3D uses a protein structure file in the PDB format as an input. After calculation, different structural properties of interest like secondary structures assignment, environment accessibility and exact contact matrices can be derived without any geometrical cut-off. Voro3D provides also a visualization of these tessellations superimposed on the associated protein structure, from which it is possible to model a polygonal protein surface using a model solvent or to quantify, for instance, the contact areas between a protein and a ligand. AVAILABILITY: The software executable file for PC using Windows 98, 2000, NT, XP can be freely downloaded at http://www.lmcp.jussieu.fr/~mornon/voronoi.html CONTACT: franck.dupuis@sanofi-aventis.com; jean-paul-mornon@imcp.jussieu.fr.  相似文献   

18.
MOTIVATION: The program ESPript (Easy Sequencing in PostScript) allows the rapid visualization, via PostScript output, of sequences aligned with popular programs such as CLUSTAL-W or GCG PILEUP. It can read secondary structure files (such as that created by the program DSSP) to produce a synthesis of both sequence and structural information. RESULTS: ESPript can be run via a command file or a friendly html-based user interface. The program calculates an homology score by columns of residues and can sort this calculation by groups of sequences. It offers a palette of markers to highlight important regions in the alignment. ESPript can also paste information on residue conservation into coordinate files, for subsequent visualization with a graphics program. AVAILABILITY: ESPript can be accessed on its Web site at http://www.ipbs.fr/ESPript. Sources and helpfiles can be downloaded via anonymous ftp from ftp.ipbs.fr. A tar file is held in the directory pub/ESPript.  相似文献   

19.
A reduced representation in the format of a barcode has been developed to provide an overview of the topological nature of a given protein structure from 3D coordinate file. The molecular structure of a protein coordinate file from Protein Data Bank is first expressed in terms of an alpha‐numero code and further converted to a barcode image. The barcode representation can be used to compare and contrast different proteins based on their structure. The utility of this method has been exemplified by comparing structural barcodes of proteins that belong to same fold family, and across different folds. In addition to this, we have attempted to provide an illustration to (i) the structural changes often seen in a given protein molecule upon interaction with ligands and (ii) Modifications in overall topology of a given protein during evolution. The program is fully downloadable from the website http://www.iitg.ac.in/probar/ .  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号