首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Homology-derived secondary structure of proteins (HSSP) is a well-known database of multiple sequence alignments (MSAs) which merges information of protein sequences and their three-dimensional structures. It is available for all proteins whose structure is deposited in the PDB. It is also used by STING and (Java)Protein Dossier to calculate and present relative entropy as a measure of the degree of conservation for each residue of proteins whose structure has been solved and deposited in the PDB. However, if the STING and (Java)Protein Dossier are to provide support for analysis of protein structures modeled in computers or being experimentally solved but not yet deposited in the PDB, then we need a new method for building alignments having a flavor of HSSP alignments (myMSAr). The present study describes a new method and its corresponding databank (SH2QS--database of sequences homologue to the query [structure-having] sequence). Our main interest in making myMSAr was to measure the degree of residue conservation for a given query sequence, regardless of whether it has a corresponding structure deposited in the PDB. In this study, we compare the measurement of residue conservation provided by corresponding alignments produced by HSSP and SH2QS. As a case study, we also present two biologically relevant examples, the first one highlighting the equivalence of analysis of the degree of residue conservation by using HSSP or SH2QS alignments, and the second one presenting the degree of residue conservation for a structure modeled in a computer, which , as a consequence, does not have an alignment reported by HSSP.  相似文献   

2.
We describe the current status of the Java molecular graphics tool, MolSurfer. MolSurfer has been designed to assist the analysis of the structures and physico-chemical properties of macromolecular interfaces. MolSurfer provides a coupled display of two-dimensional (2D) maps of the interfaces generated with the ADS software and a three-dimensional (3D) view of the macromolecular structure in the Java PDB viewer, WebMol. The interfaces are analytically defined and properties such as electrostatic potential or hydrophobicity are projected on to them. MolSurfer has been applied previously to analyze a set of 39 protein-protein complexes, with structures available from the Protein Data Bank (PDB). A new application, described here, is the visualization of 75 interfaces in structures of protein-DNA and protein-RNA complexes. Another new feature is that the MolSurfer web server is now able to compute and map Poisson-Boltzmann electrostatic potentials of macromolecules onto interfaces. The MolSurfer web server is available at http://projects.villa-bosch.de/mcm/software/molsurfer.  相似文献   

3.
4.
The genomes of more than 100 species have been sequenced, and the biological functions of encoded proteins are now actively being researched. Protein function is based on interactions between proteins and other molecules. One approach to assuming protein function based on genomic sequence is to predict interactions between an encoded protein and other molecules. As a data source for such predictions, knowledge regarding known protein-small molecule interactions needs to be compiled. We have, therefore, surveyed interactions between proteins and other molecules in Protein Data Bank (PDB), the protein three-dimensional (3D) structure database. Among 20,685 entries in PDB (April, 2003), 4,189 types of small molecules were found to interact with proteins. Biologically relevant small molecules most often found in PDB were metal ions, such as calcium, zinc, and magnesium. Sugars and nucleotides were the next most common. These molecules are known to act as cofactors for enzymes and/or stabilizers of proteins. In each case of interactions between a protein and small molecule, we found preferred amino acid residues at the interaction sites. These preferences can be the basis for predicting protein function from genomic sequence and protein 3D structures. The data pertaining to these small molecules were collected in a database named Het-PDB Navi., which is freely available at http://daisy.nagahama-i-bio.ac.jp/golab/hetpdbnavi.html and linked to the official PDB home page.  相似文献   

5.
6.
Electrostatic interactions play a key role in many aspects of protein engineering. Consequently, much effort has been put into the design of software for calculating electrostatic fields around macromolecules. We show that optimization of hydrogen bonding networks can improve both the results of pK(a) calculations and the results of electrostatic calculations performed by commonly used programs such as DelPhi. Further optimization can often be achieved by flipping the side chains of asparagine, histidine and glutamine around their chi2, chi2 and chi3 torsion angles, respectively, when this improves the local hydrogen bonding network. These optimizations are applied to some well characterized proteins: BPTI, hen egg white lysozyme and superoxide dismutase. A search for flipped residues in the PDB revealed that significant improvements in electrostatic calculations in or near the active site of enzymes can be expected for about one quarter of all enzymes in the PDB.  相似文献   

7.
We analyzed structural features of 11,038 direct atomic contacts (either electrostatic, H-bonds, hydrophobic, or other van der Waals interactions) extracted from 139 protein-DNA and 49 protein-RNA nonhomologous complexes from the Protein Data Bank (PDB). Globally, H-bonds are the most frequent interactions (approximately 50%), followed by van der Waals, hydrophobic, and electrostatic interactions. From the protein viewpoint, hydrophilic amino acids are over-represented in the interaction databases: Positively charged amino acids mainly contact nucleic acid phosphate groups but can also interact with base edges. From the nucleotide point of view, DNA and RNA behave differently: Most protein-DNA interactions involve phosphate atoms, while protein-RNA interactions involve more frequently base edge and ribose atoms. The increased participation of DNA phosphate involves H-bonds rather than salt bridges. A statistical analysis was performed to find the occurrence of amino acid-nucleotide pairs most different from chance. These pairs were analyzed individually. Finally, we studied the conformation of DNA in the interaction sites. Despite the prevalence of B-DNA in the database, our results suggest that A-DNA is favored in the interaction sites.  相似文献   

8.
STING Millennium Suite (SMS) is a new web-based suite of programs and databases providing visualization and a complex analysis of molecular sequence and structure for the data deposited at the Protein Data Bank (PDB). SMS operates with a collection of both publicly available data (PDB, HSSP, Prosite) and its own data (contacts, interface contacts, surface accessibility). Biologists find SMS useful because it provides a variety of algorithms and validated data, wrapped-up in a user friendly web interface. Using SMS it is now possible to analyze sequence to structure relationships, the quality of the structure, nature and volume of atomic contacts of intra and inter chain type, relative conservation of amino acids at the specific sequence position based on multiple sequence alignment, indications of folding essential residue (FER) based on the relationship of the residue conservation to the intra-chain contacts and Calpha-Calpha and Cbeta-Cbeta distance geometry. Specific emphasis in SMS is given to interface forming residues (IFR)-amino acids that define the interactive portion of the protein surfaces. SMS may simultaneously display and analyze previously superimposed structures. PDB updates trigger SMS updates in a synchronized fashion. SMS is freely accessible for public data at http://www.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS and http://trantor.bioc.columbia.edu/SMS.  相似文献   

9.
We predicted gamma-turns from amino acid sequences using the first-order Markov chain theory and enlarged representative data sets corresponding to protein chains selected from the Protein Data Bank (PDB). The following data sets were used for training and deriving the probability values: (1) an initial data set containing 315 protein chains comprising 904 gamma-turns and (2) a later data set in order to include new entries in the PDB, containing 434 protein chains and comprising 1053 gamma-turns. By excluding 93 protein chains that were common to these two training data sets, we generated two mutually exclusive data sets containing 222 and 341 protein chains for testing our predictions. Applying amino acid probability values derived from training data sets on to testing data sets yielded overall prediction accuracies in the range 54-57%. We recommend the use of probability values derived from the data set comprising 315 protein chains that represents more gamma-turns and also provides better predictions.  相似文献   

10.
The chalcogen bond, the noncovalent, electrostatic attraction between covalently bonded atoms in group 16 and Lewis bases, is present in protein?ligand interactions based on X-ray structures deposited in the Protein Data Bank (PDB). Discovering protein?ligand chalcogen bonding in the PDB employed a strategy that focused on searching the database for protein complexes of five-membered, heterocyclic ligands containing endocyclic sulfur with endo electron-withdrawing groups (isothiazoles; thiazoles; 1,2,3-, 1,2.4-, 1,2,5-, 1,3,4-thiadiazoles) and thiophenes with exo electron-withdrawing groups, e.g., 2-chloro, 2-bromo, 2-amino, 2-alkylthio. Out of 930 ligands investigated, 33 or 3.5% have protein?ligand S---O interactions of which 31 are chalcogen bonds and two appear to be S---HO hydrogen bonds. The bond angles for some of the chalcogen bonds found in the PDB are less than 90°, and an electrostatic model is proposed to explain this phenomenon.  相似文献   

11.
We have updated the Protein Sequence-Structure Analysis Relational Database (PSSARD) first published in the Int. J. Biol. Macromol. 36 (2005) 259-262 corresponding to 1573 representative protein chains selected from the Protein Data Bank (PDB). In this, the updated and revised PSSARD (Version 2.0), we have included all proteins in the Protein Data Bank available at the time of developing this database including the NMR PDB entries. The current database corresponds to 22,752 XRAY PDB entries and 3977 NMR PDB entries and is separated accordingly in order to facilitate the appropriate database search. The representative protein chains can also be separately accessed within the current database. We have made a provision to combine more than one field to query the database and the results of any search can be used to carry out further nested searches using a combination of queries. We have provided hyperlinks to the individual PDB entries obtained as the result of any search in PSSARD in order to obtain additional details relevant to the protein structure. Certain applications useful to identify domains and structural motifs are discussed.  相似文献   

12.
Protein docking procedures carry out the task of predicting the structure of a protein–protein complex starting from the known structures of the individual protein components. More often than not, however, the structure of one or both components is not known, but can be derived by homology modeling on the basis of known structures of related proteins deposited in the Protein Data Bank (PDB). Thus, the problem is to develop methods that optimally integrate homology modeling and docking with the goal of predicting the structure of a complex directly from the amino acid sequences of its component proteins. One possibility is to use the best available homology modeling and docking methods. However, the models built for the individual subunits often differ to a significant degree from the bound conformation in the complex, often much more so than the differences observed between free and bound structures of the same protein, and therefore additional conformational adjustments, both at the backbone and side chain levels need to be modeled to achieve an accurate docking prediction. In particular, even homology models of overall good accuracy frequently include localized errors that unfavorably impact docking results. The predicted reliability of the different regions in the model can also serve as a useful input for the docking calculations. Here we present a benchmark dataset that should help to explore and solve combined modeling and docking problems. This dataset comprises a subset of the experimentally solved ‘target’ complexes from the widely used Docking Benchmark from the Weng Lab (excluding antibody–antigen complexes). This subset is extended to include the structures from the PDB related to those of the individual components of each complex, and hence represent potential templates for investigating and benchmarking integrated homology modeling and docking approaches. Template sets can be dynamically customized by specifying ranges in sequence similarity and in PDB release dates, or using other filtering options, such as excluding sets of specific structures from the template list. Multiple sequence alignments, as well as structural alignments of the templates to their corresponding subunits in the target are also provided. The resource is accessible online or can be downloaded at http://cluspro.org/benchmark , and is updated on a weekly basis in synchrony with new PDB releases. Proteins 2016; 85:10–16. © 2016 Wiley Periodicals, Inc.  相似文献   

13.
Determination of pK(a) values of titrating residues in proteins provides a direct means of studying electrostatic coupling as well as pH-dependent stability. The B1 domain of protein G provides an excellent model system for such investigations. In this work, we analyze the observed pK(a) values of all carboxyl groups in a variant of PGB1 (T2Q, N8D, N37D) at low and high ionic strength as determined using (1)H-(13)C heteronuclear NMR in a structural context. The pK(a) values are used to calculate the pH-dependent stability in low and high salt and to investigate electrostatic coupling in the system. The observed pK(a) values can explain the pH dependence of protein stability but require pK(a) shifts relative to model values in the unfolded state, consistent with persistent residual structure in the denatured state. In particular, we find that most of the deviations from the expected random coil values can be explained by a significantly upshifted pK(a) value. We show also that (13)C backbone carbonyl data can be used to study electrostatic coupling in proteins and provide specific information on hydrogen bonding and electrostatic potential at nontitrating sites.  相似文献   

14.
SUMMARY: A web-based application to analyze protein amino acids conservation-Consensus Sequence (ConSSeq) is presented. ConSSeq graphically represents information about amino acid conservation based on sequence alignments reported in homology-derived structures of proteins. Beyond the relative entropy for each position in the alignment, ConSSeq also presents the consensus sequence and information about the amino acids, which are predominant at each position of the alignment. ConSSeq is part of the STING Millennium Suite and is implemented as a Java Applet. AVAILABILITY: http://sms.cbi.cnptia.embrapa.br/SMS/STINGm/consseq/, http://trantor.bioc.columbia.edu/SMS/STINGm/consseq/, http://mirrors.rcsb.org//SMS/STINGm/consseq/, http://www.es.embnet.org/SMS/STINGm/consseq/ and http://www.ar.embnet.org/SMS/STINGm/consseq/  相似文献   

15.
The rate of association of proteins is dictated by diffusion, but can be enhanced by favorable electrostatic forces. Here the relationship between the electrostatic energy of interaction, and the kinetics of protein-complex formation was analyzed for the protein pairs of: hirudin-thrombin, acetylcholinesterase-fasciculin and barnase-barstar, and for a panel of point mutants of these proteins. Electrostatic energies of interaction were calculated as the difference between the electrostatic energy of the complex and the sum of the energies of the two individual proteins, using the computer simulation package DelPhi. Calculated electrostatic energies of interaction were compared to experimentally determined rates of association. One kcal/mol of Coulombic interaction energy increased the rate of association by a factor of 2.8, independent of the protein-complex or mutant analyzed. Electrostatic energies of interaction were also determined from the salt dependence of the association rate constant, using the same basic equation as for the theoretical calculation. A Br?nsted analysis of the electrostatic energies of interactions plotted versus experimentally determined ln(rate)s of association shows a linear relation between the two, with a beta value close to 1. This is interpreted as the energy of the transition state varies according to the electrostatic interaction energy, fitting a two state model for the association reaction. Calculating electrostatic rate enhancement from the electrostatic interaction energy can be used as a powerful tool to design protein complexes with altered rates of association and affinities.  相似文献   

16.
In this paper, we describe a new method to generate a smooth algebraic spline (AS) approximation of the molecular surface (MS) based on an initial coarse triangulation derived from the atomic coordinate information of the biomolecule, resident in the Protein data bank (PDB). Our method first constructs a triangular prism scaffold covering the PDB structure, and then generates a piecewise polynomial F on the Bernstein-Bezier (BB) basis within the scaffold. An ASMS model of the molecular surface is extracted as the zero contours of F, which is nearly C1 and has dual implicit and parametric representations. The dual representations allow us easily do the point sampling on the ASMS model and apply it to the accurate estimation of the integrals involved in the electrostatic solvation energy computations. Meanwhile comparing with the trivial piecewise linear surface model, fewer number of sampling points are needed for the ASMS, which effectively reduces the complexity of the energy estimation.  相似文献   

17.
Protein function is a dynamic property closely related to the conformational mechanisms of protein structure in its physiological environment. To understand and control the function of target proteins, it becomes increasingly important to develop methods and tools for predicting collective motions at the molecular level. In this article, we review computational methods for predicting conformational dynamics and discuss software tools for data analysis. In particular, we discuss a high-throughput, web-based system called iGNM for protein structural dynamics. iGNM contains a database of protein motions for more than 20 000 PDB structures and supports online calculations for newly deposited PDB structures or user-modified structures. iGNM allows dynamics analysis of protein structures ranging from enzymes to large complexes and assemblies, and enables the exploration of protein sequence-structure-dynamics-function relations.  相似文献   

18.
Wang J  Feng JA 《Proteins》2005,58(3):628-637
Sequence alignment has become one of the essential bioinformatics tools in biomedical research. Existing sequence alignment methods can produce reliable alignments for homologous proteins sharing a high percentage of sequence identity. The performance of these methods deteriorates sharply for the sequence pairs sharing less than 25% sequence identity. We report here a new method, NdPASA, for pairwise sequence alignment. This method employs neighbor-dependent propensities of amino acids as a unique parameter for alignment. The values of neighbor-dependent propensity measure the preference of an amino acid pair adopting a particular secondary structure conformation. NdPASA optimizes alignment by evaluating the likelihood of a residue pair in the query sequence matching against a corresponding residue pair adopting a particular secondary structure in the template sequence. Using superpositions of homologous proteins derived from the PSI-BLAST analysis and the Structural Classification of Proteins (SCOP) classification of a nonredundant Protein Data Bank (PDB) database as a gold standard, we show that NdPASA has improved pairwise alignment. Statistical analyses of the performance of NdPASA indicate that the introduction of sequence patterns of secondary structure derived from neighbor-dependent sequence analysis clearly improves alignment performance for sequence pairs sharing less than 20% sequence identity. For sequence pairs sharing 13-21% sequence identity, NdPASA improves the accuracy of alignment over the conventional global alignment (GA) algorithm using the BLOSUM62 by an average of 8.6%. NdPASA is most effective for aligning query sequences with template sequences whose structure is known. NdPASA can be accessed online at http://astro.temple.edu/feng/Servers/BioinformaticServers.htm.  相似文献   

19.
Van der Waals interaction energy in globular proteins is presented by the interaction energies between regions of protein spatial structure with homogenous medium density distribution. We introduce a notion of the local medium permittivity as a function of absorptance of molecular groups with particular conformation. Proposed theory avoids shortcomings which are typical for the calculations on the basis of the pairwise additive approximation. The approach takes into account local peculiarities of protein spatial structure and physical-chemical characteristics of amino acid residues and molecular groups.  相似文献   

20.
Brevicin 27, a bacteriocin produced by Lactobacillus brevis SB27, is inhibitory mainly against closely related Lactobacillus brevis and Lactobacillus büchneri strains. It was purified from the culture supernatant by a four-step purification procedure including ammonium sulfate precipitation, cation exchange, hydrophobic interaction, and reverse-phase, high performance liquid chromatographies. The purified bacteriocin was subjected to mass spectrometry, amino acid composition analysis, and sequencing by Edman degradation. It was shown to be an about 5200-Da basic protein containing a high proportion of lysine and of hydrophobic amino acids. The partial N-terminal amino acid sequence (25 residues) was unique when compared with the Protein Data Bank (PDB), Swiss Prot, and Protein Information Resource (PIR) data banks and to the translated Gen Bank. Received: 24 July 1996 / Accepted: 10 September 1996  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号