期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

PSSARD: protein sequence-structure analysis relational database

Guruprasad K Srikanth K Babu AV 《International journal of biological macromolecules》2005,36(4):259-262

相似文献

2.

PROTMAP2D: visualization, comparison and analysis of 2D maps of protein structure

Pietal MJ Tuszynska I Bujnicki JM 《Bioinformatics (Oxford, England)》2007,23(11):1429-1430

MOTIVATION: Protein structure comparison is a fundamental problem in structural biology and bioinformatics. Two-dimensional maps of distances between residues in the structure contain sufficient information to restore the 3D representation, while maps of contacts reveal characteristic patterns of interactions between secondary and super-secondary structures and are very attractive for visual analysis. The overlap of 2D maps of two structures can be easily calculated, providing a sensitive measure of protein structure similarity. PROTMAP2D is a software tool for calculation of contact and distance maps based on user-defined criteria, quantitative comparison of pairs or series of contact maps (e.g. alternative models of the same protein, model versus native structure, different trajectories from molecular dynamics simulations, etc.) and visualization of the results. AVAILABILITY: PROTMAP2D for Windows / Linux / MacOSX is freely available for academic users from http://genesilico.pl/protmap2d.htm 相似文献

3.

Recognition of super-secondary structure in proteins

William R. Taylor Janet M. Thornton 《Journal of molecular biology》1984,173(4):487-514

A procedure to recognize super-secondary structure in protein sequences is described. An idealized template, derived from known super-secondary structures, is used to locate probable sites by matching with secondary structure probability profiles. We applied the method to the identification of βαβ units in β/α type proteins with 75% accuracy. The location of super-secondary structure was then used to refine the original (Garnier et al., 1978) secondary structure prediction resulting in an 8.8% improvement, which correctly assigned 83% of secondary structure elements in 14 proteins. Slight modifications to the Garnier et al. method arc suggested, producing a more accurate identification of protein class and a better prediction for β/α. type proteins. A method for the incorporation of hydrophobic information into the prediction is also described. 相似文献

4.

Alignment of protein sequences using secondary structure: a modified dynamic programming method

F Fischel-Ghodsian G Mathiowitz T F Smith 《Protein engineering》1990,3(7):577-581

A method for comparison of protein sequences based on their primary and secondary structure is described. Protein sequences are annotated with predicted secondary structures (using a modified Chou and Fasman method). Two lettered code sequences are generated (Xx, where X is the amino acid and x is its annotated secondary structure). Sequences are compared with a dynamic programming method (STRALIGN) that includes a similarity matrix for both the amino acids and secondary structures. The similarity value for each paired two-lettered code is a linear combination of similarity values for the paired amino acids and their annotated secondary structures. The method has been applied to eight globin proteins (28 pairs) for which the X-ray structure is known. For protein pairs with high primary sequence similarity (greater than 45%), STRALIGN alignment is identical to that obtained by a dynamic programming method using only primary sequence information. However, alignment of protein pairs with lower primary sequence similarity improves significantly with the addition of secondary structure annotation. Alignment of the pair with the least primary sequence similarity of 16% was improved from 0 to 37% 'correct' alignment using this method. In addition, STRALIGN was successfully applied to seven pairs of distantly related cytochrome c proteins, and three pairs of distantly related picornavirus proteins. 相似文献

5.

Generalized comparative modeling (GENECOMP): a combination of sequence comparison, threading, and lattice modeling for protein structure prediction and refinement

Kolinski A Betancourt MR Kihara D Rotkiewicz P Skolnick J 《Proteins》2001,44(2):133-149

An improved generalized comparative modeling method, GENECOMP, for the refinement of threading models is developed and validated on the Fischer database of 68 probe-template pairs, a standard benchmark used to evaluate threading approaches. The basic idea is to perform ab initio folding using a lattice protein model, SICHO, near the template provided by the new threading algorithm PROSPECTOR. PROSPECTOR also provides predicted contacts and secondary structure for the template-aligned regions, and possibly for the unaligned regions by garnering additional information from other top-scoring threaded structures. Since the lowest-energy structure generated by the simulations is not necessarily the best structure, we employed two structure-selection protocols: distance geometry and clustering. In general, clustering is found to generate somewhat better quality structures in 38 of 68 cases. When applied to the Fischer database, the protocol does no harm and in a significant number of cases improves upon the initial threading model, sometimes dramatically. The procedure is readily automated and can be implemented on a genomic scale. 相似文献

6.

The HSSP database of protein structure-sequence alignments. 总被引：3，自引：0，他引：3

下载免费PDF全文

C Sander R Schneider 《Nucleic acids research》1994,22(17):3597-3599

HSSP (homology-derived structures of proteins) is a derived database merging structural (2-D and 3-D) and sequence information (1-D). For each protein of known 3D structure from the Protein Data Bank, the database has a file with all sequence homologues, properly aligned to the PDB protein. Homologues are very likely to have the same 3D structure as the PDB protein to which they have been aligned. As a result, the database is not only a database of sequence aligned sequence families, but it is also a database of implied secondary and tertiary structures. 相似文献

7.

PDBsum: summaries and analyses of PDB structures 总被引：10，自引：2，他引：8

Laskowski RA 《Nucleic acids research》2001,29(1):221-222

相似文献

8.

STRALCP--structure alignment-based clustering of proteins

Zemla A Geisbrecht B Smith J Lam M Kirkpatrick B Wagner M Slezak T Zhou CE 《Nucleic acids research》2007,35(22):e150

Protein structural annotation and classification is an important and challenging problem in bioinformatics. Research towards analysis of sequence-structure correspondences is critical for better understanding of a protein's structure, function, and its interaction with other molecules. Clustering of protein domains based on their structural similarities provides valuable information for protein classification schemes. In this article, we attempt to determine whether structure information alone is sufficient to adequately classify protein structures. We present an algorithm that identifies regions of structural similarity within a given set of protein structures, and uses those regions for clustering. In our approach, called STRALCP (STRucture ALignment-based Clustering of Proteins), we generate detailed information about global and local similarities between pairs of protein structures, identify fragments (spans) that are structurally conserved among proteins, and use these spans to group the structures accordingly. We also provide a web server at http://as2ts.llnl.gov/AS2TS/STRALCP/ for selecting protein structures, calculating structurally conserved regions and performing automated clustering. 相似文献

9.

A fast method of comparing protein structures 总被引：1，自引：0，他引：1

M R Murthy 《FEBS letters》1984,168(1):97-102

Comparative studies on protein structures form an integral part of protein crystallography. Here, a fast method of comparing protein structures is presented. Protein structures are represented as a set of secondary structural elements. The method also provides information regarding preferred packing arrangements and evolutionary dynamics of secondary structural elements. This information is not easily obtained from previous methods. In contrast to those methods, the present one can be used only for proteins with some secondary structure. The method is illustrated with globin folds, cytochromes and dehydrogenases as examples. 相似文献

10.

Prediction of the three-dimensional structure of proteins using the electrostatic screening model and hierarchic condensation

Franc Avbelj Ljudmila Fele 《Proteins》1998,31(1):74-96

We describe a method for predicting the three-dimensional (3-D) structure of proteins from their sequence alone. The method is based on the electrostatic screening model for the stability of the protein main-chain conformation. The free energy of a protein as a function of its conformation is obtained from the potentials of mean force analysis of high-resolution x-ray protein structures. The free energy function is simple and contains only 44 fitted coefficients. The minimization of the free energy is performed by the torsion space Monte Carlo procedure using the concept of hierarchic condensation. The Monte Carlo minimization procedure is applied to predict the secondary, super-secondary, and native 3-D structures of 12 proteins with 28–110 amino acids. The 3-D structures of the majority of local secondary and super-secondary structures are predicted accurately. This result suggests that control in forming the native-like local structure is distributed along the entire protein sequence. The native 3-D structure is predicted correctly for 3 of 12 proteins composed mainly from the α-helices. The method fails to predict the native 3-D structure of proteins with a predominantly β secondary structure. We suggest that the hierarchic condensation is not an appropriate procedure for simulating the folding of proteins made up primarily from β-strands. The method has been proved accurate in predicting the local secondary and super-secondary structures in the blind ab initio 3-D prediction experiment. Proteins 31:74–96, 1998. © 1998 Wiley-Liss, Inc. 相似文献

11.

Refolding of Myoglobins of Nonhomologous Sequences by the Island Model

Yukio Kobayashi Nobuhiko Saitô 《Journal of Protein Chemistry》1997,16(2):83-98

The folding process of sea hare myoglobin was simulated by the island model, which does not rely on sequence homologies or statistical inference from database of known structure. Sea hare myoglobin has low sequence homology (28%), but high structural similarity, with sperm whale myoglobin, which was already simulated by the island model. Their structural similarity is shown physiochemically from the distribution of hydrophobic-residue pairs, that is, the key pairs for packing of the secondary structures. Irrelevant to the sequence homology, the secondary structures can be packed into the tertiary structure through the hydrophobic interactions among the amino acid pairs responsible for the local structure formation. The results on the two species of myoglobins indicate that, in contrast to other prediction methods, the island model is applicable to any type of protein without extra information other than the distribution of hydrophobic-residue pairs and the positions of the secondary structures. Consequently the present results provide another verification of the validity of the island model for elucidating the mechanisms of protein folding and predicting protein structures. 相似文献

12.

DRSP：蛋白质单残基替换结构数据库

刘继龙苗智超李雷肖智雄曹洋《生物化学与生物物理进展》2016,43(8):810-816

蛋白质残基替换是基因突变的产物之一,它可能改变蛋白质三维结构,对其生物学功能产生重大影响,因此研究蛋白质残基替换与结构改变的关系具有重要意义．随着实验解析蛋白质结构的数量迅猛增长,越来越多的野生型-突变体被应用于结构生物学的比较研究中．本研究从蛋白质三维结构数据库(PDB)出发,收集和计算了大量结构特征数据,构建了一个目前已知最大的野生型-突变体(单残基差异)的结构对数据库DRSP,展示出氨基酸类型和主链偏好性对结构保守性的相关性．DRSP的开放使用可为高精度的蛋白质结构分析预测提供有用信息,它的数据库网址是http://www.labshare.cn/drsp/index.php．相似文献

13.

Building multiple sequence alignments with a flavor of HSSP alignments

Higa RH Cruz SA Kuser PR Yamagishi ME Fileto R Oliveira SR Mazoni I Santos EH Mancini AL Neshich G 《Genetics and molecular research : GMR》2006,5(1):127-137

Homology-derived secondary structure of proteins (HSSP) is a well-known database of multiple sequence alignments (MSAs) which merges information of protein sequences and their three-dimensional structures. It is available for all proteins whose structure is deposited in the PDB. It is also used by STING and (Java)Protein Dossier to calculate and present relative entropy as a measure of the degree of conservation for each residue of proteins whose structure has been solved and deposited in the PDB. However, if the STING and (Java)Protein Dossier are to provide support for analysis of protein structures modeled in computers or being experimentally solved but not yet deposited in the PDB, then we need a new method for building alignments having a flavor of HSSP alignments (myMSAr). The present study describes a new method and its corresponding databank (SH2QS--database of sequences homologue to the query [structure-having] sequence). Our main interest in making myMSAr was to measure the degree of residue conservation for a given query sequence, regardless of whether it has a corresponding structure deposited in the PDB. In this study, we compare the measurement of residue conservation provided by corresponding alignments produced by HSSP and SH2QS. As a case study, we also present two biologically relevant examples, the first one highlighting the equivalence of analysis of the degree of residue conservation by using HSSP or SH2QS alignments, and the second one presenting the degree of residue conservation for a structure modeled in a computer, which , as a consequence, does not have an alignment reported by HSSP. 相似文献

14.

The HSSP database of protein structure-sequence alignments. 总被引：2，自引：0，他引：2

下载免费PDF全文

R Schneider C Sander 《Nucleic acids research》1996,24(1):201-205

HSSP is a derived database merging structural three dimensional (3-D) and sequence one dimensional(1-D) information. For each protein of known 3-D structure from the Protein Data Bank (PDB), the database has a multiple sequence alignment of all available homologues and a sequence profile characteristic of the family. The list of homologues is the result of a database search in Swissprot using a position-weighted dynamic programming method for sequence profile alignment (MaxHom). The database is updated frequently. The listed homologues are very likely to have the same 3-D structure as the PDB protein to which they have been aligned. As a result, the database is not only a database of aligned sequence families, but also a database of implied secondary and tertiary structures covering 27% of all Swissprot-stored sequences. 相似文献

15.

The impact of AlphaFold Protein Structure Database on the fields of life sciences

Mihaly Varadi Sameer Velankar 《Proteomics》2023,23(17):2200128

Arguably, 2020 was the year of high-accuracy protein structure predictions, with AlphaFold 2.0 achieving previously unseen accuracy in the Critical Assessment of Protein Structure Prediction (CASP). In 2021, DeepMind and EMBL-EBI developed the AlphaFold Protein Structure Database to make an unprecedented number of reliable protein structure predictions easily accessible to the broad scientific community. We provide a brief overview and describe the latest developments in the AlphaFold database. We highlight how the fields of data services, bioinformatics, structural biology, and drug discovery are directly affected by the influx of protein structure data. We also show examples of cutting-edge research that took advantage of the AlphaFold database. It is apparent that connections between various fields through protein structures are now possible, but the amount of data poses new challenges. Finally, we give an outlook regarding the future direction of the database, both in terms of data sets and new functionalities. 相似文献

16.

The HSSP database of protein structure-sequence alignments and family profiles. 总被引：9，自引：0，他引：9

下载免费PDF全文

C Dodge R Schneider C Sander 《Nucleic acids research》1998,26(1):313-315

HSSP (http: //www.sander.embl-ebi.ac.uk/hssp/) is a derived database merging structure (3-D) and sequence (1-D) information. For each protein of known 3D structure from the Protein Data Bank (PDB), we provide a multiple sequence alignment of putative homologues and a sequence profile characteristic of the protein family, centered on the known structure. The list of homologues is the result of an iterative database search in SWISS-PROT using a position-weighted dynamic programming method for sequence profile alignment (MaxHom). The database is updated frequently. The listed putative homologues are very likely to have the same 3D structure as the PDB protein to which they have been aligned. As a result, the database not only provides aligned sequence families, but also implies secondary and tertiary structures covering 33% of all sequences in SWISS-PROT. 相似文献

17.

Predicting protein secondary structure content. A tandem neural network approach.

S M Muskal S H Kim 《Journal of molecular biology》1992,225(3):713-727

A priori knowledge of secondary structure content can be of great use in theoretical and experimental determination of protein structure. We present a method that uses two computer-simulated neural networks placed in "tandem" to predict the secondary structure content of water-soluble, globular proteins. The first of the two networks, NET1, predicts a protein's helix and strand content given information about the protein's amino acid composition, molecular weight and heme presence. Because NET1 contained more adjustable parameters (network weights) than learning examples, this network experienced problems with memorization, which is the inability to generalize onto new, never-seen-before examples. To overcome this problem, we designed a second network, NET2, which learned to determine when NET1 was in a state of generalization. Together, these two networks produce prediction errors as low as 5.0% and 5.6% for helix and strand content, respectively, on a set of protein crystal structures bearing little homology to those used in network training. A comparison between three other methods including a multiple linear regression analysis, a non-hidden-node network analysis and a secondary structure assignment analysis reveals that our tandem neural network scheme is, indeed, the best method for predicting secondary structure content. The results of our analysis suggest that the knowledge of sequence information is not necessary for highly accurate predictions of protein secondary structure content. 相似文献

18.

The HSSP database of protein structure-sequence alignments. 总被引：4，自引：0，他引：4

下载免费PDF全文

R Schneider A de Daruvar C Sander 《Nucleic acids research》1997,25(1):226-230

HSSP is a derived database merging structural (3-D) and sequence (1-D) information. For each protein of known 3-D structure from the Protein Data Bank (PDB), the database has a multiple sequence alignment of all available homologues and a sequence profile characteristic of the family. The list of homologues is the result of a database search in SwissProt using a position-weighted dynamic programming method for sequence profile alignment (MaxHom). The database is updated frequently. The listed homologues are very likely to have the same 3-D structure as the PDB protein to which they have been aligned. As a result, the database is not only a database of aligned sequence families, but also a database of implied secondary and tertiary structures covering 29% of all SwissProt-stored sequences. 相似文献

19.

Protein structure databases with new web services for structural biology and biomedical research

Standley DM Kinjo AR Kinoshita K Nakamura H 《Briefings in bioinformatics》2008,9(4):276-285

The Protein Data Bank Japan (PDBj) curates, edits and distributes protein structural data as a member of the worldwide Protein Data Bank (wwPDB) and currently processes approximately 25-30% of all deposited data in the world. Structural information is enhanced by the addition of biological and biochemical functional data as well as experimental details extracted from the literature and other databases. Several applications have been developed at PDBj for structural biology and biomedical studies: (i) a Java-based molecular graphics viewer, jV; (ii) display of electron density maps for the evaluation of structure quality; (iii) an extensive database of molecular surfaces for functional sites, eF-site, as well as a search service for similar molecular surfaces, eF-seek; (iv) identification of sequence and structural neighbors; (v) a graphical user interface to all known protein folds with links to the above applications, Protein Globe. Recent examples are shown that highlight the utility of these tools in recognizing remote homologies between pairs of protein structures and in assigning putative biochemical functions to newly determined targets from structural genomics projects. 相似文献

20.

A rapid classification protocol for the CATH Domain Database to support structural genomics 总被引：1，自引：0，他引：1

Pearl FM Martin N Bray JE Buchan DW Harrison AP Lee D Reeves GA Shepherd AJ Sillitoe I Todd AE Thornton JM Orengo CA 《Nucleic acids research》2001,29(1):223-227

In order to support the structural genomic initiatives, both by rapidly classifying newly determined structures and by suggesting suitable targets for structure determination, we have recently developed several new protocols for classifying structures in the CATH domain database (http://www.biochem.ucl.ac.uk/bsm/cath). These aim to increase the speed of classification of new structures using fast algorithms for structure comparison (GRATH) and to improve the sensitivity in recognising distant structural relatives by incorporating sequence information from relatives in the genomes (DomainFinder). In order to ensure the integrity of the database given the expected increase in data, the CATH Protein Family Database (CATH-PFDB), which currently includes 25,320 structural domains and a further 160,000 sequence relatives has now been installed in a relational ORACLE database. This was essential for developing more rigorous validation procedures and for allowing efficient querying of the database, particularly for genome analysis. The associated Dictionary of Homologous Superfamilies [Bray,J.E., Todd,A.E., Pearl,F.M.G., Thornton,J.M. and Orengo,C.A. (2000) Protein Eng., 13, 153-165], which provides multiple structural alignments and functional information to assist in assigning new relatives, has also been expanded recently and now includes information for 903 homologous superfamilies. In order to improve coverage of known structures, preliminary classification levels are now provided for new structures at interim stages in the classification protocol. Since a large proportion of new structures can be rapidly classified using profile-based sequence analysis [e.g. PSI-BLAST: Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 3389-3402], this provides preliminary classification for easily recognisable homologues, which in the latest release of CATH (version 1.7) represented nearly three-quarters of the non-identical structures. 相似文献