期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

ScanNet: A Web Server for Structure-based Prediction of Protein Binding Sites with Geometric Deep Learning

《Journal of molecular biology》2022,434(19):167758

Predicting the various binding sites of a protein from its structure sheds light on its function and paves the way towards design of interaction inhibitors. Here, we report ScanNet, a freely available web server for prediction of protein–protein, protein - disordered protein and protein - antibody binding sites from structure. ScanNet (Spatio-Chemical Arrangement of Neighbors Network) is an end-to-end, interpretable geometric deep learning model that learns spatio-chemical patterns directly from 3D structures. ScanNet consistently outperforms Machine Learning models based on handcrafted features and comparative modeling approaches. The web server is linked to both the PDB and AlphaFoldDB, and supports user-provided structure files. Predictions can be readily visualized on the website via the Molstar web app and locally via ChimeraX. ScanNet is available at http://bioinfo3d.cs.tau.ac.il/ScanNet/. 相似文献

2.

Learning protein multi-view features in complex space

Dong-Jun Yu Jun Hu Xiao-Wei Wu Hong-Bin Shen Jun Chen Zhen-Min Tang Jian Yang Jing-Yu Yang 《Amino acids》2013,44(5):1365-1379

Protein attribute prediction from primary sequences is an important task and how to extract discriminative features is one of the most crucial aspects. Because single-view feature cannot reflect all the information of a protein, fusing multi-view features is considered as a promising route to improve prediction accuracy. In this paper, we propose a novel framework for protein multi-view feature fusion: first, features from different views are parallely combined to form complex feature vectors; Then, we extend the classic principal component analysis to the generalized principle component analysis for further feature extraction from the parallely combined complex features, which lie in a complex space. Finally, the extracted features are used for prediction. Experimental results on different benchmark datasets and machine learning algorithms demonstrate that parallel strategy outperforms the traditional serial approach and is particularly helpful for extracting the core information buried among multi-view feature sets. A web server for protein structural class prediction based on the proposed method (COMSPA) is freely available for academic use at: http://www.csbio.sjtu.edu.cn/bioinf/COMSPA/. 相似文献

3.

TMpro web server and web service: transmembrane helix prediction through amino acid property analysis

Ganapathiraju M Jursa CJ Karimi HA Klein-Seetharaman J 《Bioinformatics (Oxford, England)》2007,23(20):2795-2796

TMpro is a transmembrane (TM) helix prediction algorithm that uses language processing methodology for TM segment identification. It is primarily based on the analysis of statistical distributions of properties of amino acids in transmembrane segments. This article describes the availability of TMpro on the internet via a web interface. The key features of the interface are: (i) output is generated in multiple formats including a user-interactive graphical chart which allows comparison of TMpro predicted segment locations with other labeled segments input by the user, such as predictions from other methods. (ii) Up to 5000 sequences can be submitted at a time for prediction. (iii) TMpro is available as a web server and is published as a web service so that the method can be accessed by users as well as other services depending on the need for data integration. Availability: http://linzer.blm.cs.cmu.edu/tmpro/ (web server and help), http://blm.sis.pitt.edu:8080/axis/services/TMProFetcherService (web service). 相似文献

4.

Rapid membrane protein topology prediction

Hennerdal A Elofsson A 《Bioinformatics (Oxford, England)》2011,27(9):1322-1323

State-of-the-art methods for topology of α-helical membrane proteins are based on the use of time-consuming multiple sequence alignments obtained from PSI-BLAST or other sources. Here, we examine if it is possible to use the consensus of topology prediction methods that are based on single sequences to obtain a similar accuracy as the more accurate multiple sequence-based methods. Here, we show that TOPCONS-single performs better than any of the other topology prediction methods tested here, but ~6% worse than the best method that is utilizing multiple sequence alignments. AVAILABILITY AND IMPLEMENTATION: TOPCONS-single is available as a web server from http://single.topcons.net/ and is also included for local installation from the web site. In addition, consensus-based topology predictions for the entire international protein index (IPI) is available from the web server and will be updated at regular intervals. 相似文献

5.

InterProSurf: a web server for predicting interacting sites on protein surfaces 总被引：2，自引：0，他引：2

Negi SS Schein CH Oezguen N Power TD Braun W 《Bioinformatics (Oxford, England)》2007,23(24):3397-3399

A new web server, InterProSurf, predicts interacting amino acid residues in proteins that are most likely to interact with other proteins, given the 3D structures of subunits of a protein complex. The prediction method is based on solvent accessible surface area of residues in the isolated subunits, a propensity scale for interface residues and a clustering algorithm to identify surface regions with residues of high interface propensities. Here we illustrate the application of InterProSurf to determine which areas of Bacillus anthracis toxins and measles virus hemagglutinin protein interact with their respective cell surface receptors. The computationally predicted regions overlap with those regions previously identified as interface regions by sequence analysis and mutagenesis experiments. AVAILABILITY: The InterProSurf web server is available at http://curie.utmb.edu/ 相似文献

6.

Tools for comparative protein structure modeling and analysis

下载免费PDF全文

Eswar N John B Mirkovic N Fiser A Ilyin VA Pieper U Stuart AC Marti-Renom MA Madhusudhan MS Yerkovich B Sali A 《Nucleic acids research》2003,31(13):3375-3380

The following resources for comparative protein structure modeling and analysis are described (http://salilab.org): MODELLER, a program for comparative modeling by satisfaction of spatial restraints; MODWEB, a web server for automated comparative modeling that relies on PSI-BLAST, IMPALA and MODELLER; MODLOOP, a web server for automated loop modeling that relies on MODELLER; MOULDER, a CPU intensive protocol of MODWEB for building comparative models based on distant known structures; MODBASE, a comprehensive database of annotated comparative models for all sequences detectably related to a known structure; MODVIEW, a Netscape plugin for Linux that integrates viewing of multiple sequences and structures; and SNPWEB, a web server for structure-based prediction of the functional impact of a single amino acid substitution. 相似文献

7.

TargetDB: a database of peptides targeting proteins to subcellular locations.

T Wei M O'Connell 《Bioinformatics (Oxford, England)》1999,15(9):765-766

SUMMARY: TargetDB is a relational database designed to represent data on protein targeting sequences, mutant signals, subcellular targets and source organisms. AVAILABILITY: TargetDB is accessible at http://molbio.nmsu.edu:81. The web interface supports both direct data authoring and database query functions. CONTACT: moconnel@nmsu. edu, tao_wei@hms.harvard.edu 相似文献

8.

LOC3D: annotate sub-cellular localization for protein structures

下载免费PDF全文

Nair R Rost B 《Nucleic acids research》2003,31(13):3337-3340

LOC3D (http://cubic.bioc.columbia.edu/db/LOC3d/) is both a weekly-updated database and a web server for predictions of sub-cellular localization for eukaryotic proteins of known three-dimensional (3D) structure. Localization is predicted using four different methods: (i) PredictNLS, prediction of nuclear proteins through nuclear localization signals; (ii) LOChom, inferring localization through sequence homology; (iii) LOCkey, inferring localization through automatic text analysis of SWISS-PROT keywords; and (iv) LOC3Dini, ab initio prediction through a system of neural networks and vector support machines. The final prediction is based on the method that predicts localization with the highest confidence. The LOC3D database currently contains predictions for >8700 eukaryotic protein chains taken from the Protein Data Bank (PDB). The web server can be used to predict sub-cellular localization for proteins for which only a predicted structure is available from threading servers. This makes the resource of particular interest to structural genomics initiatives. 相似文献

9.

EVA: Evaluation of protein structure prediction servers

下载免费PDF全文

Koh IY Eyrich VA Marti-Renom MA Przybylski D Madhusudhan MS Eswar N Graña O Pazos F Valencia A Sali A Rost B 《Nucleic acids research》2003,31(13):3311-3315

EVA (http://cubic.bioc.columbia.edu/eva/) is a web server for evaluation of the accuracy of automated protein structure prediction methods. The evaluation is updated automatically each week, to cope with the large number of existing prediction servers and the constant changes in the prediction methods. EVA currently assesses servers for secondary structure prediction, contact prediction, comparative protein structure modelling and threading/fold recognition. Every day, sequences of newly available protein structures in the Protein Data Bank (PDB) are sent to the servers and their predictions are collected. The predictions are then compared to the experimental structures once a week; the results are published on the EVA web pages. Over time, EVA has accumulated prediction results for a large number of proteins, ranging from hundreds to thousands, depending on the prediction method. This large sample assures that methods are compared reliably. As a result, EVA provides useful information to developers as well as users of prediction methods. 相似文献

10.

Predicting experimental properties of integral membrane proteins by a naive Bayes approach

Martin-Galiano AJ Smialowski P Frishman D 《Proteins》2008,70(4):1243-1256

Integral membrane proteins (iMPs) are challenging targets for structure determination because of the substantial experimental difficulties involved in their sample preparation. Accordingly, success rates of large-scale structural genomics consortia are much lower for this class of molecules compared to globular targets, underscoring the pressing need for predictive strategies to identify iMPs that are more likely to overcome laboratory bottlenecks. On the basis of the target status information available in the TargetDB repository, we describe the first large-scale analysis of experimental behavior of iMPs. Using information on recalcitrant and propagating iMP targets as negative and positive sets, respectively, we present naive Bayes classifiers capable of predicting, from sequence alone, those proteins that are more amenable to cloning, expression, and solubilization studies. Protein sequences are represented in the space of 72 features, including amino acid composition, occurrence of amino acid groups, ratios between residue groups, and hydrophobicity measures. Taking into account unequal representation of main taxonomic groups in the TargetDB, sequence database had a beneficial effect on the prediction results. The classifiers achieve accuracies of 70%, 63-70%, and 61% in predicting the amenability of iMPs for cloning, expression, and solubilization, respectively, thus making them useful tools in target selection for structure determination. Our assessment of prediction results clearly demonstrates that classifiers based on single features do not possess acceptable discriminative power and that the experimental behavior of iMPs is imprinted in their primary sequence through relationships between a restricted set of key properties. In most cases, sets of 10-20 protein features were found actually relevant, most notably, the content of isoleucine, valine, and positively-charged residues. 相似文献

11.

TSSub: eukaryotic protein subcellular localization by extracting features from profiles

Guo J Lin Y 《Bioinformatics (Oxford, England)》2006,22(14):1784-1785

This paper introduces a new subcellular localization system (TSSub) for eukaryotic proteins. This system extracts features from both profiles and amino acid sequences. Four different features are extracted from profiles by four probabilistic neural network (PNN) classifiers, respectively (the amino acid composition from whole profiles; the amino acid composition from the N-terminus of profiles; the dipeptide composition from whole profiles and the amino acid composition from fragments of profiles). In addition, a support vector machine (SVM) classifier is added to implement the residue-couple feature extracted from amino acid sequences. The results from the five classifiers are fused by an additional SVM classifier. The overall accuracies of this TSSub reach 93.0 and 77.4% on Reinhardt and Hubbard's eukaryotic protein dataset and Huang and Li's eukaryotic protein dataset, respectively. The comparison with existing methods results shows TSSub provides better prediction performance than existing methods. AVAILABILITY: The web server is available from http://166.111.24.5/webtools/TSSub/index.html. 相似文献

12.

Predicting protein sumoylation sites from sequence features

Teng S Luo H Wang L 《Amino acids》2012,43(1):447-455

Protein sumoylation is a post-translational modification that plays an important role in a wide range of cellular processes. Small ubiquitin-related modifier (SUMO) can be covalently and reversibly conjugated to the sumoylation sites of target proteins, many of which are implicated in various human genetic disorders. The accurate prediction of protein sumoylation sites may help biomedical researchers to design their experiments and understand the molecular mechanism of protein sumoylation. In this study, a new machine learning approach has been developed for predicting sumoylation sites from protein sequence information. Random forests (RFs) and support vector machines (SVMs) were trained with the data collected from the literature. Domain-specific knowledge in terms of relevant biological features was used for input vector encoding. It was shown that RF classifier performance was affected by the sequence context of sumoylation sites, and 20 residues with the core motif ΨKXE in the middle appeared to provide enough context information for sumoylation site prediction. The RF classifiers were also found to outperform SVM models for predicting protein sumoylation sites from sequence features. The results suggest that the machine learning approach gives rise to more accurate prediction of protein sumoylation sites than the other existing methods. The accurate classifiers have been used to develop a new web server, called seeSUMO (http://bioinfo.ggc.org/seesumo/), for sequence-based prediction of protein sumoylation sites. 相似文献

13.

BetaTPred: prediction of beta-TURNS in a protein using statistical algorithms

Kaur H Raghava GP 《Bioinformatics (Oxford, England)》2002,18(3):498-499

MOTIVATION: beta-turns play an important role from a structural and functional point of view. beta-turns are the most common type of non-repetitive structures in proteins and comprise on average, 25% of the residues. In the past numerous methods have been developed to predict beta-turns in a protein. Most of these prediction methods are based on statistical approaches. In order to utilize the full potential of these methods, there is a need to develop a web server. RESULTS: This paper describes a web server called BetaTPred, developed for predicting beta-TURNS in a protein from its amino acid sequence. BetaTPred allows the user to predict turns in a protein using existing statistical algorithms. It also allows to predict different types of beta-TURNS e.g. type I, I', II, II', VI, VIII and non-specific. This server assists the users in predicting the consensus beta-TURNS in a protein. AVAILABILITY: The server is accessible from http://imtech.res.in/raghava/betatpred/ 相似文献

14.

SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning

《基因组蛋白质组与生物信息学报(英文版)》2019,17(6):645-656

Intrinsically disordered or unstructured proteins (or regions in proteins) have been found to be important in a wide range of biological functions and implicated in many diseases. Due to the high cost and low efficiency of experimental determination of intrinsic disorder and the exponential increase of unannotated protein sequences, developing complementary computational prediction methods has been an active area of research for several decades. Here, we employed an ensemble of deep Squeeze-and-Excitation residual inception and long short-term memory (LSTM) networks for predicting protein intrinsic disorder with input from evolutionary information and predicted one-dimensional structural properties. The method, called SPOT-Disorder2, offers substantial and consistent improvement not only over our previous technique based on LSTM networks alone, but also over other state-of-the-art techniques in three independent tests with different ratios of disordered to ordered amino acid residues, and for sequences with either rich or limited evolutionary information. More importantly, semi-disordered regions predicted in SPOT-Disorder2 are more accurate in identifying molecular recognition features (MoRFs) than methods directly designed for MoRFs prediction. SPOT-Disorder2 is available as a web server and as a standalone program at https://sparks-lab.org/server/spot-disorder2/. 相似文献

15.

pTARGET [corrected] a new method for predicting protein subcellular localization in eukaryotes 总被引：2，自引：0，他引：2

Guda C Subramaniam S 《Bioinformatics (Oxford, England)》2005,21(21):3963-3969

MOTIVATION: There is a scarcity of efficient computational methods for predicting protein subcellular localization in eukaryotes. Currently available methods are inadequate for genome-scale predictions with several limitations. Here, we present a new prediction method, pTARGET that can predict proteins targeted to nine different subcellular locations in the eukaryotic animal species. RESULTS: The nine subcellular locations predicted by pTARGET include cytoplasm, endoplasmic reticulum, extracellular/secretory, golgi, lysosomes, mitochondria, nucleus, plasma membrane and peroxisomes. Predictions are based on the location-specific protein functional domains and the amino acid compositional differences across different subcellular locations. Overall, this method can predict 68-87% of the true positives at accuracy rates of 96-99%. Comparison of the prediction performance against PSORT showed that pTARGET prediction rates are higher by 11-60% in 6 of the 8 locations tested. Besides, the pTARGET method is robust enough for genome-scale prediction of protein subcellular localizations since, it does not rely on the presence of signal or target peptides. AVAILABILITY: A public web server based on the pTARGET method is accessible at the URL http://bioinformatics.albany.edu/~ptarget. Datasets used for developing pTARGET can be downloaded from this web server. Source code will be available on request from the corresponding author. 相似文献

16.

EVA: continuous automatic evaluation of protein structure prediction servers. 总被引：5，自引：0，他引：5

V A Eyrich M A Martí-Renom D Przybylski M S Madhusudhan A Fiser F Pazos A Valencia A Sali B Rost 《Bioinformatics (Oxford, England)》2001,17(12):1242-1243

Evaluation of protein structure prediction methods is difficult and time-consuming. Here, we describe EVA, a web server for assessing protein structure prediction methods, in an automated, continuous and large-scale fashion. Currently, EVA evaluates the performance of a variety of prediction methods available through the internet. Every week, the sequences of the latest experimentally determined protein structures are sent to prediction servers, results are collected, performance is evaluated, and a summary is published on the web. EVA has so far collected data for more than 3000 protein chains. These results may provide valuable insight to both developers and users of prediction methods. AVAILABILITY: http://cubic.bioc.columbia.edu/eva. CONTACT: eva@cubic.bioc.columbia.edu 相似文献

17.

In silico structural and functional modelling of Antifreeze protein (AFP) sequences of Ocean pout (Zoarces americanus, Bloch & Schneider 1801)

Manojit Bhattacharya Arpita Hota Avijit Kar Deep Sankar Chini Ramesh Chandra Malick Bidhan Chandra Patra Basanta Kumar Das 《Journal of Genetic Engineering and Biotechnology》2018,16(2):721-730

Antifreeze proteins (AFPs) are known to polypeptide components formed by certain plants, animals, fungi and bacteria which support to survive in sub-zero temperature. Current study highlighted the seven different antifreeze proteins of fish Ocean pout (Zoarces americanus), in which protein (amino acids sequence) were collected from National Centre for Biotechnology Information and finely characterized using several in silico tools. Such biocomputational techniques applied to figure out the physicochemical, functional and conformational characteristics of targeted AFPs. Multiple physicochemical properties such as Isoelectric Point, Extinction Coefficient and Instability Index, Aliphatic Index, Grand Average Hydropathy were calculated and analysed by ExPASy-ProtParam prediction web server. EMBOSS: pepwheel online tool was used to represent the protein sequences in a helical form. The primary structure analysis shows that most of the AFPs are hydrophobic in nature due to the high content of non-polar residues. The secondary structure of these proteins was calculated using SOPMA tool. SOSUI server and CYS_REC program also run for ideal prediction of transmembrane helices and disulfide bridges of experimental proteins respectively. The modelling of 3D structures of seven desired AFPs were executed by the homology modelling programmes; SWISS MODEL and ProSA web server. UCSF Chimera, Antheprot 3D, PyMOL and RAMPAGE were used to visualize and analysis of the structural variation of the predicted protein model. MEGA7.0.9 software used to know the phylogenetic relationship among these AFPs. These models offered excellent and reliable baseline information for functional characterization of the experimentally derived protein domain composition by using the advanced tools and techniques of Computational Biology. 相似文献

18.

PPO: predictor for prokaryotic operons

Chuang LY Tsai JH Yang CH 《Bioinformatics (Oxford, England)》2010,26(24):3127-3128

SUMMARY: We present an operon predictor for prokaryotic operons (PPO), which can predict operons in the entire prokaryotic genome. The prediction algorithm used in PPO allows the user to select binary particle swarm optimization (BPSO), a genetic algorithm (GA) or some other methods introduced in the literature to predict operons. The operon predictor on our web server and the provided database are easy to access and use. The main features offered are: (i) selection of the prediction algorithm; (ii) adjustable parameter settings of the prediction algorithm; (iii) graphic visualization of results; (iv) integrated database queries; (v) listing of experimentally verified operons; and (vi) related tools. Availability and implementation: PPO is freely available at http://bio.kuas.edu.tw/PPO/. 相似文献

19.

T-Epitope Designer: A HLA-peptide binding prediction server

Kangueane P Sakharkar MK 《Bioinformation》2005,1(1):21-24

The current challenge in synthetic vaccine design is the development of a methodology to identify and test short antigen peptides as potential T-cell epitopes. Recently, we described a HLA-peptide binding model (using structural properties) capable of predicting peptides binding to any HLA allele. Consequently, we have developed a web server named T-EPITOPE DESIGNER to facilitate HLA-peptide binding prediction. The prediction server is based on a model that defines peptide binding pockets using information gleaned from X-ray crystal structures of HLA-peptide complexes, followed by the estimation of peptide binding to binding pockets. Thus, the prediction server enables the calculation of peptide binding to HLA alleles. This model is superior to many existing methods because of its potential application to any given HLA allele whose sequence is clearly defined. The web server finds potential application in T cell epitope vaccine design. AVAILABILITY: http://www.bioinformation.net/ted/ 相似文献

20.

In silico platform for predicting and initiating β‐turns in a protein at desired locations

下载免费PDF全文

Gajendra P. S. Raghava 《Proteins》2015,83(5):910-921

Numerous studies have been performed for analysis and prediction of β‐turns in a protein. This study focuses on analyzing, predicting, and designing of β‐turns to understand the preference of amino acids in β‐turn formation. We analyzed around 20,000 PDB chains to understand the preference of residues or pair of residues at different positions in β‐turns. Based on the results, a propensity‐based method has been developed for predicting β‐turns with an accuracy of 82%. We introduced a new approach entitled “Turn level prediction method,” which predicts the complete β‐turn rather than focusing on the residues in a β‐turn. Finally, we developed BetaTPred3, a Random forest based method for predicting β‐turns by utilizing various features of four residues present in β‐turns. The BetaTPred3 achieved an accuracy of 79% with 0.51 MCC that is comparable or better than existing methods on BT426 dataset. Additionally, models were developed to predict β‐turn types with better performance than other methods available in the literature. In order to improve the quality of prediction of turns, we developed prediction models on a large and latest dataset of 6376 nonredundant protein chains. Based on this study, a web server has been developed for prediction of β‐turns and their types in proteins. This web server also predicts minimum number of mutations required to initiate or break a β‐turn in a protein at specified location of a protein. Proteins 2015; 83:910–921. © 2015 Wiley Periodicals, Inc. 相似文献