首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Transmembrane proteins affect vital cellular functions and pathogenesis, and are a focus of drug design. It is difficult to obtain diffraction quality crystals to study transmembrane protein structure. Computational tools for transmembrane protein topology prediction fill in the gap between the abundance of transmembrane proteins and the scarcity of known membrane protein structures. Their prediction accuracy is still inadequate: TMHMM, the current state-of-the-art method, has less than 52% accuracy in topology prediction on one set of transmembrane proteins of known topology. Based on the observation that there are functional domains that occur preferentially internal or external to the membrane, we have extended the model of TMHMM to incorporate functional domains, using a probabilistic approach originally developed for computational gene finding. Our extension is better than TMHMM in predicting the topology of transmembrane proteins. As prediction of functional domain improves, our system's prediction accuracy will likely improve as well.  相似文献   

2.
We describe and validate a new membrane protein topology prediction method, TMHMM, based on a hidden Markov model. We present a detailed analysis of TMHMM's performance, and show that it correctly predicts 97-98 % of the transmembrane helices. Additionally, TMHMM can discriminate between soluble and membrane proteins with both specificity and sensitivity better than 99 %, although the accuracy drops when signal peptides are present. This high degree of accuracy allowed us to predict reliably integral membrane proteins in a large collection of genomes. Based on these predictions, we estimate that 20-30 % of all genes in most genomes encode membrane proteins, which is in agreement with previous estimates. We further discovered that proteins with N(in)-C(in) topologies are strongly preferred in all examined organisms, except Caenorhabditis elegans, where the large number of 7TM receptors increases the counts for N(out)-C(in) topologies. We discuss the possible relevance of this finding for our understanding of membrane protein assembly mechanisms. A TMHMM prediction service is available at http://www.cbs.dtu.dk/services/TMHMM/.  相似文献   

3.
We have developed reliability scores for five widely used membrane protein topology prediction methods, and have applied them both on a test set of 92 bacterial plasma membrane proteins with experimentally determined topologies and on all predicted helix bundle membrane proteins in three fully sequenced genomes: Escherichia coli, Saccharomyces cerevisiae and Caenorhabditis elegans. We show that the reliability scores work well for the TMHMM and MEMSAT methods, and that they allow the probability that the predicted topology is correct to be estimated for any protein. We further show that the available test set is biased towards high-scoring proteins when compared to the genome-wide data sets, and provide estimates for the expected prediction accuracy of TMHMM across the three genomes. Finally, we show that the performance of TMHMM is considerably better when limited experimental information (such as the in/out location of a protein's C terminus) is available, and estimate that at least ten percentage points in overall accuracy in whole-genome predictions can be gained in this way.  相似文献   

4.
Evaluation of methods for the prediction of membrane spanning regions.   总被引:20,自引:0,他引:20  
MOTIVATION: A variety of tools are available to predict the topology of transmembrane proteins. To date no independent evaluation of the performance of these tools has been published. A better understanding of the strengths and weaknesses of the different tools would guide both the biologist and the bioinformatician to make better predictions of membrane protein topology. RESULTS: Here we present an evaluation of the performance of the currently best known and most widely used methods for the prediction of transmembrane regions in proteins. Our results show that TMHMM is currently the best performing transmembrane prediction program.  相似文献   

5.
MOTIVATION: Many important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion are mediated by membrane proteins. Unfortunately, as these proteins are not water soluble, it is extremely hard to experimentally determine their structure. Therefore, improved methods for predicting the structure of these proteins are vital in biological research. In order to improve transmembrane topology prediction, we evaluate the combined use of both integrated signal peptide prediction and evolutionary information in a single algorithm. RESULTS: A new method (MEMSAT3) for predicting transmembrane protein topology from sequence profiles is described and benchmarked with full cross-validation on a standard data set of 184 transmembrane proteins. The method is found to predict both the correct topology and the locations of transmembrane segments for 80% of the test set. This compares with accuracies of 62-72% for other popular methods on the same benchmark. By using a second neural network specifically to discriminate transmembrane from globular proteins, a very low overall false positive rate (0.5%) can also be achieved in detecting transmembrane proteins. AVAILABILITY: An implementation of the described method is available both as a web server (http://www.psipred.net) and as downloadable source code from http://bioinf.cs.ucl.ac.uk/memsat. Both the server and source code files are free to non-commercial users. Benchmark and training data are also available from http://bioinf.cs.ucl.ac.uk/memsat.  相似文献   

6.
Since membranous proteins play a key role in drug targeting therefore transmembrane proteins prediction is active and challenging area of biological sciences. Location based prediction of transmembrane proteins are significant for functional annotation of protein sequences. Hidden markov model based method was widely applied for transmembrane topology prediction. Here we have presented a revised and a better understanding model than an existing one for transmembrane protein prediction. Scripting on MATLAB was built and compiled for parameter estimation of model and applied this model on amino acid sequence to know the transmembrane and its adjacent locations. Estimated model of transmembrane topology was based on TMHMM model architecture. Only 7 super states are defined in the given dataset, which were converted to 96 states on the basis of their length in sequence. Accuracy of the prediction of model was observed about 74 %, is a good enough in the area of transmembrane topology prediction. Therefore we have concluded the hidden markov model plays crucial role in transmembrane helices prediction on MATLAB platform and it could also be useful for drug discovery strategy. AVAILABILITY: The database is available for free at bioinfonavneet@gmail.comvinaysingh@bhu.ac.in.  相似文献   

7.
Membrane proteins, which constitute approximately 20% of most genomes, form two main classes: alpha helical and beta barrel transmembrane proteins. Using methods based on Bayesian Networks, a powerful approach for statistical inference, we have sought to address beta-barrel topology prediction. The beta-barrel topology predictor reports individual strand accuracies of 88.6%. The method outlined here represents a potentially important advance in the computational determination of membrane protein topology.  相似文献   

8.
Genomics and proteomics have added valuable information to our knowledgebase of the human biological system including the discovery of therapeutic targets and disease biomarkers. However, molecular profiling studies commonly result in the identification of novel proteins of unknown localization. A class of proteins of special interest is membrane proteins, in particular plasma membrane proteins. Despite their biological and medical significance, the 3-dimensional structures of less than 1% of plasma membrane proteins have been determined. In order to aid in identification of membrane proteins, a number of computational methods have been developed. These tools operate by predicting the presence of transmembrane segments. Here, we utilized five topology prediction methods (TMHMM, SOSUI, waveTM, HMMTOP, and TopPred II) in order to estimate the ratio of integral membrane proteins in the human proteome. These methods employ different algorithms and include a newly-developed method (waveTM) that has yet to be tested on a large proteome database. Since these tools are prone for error mainly as a result of falsely predicting signal peptides as transmembrane segments, we have utilized an additional method, SignalP. Based on our analyses, the ratio of human proteins with transmembrane segments is estimated to fall between 15% and 39% with a consensus of 13%. Agreement among the programs is reduced further when both a positive identification of a membrane protein and the number of transmembrane segments per protein are considered. Such a broad range of prediction depends on the selectivity of the individual method in predicting integral membrane proteins. These methods can play a critical role in determining protein structure and, hence, identifying suitable drug targets in humans.  相似文献   

9.
10.
Topology predictions for integral membrane proteins can be substantially improved if parts of the protein can be constrained to a given in/out location relative to the membrane using experimental data or other information. Here, we have identified a set of 367 domains in the SMART database that, when found in soluble proteins, have compartment-specific localization of a kind relevant for membrane protein topology prediction. Using these domains as prediction constraints, we are able to provide high-quality topology models for 11% of the membrane proteins extracted from 38 eukaryotic genomes. Two-thirds of these proteins are single spanning, a group of proteins for which current topology prediction methods perform particularly poorly.  相似文献   

11.
We have developed a method to reliably identify partial membrane protein topologies using the consensus of five topology prediction methods. When evaluated on a test set of experimentally characterized proteins, we find that approximately 90% of the partial consensus topologies are correctly predicted in membrane proteins from prokaryotic as well as eukaryotic organisms. Whole-genome analysis reveals that a reliable partial consensus topology can be predicted for approximately 70% of all membrane proteins in a typical bacterial genome and for approximately 55% of all membrane proteins in a typical eukaryotic genome. The average fraction of sequence length covered by a partial consensus topology is 44% for the prokaryotic proteins and 17% for the eukaryotic proteins in our test set, and similar numbers are found when the algorithm is applied to whole genomes. Reliably predicted partial topologies may simplify experimental determinations of membrane protein topology.  相似文献   

12.
Limited experimental data may be very useful to discriminate between membrane topology models of membrane proteins derived from different methods. A membrane topology screening method is proposed by which the cellular disposition of three positions in a membrane protein are determined, the N- and the C-termini and a position in the middle of the protein. The method involves amplification of the encoding genes or gene fragments by PCR, rapid cloning in dedicated vectors by ligation independent cloning, and determination of the cellular disposition of the three sites using conventional techniques. The N-terminus was determined by labeling with a fluorescent probe, the central position and the C-terminus by the reporter fusion technique using alkaline phosphatase (PhoA) and green fluorescence protein (GFP) as reporters. The method was evaluated using 16 transporter proteins of known function from four different structural classes. For 13 proteins a complete set of three localizations was obtained. The experimental data was used to discriminate between membrane topology models predicted by TMHMM, a widely used predictor using the amino acid sequence as input and by MemGen that uses hydropathy profile alignment and known 3D structures or existing models. It follows that in those cases where the models from the two methods were similar, the models were consistent with the experimental data. In those cases where the models differed, the MemGen model agreed with the experimental data. Three more recent predictors, MEMSAT3, OCTOPUS and TOPCONS showed a significantly higher consistency with the experimental data than observed with TMHMM.  相似文献   

13.
A combined transmembrane topology and signal peptide prediction method   总被引:31,自引:0,他引:31  
An inherent problem in transmembrane protein topology prediction and signal peptide prediction is the high similarity between the hydrophobic regions of a transmembrane helix and that of a signal peptide, leading to cross-reaction between the two types of predictions. To improve predictions further, it is therefore important to make a predictor that aims to discriminate between the two classes. In addition, topology information can be gained when successfully predicting a signal peptide leading a transmembrane protein since it dictates that the N terminus of the mature protein must be on the non-cytoplasmic side of the membrane. Here, we present Phobius, a combined transmembrane protein topology and signal peptide predictor. The predictor is based on a hidden Markov model (HMM) that models the different sequence regions of a signal peptide and the different regions of a transmembrane protein in a series of interconnected states. Training was done on a newly assembled and curated dataset. Compared to TMHMM and SignalP, errors coming from cross-prediction between transmembrane segments and signal peptides were reduced substantially by Phobius. False classifications of signal peptides were reduced from 26.1% to 3.9% and false classifications of transmembrane helices were reduced from 19.0% to 7.7%. Phobius was applied to the proteomes of Homo sapiens and Escherichia coli. Here we also noted a drastic reduction of false classifications compared to TMHMM/SignalP, suggesting that Phobius is well suited for whole-genome annotation of signal peptides and transmembrane regions. The method is available at as well as at  相似文献   

14.
Membrane proteins represent a significant fraction of all genomes and play key roles in many aspects of biology, but their structural analysis has been hampered by difficulties in large-scale production and crystallisation. To overcome the first of these hurdles, we present here a systematic approach for expression and affinity-tagging which takes into account transmembrane topology. Using a set of bacterial transporters with known topologies, we tested the efficacy of a panel of conventional and Gateway recombinational cloning vectors designed for protein expression under the control of the tac promoter, and for the addition of differing N- and C-terminal affinity tags. For transporters in which both termini are cytoplasmic, C-terminal oligohistidine tagging by recombinational cloning typically yielded functional protein at levels equivalent to or greater than those achieved by conventional cloning. In contrast, it was not effective for examples of the substantial minority of proteins that have one or both termini located on the periplasmic side of the membrane, possibly because of impairment of membrane insertion by the tag and/or att-site-encoded sequences. However, fusion either of an oligohistidine tag to cytoplasmic (but not periplasmic) termini, or of a Strep-tag II peptide to periplasmic termini using conventional cloning vectors did not interfere with membrane insertion, enabling high-level expression of such proteins. In conjunction with use of a C-terminal Lumio fluorescence tag, which we found to be compatible with both periplasmic and cytoplasmic locations, these findings offer a system for strategic planning of construct design for high throughput expression of membrane proteins for structural genomics projects.  相似文献   

15.
Zhang N  Chen R  Young N  Wishart D  Winter P  Weiner JH  Li L 《Proteomics》2007,7(4):484-493
Both organic solvent and surfactant have been used for dissolving membrane proteins for shotgun proteomics. In this work, two methods of protein solubilization, namely using 60% methanol or 1% SDS, to dissolve and analyze the inner membrane fraction of an Escherichia coli K12 cell lysate were compared. A total of 358 proteins (1417 unique peptides) from the methanol-solubilized protein mixture and 299 proteins (892 peptides) from the SDS-solubilized sample-were identified by using trypsin digestion and 2-D LC-ESI MS/MS. It was found that the methanol method detected more hydrophobic peptides, resulting in a greater number of proteins identified, than the SDS method. We found that 159 out of 358 proteins (44%) and 120 out of 299 proteins (40%) detected from the methanol- and SDS-solubilized samples, respectively, are integral membrane proteins. Among the 190 integral membrane proteins 70 were identified exclusively in the methanol-solubilized sample, 89 were identified by both methods, and only 31 proteins were exclusively identified by the SDS method. It is shown that the integral membrane proteins reflected the theoretical proteome for number of transmembrane helices, length, functional class, and topology, indicating there was no bias in the proteins identified.  相似文献   

16.
The prediction of a protein's structure from its amino acid sequence has been a long-standing goal of molecular biology. In this work, a new set of conformational parameters for membrane spanning alpha helices was developed using the information from the topology of 70 membrane proteins. Based on these conformational parameters, a simple algorithm has been formulated to predict the transmembrane alpha helices in membrane proteins. A FORTRAN program has been developed which takes the amino acid sequence as input and gives the predicted transmembrane alpha-helices as output. The present method correctly identifies 295 transmembrane helical segments in 70 membrane proteins with only two overpredictions. Furthermore, this method predicts all 45 transmembrane helices in the photosynthetic reaction center, bacteriorhodopsin and cytochrome c oxidase to an 86% level of accuracy and so is better than all other methods published to date.  相似文献   

17.
The prediction of transmembrane (TM) helix and topology provides important information about the structure and function of a membrane protein. Due to the experimental difficulties in obtaining a high-resolution model, computational methods are highly desirable. In this paper, we present a hierarchical classification method using support vector machines (SVMs) that integrates selected features by capturing the sequence-to-structure relationship and developing a new scoring function based on membrane protein folding. The proposed approach is evaluated on low- and high-resolution data sets with cross-validation, and the topology (sidedness) prediction accuracy reaches as high as 90%. Our method is also found to correctly predict both the location of TM helices and the topology for 69% of the low-resolution benchmark set. We also test our method for discrimination between soluble and membrane proteins and achieve very low overall false positive (0.5%) and false negative rates (0 to approximately 1.2%). Lastly, the analysis of the scoring function suggests that the topogeneses of single-spanning and multispanning TM proteins have different levels of complexity, and the consideration of interloop topogenic interactions for the latter is the key to achieving better predictions. This method can facilitate the annotation of membrane proteomes to extract useful structural and functional information. It is publicly available at http://bio-cluster.iis.sinica.edu.tw/~bioapp/SVMtop.  相似文献   

18.
Membrane proteins represent a significant fraction of all genomes and play key roles in many aspects of biology, but their structural analysis has been hampered by difficulties in large-scale production and crystallisation. To overcome the first of these hurdles, we present here a systematic approach for expression and affinity-tagging which takes into account transmembrane topology. Using a set of bacterial transporters with known topologies, we tested the efficacy of a panel of conventional and Gateway? recombinational cloning vectors designed for protein expression under the control of the tac promoter, and for the addition of differing N- and C-terminal affinity tags. For transporters in which both termini are cytoplasmic, C-terminal oligohistidine tagging by recombinational cloning typically yielded functional protein at levels equivalent to or greater than those achieved by conventional cloning. In contrast, it was not effective for examples of the substantial minority of proteins that have one or both termini located on the periplasmic side of the membrane, possibly because of impairment of membrane insertion by the tag and/or att-site-encoded sequences. However, fusion either of an oligohistidine tag to cytoplasmic (but not periplasmic) termini, or of a Strep-tag II peptide to periplasmic termini using conventional cloning vectors did not interfere with membrane insertion, enabling high-level expression of such proteins. In conjunction with use of a C-terminal Lumio? fluorescence tag, which we found to be compatible with both periplasmic and cytoplasmic locations, these findings offer a system for strategic planning of construct design for high throughput expression of membrane proteins for structural genomics projects.  相似文献   

19.
Käll L  Sonnhammer EL 《FEBS letters》2002,532(3):415-418
Transmembrane prediction methods are generally benchmarked on a set of proteins with experimentally verified topology. We have investigated if the accuracy measured on such datasets can be expected in an unbiased genomic analysis, or if there is a bias towards 'easily predictable' proteins in the benchmark datasets. As a measurement of accuracy, the concordance of the results from five different prediction methods was used (TMHMM, PHD, HMMTOP, MEMSAT, and TOPPRED). The benchmark dataset showed significantly higher levels (up to five times) of agreement between different methods than in 10 tested genomes. We have also analyzed which programs are most prone to make mispredictions by measuring the frequency of one-out-of-five disagreeing predictions.  相似文献   

20.
在基因组数据中,有20%~30%的产物被预测为跨膜蛋白,本文通过对膜蛋白拓扑结构预测方法进行分析,并评价其结果,为选择更合适的拓扑结构预测方法预测膜蛋白结构。通过对目前已有的拓扑结构预测方法的评价分析,可以为我们在实际工作中提供重要的参考。比如对一个未知拓扑结构的跨膜蛋白序列,我们可以先进行是否含有信号肽的预测,参考Polyphobius和SignalP两种方法,若两种方法预测结果不一致,综合上述对两种方法的评价,Polyphobius预测的综合能力较好,可取其预测的结果,一旦确定含有信号肽,则N端必然位于膜外侧。然后结合序列的长度,判断蛋白是单跨膜还是多重跨膜,即可参照上述评价结果,选择合适的拓扑结构预测方法进行预测。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号