首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 618 毫秒
1.
We present BTMX (Beta barrel TransMembrane eXposure), a computational method to predict the exposure status (i.e. exposed to the bilayer or hidden in the protein structure) of transmembrane residues in transmembrane beta barrel proteins (TMBs). BTMX predicts the exposure status of known TM residues with an accuracy of 84.2% over 2,225 residues and provides a confidence score for all predictions. Predictions made are in concert with the fact that hydrophobic residues tend to be more exposed to the bilayer. The biological relevance of the input parameters is also discussed. The highest prediction accuracy is obtained when a sliding window comprising three residues with similar C(α)-C(β) vector orientations is employed. The prediction accuracy of the BTMX method on a separate unseen non-redundant test dataset is 78.1%. By employing out-pointing residues that are exposed to the bilayer, we have identified various physico-chemical properties that show statistically significant differences between the beta strands located at the oligomeric interfaces compared to the non-oligomeric strands. The BTMX web server generates colored, annotated snake-plots as part of the prediction results and is available under the BTMX tab at http://service.bioinformatik.uni-saarland.de/tmx-site/. Exposure status prediction of TMB residues may be useful in 3D structure prediction of TMBs.  相似文献   

2.
Transmembrane beta-barrel (TMB) proteins are embedded in the outer membrane of Gram-negative bacteria, mitochondria, and chloroplasts. The cellular location and functional diversity of beta-barrel outer membrane proteins (omps) makes them an important protein class. At the present time, very few nonhomologous TMB structures have been determined by X-ray diffraction because of the experimental difficulty encountered in crystallizing transmembrane proteins. A novel method using pairwise interstrand residue statistical potentials derived from globular (nonouter membrane) proteins is introduced to predict the supersecondary structure of transmembrane beta-barrel proteins. The algorithm transFold employs a generalized hidden Markov model (i.e., multitape S-attribute grammar) to describe potential beta-barrel supersecondary structures and then computes by dynamic programming the minimum free energy beta-barrel structure. Hence, the approach can be viewed as a "wrapping" component that may capture folding processes with an initiation stage followed by progressive interaction of the sequence with the already-formed motifs. This approach differs significantly from others, which use traditional machine learning to solve this problem, because it does not require a training phase on known TMB structures and is the first to explicitly capture and predict long-range interactions. TransFold outperforms previous programs for predicting TMBs on smaller (相似文献   

3.
MOTIVATION: Membrane proteins are an abundant and functionally relevant subset of proteins that putatively include from about 15 up to 30% of the proteome of organisms fully sequenced. These estimates are mainly computed on the basis of sequence comparison and membrane protein prediction. It is therefore urgent to develop methods capable of selecting membrane proteins especially in the case of outer membrane proteins, barely taken into consideration when proteome wide analysis is performed. This will also help protein annotation when no homologous sequence is found in the database. Outer membrane proteins solved so far at atomic resolution interact with the external membrane of bacteria with a characteristic beta barrel structure comprising different even numbers of beta strands (beta barrel membrane proteins). In this they differ from the membrane proteins of the cytoplasmic membrane endowed with alpha helix bundles (all alpha membrane proteins) and need specialised predictors. RESULTS: We develop a HMM model, which can predict the topology of beta barrel membrane proteins using, as input, evolutionary information. The model is cyclic with 6 types of states: two for the beta strand transmembrane core, one for the beta strand cap on either side of the membrane, one for the inner loop, one for the outer loop and one for the globular domain state in the middle of each loop. The development of a specific input for HMM based on multiple sequence alignment is novel. The accuracy per residue of the model is 83% when a jack knife procedure is adopted. With a model optimisation method using a dynamic programming algorithm seven topological models out of the twelve proteins included in the testing set are also correctly predicted. When used as a discriminator, the model is rather selective. At a fixed probability value, it retains 84% of a non-redundant set comprising 145 sequences of well-annotated outer membrane proteins. Concomitantly, it correctly rejects 90% of a set of globular proteins including about 1200 chains with low sequence identity (<30%) and 90% of a set of all alpha membrane proteins, including 188 chains.  相似文献   

4.
A method based on neural networks is trained and tested on a nonredundant set of beta-barrel membrane proteins known at atomic resolution with a jackknife procedure. The method predicts the topography of transmembrane beta strands with residue accuracy as high as 78% when evolutionary information is used as input to the network. Of the transmembrane beta-strands included in the training set, 93% are correctly assigned. The predictor includes an algorithm of model optimization, based on dynamic programming, that correctly models eight out of the 11 proteins present in the training/testing set. In addition, protein topology is assigned on the basis of the location of the longest loops in the models. We propose this as a general method to fill the gap of the prediction of beta-barrel membrane proteins.  相似文献   

5.
6.
Predicting transmembrane beta-barrels in proteomes   总被引:1,自引:0,他引:1  
Very few methods address the problem of predicting beta-barrel membrane proteins directly from sequence. One reason is that only very few high-resolution structures for transmembrane beta-barrel (TMB) proteins have been determined thus far. Here we introduced the design, statistics and results of a novel profile-based hidden Markov model for the prediction and discrimination of TMBs. The method carefully attempts to avoid over-fitting the sparse experimental data. While our model training and scoring procedures were very similar to a recently published work, the architecture and structure-based labelling were significantly different. In particular, we introduced a new definition of beta- hairpin motifs, explicit state modelling of transmembrane strands, and a log-odds whole-protein discrimination score. The resulting method reached an overall four-state (up-, down-strand, periplasmic-, outer-loop) accuracy as high as 86%. Furthermore, accurately discriminated TMB from non-TMB proteins (45% coverage at 100% accuracy). This high precision enabled the application to 72 entirely sequenced Gram-negative bacteria. We found over 164 previously uncharacterized TMB proteins at high confidence. Database searches did not implicate any of these proteins with membranes. We challenge that the vast majority of our 164 predictions will eventually be verified experimentally. All proteome predictions and the PROFtmb prediction method are available at http://www.rostlab.org/ services/PROFtmb/.  相似文献   

7.
In contrast to water-soluble proteins, membrane proteins reside in a heterogeneous environment, and their surfaces must interact with both polar and apolar membrane regions. As a consequence, the composition of membrane proteins' residues varies substantially between the membrane core and the interfacial regions. The amino acid compositions of helical membrane proteins are also known to be different on the cytoplasmic and extracellular sides of the membrane. Here we report that in the 16 transmembrane beta-barrel structures, the amino acid compositions of lipid-facing residues are different near the N and C termini of the individual strands. Polar amino acids are more prevalent near the C termini than near the N termini, and hydrophobic amino acids show the opposite trend. We suggest that this difference arises because it is easier for polar atoms to escape from the apolar regions of the bilayer at the C terminus of a beta-strand. This new characteristic of beta-barrel membrane proteins enhances our understanding of how a sequence encodes a membrane protein structure and should prove useful in identifying and predicting the structures of trans-membrane beta-barrels.  相似文献   

8.
MOTIVATION: Many important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion are mediated by membrane proteins. Unfortunately, as these proteins are not water soluble, it is extremely hard to experimentally determine their structure. Therefore, improved methods for predicting the structure of these proteins are vital in biological research. In order to improve transmembrane topology prediction, we evaluate the combined use of both integrated signal peptide prediction and evolutionary information in a single algorithm. RESULTS: A new method (MEMSAT3) for predicting transmembrane protein topology from sequence profiles is described and benchmarked with full cross-validation on a standard data set of 184 transmembrane proteins. The method is found to predict both the correct topology and the locations of transmembrane segments for 80% of the test set. This compares with accuracies of 62-72% for other popular methods on the same benchmark. By using a second neural network specifically to discriminate transmembrane from globular proteins, a very low overall false positive rate (0.5%) can also be achieved in detecting transmembrane proteins. AVAILABILITY: An implementation of the described method is available both as a web server (http://www.psipred.net) and as downloadable source code from http://bioinf.cs.ucl.ac.uk/memsat. Both the server and source code files are free to non-commercial users. Benchmark and training data are also available from http://bioinf.cs.ucl.ac.uk/memsat.  相似文献   

9.
The unusually stable and multifunctional, thin aggregative fimbriae common to all Salmonella spp. are principally polymers of the fimbrin subunit, AgfA. AgfA of Salmonella enteritidis consists of two domains: a protease-sensitive, 22 amino acid residue N-terminal region and a protease-resistant, 109 residue C-terminal core. The unusual amino acid sequence of the AgfA core region comprises two-, five- and tenfold internal sequence homology patterns reflected in five conserved, 18-residue tandem repeats. These repeats have the consensus sequence, Sx5QxGx2NxAx3Q and are linked together by four or five residues, (x)xAx2. The predicted secondary structure for this unusual arrangement of tandem repeats in AgfA indicates mainly extended conformation with the beta strands linked by four to six residues. Candidate proteins of known structure with motifs of alternating beta strands and short loops were selected from folds described in SCOP as a source of coordinates for AgfA model construction. Three all-beta class motifs selected from the Serratia marcescens metalloprotease, myelin P2 protein or vitelline membrane outer protein I were used for initial AgfA homology build-up procedures ultimately resulting in three structural models; beta barrel, beta prism and parallel beta helix. The beta barrel model is a compact, albeit irregular structure, with the beta strands arranged in two antiparallel beta sheet faces. The beta prism model does not reflect the 5 or 10-fold symmetry of the AgfA primary sequence. However, the favored, parallel beta helix model is a compact coil of ten helically arranged beta strands forming two parallel beta sheet faces. This arrangement predicts a regular, potentially stable, C-terminal core region consistent with the observed tandem repeat sequences, protease-resistance and strong tendency of this fimbrin to oligomerize and aggregate. Positional conservation of amino acid residues in AgfA and the Escherichia coli AgfA homologue, CsgA, provides strong support for this model. The parallel beta helix model of AgfA offers an interesting solution to a multifunctional fimbrin molecular surface having solvent exposed areas, regions for major and minor subunit interactions as well as fiber-fiber interactions common to many bacterial fimbriae.  相似文献   

10.
Prediction of transmembrane spans and secondary structure from the protein sequence is generally the first step in the structural characterization of (membrane) proteins. Preference of a stretch of amino acids in a protein to form secondary structure and being placed in the membrane are correlated. Nevertheless, current methods predict either secondary structure or individual transmembrane states. We introduce a method that simultaneously predicts the secondary structure and transmembrane spans from the protein sequence. This approach not only eliminates the necessity to create a consensus prediction from possibly contradicting outputs of several predictors but bears the potential to predict conformational switches, i.e., sequence regions that have a high probability to change for example from a coil conformation in solution to an α‐helical transmembrane state. An artificial neural network was trained on databases of 177 membrane proteins and 6048 soluble proteins. The output is a 3 × 3 dimensional probability matrix for each residue in the sequence that combines three secondary structure types (helix, strand, coil) and three environment types (membrane core, interface, solution). The prediction accuracies are 70.3% for nine possible states, 73.2% for three‐state secondary structure prediction, and 94.8% for three‐state transmembrane span prediction. These accuracies are comparable to state‐of‐the‐art predictors of secondary structure (e.g., Psipred) or transmembrane placement (e.g., OCTOPUS). The method is available as web server and for download at www.meilerlab.org . Proteins 2013; 81:1127–1140. © 2013 Wiley Periodicals, Inc.  相似文献   

11.
waveTM is a web tool for the prediction of transmembrane segments in alpha-helical membrane proteins. Prediction is performed by a dynamic programming algorithm on wavelet-denoised 'hydropathy' signals. Users submit a protein sequence and receive interactively the results. Topology prediction can also be obtained in conjunction with the algorithm OrienTM. A web server that implements the waveTM algorithm is freely available at http://bioinformatics.biol.uoa.gr/waveTM.  相似文献   

12.
Signal peptides and transmembrane helices both contain a stretch of hydrophobic amino acids. This common feature makes it difficult for signal peptide and transmembrane helix predictors to correctly assign identity to stretches of hydrophobic residues near the N-terminal methionine of a protein sequence. The inability to reliably distinguish between N-terminal transmembrane helix and signal peptide is an error with serious consequences for the prediction of protein secretory status or transmembrane topology. In this study, we report a new method for differentiating protein N-terminal signal peptides and transmembrane helices. Based on the sequence features extracted from hydrophobic regions (amino acid frequency, hydrophobicity, and the start position), we set up discriminant functions and examined them on non-redundant datasets with jackknife tests. This method can incorporate other signal peptide prediction methods and achieve higher prediction accuracy. For Gram-negative bacterial proteins, 95.7% of N-terminal signal peptides and transmembrane helices can be correctly predicted (coefficient 0.90). Given a sensitivity of 90%, transmembrane helices can be identified from signal peptides with a precision of 99% (coefficient 0.92). For eukaryotic proteins, 94.2% of N-terminal signal peptides and transmembrane helices can be correctly predicted with coefficient 0.83. Given a sensitivity of 90%, transmembrane helices can be identified from signal peptides with a precision of 87% (coefficient 0.85). The method can be used to complement current transmembrane protein prediction and signal peptide prediction methods to improve their prediction accuracies.  相似文献   

13.
The prediction of a protein's structure from its amino acid sequence has been a long-standing goal of molecular biology. In this work, a new set of conformational parameters for membrane spanning alpha helices was developed using the information from the topology of 70 membrane proteins. Based on these conformational parameters, a simple algorithm has been formulated to predict the transmembrane alpha helices in membrane proteins. A FORTRAN program has been developed which takes the amino acid sequence as input and gives the predicted transmembrane alpha-helices as output. The present method correctly identifies 295 transmembrane helical segments in 70 membrane proteins with only two overpredictions. Furthermore, this method predicts all 45 transmembrane helices in the photosynthetic reaction center, bacteriorhodopsin and cytochrome c oxidase to an 86% level of accuracy and so is better than all other methods published to date.  相似文献   

14.
The HMMTOP transmembrane topology prediction server   总被引:22,自引:0,他引:22  
The HMMTOP transmembrane topology prediction server predicts both the localization of helical transmembrane segments and the topology of transmembrane proteins. Recently, several improvements have been introduced to the original method. Now, the user is allowed to submit additional information about segment localization to enhance the prediction power. This option improves the prediction accuracy as well as helps the interpretation of experimental results, i.e. in epitope insertion experiments. Availability: HMMTOP 2.0 is freely available to non-commercial users at http://www.enzim.hu/hmmtop. Source code is also available upon request to academic users.  相似文献   

15.
Kaur H  Raghava GP 《FEBS letters》2004,564(1-2):47-57
In this study, an attempt has been made to develop a neural network-based method for predicting segments in proteins containing aromatic-backbone NH (Ar-NH) interactions using multiple sequence alignment. We have analyzed 3121 segments seven residues long containing Ar-NH interactions, extracted from 2298 non-redundant protein structures where no two proteins have more than 25% sequence identity. Two consecutive feed-forward neural networks with a single hidden layer have been trained with standard back-propagation as learning algorithm. The performance of the method improves from 0.12 to 0.15 in terms of Matthews correlation coefficient (MCC) value when evolutionary information (multiple alignment obtained from PSI-BLAST) is used as input instead of a single sequence. The performance of the method further improves from MCC 0.15 to 0.20 when secondary structure information predicted by PSIPRED is incorporated in the prediction. The final network yields an overall prediction accuracy of 70.1% and an MCC of 0.20 when tested by five-fold cross-validation. Overall the performance is 15.2% higher than the random prediction. The method consists of two neural networks: (i) a sequence-to-structure network which predicts the aromatic residues involved in Ar-NH interaction from multiple alignment of protein sequences and (ii) a structure-to structure network where the input consists of the output obtained from the first network and predicted secondary structure. Further, the actual position of the donor residue within the 'potential' predicted fragment has been predicted using a separate sequence-to-structure neural network. Based on the present study, a server Ar_NHPred has been developed which predicts Ar-NH interaction in a given amino acid sequence. The web server Ar_NHPred is available at and (mirror site).  相似文献   

16.
The selective breakdown of newly synthesized proteins retained within the endoplasmic reticulum (ER) is probably mediated by the specific recognition of structural features of protein substrates by components of a degradative system. Within the alpha chain of the multisubunit T-cell antigen receptor (TCR) complex, a transmembrane sequence containing two basic amino acid residues has been shown to act as a determinant for retention and rapid degradation in the ER. We now demonstrate that single basic or acidic amino acid residues can cause targeting for retention and degradation in the ER when placed within the transmembrane domain of an integral membrane protein normally destined for the cell surface. The effect of such potentially charged residues is dependent on their relative position within the transmembrane sequence and on the nature of the amino acid side chains. The phenotypic changes induced by potentially charged transmembrane residues occur without apparent alterations of the global folding or transmembrane topology of the mutant proteins. These observations test the hypothesis that potentially charged residues within transmembrane domains can provide the basis for a motif for ER degradation and explain the selective breakdown of some proteins retained within the ER.  相似文献   

17.
Modeling of integral membrane proteins and the prediction of their functional sites requires the identification of transmembrane (TM) segments and the determination of their angular orientations. Hydrophobicity scales predict accurately the location of TM helices, but are less accurate in computing angular disposition. Estimating lipid-exposure propensities of the residues from statistics of solved membrane protein structures has the disadvantage of relying on relatively few proteins. As an alternative, we propose here a scale of knowledge-based Propensities for Residue Orientation in Transmembrane segments (kPROT), derived from the analysis of more than 5000 non-redundant protein sequences. We assume that residues that tend to be exposed to the membrane are more frequent in TM segments of single-span proteins, while residues that prefer to be buried in the transmembrane bundle interior are present mainly in multi-span TMs. The kPROT value for each residue is thus defined as the logarithm of the ratio of its proportions in single and multiple TM spans. The scale is refined further by defining it for three discrete sections of the TM segment; namely, extracellular, central, and intracellular. The capacity of the kPROT scale to predict angular helical orientation was compared to that of alternative methods in a benchmark test, using a diversity of multi-span alpha-helical transmembrane proteins with a solved 3D structure. kPROT yielded an average angular error of 41 degrees, significantly lower than that of alternative scales (62 degrees -68 degrees ). The new scale thus provides a useful general tool for modeling and prediction of functional residues in membrane proteins. A WWW server (http://bioinfo.weizmann.ac.il/kPROT) is available for automatic helix orientation prediction with kPROT.  相似文献   

18.
MOTIVATION: Membrane domain prediction has recently been re-evaluated by several groups, suggesting that the accuracy of existing methods is still rather limited. In this work, we revisit this problem and propose novel methods for prediction of alpha-helical as well as beta-sheet transmembrane (TM) domains. The new approach is based on a compact representation of an amino acid residue and its environment, which consists of predicted solvent accessibility and secondary structure of each amino acid. A recently introduced method for solvent accessibility prediction trained on a set of soluble proteins is used here to indicate segments of residues that are predicted not to be accessible to water and, therefore, may be 'buried' in the membrane. While evolutionary profiles in the form of a multiple alignment are used to derive these simple 'structural profiles', they are not used explicitly for the membrane domain prediction and the overall number of parameters in the model is significantly reduced. This offers the possibility of a more reliable estimation of the free parameters in the model with a limited number of experimentally resolved membrane protein structures. RESULTS: Using cross-validated training on available sets of structurally resolved and non-redundant alpha and beta membrane proteins, we demonstrate that membrane domain prediction methods based on such a compact representation outperform approaches that utilize explicitly evolutionary profiles and multiple alignments. Moreover, using an external evaluation by the TMH Benchmark server we show that our final prediction protocol for the TM helix prediction is competitive with the state-of-the-art methods, achieving per-residue accuracy of approximately 89% and per-segment accuracy of approximately 80% on the set of high resolution structures used by the TMH Benchmark server. At the same time the observed rates of confusion with signal peptides and globular proteins are the lowest among the tested methods. The new method is available online at http://minnou.cchmc.org.  相似文献   

19.
CHIP28 is a 28-kD hydrophobic integral membrane protein that functions as a water channel in erythrocytes and renal tubule epithelial cell membranes. We examined the transmembrane topology of CHIP28 in the ER by engineering a reporter of translocation (derived from bovine prolactin) into nine sequential sites in the CHIP28 coding region. The resulting chimeras were expressed in Xenopus oocytes, and the topology of the reporter with respect to the ER membrane was determined by protease sensitivity. We found that although hydropathy analysis predicted up to seven potential transmembrane regions, CHIP28 spanned the membrane only four times. Two putative transmembrane helices, residues 52-68 and 143-157, reside on the lumenal and cytosolic surfaces of the ER membrane, respectively. Topology derived from these chimeric proteins was supported by cell-free translation of five truncated CHIP28 cDNAs, by N-linked glycosylation at an engineered consensus site in native CHIP28 (residue His69), and by epitope tagging of the CHIP28 amino terminus. Defined protein chimeras were used to identify internal sequences that direct events of CHIP28 topogenesis. A signal sequence located within the first 52 residues initiated nascent chain translocation into the ER lumen. A stop transfer sequence located in the hydrophobic region from residues 90-120 terminated ongoing translocation. A second internal signal sequence, residues 155-186, reinitiated translocation of a COOH-terminal domain (residues 186-210) into the ER lumen. Integration of the nascent chain into the ER membrane occurred after synthesis of 107 residues and required the presence of two membrane-spanning regions. From this data, we propose a structural model for CHIP28 at the ER membrane in which four membrane- spanning alpha-helices form a central aqueous channel through the lipid bilayer and create a pathway for water transport.  相似文献   

20.
MOTIVATION: The experimental difficulties of alpha-helical transmembrane protein structure determination make this class of protein an important target for sequence-based structure prediction tools. The MEMPACK prediction server allows users to submit a transmembrane protein sequence and returns transmembrane topology, lipid exposure, residue contacts, helix-helix interactions and helical packing arrangement predictions in both plain text and graphical formats using a number of novel machine learning-based algorithms. AVAILABILITY: The server can be accessed as a new component of the PSIPRED portal by at http://bioinf.cs.ucl.ac.uk/psipred/.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号