首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 23 毫秒
1.
Transmembrane helices are the most readily predictable secondary structure components of proteins. They can be predicted to a high degree of accuracy in a variety of ways. Many of these methods compare new sequence data with the sequence characteristics of known transmembrane domains. However, the known transmembrane sequences are not necessarily representative of a particular organism. We attempt to demonstrate that parameters optimized for the known transmembrane domains are far from optimal when predicting transmembrane regions in a given genome. In particular, we have tested the effect of nucleotide bias upon the composition and hence the prediction characteristics of transmembrane helices. Our analysis shows that nucleotide bias of a genome has a strong and predictable influence upon the occurrences of several of the most important hydrophobic amino acids found within transmembrane helices. Thus, we show that nucleotide bias should be taken into account when determining putative transmembrane domains from sequence data.  相似文献   

2.
In contrast to water-soluble proteins, membrane proteins reside in a heterogeneous environment, and their surfaces must interact with both polar and apolar membrane regions. As a consequence, the composition of membrane proteins' residues varies substantially between the membrane core and the interfacial regions. The amino acid compositions of helical membrane proteins are also known to be different on the cytoplasmic and extracellular sides of the membrane. Here we report that in the 16 transmembrane beta-barrel structures, the amino acid compositions of lipid-facing residues are different near the N and C termini of the individual strands. Polar amino acids are more prevalent near the C termini than near the N termini, and hydrophobic amino acids show the opposite trend. We suggest that this difference arises because it is easier for polar atoms to escape from the apolar regions of the bilayer at the C terminus of a beta-strand. This new characteristic of beta-barrel membrane proteins enhances our understanding of how a sequence encodes a membrane protein structure and should prove useful in identifying and predicting the structures of trans-membrane beta-barrels.  相似文献   

3.
Proline-induced distortions of transmembrane helices   总被引:14,自引:0,他引:14  
Proline residues in the transmembrane (TM) alpha-helices of integral membrane proteins have long been suspected to play a key role for helix packing and signal transduction by inducing regions of helix distortion and/or dynamic flexibility (hinges). In this study we try to characterise the effect of proline on the geometric properties of TM alpha-helices. We have examined 199 transmembrane alpha-helices from polytopic membrane proteins of known structure. After examining the location of proline residues within the amino acid sequences of TM helices, we estimated the helix axes either side of a hinge and hence identified a hinge residue. This enabled us to calculate helix kink and swivel angles. The results of this analysis show that proline residues occur with a significant concentration in the centre of sequences of TM alpha-helices. In this location, they may induce formation of molecular hinges, located on average about four residues N-terminal to the proline residue. A superposition of proline-containing TM helices structures shows that the distortion induced is anisotropic and favours certain relative orientations (defined by helix kink and swivel angles) of the two helix segments.  相似文献   

4.
Transmembrane proteins (TMPs) are important drug targets because they are essential for signaling, regulation, and transport. Despite important breakthroughs, experimental structure determination remains challenging for TMPs. Various methods have bridged the gap by predicting transmembrane helices (TMHs), but room for improvement remains. Here, we present TMSEG, a novel method identifying TMPs and accurately predicting their TMHs and their topology. The method combines machine learning with empirical filters. Testing it on a non‐redundant dataset of 41 TMPs and 285 soluble proteins, and applying strict performance measures, TMSEG outperformed the state‐of‐the‐art in our hands. TMSEG correctly distinguished helical TMPs from other proteins with a sensitivity of 98 ± 2% and a false positive rate as low as 3 ± 1%. Individual TMHs were predicted with a precision of 87 ± 3% and recall of 84 ± 3%. Furthermore, in 63 ± 6% of helical TMPs the placement of all TMHs and their inside/outside topology was correctly predicted. There are two main features that distinguish TMSEG from other methods. First, the errors in finding all helical TMPs in an organism are significantly reduced. For example, in human this leads to 200 and 1600 fewer misclassifications compared to the second and third best method available, and 4400 fewer mistakes than by a simple hydrophobicity‐based method. Second, TMSEG provides an add‐on improvement for any existing method to benefit from. Proteins 2016; 84:1706–1716. © 2016 Wiley Periodicals, Inc.  相似文献   

5.
Structures of N-termini of helices in proteins.   总被引:1,自引:7,他引:1       下载免费PDF全文
We have surveyed 393 N-termini of alpha-helices and 156 N-termini of 3(10)-helices in 85 high resolution, non-homologous protein crystal structures for N-cap side-chain rotamer preferences, hydrogen bonding patterns, and solvent accessibilities. We find very strong rotamer preferences that are unique to N-cap sites. The following rules are generally observed for N-capping in alpha-helices: Thr and Ser N-cap side chains adopt the gauche - rotamer, hydrogen bond to the N3 NH and have psi restricted to 164 +/- 8 degrees. Asp and Asn N-cap side chains either adopt the gauche - rotamer and hydrogen bond to the N3 NH with psi = 172 +/- 10 degrees, or adopt the trans rotamer and hydrogen bond to both the N2 and N3 NH groups with psi = 1-7 +/- 19 degrees. With all other N-caps, the side chain is found in the gauche + rotamer so that the side chain does not interact unfavorably with the N-terminus by blocking solvation and psi is unrestricted. An i, i + 3 hydrogen bond from N3 NH to the N-cap backbone C = O in more likely to form at the N-terminus when an unfavorable N-cap is present. In the 3(10)-helix Asn and Asp remain favorable N-caps as they can hydrogen bond to the N2 NH while in the trans rotamer; in contrast, Ser and Thr are disfavored as their preferred hydrogen bonding partner (N3 NH) is inaccessible. This suggests that Ser is the optimum choice of N-cap when alpha-helix formation is to be encouraged while 3(10)-helix formation discouraged. The strong energetic and structural preferences found for N-caps, which differ greatly from positions within helix interiors, suggest that N-caps should be treated explicitly in any consideration of helical structure in peptides or proteins.  相似文献   

6.
Zpred2 is an improved version of ZPRED, a predictor for the Z-coordinates of alpha-helical membrane proteins, that is, the distance of the residues from the center of the membrane. Using principal component analysis and a set of neural networks, Zpred2 analyzes data extracted from the amino acid sequence, the predicted topology, and evolutionary profiles. Zpred2 achieves an average accuracy error of 2.18 A (2.17 A when an independent test set is used), an improvement by 15% compared to the previous version. We show that this accuracy is sufficient to enable the predictions of helix lengths with a correlation coefficient of 0.41. As a comparison, two state-of-the-art HMM-based topology prediction methods manage to predict the helix lengths with a correlation coefficient of less than 0.1. In addition, we applied Zpred2 to two other problems, the re-entrant region identification and model validation. Re-entrants were able to be detected with a certain consistency, but not better than with previous approaches, while incorrect models as well as mispredicted helices of transmembrane proteins could be distinguished based on the Z-coordinate predictions.  相似文献   

7.
Known sequence motifs containing key glycine residues can drive the homo-oligomerization of transmembrane helices. To find other motifs, a randomized library of transmembrane interfaces was generated in which glycine was omitted. The TOXCAT system, which measures transmembrane helix association in the Escherichia coli inner membrane, was used to select high-affinity homo-oligomerizing sequences in this library. The two most frequently occurring motifs were SxxSSxxT and SxxxSSxxT. Isosteric mutations of any one of the serine and threonine residues to non-polar residues abolished oligomerization, indicating that the interaction between these positions is specific and requires an extended motif of serine and threonine hydroxyl groups. Computational modeling of these sequences produced several chemically plausible structures that contain multiple hydrogen bonds between the serine and threonine residues. While single serine or threonine side-chains do not appear to promote helix association, motifs can drive strong and specific association through a cooperative network of interhelical hydrogen bonds.  相似文献   

8.
We report a comprehensive analysis of the numbers, lengths and amino acid compositions of transmembrane helices in 235 high-resolution structures of integral membrane proteins. The properties of 1551 transmembrane helices in the structures were compared with those obtained by analysis of the same amino acid sequences using topology prediction tools. Explanations for the 81 (5.2%) missing or additional transmembrane helices in the prediction results were identified. Main reasons for missing transmembrane helices were mis-identification of N-terminal signal peptides, breaks in α-helix conformation or charged residues in the middle of transmembrane helices and transmembrane helices with unusual amino acid composition. The main reason for additional transmembrane helices was mis-identification of amphipathic helices, extramembrane helices or hairpin re-entrant loops. Transmembrane helix length had an overall median of 24 residues and an average of 24.9 ± 7.0 residues and the most common length was 23 residues. The overall content of residues in transmembrane helices as a percentage of the full proteins had a median of 56.8% and an average of 55.7 ± 16.0%. Amino acid composition was analysed for the full proteins, transmembrane helices and extramembrane regions. Individual proteins or types of proteins with transmembrane helices containing extremes in contents of individual amino acids or combinations of amino acids with similar physicochemical properties were identified and linked to structure and/or function. In addition to overall median and average values, all results were analysed for proteins originating from different types of organism (prokaryotic, eukaryotic, viral) and for subgroups of receptors, channels, transporters and others.  相似文献   

9.
We analyzed the nucleotide contents of several completely sequenced genomes, and we show that nucleotide bias can have a dramatic effect on the amino acid composition of the encoded proteins. By surveying the genes in 21 completely sequenced eubacterial and archaeal genomes, along with the entire Saccharomyces cerevisiae genome and two Plasmodium falciparum chromosomes, we show that biased DNA encodes biased proteins on a genomewide scale. The predicted bias affects virtually all genes within the genome, and it could be clearly seen even when we limited the analysis to sets of homologous gene sequences. Parallel patterns of compositional bias were found within the archaea and the eubacteria. We also found a positive correlation between the degree of amino acid bias and the magnitude of protein sequence divergence. We conclude that mutational bias can have a major effect on the molecular evolution of proteins. These results could have important implications for the interpretation of protein-based molecular phylogenies and for the inference of functional protein adaptation from comparative sequence data.  相似文献   

10.
Shelden MC  Loughlin P  Tierney ML  Howitt SM 《Biochemistry》2003,42(44):12941-12949
The aim of this study was to identify charged amino acid residues important for activity of the sulfate transporter SHST1. We mutated 10 charged amino acids in or near proposed transmembrane helices and expressed the resulting mutants in a sulfate transport-deficient yeast strain. Mutations affecting four residues resulted in a complete loss of sulfate transport; these residues were D107 and D122 in helix 1 and R354 and E366 in helix 8. All other mutants showed some reduction in transport activity. The E366Q mutant was unusual in that expression of the mutant protein was toxic to yeast cells. The R354Q mutant showed reduced trafficking to the plasma membrane, indicating that the protein was misfolded. However, transporter function (to a low level) and wild-type trafficking could be recovered by combining the R354Q mutation with either the E175Q or E270Q mutations. This suggested that R354 interacts with both E175 and E270. The triple mutant E175Q/E270Q/R354Q retained only marginal sulfate transport activity but was trafficked at wild-type levels, suggesting that a charge network between these three residues may be involved in the transport pathway, rather than in folding. D107 was also found to be essential for the ion transport pathway and may form a charge pair with R154, both of which are highly conserved. The information obtained on interactions between charged residues provides the first evidence for the possible spatial arrangement of transmembrane helices within any member of this transporter family. This information is used to develop a model for SHST1 tertiary structure.  相似文献   

11.
The protein-protein interaction energy of 12 nonhomologous serine protease-inhibitor and 15 antibody-antigen complexes is calculated using a molecular mechanics formalism and dissected in terms of the main-chain vs. side-chain contribution, nonrotameric side-chain contributions, and amino acid residue type involvement in the interface interaction. There are major differences in the interactions of the two types of protein-protein complex. Protease-inhibitor complexes interact predominantly through a main-chain-main-chain mechanism while antibody-antigen complexes interact predominantly through a side-chain-side-chain or a side-chain-main-chain mechanism. However, there is no simple correlation between the main-chain-main-chain interaction energy and the percentage of main-chain surface area buried on binding. The interaction energy is equally effected by the presence of nonrotameric side-chain conformations, which constitute approximately 20% of the interaction energy. The ability to reproduce the interface interaction energy of the crystal structure if original side-chain conformations are removed from the calculation is much greater in the protease-inhibitor complexes than the antibody-antigen complexes. The success of a rotameric model for protein-protein docking appears dependent on the extent of the main-chain-main-chain contribution to binding. Analysis of (1) residue type and (2) residue pair interactions at the interface show that antibody-antigen interactions are very restricted with over 70% of the antibody energy attributable to just six residue types (Tyr > Asp > Asn > Ser > Glu > Trp) in agreement with previous studies on residue propensity. However, it is found here that 50% of the antigen energy is attributable to just four residue types (Arg = Lys > Asn > Asp). On average just 12 residue pair interactions (6%) contribute over 40% of the favorable interaction energy in the antibody-antigen complexes, with charge-charge and charge/polar-tyrosine interactions being prominent. In contrast protease inhibitors use a diverse set of residue types and residue pair interactions.  相似文献   

12.
Ashish Shelar  Manju Bansal 《Proteins》2014,82(12):3420-3436
α‐helices are amongst the most common secondary structural elements seen in membrane proteins and are packed in the form of helix bundles. These α‐helices encounter varying external environments (hydrophobic, hydrophilic) that may influence the sequence preferences at their N and C‐termini. The role of the external environment in stabilization of the helix termini in membrane proteins is still unknown. Here we analyze α‐helices in a high‐resolution dataset of integral α‐helical membrane proteins and establish that their sequence and conformational preferences differ from those in globular proteins. We specifically examine these preferences at the N and C‐termini in helices initiating/terminating inside the membrane core as well as in linkers connecting these transmembrane helices. We find that the sequence preferences and structural motifs at capping (Ncap and Ccap) and near‐helical (N' and C') positions are influenced by a combination of features including the membrane environment and the innate helix initiation and termination property of residues forming structural motifs. We also find that a large number of helix termini which do not form any particular capping motif are stabilized by formation of hydrogen bonds and hydrophobic interactions contributed from the neighboring helices in the membrane protein. We further validate the sequence preferences obtained from our analysis with data from an ultradeep sequencing study that identifies evolutionarily conserved amino acids in the rat neurotensin receptor. The results from our analysis provide insights for the secondary structure prediction, modeling and design of membrane proteins. Proteins 2014; 82:3420–3436. © 2014 Wiley Periodicals, Inc.  相似文献   

13.
Cells have developed an incredible machinery to facilitate the insertion of membrane proteins into the membrane. While we have a fairly good understanding of the mechanism and determinants of membrane integration, more data is needed to understand the insertion of membrane proteins with more complex insertion and folding pathways. This review will focus on marginally hydrophobic transmembrane helices and their influence on membrane protein folding. These weakly hydrophobic transmembrane segments are by themselves not recognized by the translocon and therefore rely on local sequence context for membrane integration. How can such segments reside within the membrane? We will discuss this in the light of features found in the protein itself as well as the environment it resides in. Several characteristics in proteins have been described to influence the insertion of marginally hydrophobic helices. Additionally, the influence of biological membranes is significant. To begin with, the actual cost for having polar groups within the membrane may not be as high as expected; the presence of proteins in the membrane as well as characteristics of some amino acids may enable a transmembrane helix to harbor a charged residue. The lipid environment has also been shown to directly influence the topology as well as membrane boundaries of transmembrane helices—implying a dynamic relationship between membrane proteins and their environment.  相似文献   

14.
Local structures in denatured proteins may be important in guiding a polypeptide chain during the folding and misfolding processes. Existence of local structures in chemically denatured proteins is a highly controversial issue. NMR parameters [coupling constants (3) J(H(alpha),H(N)) and chemical shifts] of chemically denatured proteins in general deviate little from their values in small peptides. These peptides were presumed to be completely unstructured; therefore, it was considered that chemically denatured proteins are random coils. But recent experimental studies show that small peptides adopt relatively stable structures in aqueous solutions. Small deviations of the NMR parameters from their values in small peptides may thus actually indicate the existence of local structures in chemically denatured proteins. Using NMR data and theoretical predictions we show here that fluctuating beta-strands exist in urea-denatured ubiquitin (8 M urea at pH 2). Residues in such beta-strands populate more frequently the left side of the broad beta region of -psi space. Urea-denatured ubiquitin contains no detectable beta-sheet secondary structures; nevertheless, the fluctuating beta-strands in urea-denatured ubiquitin coincide to the beta-strands in the native state. Formation of beta-strands is in accord with the electrostatic screening model of unfolded proteins. The free energy of a residue in an unfolded protein is in this model determined by the local backbone electrostatics and its screening by backbone solvation. These energy terms introduce strong electrostatic coupling between neighboring residues, which causes cooperative formation of beta-strands in denatured proteins. We propose that fluctuating beta-strands in denatured proteins may serve as initiation sites to form fibrils.  相似文献   

15.
Given the known high-resolution structures of alpha-helical transmembrane domains, we show that there are statistically distinct classes of transmembrane interfaces which relate to the folding and oligomerization of transmembrane domains. Distinct types of interfaces have been categorized and refer to those between: the same polypeptide chain, different polypeptide chains, helices that are sequential neighbors, and those that are nonsequential. These different interfaces may reflect different phases in the mechanism of transmembrane domain folding and are consistent with the current experimental evidence pertaining to the folding and oligomerization of transmembrane domains. The classes of helix-helix interfaces have been identified in terms of the numbers and different types of pairwise amino acid interactions. The specific measures used are interaction entropy, the information content of interacting partners compared to a random set of contacts, the amino acid composition of the classes and the abundances of specific amino acid pairs in close contact. Knowledge of the clear differences in the types of helix-helix contacts helps with the derivation of knowledge-based constraints which until now have focused on only the interiors of transmembrane domains as compared to the exterior. Taken together, an in vivo model for membrane protein folding is presented, which is distinct from the familiar two-stage model. The model takes into account the different interfaces of membrane helices defined herein, and the available data regarding folding in the translocation channel.  相似文献   

16.
We have developed an empirical residue-based potential (E(z) potential) for protein insertion in lipid membranes. Propensities for occurrence as a function of depth in the bilayer were calculated for the individual amino acid types from their distribution in known structures of helical membrane proteins. The propensities were then fit to continuous curves and converted to a potential using a reverse-Boltzman relationship. The E(z) potential demonstrated a good correlation with experimental data such as amino acid transfer free energy scales (water to membrane center and water to interface), and it incorporates transmembrane helices of varying composition in the membrane with trends similar to those obtained with translocon-mediated insertion experiments. The potential has a variety of applications in the analysis of natural membrane proteins as well as in the design of new ones. It can help in calculating the propensity of single helices to insert in the bilayer and estimate their tilt angle with respect to the bilayer normal. It can be utilized to discriminate amphiphilic helices that assume a parallel orientation at the membrane interface, such as those of membrane-active peptides. In membrane protein design applications, the potential allows an environment-dependent selection of amino acid identities.  相似文献   

17.
The outer membrane proteins (OMPs) are β-barrel membrane proteins that performed lots of biology functions. The discriminating OMPs from other non-OMPs is a very important task for understanding some biochemical process. In this study, a method that combines increment of diversity with modified Mahalanobis Discriminant, called IDQD, is presented to predict 208 OMPs, 206 transmembrane helical proteins (TMHPs) and 673 globular proteins (GPs) by using Chou's pseudo amino acid compositions as parameters. The overall accuracy of jackknife cross-validation is 93.2% and 96.1%, respectively, for three datasets (OMPs, TMHPs and GPs) and two datasets (OMPs and non-OMPs). These predicted results suggest that the method can be effectively applied to discriminate OMPs, TMHPs and GPs. And it also indicates that the pseudo amino acid composition can better reflect the core feature of membrane proteins than the classical amino acid composition.  相似文献   

18.
The evolution of protein folds is under strong constraints from their surrounding environment. Although folding in water‐soluble proteins is driven primarily by hydrophobic forces, the nature of the forces that determine the folding and stability of transmembrane proteins are still not fully understood. Furthermore, the chemically heterogeneous lipid bilayer has a non‐uniform effect on protein structure. In this article, we attempt to get an insight into the nature of this effect by examining the impact of various types of local structure environment on amino acid substitution, based on alignments of high‐resolution structures of polytopic helical transmembrane proteins combined with sequences of close homologs. Compared to globular proteins, burying amino acid sidechains, especially hydrophilic ones, led to a lower increase in conservation in both the lipid‐water interface region and the hydrocarbon core region. This observation is due to surface residues in HTM proteins especially in the HC region being relatively highly conserved, suggesting higher evolutionary constraints from their specific interactions with the surrounding lipid molecules. Polar and small residues, particularly Pro and Gly, show a noticeable increase in conservation as they are positioned more towards the centre of the membrane, which is consistent with their recognized key roles in structural stability. In addition, the examination of hydrogen bonds in the membrane environment identified some exposed hydrophilic residues being better conserved when not hydrogen‐bonded to other residues, supporting the importance of lipid‐protein sidechain interactions. The conclusions presented in this study highlight the distinct features of substitution matrices that take into account the membrane environment, and their potential role in improving sequence‐structure alignments of transmembrane proteins. Proteins 2010; © 2010 Wiley‐Liss, Inc.  相似文献   

19.
The Alacoil is an antiparallel (rather than the usual parallel) coiled-coil of α-helices with Ala or another small residue in every seventh position, allowing a very close spacing of the helices (7.5–8.5 Å between local helix axes), often over four or five helical turns. It occurs in two distinct types that differ by which position of the heptad repeat is occupied by Ala and by whether the closest points on the backbone of the two helices are aligned or are offset by half a turn. The aligned, or ROP, type has Ala in position “d” of the heptad repeat, which occupies the “tip-to-tip” side of the helix contact where the Cα–Cβ bonds point toward each other. The more common offset, or ferritin, type of Alacoil has Ala in position “a” of the heptad repeat (where the Cα-Cβ bonds lie back-to-back, on the “knuckle-touch” side of the helix contact), and the backbones of the two helices are offset vertically by half a turn. In both forms, successive layers of contact have the Ala first on one and then on the other helix. The Alacoil structure has much in common with the coiled-coils of fibrous proteins or leucine zippers: both are α-helical coiled-coils, with a critical amino acid repeated every seven residues (the Leu or the Ala) and a secondary contact position in between. However, Leu zippers are between aligned, parallel helices (often identical, in dimers), whereas Alacoils are between antiparallel helices, usually offset, and much closer together. The Alacoil, then, could be considered as an “Ala anti-zipper.” Leu zippers have a classic “knobs-into-holes” packing of the Leu side chain into a diamond of four residues on the opposite helix; for Alacoils, the helices are so close together that the Ala methyl group must choose one side of the diamond and pack inside a triangle of residues on the other helix. We have used the ferritin-type Alacoil as the basis for the de novo design of a 66-residue, coiled helix hairpin called “Alacoilin.” Its sequence is: cmSP DQWDKE A AQYDAHA QE FEKKS HRNng TPEA DQYRHM A SQY QAMA QK LKAIA NQLKK Gseter (with “a” heptad positions underlined and nonhelical parts in lowercase), which we will produce and test for both stability and uniqueness of structure.  相似文献   

20.
The amino acid compositions of proteins from halophilic archaea were compared with those from non-halophilic mesophiles and thermophiles, in terms of the protein surface and interior, on a genome-wide scale. As we previously reported for proteins from thermophiles, a biased amino acid composition also exists in halophiles, in which an abundance of acidic residues was found on the protein surface as compared to the interior. This general feature did not seem to depend on the individual protein structures, but was applicable to all proteins encoded within the entire genome. Unique protein surface compositions are common in both halophiles and thermophiles. Statistical tests have shown that significant surface compositional differences exist among halophiles, non-halophiles, and thermophiles, while the interior composition within each of the three types of organisms does not significantly differ. Although thermophilic proteins have an almost equal abundance of both acidic and basic residues, a large excess of acidic residues in halophilic proteins seems to be compensated by fewer basic residues. Aspartic acid, lysine, asparagine, alanine, and threonine significantly contributed to the compositional differences of halophiles from meso- and thermophiles. Among them, however, only aspartic acid deviated largely from the expected amount estimated from the dinucleotide composition of the genomic DNA sequence of the halophile, which has an extremely high G+C content (68%). Thus, the other residues with large deviations (Lys, Ala, etc.) from their non-halophilic frequencies could have arisen merely as "dragging effects" caused by the compositional shift of the DNA, which would have changed to increase principally the fraction of aspartic acid alone.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号