首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 453 毫秒
1.
The point of view that a uniquely folded protein tertiary structure is required for the protein functioning has been prevailing in the literature quite recently. However of lately it has been found that many proteins in a cell have no such structure in an isolated state, though they have a well-defined function in physiological conditions. These proteins were named as proteins with natural or internal disorder. The portion of disordered regions in such proteins may vary from a sequence of several amino acids to a completely disordered sequence containing from tens to hundreds of amino acids. The main difference of these proteins from the structured (globular) ones is that they have no unique tertiary structure in an isolated state and acquire it after interaction with their partners. Their conformation in such a complex depends on the interacting partner and not only on their own amino acid sequence, which is specific for structured (globular) proteins. The problem of structural and functional relations in the structured proteins and proteins with internal disorder is discussed in this review. The complexity of the problem and its potential solutions are illustrated by the example of elongation factors EFlA.  相似文献   

2.
A growing number of proteins are being identified that are biologically active though intrinsically disordered, in sharp contrast with the classic notion that proteins require a well-defined globular structure in order to be functional. At the same time recent work showed that aggregation and amyloidosis are initiated in amino acid sequences that have specific physico-chemical properties in terms of secondary structure propensities, hydrophobicity and charge. In intrinsically disordered proteins (IDPs) such sequences would be almost exclusively solvent-exposed and therefore cause serious solubility problems. Further, some IDPs such as the human prion protein, synuclein and Tau protein are related to major protein conformational diseases. However, this scenario contrasts with the large number of unstructured proteins identified, especially in higher eukaryotes, and the fact that the solubility of these proteins is often particularly good. We have used the algorithm TANGO to compare the beta aggregation tendency of a set of globular proteins derived from SCOP and a set of 296 experimentally verified, non-redundant IDPs but also with a set of IDPs predicted by the algorithms DisEMBL and GlobPlot. Our analysis shows that the beta-aggregation propensity of all-alpha, all-beta and mixed alpha/beta globular proteins as well as membrane-associated proteins is fairly similar. This illustrates firstly that globular structures possess an appreciable amount of structural frustration and secondly that beta-aggregation is not determined by hydrophobicity and beta-sheet propensity alone. We also show that globular proteins contain almost three times as much aggregation nucleating regions as IDPs and that the formation of highly structured globular proteins comes at the cost of a higher beta-aggregation propensity because both structure and aggregation obey very similar physico-chemical constraints. Finally, we discuss the fact that although IDPs have a much lower aggregation propensity than globular proteins, this does not necessarily mean that they have a lower potential for amyloidosis.  相似文献   

3.
Amino acid residues can be divided into similar groups by frequencies of interreplacements in the evolutionary pathway and by trends to spatial contacts at the tertiary structures of globular proteins. Each residue was compared to the cluster of spatial surrounding--the totality of residues spacially drawn together. 5210 clusters in 32 unhomologous proteins with established tertiary structure and 6447 clusters formed only by variables amino acid residues were analysed. Spatial contacts among residues were studied depending on the secondary structure and the amount of residues in a cluster. It was assumed that functionally admissible mutations may be defined, first of all, by the degree of neighboring of amino acid residues in the spatial surrounding.  相似文献   

4.
Extended proteins such as calmodulin and troponin C have two globular terminal domains linked by a central region that is exposed to water and often acts as a function-regulating element. The mechanisms that stabilize the tertiary structure of extended proteins appear to differ greatly from those of globular proteins. Identifying such differences in physical properties of amino acid sequences between extended proteins and globular proteins can provide clues useful for identification of extended proteins from complete genomes including orphan sequences. In the present study, we examined the structure and amino acid sequence of extended proteins. We found that extended proteins have a large net electric charge, high charge density, and an even balance of charge between the terminal domains, indicating that electrostatic interaction is a dominant factor in stabilization of extended proteins. Additionally, the central domain exposed to water contained many amphiphilic residues. Extended proteins can be identified from these physical properties of the tertiary structure, which can be deduced from the amino acid sequence. Analysis of physical properties of amino acid sequences can provide clues to the mechanism of protein folding. Also, structural changes in extended proteins may be caused by formation of molecular complexes. Long-range effects of electrostatic interactions also appear to play important roles in structural changes of extended proteins.  相似文献   

5.
Because intrinsically disordered proteins are incapable of forming unique tertiary structures in isolation, their interaction with partner structures enables them to play important roles in many different biological functions. Therefore, such proteins are usually multifunctional, and their ability to perform their major function, as well as accessory functions, depends on the characteristics of a given interaction. The present paper demonstrates, using predictions from two programs, that the transmembrane proteoglycans syndecans are natively disordered because of their diverse functions and large number of interaction partners. Syndecans perform multiple functions during development, damage repair, tumor growth, angiogenesis, and neurogenesis. By mediating the binding of a large number of extracellular ligands to their receptors, these proteoglycans trigger a cascade of reactions that subsequently regulate various cell processes: cytoskeleton formation, proliferation, differentiation, adhesion, and migration. The occurrences of 20 amino acids in syndecans 1–4 from 25 animals were compared with those in 17 animal proteomes. Gly?+?Ala, Thr, Glu, and Pro were observed to predominate in the syndecans, contributing to the lack of an ordered structure. In contrast, there were many fewer amino acids in syndecans that promote an ordered structure, such as Cys, Trp, Asn, and His. In addition, a region rich in Asp has been identified between two heparan sulfate-binding sites in the ectodomains, and a region rich in Lys has been identified in the conserved C1 site of the cytoplasmic domain. These particular regions play an essential role in the various functions of syndecans due to their lack of structure.  相似文献   

6.
Globular proteins composed of different secondary structures and fold types were examined by synchrotron radiation circular dichroism spectroscopy to determine the effects of dehydration on their secondary structures. They exhibited only minor changes upon removal of bulk water during film formation, contrary to previously reported studies of proteins dehydrated by lyophilization (where substantial loss of helical structure and gain in sheet structure was detected). This near lack of conformational change observed for globular proteins contrasts with intrinsically disordered proteins (IDPs) dried in the same manner: the IDPs, which have almost completely unordered structures in solution, exhibited increased amounts of regular (mostly helical) secondary structures when dehydrated, suggesting formation of new intra‐protein hydrogen bonds replacing solvent‐protein hydrogen bonds, in a process which may mimic interactions that occur when IDPs bind to partner molecules. This study has thus shown that the secondary structures of globular and intrinsically disordered proteins behave very differently upon dehydration, and that films are a potentially useful format for examining dehydrated soluble proteins and assessing IDPs structures.  相似文献   

7.
8.
Serine/arginine-rich (SR) splicing factors play an important role in constitutive and alternative splicing as well as during several steps of RNA metabolism. Despite the wealth of functional information about SR proteins accumulated to-date, structural knowledge about the members of this family is very limited. To gain a better insight into structure-function relationships of SR proteins, we performed extensive sequence analysis of SR protein family members and combined it with ordered/disordered structure predictions. We found that SR proteins have properties characteristic of intrinsically disordered (ID) proteins. The amino acid composition and sequence complexity of SR proteins were very similar to those of the disordered protein regions. More detailed analysis showed that the SR proteins, and their RS domains in particular, are enriched in the disorder-promoting residues and are depleted in the order-promoting residues as compared to the entire human proteome. Moreover, disorder predictions indicated that RS domains of SR proteins were completely unstructured. Two different classification methods, the charge-hydropathy measure and the cumulative distribution function (CDF) of the disorder scores, were in agreement with each other, and they both strongly predicted members of the SR protein family to be disordered. This study emphasizes the importance of the disordered structure for several functions of SR proteins, such as for spliceosome assembly and for interaction with multiple partners. In addition, it demonstrates the usefulness of order/disorder predictions for inferring protein structure from sequence.  相似文献   

9.
The structural stability of a protein requires a large number of interresidue interactions. The energetic contribution of these can be approximated by low-resolution force fields extracted from known structures, based on observed amino acid pairing frequencies. The summation of such energies, however, cannot be carried out for proteins whose structure is not known or for intrinsically unstructured proteins. To overcome these limitations, we present a novel method for estimating the total pairwise interaction energy, based on a quadratic form in the amino acid composition of the protein. This approach is validated by the good correlation of the estimated and actual energies of proteins of known structure and by a clear separation of folded and disordered proteins in the energy space it defines. As the novel algorithm has not been trained on unstructured proteins, it substantiates the concept of protein disorder, i.e. that the inability to form a well-defined 3D structure is an intrinsic property of many proteins and protein domains. This property is encoded in their sequence, because their biased amino acid composition does not allow sufficient stabilizing interactions to form. By limiting the calculation to a predefined sequential neighborhood, the algorithm was turned into a position-specific scoring scheme that characterizes the tendency of a given amino acid to fall into an ordered or disordered region. This application we term IUPred and compare its performance with three generally accepted predictors, PONDR VL3H, DISOPRED2 and GlobPlot on a database of disordered proteins.  相似文献   

10.
Compared to eukaryotes, the occurrence of "intrinsically disordered" or "natively unfolded" proteins in prokaryotes has not been explored extensively. Here, we report the occurrence of an intrinsically disordered protein from the mesophilic human pathogen Mycobacterium tuberculosis. The Histidine-tagged recombinant Rv3221c biotin-binding protein is intrinsically disordered at ambient and physiological growth temperatures as revealed by circular dichroism and Fourier transform infrared (FTIR) spectroscopic studies. However, an increase in temperature induces a transition from disordered to structured state with a folding temperature of approximately 53 degrees C. Addition of a structure inducing solvent trifluoroethanol (TFE) causes the protein to fold at lower temperatures suggesting that TFE fosters hydrophobic interactions, which drives protein folding. Differential Scanning Calorimetry studies revealed that folding is endothermic and the transition from a disordered to structured state is continuous (higher-order), implying existence of intermediates during folding process. Secondary structure analysis revealed that the protein has propensity to form beta-sheets. This is in conformity with FTIR spectrum that showed an absorption peak at wave number of 1636 cm(-1), indicative of disordered beta-sheet conformation in the native state. These data suggest that although Rv3221c may be disordered under ambient or optimal growth temperature conditions, it has the potential to fold into ordered structure at high temperature driven by increased hydrophobic interactions. In contrast to the generally known behavior of other intrinsically disordered proteins folding at high temperature, Rv3221c does not appear to oligomerize or aggregate as revealed through numerous experiments including Congo red binding, Thioflavin T-binding, turbidity measurements, and examining molar ellipticity as a function of protein concentration. The amino acid composition of Rv3221c reveals that it has 24% charged and 54.9% hydrophobic amino acid residues. In this respect, this protein, although belonging to the class of intrinsically disordered proteins, has distinct features. The intrinsically disordered state and the biotin-binding feature of this protein suggest that it may participate in many biochemical processes requiring biotin as a cofactor and adopt suitable conformations upon binding other folded targets.  相似文献   

11.
Lise S  Jones DT 《Proteins》2005,58(1):144-150
The relationship between amino acid sequence and intrinsic disorder in proteins is investigated. Two databases, one of disordered proteins and the other of globular proteins, are analyzed and compared in order to extract simple sequence patterns of a few amino acids or amino acid properties that characterize disordered segments. It is found that a number of reliable, nonrandom associations exists. In particular, two types of patterns appear to be recurrent: a proline-rich pattern and a (positively or negatively) charged pattern. These results indicate that local sequence information can determine disordered regions in proteins. The derived patterns provide some insights into the physical reasons for disordered structures. They should also be helpful in improving currently available prediction methods.  相似文献   

12.
This study investigates the relationship between the thermal stability of a globular protein and its amino acid composition. The method deals with the relationship between the amino acid compositions and melting points in a set of proteins by computing single-residue and group correlations. Groups of residues are shown to stabilize or destabilize the molecule against temperature. The stabilizing group consists of polar-charged residues and nonpolar residues possessing high surrounding hydrophobicity. The polar-uncharged residues destabilize the molecule against temperature, serine being the most destabilizing residue. A very high cooperativity exists among the stabilizing nonpolar residues suggesting that their characteristic clustering inside the globule may enhance the thermostability of a protein. In small globular proteins which act as single cooperative units, the melting temperature remains mainly a function of amino acid composition, whereas in complex molecules it depends on other factors also.  相似文献   

13.
We explore the question of whether local effects (originating from the amino acids intrinsic secondary structure propensities) or nonlocal effects (reflecting the sequence of amino acids as a whole) play a larger role in determining the fold of globular proteins. Earlier circular dichroism studies have shown that the pattern of polar, non polar amino acids (nonlocal effect) dominates over the amino acid intrinsic propensity (local effect) in determining the secondary structure of oligomeric peptides. In this article, we present a coarse grained computational model that allows us to quantitatively estimate the role of local and nonlocal factors in determining both the secondary and tertiary structure of small, globular proteins. The amino acid intrinsic secondary structure propensity is modeled by a dihedral potential term. This dihedral potential is parametrized to match with experimental measurements of secondary structure propensity. Similarly, the magnitude of the attraction between hydrophobic residues is parametrized to match the experimental transfer free energies of hydrophobic amino acids. Under these parametrization conditions, we systematically explore the degree of frustration a given polar, non polar pattern can tolerate when the secondary structure intrinsic propensities are in opposition to it. When the parameters are in the biophysically relevant range, we observe that the fold of small, globular proteins is determined by the pattern of polar, non polar amino acids regardless of their instrinsic secondary structure propensities. Our simulations shed new light on previous observations that tertiary interactions are more influential in determining protein structure than secondary structure propensity. The fact that this can be inferred using a simple polymer model that lacks most of the biochemical details points to the fundamental importance of binary patterning in governing folding.  相似文献   

14.
Intrinsically disordered proteins (IDPs) do not adopt stable three-dimensional structures in physiological conditions, yet these proteins play crucial roles in biological phenomena. In most cases, intrinsic disorder manifests itself in segments or domains of an IDP, called intrinsically disordered regions (IDRs), but fully disordered IDPs also exist. Although IDRs can be detected as missing residues in protein structures determined by X-ray crystallography, no protocol has been developed to identify IDRs from structures obtained by Nuclear Magnetic Resonance (NMR). Here, we propose a computational method to assign IDRs based on NMR structures. We compared missing residues of X-ray structures with residue-wise deviations of NMR structures for identical proteins, and derived a threshold deviation that gives the best correlation of ordered and disordered regions of both structures. The obtained threshold of 3.2 Å was applied to proteins whose structures were only determined by NMR, and the resulting IDRs were analyzed and compared to those of X-ray structures with no NMR counterpart in terms of sequence length, IDR fraction, protein function, cellular location, and amino acid composition, all of which suggest distinct characteristics. The structural knowledge of IDPs is still inadequate compared with that of structured proteins. Our method can collect and utilize IDRs from structures determined by NMR, potentially enhancing the understanding of IDPs.  相似文献   

15.
固有无序蛋白质是一类在生理条件下缺乏稳定三维结构而具有正常功能,参与信号转导、转录调控、胁迫应答等多种生物学过程的蛋白质.植物中许多逆境响应蛋白是固有无序蛋白质,通过其结构无序或部分无序区域在蛋白质 蛋白质、蛋白质 膜脂、蛋白质 核酸的互作中发挥重要作用.本文主要对固有无序蛋白质的类别、氨基酸组成和结构特点以及在逆境胁迫下其稳定细胞膜、保护核酸和蛋白质、调控基因表达等分子功能进行综述,以拓展对逆境胁迫下蛋白质作用分子机制的认识.  相似文献   

16.
Intrinsically disordered proteins (IDPs) lack a well-defined three-dimensional structure under physiological conditions. Intrinsic disorder is a common phenomenon, particularly in multicellular eukaryotes, and is responsible for important protein functions including regulation and signaling. Many disease-related proteins are likely to be intrinsically disordered or to have disordered regions. In this paper, a new predictor model based on the Bayesian classification methodology is introduced to predict for a given protein or protein region if it is intrinsically disordered or ordered using only its primary sequence. The method allows to incorporate length-dependent amino acid compositional differences of disordered regions by including separate statistical representations for short, middle and long disordered regions. The predictor was trained on the constructed data set of protein regions with known structural properties. In a Jack-knife test, the predictor achieved the sensitivity of 89.2% for disordered and 81.4% for ordered regions. Our method outperformed several reported predictors when evaluated on the previously published data set of Prilusky et al. [2005. FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded. Bioinformatics 21 (16), 3435-3438]. Further strength of our approach is the ease of implementation.  相似文献   

17.
The prion protein (PrP) is composed of two major domains of similar size. The structured C-terminal domain contains three alpha-helical regions and a short two-stranded beta-sheet, while the N-terminal domain is intrinsically disordered. The analysis of PrP mutants with deletions in the C-terminal globular domain provided the first hint that intrinsically disordered domains are inefficiently transported into the endoplasmic reticulum through the Sec61 translocon. Interestingly, C-terminally truncated PrP mutants have been linked to inherited prion disease in humans and are characterized by inefficient ER import and the formation of neurotoxic PrP conformers. In a recent study we found that the Sec61 translocon in eukaryotic cells as well as the SecY translocon in bacteria is inherently deficient in translocating intrinsically disordered proteins. Moreover, our results suggest that translocon-associated components in eukaryotic cells enable the Sec61 complex to transport secretory proteins with extended unstructured domains such as PrP and shadoo.  相似文献   

18.
Many protein regions have been shown to be intrinsically disordered, lacking unique structure under physiological conditions. These intrinsically disordered regions are not only very common in proteomes, but also crucial to the function of many proteins, especially those involved in signaling, recognition, and regulation. The goal of this work was to identify the prevalence, characteristics, and functions of conserved disordered regions within protein domains and families. A database was created to store the amino acid sequences of nearly one million proteins and their domain matches from the InterPro database, a resource integrating eight different protein family and domain databases. Disorder prediction was performed on these protein sequences. Regions of sequence corresponding to domains were aligned using a multiple sequence alignment tool. From this initial information, regions of conserved predicted disorder were found within the domains. The methodology for this search consisted of finding regions of consecutive positions in the multiple sequence alignments in which a 90% or more of the sequences were predicted to be disordered. This procedure was constrained to find such regions of conserved disorder prediction that were at least 20 amino acids in length. The results of this work included 3,653 regions of conserved disorder prediction, found within 2,898 distinct InterPro entries. Most regions of conserved predicted disorder detected were short, with less than 10% of those found exceeding 30 residues in length.  相似文献   

19.
Intrinsic disorder in the Protein Data Bank   总被引:2,自引:0,他引:2  
The Protein Data Bank (PDB) is the preeminent source of protein structural information. PDB contains over 32,500 experimentally determined 3-D structures solved using X-ray crystallography or nuclear magnetic resonance spectroscopy. Intrinsically disordered regions fail to form a fixed 3-D structure under physiological conditions. In this study, we compare the amino-acid sequences of proteins whose structures are determined by X-ray crystallography with the corresponding sequences from the Swiss-Prot database. The analyzed dataset includes 16,370 structures, which represent 18,101 PDB chains and 5,434 different proteins from 910 different organisms (2,793 eukaryotic, 2,109 bacterial, 288 viral, and 244 archaeal). In this dataset, on average, each Swiss-Prot protein is represented by 7 PDB chains with 76% of the crystallized regions being represented by more than one structure. Intriguingly, the complete sequences of only approximately 7% of proteins are observed in the corresponding PDB structures, and only approximately 25% of the total dataset have >95% of their lengths observed in the corresponding PDB structures. This suggests that the vast majority of PDB proteins is shorter than their corresponding Swiss-Prot sequences and/or contain numerous residues, which are not observed in maps of electron density. To determine the prevalence of disordered regions in PDB, the residues in the Swiss-Prot sequences were grouped into four general categories, "Observed" (which correspond to structured regions), "Not observed" (regions with missing electron density, potentially disordered), "Uncharacterized," and "Ambiguous," depending on their appearance in the corresponding PDB entries. This non-redundant set of residues can be viewed as a 'fragment' or empirical domain database that contains a set of experimentally determined structured regions or domains and a set of experimentally verified disordered regions or domains. We studied the propensities and properties of residues in these four categories and analyzed their relations to the predictions of disorder using several algorithms. "Non-observed," "Ambiguous," and "Uncharacterized" regions were shown to possess the amino acid compositional biases typical of intrinsically disordered proteins. The application of four different disorder predictors (PONDR(R) VL-XT, VL3-BA, VSL1P, and IUPred) revealed that the vast majority of residues in the "Observed" dataset are ordered, and that the "Not observed" regions are mostly disordered. The "Uncharacterized" regions possess some tendency toward order, whereas the predictions for the short "Ambiguous" regions are really ambiguous. Long "Ambiguous" regions (>70 amino acid residues) are mostly predicted to be ordered, suggesting that they are likely to be "wobbly" domains. Overall, we showed that completely ordered proteins are not highly abundant in PDB and many PDB sequences have disordered regions. In fact, in the analyzed dataset approximately 10% of the PDB proteins contain regions of consecutive missing or ambiguous residues longer than 30 amino-acids and approximately 40% of the proteins possess short regions (> or =10 and < 30 amino-acid long) of missing and ambiguous residues.  相似文献   

20.
Many biologically active proteins, which are usually called intrinsically disordered or natively unfolded proteins, lack stable tertiary and/or secondary structure under physiological conditions in vitro. Their functions complement the functional repertoire of ordered proteins, with intrinsically disordered proteins (IDPs) often being involved in regulation, signaling and control. Their amino acid sequences and compositions are very different from those of ordered proteins, making reliable identification of IDPs possible at the proteome level. IDPs are highly abundant in various human diseases, including neurodegeneration and other protein dysfunction maladies and, therefore, represent attractive novel drug targets. Some of the aspects of IDPs, as well as their roles in neurodegeneration and protein dysfunction diseases, are discussed in this article, together with the peculiarities of IDPs as potential drug targets.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号