首页 | 本学科首页   官方微博 | 高级检索  
 共查询到20条相似文献,搜索用时 15 毫秒
How do mostly disordered proteins coordinate the specific assembly of very large signal transduction protein complexes? A newly emerging hypothesis may provide some clues towards a molecular mechanism.  相似文献   

All proteomes contain both proteins and polypeptide segments that don’t form a defined three-dimensional structure yet are biologically active—called intrinsically disordered proteins and regions (IDPs and IDRs). Most of these IDPs/IDRs lack useful functional annotation limiting our understanding of their importance for organism fitness. Here we characterized IDRs using protein sequence annotations of functional sites and regions available in the UniProt knowledgebase (“UniProt features”: active site, ligand-binding pocket, regions mediating protein-protein interactions, etc.). By measuring the statistical enrichment of twenty-five UniProt features in 981 IDRs of 561 human proteins, we identified eight features that are commonly located in IDRs. We then collected the genetic variant data from the general population and patient-based databases and evaluated the prevalence of population and pathogenic variations in IDPs/IDRs. We observed that some IDRs tolerate 2 to 12-times more single amino acid-substituting missense mutations than synonymous changes in the general population. However, we also found that 37% of all germline pathogenic mutations are located in disordered regions of 96 proteins. Based on the observed-to-expected frequency of mutations, we categorized 34 IDRs in 20 proteins (DDX3X, KIT, RB1, etc.) as intolerant to mutation. Finally, using statistical analysis and a machine learning approach, we demonstrate that mutation-intolerant IDRs carry a distinct signature of functional features. Our study presents a novel approach to assign functional importance to IDRs by leveraging the wealth of available genetic data, which will aid in a deeper understating of the role of IDRs in biological processes and disease mechanisms.  相似文献   

Why the intrinsically disordered regions evolve within human proteome has became an interesting question for a decade. Till date, it remains an unsolved yet an intriguing issue to investigate why some of the disordered regions evolve rapidly while the rest are highly conserved across mammalian species. Identifying the key biological factors, responsible for the variation in the conservation rate of different disordered regions within the human proteome, may revisit the above issue. We emphasized that among the other biological features (multifunctionality, gene essentiality, protein connectivity, number of unique domains, gene expression level and expression breadth) considered in our study, the number of unique protein domains acts as a strong determinant that negatively influences the conservation of disordered regions. In this context, we justified that proteins having a fewer types of domains preferably need to conserve their disordered regions to enhance their structural flexibility which in turn will facilitate their molecular interactions. In contrast, the selection pressure acting on the stretches of disordered regions is not so strong in the case of multi-domains proteins. Therefore, we reasoned that the presence of conserved disordered stretches may compensate the functions of multiple domains within a single domain protein. Interestingly, we noticed that the influence of the unique domain number and expression level acts differently on the evolution of disordered regions from that of well-structured ones.  相似文献   

Intrinsically disordered proteins (IDPs)/regions do not have well‐defined secondary and tertiary structures, however, they are functional and it is critical to gain a deep understanding of their residue packing. The shape distributions methodology, which is usually utilized in pattern recognition, clustering, and classification studies in computer science, may be adopted to study the residue packing of the proteins. In this study, shape distributions of the globular proteins and IDPs were obtained to shed light on the residue packing of their structures. The shape feature that was used is the sphericity of tetrahedra obtained by Delaunay Tessellation of points of Cα coordinates. Then the sphericity probability distributions were compared by using Principal Component Analysis. This computational structural study shows that the set of IDPs constitute a more diverse set than the set of globular proteins in terms of the geometrical properties of their network structures.  相似文献   

To assess the potential of intrinsically disordered proteins (IDPs) as drug design targets, we have analyzed the ligand-binding cavities of two datasets of IDPs (containing 37 and 16 entries, respectively) and compared their properties with those of conventional ordered (folded) proteins. IDPs were predicted to possess more binding cavity than ordered proteins at similar length, supporting the proposed advantage of IDPs economizing genome and protein resources. The cavity number has a wide distribution within each conformation ensemble for IDPs. The geometries of the cavities of IDPs differ from the cavities of ordered proteins, for example, the cavities of IDPs have larger surface areas and volumes, and are more likely to be composed of a single segment. The druggability of the cavities was examined, and the average druggable probability is estimated to be 9% for IDPs, which is almost twice that for ordered proteins (5%). Some IDPs with druggable cavities that are associated with diseases are listed. The optimism versus obstacles for drug design for IDPs is also briefly discussed.  相似文献   

A new possibility of predicting short disordered regions (loops) at a small window size (three amino acid residues) by the FoldUnfold program is described. As demonstrated with the example of three G proteins, FoldUnfold predicted almost all existing loops at the positions fitting well the X-ray structural data. The loops predicted in the Ras p21 structure were classified into two types. The loops of the first type display high Debye-Waller factor values, characteristic of the so-called functional loops (flexible loops). The second-type loops had lower Debye-Waller factor values and, consequently, were regarded as the loops connecting secondary structure elements (rigid loops). Comparison of the results predicted by FoldUnfold with the predictions of other programs (PONDR, RONN, DisEMBL, PreLINK, IUPred, GlobPlot 2, and FoldIndex) demonstrated that the first program was much better in predicting the positions of short loops. FoldUnfold made it possible to solve the problem difficult for the other programs, that is, to determine the boundary between the ordered and disordered regions in proteins with a large fraction of disordered regions, exemplified by the ubiquitin-like domain. In particular, FoldUnfold predicted a boundary between the ordered and disordered regions at residues 30 and 31, whereas the other programs predicted the boundary in the range of 28–70 amino acid residues.  相似文献   

High-speed atomic force microscopy (HS-AFM) is a powerful tool established 13 years ago. This methodology can capture individual protein molecules carrying out functional activities under near-physiological conditions, without chemical labeling, at 2–3 nm lateral and ∼0.1 nm vertical spatial resolution, and at sub-100 ms temporal resolution. Although most biological HS-AFM studies thus far target structured proteins, HS-AFM is also ideally suited to study the dynamics of intrinsically disordered proteins. Here we review some of the dynamic structures and processes of intrinsically disordered proteins that have been unveiled by HS-AFM imaging.  相似文献   

Small-angle scattering of X-rays (SAXS) is an established method to study the overall structure and structural transitions of biological macromolecules in solution. For folded proteins, the technique provides three-dimensional low resolution structures ab initio or it can be used to drive rigid-body modeling. SAXS is also a powerful tool for the quantitative analysis of flexible systems, including intrinsically disordered proteins (IDPs), and is highly complementary to the high resolution methods of X-ray crystallography and NMR. Here we present the basic principles of SAXS and review the main approaches to the characterization of IDPs and flexible multidomain proteins using SAXS. Together with the standard approaches based on the analysis of overall parameters, a recently developed Ensemble Optimization Method (EOM) is now available. The latter method allows for the co-existence of multiple protein conformations in solution compatible with the scattering data. Analysis of the selected ensembles provides quantitative information about flexibility and also offers insights into structural features. Examples of the use of SAXS and combined approaches with NMR, X-ray crystallography, and computational methods to characterize completely or partially disordered proteins are presented.  相似文献   

PagP, a beta-barrel membrane protein found in Gram-negative bacteria, expresses robustly in inclusion bodies when its signal sequence is removed. We have developed a new fusion protein expression system based on PagP and demonstrated its utility in the expression of the unstructured N-terminal region of human cardiac troponin I (residues 1-71). A yield of 100mg fusion protein per liter M9 minimal media was obtained. The troponin I fragment was removed from PagP using cyanogen bromide cleavage at methionine residues followed by nickel affinity chromatography. We further demonstrate that optimal cleavage requires complete reduction of methionine residues prior to cyanogen bromide treatment, and this is effectively accomplished using potassium iodide under acidic conditions. The PagP-based fusion protein system is more effective at targeting proteins into inclusion bodies than a commercially available system that uses ketosteroid isomerase; it thus represents an important advance for producing large quantities of unfolded peptides or proteins in Escherichia coli.  相似文献   

Previous studies based on bioinformatics showed that there is a sharp distinction of structural features and residue composition between the intrinsically disordered proteins and the folded proteins. What induces such a composition-related structural transition? How do various kinds of interactions work in such processes? In this work, we investigate these problems based on a survey on peptides randomly composed of charged residues (including glutamic acids and lysines) and the residues with different hydrophobicity, such as alanines, glycines, or phenylalanines. Based on simulations using all-atom model and replica-exchange Monte Carlo method, a coil-globule transition is observed for each peptide. The corresponding transition temperature is found to be dependent on the contents of the hydrophobic and charged residues. For several cases, when the mean hydrophobicity is larger than a certain threshold, the transition temperature is higher than the room temperature, and vise versa. These thresholds of hydrophobicity and net charge are quantitatively consistent with the border line observed from the study of bioinformatics. These results outline the basic physical reasons for the compositional distinction between the intrinsically disordered proteins and the folded proteins. Furthermore, the contributions of various interactions to the structural variation of peptides are analyzed based on the contact statistics and the charge-pattern dependence of the gyration radii of the peptides. Our observations imply that the hydrophobicity contributes essentially to such composition-related transitions. Thus, we achieve a better understanding on composition–structure relation of the natural proteins and the underlying physics.  相似文献   

The p53 transactivation domain (p53TAD) is an intrinsically disordered protein (IDP) domain that undergoes coupled folding and binding when interacting with partner proteins like the E3 ligase, MDM2, and the 70 kDa subunit of replication protein A, RPA70. The secondary structure and dynamics of six closely related mammalian homologues of p53TAD were investigated using nuclear magnetic resonance (NMR) spectroscopy. Differences in both transient secondary structure and backbone dynamics were observed for the homologues. Many of these differences were localized to the binding sites for MDM2 and RPA70. The amount of transient helical secondary structure observed for the MDM2 binding site was lower for the dog and mouse homologues, compared with human, and the amount of transient helical secondary structure observed for the RPA70 binding site was higher for guinea pig and rabbit, compared with human. Differences in the amount of transient helical secondary structure observed for the MDM2 binding site were directly related to amino acid substitutions occurring on the solvent exposed side of the amphipathic helix that forms during the p53TAD/MDM2 interaction. Differences in the amount of transient helical secondary structure were not as easily explained for the RPA70 binding site because of its extensive sequence divergence. Clustering analysis shows that the divergence in the transient secondary structure of the p53TAD homologues exceeds the amino acid sequence divergence. In contrast, strong correlations were observed between the backbone dynamics of the homologues and the sequence identity matrix, suggesting that the dynamic behavior of IDPs is a conserved evolutionary feature. Proteins 2013; 81:1686–1698. © 2013 Wiley Periodicals, Inc.  相似文献   

Proteins provide much of the scaffolding for life, as well as undertaking a variety of essential catalytic reactions. These characteristic functions have led us to presuppose that proteins are in general functional only when well structured and correctly folded. As we begin to explore the repertoire of possible protein sequences inherent in the human and other genomes, two stark facts that belie this supposition become clear: firstly, the number of apparent open reading frames in the human genome is significantly smaller than appears to be necessary to code for all of the diverse proteins in higher organisms, and secondly that a significant proportion of the protein sequences that would be coded by the genome would not be expected to form stable three-dimensional (3D) structures. Clearly the genome must include coding for a multitude of alternative forms of proteins, some of which may be partly or fully disordered or incompletely structured in their functional states. At the same time as this likelihood was recognized, experimental studies also began to uncover examples of important protein molecules and domains that were incompletely structured or completely disordered in solution, yet remained perfectly functional. In the ensuing years, we have seen an explosion of experimental and genome-annotation studies that have mapped the extent of the intrinsic disorder phenomenon and explored the possible biological rationales for its widespread occurrence. Answers to the question 'why would a particular domain need to be unstructured?' are as varied as the systems where such domains are found. This review provides a survey of recent new directions in this field, and includes an evaluation of the role not only of intrinsically disordered proteins but also of partially structured and highly dynamic members of the disorder-order continuum.  相似文献   

We propose a new alpha proton detection based approach for the sequential assignment of natively unfolded proteins. The proposed protocol superimposes on following features: HA-detection (1) enables assignment of natively unfolded proteins at any pH, i.e., it is not sensitive to rapid chemical exchange undergoing in natively unfolded proteins even at moderately high pH. (2) It allows straightforward assignment of proline-rich polypeptides without additional proline-customized experiments. (3) It offers more streamlined and less ambiguous assignment based on solely intraresidual 15N(i)-13C′(i)-Hα(i) (or 15N(i)-13Cα(i)-Hα(i)) and sequential 15N(i + 1)-13C′(i)-Hα(i) (or 15N(i + 1)-13Cα(i)-Hα(i)) correlation experiments together with efficient use of chemical shifts of 15N and 13C′ nuclei, which show smaller dependence on residue type. We have tested the proposed protocol on two proteins, small globular 56-residue GB1, and highly disordered, proline-rich 47-residue fifth repeat of EspFU. Using the proposed approach, we were able to assign 90% of 1Hα, 13Cα, 13C′, 15N chemical shifts in EspFU. We reckon that the HA-detection based strategy will be very useful in the assignment of natively unfolded proline-rich proteins or polypeptide chains.  相似文献   

Identifying wheat leaf protein expression is a major challenge of functional genomics. Using two-dimensional gel electrophoresis 541 wheat leaf proteins were separated and 55 of them were sequenced by nano liquid chromatography-tandem mass spectrometry. Peptide sequence data were screened against protein banks and expressed sequence tag public banks. Among these 55 spots, 20 proteins were found in wheat and 21 in other grass families (http://www.ncbi.nlm.nih.gov/). Twelve proteins showed similarities with other eukaryotic plant species. One protein showed homology to a bacterial sequence and another protein remained unknown. In 18 cases a significant score was found for the wheat TUC (Tentative Unique Contigs) of the PlantGDB (http://www.plantgdb.org/) data. In several cases, different spots were identified as corresponding to the same protein that can probably be attributed to the hexaploid structure of wheat. The identified proteins were classified in six groups and their role is discussed. Most of them (31/55) are involved in carbohydrate metabolism.  相似文献   

Glycosylation is an intricate process requiring the coordinated action of multiple proteins, including glycosyltransferases, glycosidases, sugar nucleotide transporters and trafficking proteins. Work by several groups points to a role for microRNA (miRNA) in controlling the levels of specific glycosyltransferases involved in cancer, neural migration and osteoblast formation. Recent work in our laboratory suggests that miRNA are a principal regulator of the glycome, translating genomic information into the glycocode through tuning of enzyme levels. Herein we overlay predicted miRNA regulation of glycosylation related genes (glycogenes) onto maps of the common N-linked and O-linked glycan biosynthetic pathways to identify key regulatory nodes of the glycome. Our analysis provides insights into glycan regulation and suggests that at the regulatory level, glycogenes are non-redundant.  相似文献   

Phosphorylation of intrinsically disordered proteins (IDPs) can produce changes in structural and dynamical properties and thereby mediate critical biological functions. How phosphorylation effects intrinsically disordered proteins has been studied for an increasing number of IDPs, but a systematic understanding is still lacking. Here, we compare the collapse propensity of four disordered proteins, Ash1, the C-terminal domain of RNA polymerase (CTD2’), the cytosolic domain of E-Cadherin, and a fragment of the p130Cas, in unphosphorylated and phosphorylated forms using extensive all-atom molecular dynamics (MD) simulations. We find all proteins to show V-shape changes in their collapse propensity upon multi-site phosphorylation according to their initial net charge: phosphorylation expands neutral or overall negatively charged IDPs and shrinks positively charged IDPs. However, force fields including those tailored towards and commonly used for IDPs overestimate these changes. We find quantitative agreement of MD results with SAXS and NMR data for Ash1 and CTD2’ only when attenuating protein electrostatic interactions by using a higher salt concentration (e.g. 350 mM), highlighting the overstabilization of salt bridges in current force fields. We show that phosphorylation of IDPs also has a strong impact on the solvation of the protein, a factor that in addition to the actual collapse or expansion of the IDP should be considered when analyzing SAXS data. Compared to the overall mild change in global IDP dimension, the exposure of active sites can change significantly upon phosphorylation, underlining the large susceptibility of IDP ensembles to regulation through post-translational modifications.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号