首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Traditionally, protein-protein interactions were thought to be mediated by large, structured domains. However, it has become clear that the interactome comprises a wide range of binding interfaces with varying degrees of flexibility, ranging from rigid globular domains to disordered regions that natively lack structure. Enrichment for disorder in highly connected hub proteins and its correlation with organism complexity hint at the functional importance of disordered regions. Nevertheless, they have not yet been extensively characterised. Shifting the attention from globular domains to disordered regions of the proteome might bring us closer to elucidating the dense and complex connectivity of the interactome. An important class of disordered interfaces are the compact mono-partite, short linear motifs (SLiMs, or eukaryotic linear motifs (ELMs)). They are evolutionarily plastic and interact with relatively low affinity due to the limited number of residues that make direct contact with the binding partner. These features confer to SLiMs the ability to evolve convergently and mediate transient interactions, which is imperative to network evolution and to maintain robust cell signalling, respectively. The ability to discriminate biologically relevant SLiMs by means of different attributes will improve our understanding of the complexity of the interactome and aid development of bioinformatics tools for motif discovery. In this paper, the curated instances currently available in the Eukaryotic Linear Motif (ELM) database are analysed to provide a clear overview of the defining attributes of SLiMs. These analyses suggest that functional SLiMs have higher levels of conservation than their surrounding residues, frequently evolve convergently, preferentially occur in disordered regions and often form a secondary structure when bound to their interaction partner. These results advocate searching for small groupings of residues in disordered regions with higher relative conservation and a propensity to form the secondary structure. Finally, the most interesting conclusions are examined in regard to their functional consequences.  相似文献   

2.
线性短模体是天然无序蛋白实现生物学功能的重要组件.线性短模体具有柔性结构和短小的序列,可以介导瞬时、可逆的蛋白质相互作用,并在发生相互作用时表现出杂泛性.随着实验技术的更新和预测手段的发展,越来越多的线性短模体被发现和重新定义,例如BH3线性短模体.本文重点总结了线性短模体在结构、生物学功能以及进化等方面的特点.对线性短模体功能的研究将为解析细胞信号转导网络、疾病靶标确认、新药发现等领域带来新的思路.  相似文献   

3.
Pretranslational modification by alternative splicing, alternative promoter usage and RNA editing enables the production of multiple protein isoforms from a single gene. A large quantity of data now supports the notion that short linear motifs (SLiMs), which are protein interaction modules enriched within intrinsically disordered regions, are key for the functional diversification of these isoforms. The inclusion or removal of these SLiMs can switch the subcellular localisation of an isoform, promote cooperative associations, refine the affinity of an interaction, coordinate phase transitions within the cell, and even create isoforms of opposing function. This article discusses the novel functionality enabled by the addition or removal of SLiM-containing exons by pretranslational modifications, such as alternative splicing and alternative promoter usage, and how these alterations enable the creation and modulation of complex regulatory and signalling pathways.  相似文献   

4.
Many of the specific functions of intrinsically disordered protein segments are mediated by Short Linear Motifs (SLiMs) interacting with other proteins. Well known examples include SLiMs that interact with 14-3-3, PDZ, SH2, SH3, and WW domains but the true extent and diversity of SLiM-mediated interactions is largely unknown. Here, we attempt to expand our knowledge of human SLiMs by applying in silico SLiM prediction to the human interactome. Combining data from seven different interaction databases, we analysed approximately 6000 protein-centred and 1600 domain-centred human interaction datasets of 3+ unrelated proteins that interact with a common partner. Results were placed in context through comparison to randomised datasets of similar size and composition. The search returned thousands of evolutionarily conserved, intrinsically disordered occurrences of hundreds of significantly enriched recurring motifs, including many that have never been previously identified (). In addition to True Positive results for at least 25 different known SLiMs, a striking number of "off-target" proteins/domains also returned significantly enriched known motifs. Often, this was due to the non-independence of the datasets, with many proteins sharing interaction partners or contributing interactions to multiple domain datasets. The majority of these motif classes, however, were also found to be significantly enriched in one or more randomised datasets. This highlights the need for care when interpreting motif predictions of this nature but also raises the possibility that SLiM occurrences may be successfully identified independently of interaction data. Although not as compositionally biased as previous studies, patterns matching known SLiMs tended to cluster into a few large groups of similar sequence, while novel predictions tended to be more distinctive and less abundant. Whether this is due to ascertainment bias or a true functional composition bias of SLiMs is not clear and warrants further investigation.  相似文献   

5.
ABSTRACT: BACKGROUND: Short linear protein motifs are attracting increasing attention as functionally independent sites, typically 3-10 amino acids in length that are enriched in disordered regions of proteins. Multiple methods have recently been proposed to discover over-represented motifs within a set of proteins based on simple regular expressions. Here, we extend these approaches to profile-based methods, which provide a richer motif representation. RESULTS: The profile motif discovery method MEME performed relatively poorly for motifs in disordered regions of proteins. However, when we applied evolutionary weighting to account for redundancy amongst homologous proteins, and masked out poorly conserved regions of disordered proteins, the performance of MEME is equivalent to that of regular expression methods. However, the two approaches returned different subsets within both a benchmark dataset, and a more realistic discovery dataset. CONCLUSIONS: Profile-based motif discovery methods complement regular expression based methods. Whilst profile-based methods are computationally more intensive, they are likely to discover motifs currently overlooked by regular expression methods.  相似文献   

6.
Autoinhibition of p53 binding to MDMX requires two short-linear motifs (SLiMs) containing adjacent tryptophan (WW) and tryptophan-phenylalanine (WF) residues. NMR spectroscopy was used to show the WW and WF motifs directly compete for the p53 binding site on MDMX and circular dichroism spectroscopy was used to show the WW motif becomes helical when it is bound to the p53 binding domain (p53BD) of MDMX. Binding studies using isothermal titration calorimetry showed the WW motif is a stronger inhibitor of p53 binding than the WF motif when they are both tethered to p53BD by the natural disordered linker. We also investigated how the WW and WF motifs interact with the DNA binding domain (DBD) of p53. Both motifs bind independently to similar sites on DBD that overlap the DNA binding site. Taken together our work defines a model for complex formation between MDMX and p53 where a pair of disordered SLiMs bind overlapping sites on both proteins.  相似文献   

7.
A major challenge to the characterization of intrinsically disordered regions (IDRs), which are widespread in the proteome, but relatively poorly understood, is the identification of molecular features that mediate functions of these regions, such as short motifs, amino acid repeats and physicochemical properties. Here, we introduce a proteome-scale feature discovery approach for IDRs. Our approach, which we call “reverse homology”, exploits the principle that important functional features are conserved over evolution. We use this as a contrastive learning signal for deep learning: given a set of homologous IDRs, the neural network has to correctly choose a held-out homolog from another set of IDRs sampled randomly from the proteome. We pair reverse homology with a simple architecture and standard interpretation techniques, and show that the network learns conserved features of IDRs that can be interpreted as motifs, repeats, or bulk features like charge or amino acid propensities. We also show that our model can be used to produce visualizations of what residues and regions are most important to IDR function, generating hypotheses for uncharacterized IDRs. Our results suggest that feature discovery using unsupervised neural networks is a promising avenue to gain systematic insight into poorly understood protein sequences.  相似文献   

8.

Background

Analyzing the amino acid sequence of an intrinsically disordered protein (IDP) in an evolutionary context can yield novel insights on the functional role of disordered regions and sequence element(s). However, in the case of many IDPs, the lack of evolutionary conservation of the primary sequence can hamper the study of functionality, because the conservation of their disorder profile and ensuing function(s) may not appear in a traditional analysis of the evolutionary history of the protein.

Results

Here we present DisCons (Disorder Conservation), a novel pipelined tool that combines the quantification of sequence- and disorder conservation to classify disordered residue positions. According to this scheme, the most interesting categories (for functional purposes) are constrained disordered residues and flexible disordered residues. The former residues show conservation of both the sequence and the property of disorder and are associated mainly with specific binding functionalities (e.g., short, linear motifs, SLiMs), whereas the latter class correspond to segments where disorder as a feature is important for function as opposed to the identity of the underlying sequence (e.g., entropic chains and linkers). DisCons therefore helps with elucidating the function(s) arising from the disordered state by analyzing individual proteins as well as large-scale proteomics datasets.

Conclusions

DisCons is an openly accessible sequence analysis tool that identifies and highlights structurally disordered segments of proteins where the conformational flexibility is conserved across homologs, and therefore potentially functional. The tool is freely available both as a web application and as stand-alone source code hosted at http://pedb.vib.be/discons.  相似文献   

9.
10.
11.
12.
13.
The Atg8/LC3/GABARAP family of proteins, a group that has structural homology with ubiquitin, connects with a large set of binding partners to function in macroautophagy (hereafter autophagy). This interaction occurs primarily via a conserved motif termed the LC3-interacting region (LIR), or the Atg8-interacting motif (AIM). The consensus sequence for this motif, [W/F/Y]xx[L/I/V], can be found in many proteins, but only some of them are physiological partners containing a functional LIR/AIM. Because the structure of many full-length partners has not been, or cannot be, solved, the structural context of the LIR/AIM within the native protein conformation is not obvious. Here we suggest that the functional LIR/AIM is a short linear motif (SLiM) protein-binding module, arising from an intrinsically disordered region. This finding enables the rapid elimination of some false Atg8/LC3/GABARAP-binding proteins, and connects the exponentially growing knowledge on disordered SLiMs with autophagy.  相似文献   

14.
Intracellular juxtamembrane regions of transmembrane proteins play pivotal roles in cell signalling, mediated by protein-protein interactions. Disordered protein regions, and short conserved motifs within them, are emerging as key determinants of many such interactions. Here, we investigated whether disorder and conserved motifs are enriched in the juxtamembrane area of human single-pass transmembrane proteins. Conserved motifs were defined as short disordered regions that were much more conserved than the adjacent disordered residues. Human single-pass proteins had higher mean disorder in their cytoplasmic segments than their extracellular parts. Some, but not all, of this effect reflected the shorter length of the cytoplasmic tail. A peak of cytoplasmic disorder was seen at around 30 residues from the membrane. We noted a significant increase in the incidence of conserved motifs within the disordered regions at the same location, even after correcting for the extent of disorder. We conclude that elevated disorder within the cytoplasmic tail of many transmembrane proteins is likely to be associated with enrichment for signalling interactions mediated by conserved short motifs.  相似文献   

15.
《Biophysical journal》2022,121(16):3049-3060
Intrinsically disordered proteins (IDPs) play critical roles in regulatory protein interactions, but detailed structural/dynamic characterization of their ensembles remain challenging, both in isolation and when they form dynamic “fuzzy” complexes. Such is the case for mRNA cap-dependent translation initiation, which is regulated by the interaction of the predominantly folded eukaryotic initiation factor 4E (eIF4E) with the intrinsically disordered eIF4E binding proteins (4E-BPs) in a phosphorylation-dependent manner. Single-molecule Förster resonance energy transfer showed that the conformational changes of 4E-BP2 induced by binding to eIF4E are non-uniform along the sequence; while a central region containing both motifs that bind to eIF4E expands and becomes stiffer, the C-terminal region is less affected. Fluorescence anisotropy decay revealed a non-uniform segmental flexibility around six different labeling sites along the chain. Dynamic quenching of these fluorescent probes by intrinsic aromatic residues measured via fluorescence correlation spectroscopy report on transient intra- and inter-molecular contacts on nanosecond-to-microsecond timescales. Upon hyperphosphorylation, which induces folding of ~40 residues in 4E-BP2, the quenching rates decreased at most labeling sites. The chain dynamics around sites in the C-terminal region far away from the two binding motifs significantly increased upon binding to eIF4E, suggesting that this region is also involved in the highly dynamic 4E-BP2:eIF4E complex. Our time-resolved fluorescence data paint a sequence-level rigidity map of three states of 4E-BP2 differing in phosphorylation or binding status and distinguish regions that form contacts with eIF4E. This study adds complementary structural and dynamics information to recent studies of 4E-BP2, and it constitutes an important step toward a mechanistic understanding of this important IDP via integrative modeling.  相似文献   

16.
The eukaryotic translation initiation factor (eIF) 4B promotes the RNA-dependent ATP hydrolysis activity and ATP-dependent RNA helicase activity of eIF4A and eIF4F during translation initiation. Although this function is conserved among plants, animals, and yeast, eIF4B is one of the least conserved of initiation factors at the sequence level. To gain insight into its functional conservation, the organization of the functional domains of eIF4B from wheat has been investigated. Plant eIF4B contains three RNA binding domains, one more than reported for mammalian or yeast eIF4B, and each domain exhibits a preference for purine-rich RNA. In addition to a conserved RNA recognition motif and a C-terminal RNA binding domain, wheat eIF4B contains a novel N-terminal RNA binding domain that requires a short, lysine-rich containing sequence. Both the lysine-rich motif and an adjacent, C-proximal motif are conserved with an N-proximal sequence in human and yeast eIF4B. The C-proximal motif within the N-terminal RNA binding domain in wheat eIF4B is required for interaction with eIFiso4G, an interaction not reported for other eIF4B proteins. Moreover, each RNA binding domain requires dimerization for binding activity. Two binding sites for the poly(A)-binding protein were mapped to a region within each of two conserved 41-amino acid repeat domains on either side of the C-terminal RNA binding domain. eIF4A bound to an adjacent region within each repeat, supporting a central role for these conserved eIF4B domains in facilitating interaction with other components of the translational machinery. These results support the notion that eIF4B functions by organizing multiple components of the translation initiation machinery and RNA.  相似文献   

17.
Eukaryotic translation initiation is a highly regulated process involving multiple steps, from 43S pre-initiation complex (PIC) assembly, to ribosomal subunit joining. Subunit joining is controlled by the G-protein eukaryotic translation initiation factor 5B (eIF5B). Another protein, eIF1A, is involved in virtually all steps, including subunit joining. The intrinsically disordered eIF1A C-terminal tail (eIF1A-CTT) binds to eIF5B Domain-4 (eIF5B-D4). The ribosomal complex undergoes conformational rearrangements at every step of translation initiation; however, the underlying molecular mechanisms are poorly understood. Here we report three novel interactions involving eIF5B and eIF1A: (i) a second binding interface between eIF5B and eIF1A; (ii) a dynamic intramolecular interaction in eIF1A between the folded domain and eIF1A-CTT; and (iii) an intramolecular interaction between eIF5B-D3 and -D4. The intramolecular interactions within eIF1A and eIF5B interfere with one or both eIF5B/eIF1A contact interfaces, but are disrupted on the ribosome at different stages of translation initiation. Therefore, our results indicate that the interactions between eIF1A and eIF5B are being continuously rearranged during translation initiation. We present a model how the dynamic eIF1A/eIF5B interaction network can promote remodeling of the translation initiation complexes, and the roles in the process played by intrinsically disordered protein segments.  相似文献   

18.
19.
Universal scale of the sequence conservation has been recently introduced based on omnipresence of the protein sequence motifs across species. A large spectrum of short sequences, up to eight residues has been found to reside in all or almost all prokaryotic organisms. By this discovery a principally novel quantitative approach is introduced to the problem of reconstruction of the last universal common ancestor (LUCA). The most conserved elements (protein modules) with defined structures and sequences harboring the omnipresent motifs are outlined in this work, by combining the sequence and protein crystal structure data. The structurally conserved modules involve 25–30 amino acid residues and have appearance of closed loops, loop-n-lock structures. This confirms earlier conclusions on the loop-fold structure of globular proteins. Many of the topmost conserved modules represent the primary closed loop prototypes, that have been derived by whole genome sequence searches. The data presented, thus, make a basis for further developments toward the earliest stages of protein evolution. [Reviewing Editor: Dr. Martin Kreitman]  相似文献   

20.
Voltage-gated ion channels (VGCs) mediate selective diffusion of ions across cell membranes to enable many vital cellular processes. Three-dimensional structure data are lacking for VGC proteins; hence, to better understand their function, there is a need to identify the conserved motifs using sequence analysis methods. In this study, we have used a profile-to-profile alignment method to identify several new conserved motifs specific to each transmembrane segment (TMS) of the voltage-sensing and the pore-forming modules of Ca2+, Na+, and K+ channel subfamilies. For Ca2+ and Na+, the functional theme of motif conservation is similar in all segments while they differ with those of the K+ channel proteins. Nevertheless, the conservation is strikingly similar in the S4 segment of the voltage-sensing module across all subfamilies. In each subfamily and for each TMS, we have identified conserved motifs/residues and correlated their functional significance and disease associations in human, using mutational data from the literature.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号