首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Bcl-2 homology 3 (BH3) domains are short sequence motifs that mediate nearly all protein-protein interactions between B cell lymphoma 2 (Bcl-2) family proteins in the intrinsic apoptotic cell death pathway. These sequences are found on both pro-survival and pro-apoptotic members, although their primary function is believed to be associated with induction of cell death. Here, we identify critical features of the BH3 domains of pro-survival proteins that distinguish them functionally from their pro-apoptotic counterparts. Biochemical and x-ray crystallographic studies demonstrate that these differences reduce the capacity of most pro-survival proteins to form high affinity “BH3-in-groove” complexes that are critical for cell death induction. Switching these residues for the corresponding residues in Bcl-2 homologous antagonist/killer (Bak) increases the binding affinity of isolated BH3 domains for pro-survival proteins; however, their exchange in the context of the parental protein causes rapid proteasomal degradation due to protein destabilization. This is supported by further x-ray crystallographic studies that capture elements of this destabilization in one pro-survival protein, Bcl-w. In pro-apoptotic Bak, we demonstrate that the corresponding distinguishing residues are important for its cell-killing capacity and antagonism by pro-survival proteins.  相似文献   

2.
The amino acid sequences of proteins determine their three-dimensional structures and functions. However, how sequence information is related to structures and functions is still enigmatic. In this study, we show that at least a part of the sequence information can be extracted by treating amino acid sequences of proteins as a collection of English words, based on a working hypothesis that amino acid sequences of proteins are composed of short constituent amino acid sequences (SCSs) or “words”. We first confirmed that the English language highly likely follows Zipf''s law, a special case of power law. We found that the rank-frequency plot of SCSs in proteins exhibits a similar distribution when low-rank tails are excluded. In comparison with natural English and “compressed” English without spaces between words, amino acid sequences of proteins show larger linear ranges and smaller exponents with heavier low-rank tails, demonstrating that the SCS distribution in proteins is largely scale-free. A distribution pattern of SCSs in proteins is similar among species, but species-specific features are also present. Based on the availability scores of SCSs, we found that sequence motifs are enriched in high-availability sites (i.e., “key words”) and vice versa. In fact, the highest availability peak within a given protein sequence often directly corresponds to a sequence motif. The amino acid composition of high-availability sites within motifs is different from that of entire motifs and all protein sequences, suggesting the possible functional importance of specific SCSs and their compositional amino acids within motifs. We anticipate that our availability-based word decoding approach is complementary to sequence alignment approaches in predicting functionally important sites of unknown proteins from their amino acid sequences.  相似文献   

3.
PDZ domains are small globular building blocks that are amongst the most abundant protein interaction domains in organisms. Over the past several years an avalanche of data has implicated these modules in the clustering, targeting and routing of associating proteins. An overview is given of the types of interactions displayed by PDZ domains and how this relates to the current knowledge on their spatial structure. Furthermore, the different levels on which PDZ – ligand binding can be regulated and the consequences of PDZ domain-mediated clustering for activity, routing and targeting of interacting proteins will be addressed. Finally, some cell and animal models that illustrate the impact of PDZ domain-containing proteins on (multi-) cellular processes will be discussed.  相似文献   

4.
Macromolecule condensates, phase separation, and membraneless compartments have become an important area of cell biology research where new biophysical concepts are emerging. This article discusses the possibility that condensates assemble on multivalent surfaces such as DNA, microtubules, or lipid bilayers by multilayer adsorption. Langmuir isotherm theory conceptualized saturable surface binding and deeply influenced physical biochemistry. Brunauer-Emmett-Teller (BET) theory extended Langmuir’s ideas to multilayer adsorption. A BET-inspired biochemical model predicts that surface-binding proteins with a tendency to self-associate will form multilayered condensates on binding surfaces. These “bound condensates” are expected to assemble well below the saturation concentration for liquid–liquid phase separation, so they can compete subunits away from phase-separated droplets and are thermodynamically pinned to the binding surface. Tau binding to microtubules is an interesting test case. The nonsaturable binding isotherm is reminiscent of BET predictions, but assembly of Tau-rich domains at low concentrations requires a different model. Surface-bound condensates may find multiple biological uses, particularly in situations where it is important that condensate assembly is spatially constrained, such as gene regulation.  相似文献   

5.
The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the “back catalog” of enzymology – “orphan enzymes,” those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme “back catalog” is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology’s “back catalog” another powerful tool to drive accurate genome annotation.  相似文献   

6.
Protein alignments are commonly used to evaluate the similarity of protein residues, and the derived consensus sequence used for identifying functional units (e.g., domains). Traditional consensus-building models fail to account for interpositional dependencies – functionally required covariation of residues that tend to appear simultaneously throughout evolution and across the phylogentic tree. These relationships can reveal important clues about the processes of protein folding, thermostability, and the formation of functional sites, which in turn can be used to inform the engineering of synthetic proteins. Unfortunately, these relationships essentially form sub-motifs which cannot be predicted by simple “majority rule” or even HMM-based consensus models, and the result can be a biologically invalid “consensus” which is not only never seen in nature but is less viable than any extant protein. We have developed a visual analytics tool, StickWRLD, which creates an interactive 3D representation of a protein alignment and clearly displays covarying residues. The user has the ability to pan and zoom, as well as dynamically change the statistical threshold underlying the identification of covariants. StickWRLD has previously been successfully used to identify functionally-required covarying residues in proteins such as Adenylate Kinase and in DNA sequences such as endonuclease target sites.  相似文献   

7.
Interactions in protein networks may place constraints on protein interface sequences to maintain correct and avoid unwanted interactions. Here we describe a “multi-constraint” protein design protocol to predict sequences optimized for multiple criteria, such as maintaining sets of interactions, and apply it to characterize the mechanism and extent to which 20 multi-specific proteins are constrained by binding to multiple partners. We find that multi-specific binding is accommodated by at least two distinct patterns. In the simplest case, all partners share key interactions, and sequences optimized for binding to either single or multiple partners recover only a subset of native amino acid residues as optimal. More interestingly, for signaling interfaces functioning as network “hubs,” we identify a different, “multi-faceted” mode, where each binding partner prefers its own subset of wild-type residues within the promiscuous binding site. Here, integration of preferences across all partners results in sequences much more “native-like” than seen in optimization for any single binding partner alone, suggesting these interfaces are substantially optimized for multi-specificity. The two strategies make distinct predictions for interface evolution and design. Shared interfaces may be better small molecule targets, whereas multi-faceted interactions may be more “designable” for altered specificity patterns. The computational methodology presented here is generalizable for examining how naturally occurring protein sequences have been selected to satisfy a variety of positive and negative constraints, as well as for rationally designing proteins to have desired patterns of altered specificity.  相似文献   

8.
The RNase II family of 3′–5′ exoribonucleases is present in all domains of life, and eukaryotic family members Dis3 and Dis3L2 play essential roles in RNA degradation. Ascomycete yeasts contain both Dis3 and inactive RNase II-like “pseudonucleases.” The latter function as RNA-binding proteins that affect cell growth, cytokinesis, and fungal pathogenicity. However, the evolutionary origins of these pseudonucleases are unknown: What sequence of events led to their novel function, and when did these events occur? Here, we show how RNase II pseudonuclease homologs, including Saccharomyces cerevisiae Ssd1, are descended from active Dis3L2 enzymes. During fungal evolution, active site mutations in Dis3L2 homologs have arisen at least four times, in some cases following gene duplication. In contrast, N-terminal cold-shock domains and regulatory features are conserved across diverse dikarya and mucoromycota, suggesting that the nonnuclease function requires these regions. In the basidiomycete pathogenic yeast Cryptococcus neoformans, the single Ssd1/Dis3L2 homolog is required for cytokinesis from polyploid “titan” growth stages. This phenotype of C. neoformans Ssd1/Dis3L2 deletion is consistent with those of inactive fungal pseudonucleases, yet the protein retains an active site sequence signature. We propose that a nuclease-independent function for Dis3L2 arose in an ancestral hyphae-forming fungus. This second function has been conserved across hundreds of millions of years, whereas the RNase activity was lost repeatedly in independent lineages.  相似文献   

9.
The aim of this study is to explore whether matrices and MP trees used to produce systematic categories of organisms could be useful to produce categories of ideas in history of science. We study the history of the use of trees in systematics to represent the diversity of life from 1766 to 1991. We apply to those ideas a method inspired from coding homologous parts of organisms. We discretize conceptual parts of ideas, writings and drawings about trees contained in 41 main writings; we detect shared parts among authors and code them into a 91-characters matrix and use a tree representation to show who shares what with whom. In other words, we propose a hierarchical representation of the shared ideas about trees among authors: this produces a “tree of trees.” Then, we categorize schools of tree-representations. Classical schools like “cladists” and “pheneticists” are recovered but others are not: “gradists” are separated into two blocks, one of them being called here “grade theoreticians.” We propose new interesting categories like the “buffonian school,” the “metaphoricians,” and those using “strictly genealogical classifications.” We consider that networks are not useful to represent shared ideas at the present step of the study. A cladogram is made for showing who is sharing what with whom, but also heterobathmy and homoplasy of characters. The present cladogram is not modelling processes of transmission of ideas about trees, and here it is mostly used to test for proximity of ideas of the same age and for categorization.  相似文献   

10.
To investigate novel patterns and processes of protein evolution, we have focused in the metallothioneins (MTs), a singular group of metal-binding, cysteine-rich proteins that, due to their high degree of sequence diversity, still represents a “black hole” in Evolutionary Biology. We have identified and analyzed more than 160 new MTs in nonvertebrate chordates (especially in 37 species of ascidians, 4 thaliaceans, and 3 appendicularians) showing that prototypic tunicate MTs are mono-modular proteins with a pervasive preference for cadmium ions, whereas vertebrate and cephalochordate MTs are bimodular proteins with diverse metal preferences. These structural and functional differences imply a complex evolutionary history of chordate MTs—including de novo emergence of genes and domains, processes of convergent evolution, events of gene gains and losses, and recurrent amplifications of functional domains—that would stand for an unprecedented case in the field of protein evolution.  相似文献   

11.
Intrinsically disordered regions have been associated with various cellular processes and are implicated in several human diseases, but their exact roles remain unclear. We previously defined two classes of conserved disordered regions in budding yeast, referred to as “flexible” and “constrained” conserved disorder. In flexible disorder, the property of disorder has been positionally conserved during evolution, whereas in constrained disorder, both the amino acid sequence and the property of disorder have been conserved. Here, we show that flexible and constrained disorder are widespread in the human proteome, and are particularly common in proteins with regulatory functions. Both classes of disordered sequences are highly enriched in regions of proteins that undergo tissue-specific (TS) alternative splicing (AS), but not in regions of proteins that undergo general (i.e., not tissue-regulated) AS. Flexible disorder is more highly enriched in TS alternative exons, whereas constrained disorder is more highly enriched in exons that flank TS alternative exons. These latter regions are also significantly more enriched in potential phosphosites and other short linear motifs associated with cell signaling. We further show that cancer driver mutations are significantly enriched in regions of proteins associated with TS and general AS. Collectively, our results point to distinct roles for TS alternative exons and flanking exons in the dynamic regulation of protein interaction networks in response to signaling activity, and they further suggest that alternatively spliced regions of proteins are often functionally altered by mutations responsible for cancer.  相似文献   

12.
Immunoglobulin heavy chain-binding protein (BiP) is a member of the hsp70 family of chaperones and one of the most abundant proteins in the ER lumen. It is known to interact transiently with many nascent proteins as they enter the ER and more stably with protein subunits produced in stoichiometric excess or with mutant proteins. However, there also exists a large number of secretory pathway proteins that do not apparently interact with BiP. To begin to understand what controls the likelihood that a nascent protein entering the ER will associate with BiP, we have examined the in vivo folding of a murine λI immunoglobulin (Ig) light chain (LC). This LC is composed of two Ig domains that can fold independent of the other and that each possess multiple potential BiP-binding sequences. To detect BiP binding to the LC during folding, we used BiP ATPase mutants, which bind irreversibly to proteins, as “kinetic traps.” Although both the wild-type and mutant BiP clearly associated with the unoxidized variable region domain, we were unable to detect binding of either BiP protein to the constant region domain. A combination of in vivo and in vitro folding studies revealed that the constant domain folds rapidly and stably even in the absence of an intradomain disulfide bond. Thus, the simple presence of a BiP-binding site on a nascent chain does not ensure that BiP will bind and play a role in its folding. Instead, it appears that the rate and stability of protein folding determines whether or not a particular site is recognized, with BiP preferentially binding to proteins that fold slowly or somewhat unstably.  相似文献   

13.
How the same DNA sequences can function in the three-dimensional architecture of interphase nucleus, fold in the very compact structure of metaphase chromosomes and go precisely back to the original interphase architecture in the following cell cycle remains an unresolved question to this day. The strategy used to address this issue was to analyze the correlations between chromosome architecture and the compositional patterns of DNA sequences spanning a size range from a few hundreds to a few thousands Kilobases. This is a critical range that encompasses isochores, interphase chromatin domains and boundaries, and chromosomal bands. The solution rests on the following key points: 1) the transition from the looped domains and sub-domains of interphase chromatin to the 30-nm fiber loops of early prophase chromosomes goes through the unfolding into an extended chromatin structure (probably a 10-nm “beads-on-a-string” structure); 2) the architectural proteins of interphase chromatin, such as CTCF and cohesin sub-units, are retained in mitosis and are part of the discontinuous protein scaffold of mitotic chromosomes; 3) the conservation of the link between architectural proteins and their binding sites on DNA through the cell cycle explains the “mitotic memory” of interphase architecture and the reversibility of the interphase to mitosis process. The results presented here also lead to a general conclusion which concerns the existence of correlations between the isochore organization of the genome and the architecture of chromosomes from interphase to metaphase.  相似文献   

14.
Occludin is the only known integral membrane protein localizing at tight junctions (TJ), but recent targeted disruption analysis of the occludin gene indicated the existence of as yet unidentified integral membrane proteins in TJ. We therefore re-examined the isolated junction fraction from chicken liver, from which occludin was first identified. Among numerous components of this fraction, only a broad silver-stained band ~22 kD was detected with the occludin band through 4 M guanidine-HCl extraction as well as sonication followed by stepwise sucrose density gradient centrifugation. Two distinct peptide sequences were obtained from the lower and upper halves of the broad band, and similarity searches of databases allowed us to isolate two full-length cDNAs encoding related mouse 22-kD proteins consisting of 211 and 230 amino acids, respectively. Hydrophilicity analysis suggested that both bore four transmembrane domains, although they did not show any sequence similarity to occludin. Immunofluorescence and immunoelectron microscopy revealed that both proteins tagged with FLAG or GFP were targeted to and incorporated into the TJ strand itself. We designated them as “claudin-1” and “claudin-2”, respectively. Although the precise structure/function relationship of the claudins to TJ still remains elusive, these findings indicated that multiple integral membrane proteins with four putative transmembrane domains, occludin and claudins, constitute TJ strands.  相似文献   

15.
Haspel N  Tsai CJ  Wolfson H  Nussinov R 《Proteins》2003,51(2):203-215
We have previously presented a building block folding model. The model postulates that protein folding is a hierarchical top-down process. The basic unit from which a fold is constructed, referred to as a hydrophobic folding unit, is the outcome of combinatorial assembly of a set of "building blocks." Results obtained by the computational cutting procedure yield fragments that are in agreement with those obtained experimentally by limited proteolysis. Here we show that as expected, proteins from the same family give very similar building blocks. However, different proteins can also give building blocks that are similar in structure. In such cases the building blocks differ in sequence, stability, contacts with other building blocks, and in their 3D locations in the protein structure. This result, which we have repeatedly observed in many cases, leads us to conclude that while a building block is influenced by its environment, nevertheless, it can be viewed as a stand-alone unit. For small-sized building blocks existing in multiple conformations, interactions with sister building blocks in the protein will increase the population time of the native conformer. With this conclusion in hand, it is possible to develop an algorithm that predicts the building block assignment of a protein sequence whose structure is unknown. Toward this goal, we have created sequentially nonredundant databases of building block sequences. A protein sequence can be aligned against these, in order to be matched to a set of potential building blocks.  相似文献   

16.
Annotations of the genes and their products are largely guided by inferring homology. Sequence similarity is the primary measure used for annotation purpose however, the domain content and order were given less importance albeit the fact that domain insertion, deletion, positional changes can bring in functional varieties. Of late, several methods developed quantify domain architecture similarity depending on alignments of their sequences and are focused on only homologous proteins. We present an alignment-free domain architecture-similarity search (ADASS) algorithm that identifies proteins that share very poor sequence similarity yet having similar domain architectures. We introduce a “singlet matching-triplet comparison” method in ADASS, wherein triplet of domains is compared with other triplets in a pair-wise comparison of two domain architectures. Different events in the triplet comparison are scored as per a scoring scheme and an average pairwise distance score (Domain Architecture Distance score - DAD Score) is calculated between protein domains architectures. We use domain architectures of a selected domain termed as centric domain and cluster them based on DAD score. The algorithm has high Positive Prediction Value (PPV) with respect to the clustering of the sequences of selected domain architectures. A comparison of domain architecture based dendrograms using ADASS method and an existing method revealed that ADASS can classify proteins depending on the extent of domain architecture level similarity. ADASS is more relevant in cases of proteins with tiny domains having little contribution to the overall sequence similarity but contributing significantly to the overall function.  相似文献   

17.
We have determined X-ray crystal structures of four members of an archaeal specific family of proteins of unknown function (UPF0201; Pfam classification: DUF54) to advance our understanding of the genetic repertoire of archaea. Despite low pairwise amino acid sequence identities (10–40%) and the absence of conserved sequence motifs, the three-dimensional structures of these proteins are remarkably similar to one another. Their common polypeptide chain fold, encompassing a five-stranded antiparallel β-sheet and five α-helices, proved to be quite unexpectedly similar to that of the RRM-type RNA-binding domain of the ribosomal L5 protein, which is responsible for binding the 5S- rRNA. Structure-based sequence alignments enabled construction of a phylogenetic tree relating UPF0201 family members to L5 ribosomal proteins and other structurally similar RNA binding proteins, thereby expanding our understanding of the evolutionary purview of the RRM superfamily. Analyses of the surfaces of these newly determined UPF0201 structures suggest that they probably do not function as RNA binding proteins, and that this domain specific family of proteins has acquired a novel function in archaebacteria, which awaits experimental elucidation.  相似文献   

18.
Cells are highly organized machines with functionally specialized compartments. For example, membrane proteins are localized to axons or dendrites in neurons and to apical or basolateral surfaces in epithelial cells. Interestingly, many sensory cells—including vertebrate photoreceptors and olfactory neurons—exhibit both neuronal and epithelial features. Here, we show that Caenorhabditis elegans amphid neurons simultaneously exhibit axon-dendrite sorting like a neuron and apical-basolateral sorting like an epithelial cell. The distal ∼5–10 µm of the dendrite is apical, while the remainder of the dendrite, soma, and axon are basolateral. To determine how proteins are sorted among these compartments, we studied the localization of the conserved adhesion molecule SAX-7/L1CAM. Using minimal synthetic transmembrane proteins, we found that the 91-aa cytoplasmic tail of SAX-7 is necessary and sufficient to direct basolateral localization. Basolateral localization can be fully recapitulated using either of 2 short (10-aa or 19-aa) tail sequences that, respectively, resemble dileucine and Tyr-based motifs known to mediate sorting in mammalian epithelia. The Tyr-based motif is conserved in human L1CAM but had not previously been assigned a function. Disrupting key residues in either sequence leads to apical localization, while “improving” them to match epithelial sorting motifs leads to axon-only localization. Indeed, changing only 2 residues in a short motif is sufficient to redirect the protein between apical, basolateral, and axonal localization. Our results demonstrate that axon-dendrite and apical-basolateral sorting pathways can coexist in a single cell, and suggest that subtle changes to short sequence motifs are sufficient to redirect proteins between these pathways.  相似文献   

19.
The closely related Abl family kinases, Arg and Abl, play important non-redundant roles in the regulation of cell morphogenesis and motility. Despite similar N-terminal sequences, Arg and Abl interact with different substrates and binding partners with varying affinities. This selectivity may be due to slight differences in amino acid sequence leading to differential interactions with target proteins. We report that the Arg Src homology (SH) 2 domain binds two specific phosphotyrosines on cortactin, a known Abl/Arg substrate, with over 10-fold higher affinity than the Abl SH2 domain. We show that this significant affinity difference is due to the substitution of arginine 161 and serine 187 in Abl to leucine 207 and threonine 233 in Arg, respectively. We constructed Abl SH2 domains with R161L and S187T mutations alone and in combination and find that these substitutions are sufficient to convert the low affinity Abl SH2 domain to a higher affinity “Arg-like” SH2 domain in binding to a phospho-cortactin peptide. We crystallized the Arg SH2 domain for structural comparison to existing crystal structures of the Abl SH2 domain. We show that these two residues are important determinants of Arg and Abl SH2 domain binding specificity. Finally, we expressed Arg containing an “Abl-like” low affinity mutant Arg SH2 domain (L207R/T233S) and find that this mutant, although properly localized to the cell periphery, does not support wild type levels of cell edge protrusion. Together, these observations indicate that these two amino acid positions confer different binding affinities and cellular functions on the distinct Abl family kinases.  相似文献   

20.
Generating random number sequences is a popular psychological task often used to measure executive functioning. We explore random generation under “joint cognition” instructions; pairs of participants take turns to compile a shared response sequence. Across three studies, we point to six key findings from this novel format. First, there are both costs and benefits from group performance. Second, repetition avoidance occurs in dyadic as well as individual production settings. Third, individuals modify their choices in a dyadic situation such that the pair becomes the unit of psychological function. Fourth, there is immediate contagion of sequence stereotypy amongst the pairs (i.e., each contributor “owns” their partner’s response). Fifth, dyad effects occur even when participants know their partner is not interacting with them (Experiment 2). Sixth, ironically, directing participants’ efforts away from their shared task responsibility can actually benefit conjoint performance (Experiment 3). These results both constrain models of random generation and illuminate processes of joint cognition.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号