首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 500 毫秒
1.
Many proteins consist of several structural domains. These multi-domain proteins have likely been generated by selective genome growth dynamics during evolution to perform new functions as well as to create structures that fold on a biologically feasible time scale. Domain units frequently evolved through a variety of genetic shuffling mechanisms. Here we examine the protein domain statistics of more than 1000 organisms including eukaryotic, archaeal and bacterial species. The analysis extends earlier findings on asymmetric statistical laws for proteome to a wider variety of species. While proteins are composed of a wide range of domains, displaying a power-law decay, the computation of domain families for each protein reveals an exponential distribution, characterizing a protein universe composed of a thin number of unique families. Structural studies in proteomics have shown that domain repeats, or internal duplicated domains, represent a small but significant fraction of genome. In spite of its importance, this observation has been largely overlooked until recently. We model the evolutionary dynamics of proteome and demonstrate that these distinct distributions are in fact rooted in an internal duplication mechanism. This process generates the contemporary protein structural domain universe, determines its reduced thickness, and tames its growth. These findings have important implications, ranging from protein interaction network modeling to evolutionary studies based on fundamental mechanisms governing genome expansion.  相似文献   

2.
Protein conjugation with ubiquitin, known as ubiquitination, is a key regulatory mechanism to control protein abundance, localization, and activity in eukaryotic cells. To identify ubiquitin-dependent regulatory steps in plants, we developed a robust affinity purification/identification system for ubiquitinated proteins. Using GST-tagged ubiquitin binding domains, we performed a large scale affinity purification of ubiquitinated proteins from Arabidopsis cell suspension culture. High molecular weight ubiquitinated proteins were separated by SDS-PAGE, and the trypsin-digested samples were then analyzed by a multidimensional protein identification technology (MudPIT) system. A total of 294 proteins specifically bound by the GST-tagged ubiquitin binding domains were identified. From these we determined 85 ubiquitinated lysine residues in 56 proteins, confirming the enrichment of the target class of proteins. Our data provide the first view of the ubiquitinated proteome in plants. We also provide evidence that this technique can be broadly applied to the study of protein ubiquitination in diverse plant species.  相似文献   

3.
Phylomat: an automated protein motif analysis tool for phylogenomics   总被引:2,自引:0,他引:2  
Recent progress in genomics, proteomics, and bioinformatics enables unprecedented opportunities to examine the evolutionary history of molecular, cellular, and developmental pathways through phylogenomics. Accordingly, we have developed a motif analysis tool for phylogenomics (Phylomat, http://alg.ncsa.uiuc.edu/pmat) that scans predicted proteome sets for proteins containing highly conserved amino acid motifs or domains for in silico analysis of the evolutionary history of these motifs/domains. Phylomat enables the user to download results as full protein or extracted motif/domain sequences from each protein. Tables containing the percent distribution of a motif/domain in organisms normalized to proteome size are displayed. Phylomat can also align the set of full protein or extracted motif/domain sequences and predict a neighbor-joining tree from relative sequence similarity. Together, Phylomat serves as a user-friendly data-mining tool for the phylogenomic analysis of conserved sequence motifs/domains in annotated proteomes from the three domains of life.  相似文献   

4.
5.
Identification of a family of human F-box proteins.   总被引:21,自引:0,他引:21  
F-box proteins are an expanding family of eukaryotic proteins characterized by an approximately 40 aminoacid motif, the F box (so named because cyclin F was one of the first proteins in which this motif was identified) [1]. Some F-box proteins have been shown to be critical for the controlled degradation of cellular regulatory proteins [2] [3]. In fact, F-box proteins are one of the four subunits of ubiquitin protein ligases called SCFs. The other three subunits are the Skp1 protein; one of the cullin proteins (Cul1 in metazoans and Cdc53 or Cul A in the yeast Saccharomyces cerevisiae); and the recently identified Roc1 protein (also called Rbx1 or Hrt1). SCF ligases bring ubiquitin conjugating enzymes (either Ubc3 or Ubc4) to substrates that are specifically recruited by the different F-box proteins. The need for high substrate specificity and the large number of known F-box proteins in yeast and worms [2] [4] suggest the existence of a large family of mammalian F-box proteins. Using Skp1 as a bait in a yeast two-hybrid screen and by searching DNA databases, we identified a family of 26 human F-box proteins, 25 of which were novel. Some of these proteins contained WD-40 domains or leucine-rich repeats; others contained either different protein-protein interaction modules or no recognizable motifs. We have named the F-box proteins that contain WD-40 domains Fbws, those containing leucine-rich repeats, Fbls, and the remaining ones Fbxs. We have further characterized representative members of these three classes of F-box proteins.  相似文献   

6.
7.
Protein kinases play important roles in almost all major signaling and regulatory pathways of eukaryotic organisms. Members in the family of protein kinases make up a substantial fraction of eukaryotic proteome. Analysis of the protein kinase repertoire (kinome) would help in the better understanding of the regulatory processes. In this article, we report the identification and analysis of the repertoire of protein kinases in the intracellular parasite Entamoeba histolytica. Using a combination of various sensitive sequence search methods and manual analysis, we have identified a set of 307 protein kinases in E. histolytica genome. We have classified these protein kinases into different subfamilies originally defined by Hanks and Hunter and studied these kinases further in the context of noncatalytic domains that are tethered to catalytic kinase domain. Compared to other eukaryotic organisms, protein kinases from E. histolytica vary in terms of their domain organization and displays features that may have a bearing in the unusual biology of this organism. Some of the parasitic kinases show high sequence similarity in the catalytic domain region with calmodulin/calcium dependent protein kinase subfamily. However, they are unlikely to act like typical calcium/calmodulin dependent kinases as they lack noncatalytic domains characteristic of such kinases in other organisms. Such kinases form the largest subfamily of kinases in E. histolytica. Interestingly, a PKA/PKG-like subfamily member is tethered to pleckstrin homology domain. Although potential cyclins and cyclin-dependent kinases could be identified in the genome the likely absence of other cell cycle proteins suggests unusual nature of cell cycle in E. histolytica. Some of the unusual features recognized in our analysis include the absence of MEK as a part of the Mitogen Activated Kinase signaling pathway and identification of transmembrane region containing Src kinase-like kinases. Sequences which could not be classified into known subfamilies of protein kinases have unusual domain architectures. Many such unclassified protein kinases are tethered to domains which are Cysteine-rich and to domains known to be involved in protein-protein interactions. Our kinome analysis of E. histolytica suggests that the organism possesses a complex protein phosphorylation network that involves many unusual kinases.  相似文献   

8.
Histidine-containing phosphotransfer (HPt) proteins play an essential role in multistep histidine-aspartate phosphorelay signal transduction systems in prokaryotes and eukaryotes. The putative HPt protein in Schizosaccharomyces pombe, Mpr1p (also known as Spy1p), is a 295 amino acid protein that appears to be composed of more than one functional domain. The amino acid sequence of the N-terminal region of Mpr1p lacks homology to other known proteins, whereas the C-terminal domain is predicted to have structural similarity to the Ypd1p HPt protein from Saccharomyces cerevisiae. This study provides both in vitro and in vivo evidence that the C-terminal domain of Mpr1p indeed functions as an HPt protein in shuttling phosphoryl groups from one response regulator domain to another. Furthermore, we find that various deletions of the N-terminal region diminish both the phosphotransfer activity of Mpr1p and its affinity for response regulator domains, suggesting a possible role for the N-terminal domain in HPt-response regulator domain interactions.  相似文献   

9.
Two-component regulatory systems that utilize a multistep phosphorelay mechanism often involve a histidine-containing phosphotransfer (HPt) domain. These HPt domains serve an essential role as histidine-phosphorylated protein intermediates during phosphoryl transfer from one response regulator domain to another. In Saccharomyces cerevisiae, the YPD1 protein facilitates phosphoryl transfer from a hybrid sensor kinase, SLN1, to two distinct response regulator proteins, SSK1 and SKN7. Because the phosphorylation state largely determines the functional state of response regulator proteins, we have carried out a comparative study of the phosphorylated lifetimes of the three response regulator domains associated with SLN1, SSK1, and SKN7 (R1, R2, and R3, respectively). The isolated regulatory domains exhibited phosphorylated lifetimes within the range previously observed for other response regulator domains (i.e., several minutes to several hours). However, in the presence of YPD1, we found that the half-life of phosphorylated SSK1-R2 was dramatically extended (almost 200-fold longer than in the absence of YPD1). This stabilization effect was specific for SSK1-R2 and was not observed for SLN1-R1 or SKN7-R3. Our findings suggest a mechanism by which SSK1 is maintained in its phosphorylated state under normal physiological conditions and demonstrate an unprecedented regulatory role for an HPt domain in a phosphorelay signaling system.  相似文献   

10.
Zinc is one of the metal ions essential for life, as it is required for the proper functioning of a large number of proteins. Despite its importance, the annotation of zinc-binding proteins in gene banks or protein domain databases still has significant room for improvement. In the present work, we compiled a list of known zinc-binding protein domains and of known zinc-binding sequence motifs (zinc-binding patterns), and then used them jointly to analyze the proteome of 57 different organisms to obtain an overview of zinc usage by archaeal, bacterial, and eukaryotic organisms. Zinc-binding proteins are an abundant fraction of these proteomes, ranging between 4% and 10%. The number of zinc-binding proteins correlates linearly with the total number of proteins encoded by the genome of an organism, but the proportionality constant of Eukaryota (8.8%) is significantly higher than that observed in Bacteria and Archaea (from 5% to 6%). Most of this enrichment is due to the larger portfolio of regulatory proteins in Eukaryota.  相似文献   

11.
The mammalian SURF-6 protein is localized in the nucleolus, yet its function remains elusive in the recently characterized nucleolar proteome. We discovered by searching the Protein families database that a unique evolutionary conserved SURF-6 domain is present in the carboxy-terminal of a novel family of eukaryotic proteins extending from human to yeast. By using the enhanced green fluorescent protein as a fusion protein marker in mammalian cells, we show that proteins from distantly related taxonomic groups containing the SURF-6 domain are localized in the nucleolus. Deletion sequence analysis shows that multiple regions of the SURF-6 protein are capable of nucleolar targeting independently of the evolutionary conserved domain. We identified that the Saccharomyces cerevisiae member of the SURF-6 family, named rrp14 or ykl082c, has been categorized in yeast databases to interact with proteins involved in ribosomal biogenesis and cell polarity. These results classify SURF-6 as a new family of nucleolar proteins in the eukaryotic kingdom and point out that SURF-6 has a distinct domain within the known nucleolar proteome that may mediate complex protein-protein interactions for analogous processes between yeast and mammalian cells.  相似文献   

12.
Intrinsically disordered proteins and intrinsically disordered protein regions are highly abundant in nature. However, the quantitative and qualitative measures of protein intrinsic disorder in species with known genomes are still not available. Furthermore, although the correlation between high fraction of disordered residues and advanced species has been reported, the details of this correlation and the connection between the disorder content and proteome complexity have not been reported as of yet. To fill this gap, we analysed entire proteomes of 3484 species from three domains of life (archaea, bacteria and eukaryotes) and from viruses. Our analysis revealed that the evolution process is characterized by distinctive patterns of changes in the protein intrinsic disorder content. We are showing here that viruses are characterized by the widest spread of the proteome disorder content (the percentage of disordered residues ranges from 7.3% in human coronavirus NL63 to 77.3% in Avian carcinoma virus). For several organisms, a clear correlation is seen between their disorder contents and habitats. In multicellular eukaryotes, there is a weak correlation between the complexity of an organism (evaluated as a number of different cell types) and its overall disorder content. For both the prokaryotes and eukaryotes, the disorder content is generally independent of the proteome size. However, disorder shows a sharp increase associated with the transition from prokaryotic to eukaryotic cells. This suggests that the increased disorder content in eukaryotic proteomes might be used by nature to deal with the increased cell complexity due to the appearance of the various cellular compartments.  相似文献   

13.
GYF domains are conserved eukaryotic adaptor domains that recognize proline-rich sequences. Although the structure and function of the prototypic GYF domain from the human CD2BP2 protein have been characterized in detail, very little is known about GYF domains from other proteins and species. Here we describe the binding properties of four GYF domains of various origins. Phage display in combination with SPOT analysis revealed the PPG(F/I/L/M/V) motif as a general recognition signature. Based on these results, the proteomes of human, yeast, and Arabidopsis thaliana were searched for potential interaction sites. Binding of several candidate proteins was confirmed by pull-down experiments or yeast two-hybrid analysis. The binding epitope of the GYF domain from the yeast SMY2 protein was mapped by NMR spectroscopy and led to a structural model that accounts for the different binding properties of SMY2-type GYF domains and the CD2BP2-GYF domain.  相似文献   

14.
We have identified a new protein domain, which we have named the SHOCT domain (Short C-terminal domain). This domain is widespread in bacteria with over a thousand examples. But we found it is missing from the most commonly studied model organisms, despite being present in closely related species. It''s predominantly C-terminal location, co-occurrence with numerous other domains and short size is reminiscent of the Gram-positive anchor motif, however it is present in a much wider range of species. We suggest several hypotheses about the function of SHOCT, including oligomerisation and nucleic acid binding. Our initial experiments do not support its role as an oligomerisation domain.  相似文献   

15.
16.
Coiled-coil domains in eukaryotic and prokaryotic proteins contribute to diverse structural and regulatory functions. Here we have used in silico analysis to predict which proteins in the proteome of the enteric pathogen, Salmonella enterica serovar Typhimurium, harbour coiled-coil domains. We found that coiled-coil domains are especially prevalent in virulence-associated proteins, including type III effectors. Using SopB as a model coiled-coil domain type III effector, we have investigated the role of this motif in various aspects of effector function including chaperone binding, secretion and translocation, protein stability, localization and biological activity. Compared with wild-type SopB, SopB coiled-coil mutants were unstable, both inside bacteria and after translocation into host cells. In addition, the putative coiled-coil domain was required for the efficient membrane association of SopB in host cells. Since many other Salmonella effectors were predicted to contain coiled-coil domains, we also investigated the role of this motif in their intracellular targeting in mammalian cells. Mutation of the predicted coiled-coil domains in PipB2, SseJ and SopD2 also eliminated their membrane localization in mammalian cells. These findings suggest that coiled-coil domains represent a common membrane-targeting determinant for Salmonella type III effectors.  相似文献   

17.
Hsp90 is critical for the regulation and activation of numerous client proteins critical for diverse functions such as cell growth, differentiation, and reproduction. Cytosolic Hsp90 function is dependent on a battery of co-chaperone proteins that regulate the ATPase activity of Hsp90 function or direct Hsp90 to interact with specific client proteins. Little is known about how Hsp90 complexes vary between different organisms and how this affects the scope of clients that are activated by Hsp90. This study determined whether ten distinct Hsp90 co-chaperones were encoded by genes in 19 disparate eukaryotic organisms. Surprisingly, none of the co-chaperones were present in all organisms. The co-chaperone Hop/Sti1 was most widely dispersed (18 out of 19 species), while orthologs of Cdc37, which is critical for the stability and activation of diverse protein kinases in yeast and mammals, were identified in only nine out of 19 species examined. The organism with the smallest proteome, Encephalitozoon cuniculi, contained only three of these co-chaperones, suggesting a correlation between client diversity and the complexity of the Hsp90 co-chaperone machine. Our results suggest co-chaperones are critical for cytosolic Hsp90 function in vivo, but that the composition of Hsp90 complexes varies depending on the specialized protein folding requirements of divergent species.  相似文献   

18.
Peroxisomes are small subcellular compartments responsible for a range of essential metabolic processes. Efforts in predicting peroxisomal protein import are challenged by species variation and sparse sequence data sets with experimentally confirmed localization. We present a predictor of peroxisomal import based on the presence of the dominant peroxisomal targeting signal one (PTS1), a seemingly wellconserved but highly unspecific motif. The signal appears to rely on subtle dependencies with the preceding residues. We evaluate prediction accuracies against two alternative predictor services, PEROXIP and the PTS1 PREDICTOR. We test the integrity of prediction on a range of prokaryotic and eukaryotic proteomes lacking peroxisomes. Similarly we test the accuracy on peroxisomal proteins known to not overlap with training data. The model identified a number of proteins within the RIKEN IPS7 mouse protein dataset as potentially novel peroxisomal proteins. Three were confirmed in vitro using immunofluorescent detection of myc-epitope-tagged proteins in transiently transfected BHK-21 cells (Dhrs2, Serhl, and Ehhadh). The final model has a superior specificity to both alternatives, and an accuracy better than PEROXIP and on par with PTS1 PREDICTOR. Thus, the model we present should prove invaluable for labeling PTS1 targeted proteins with high confidence. We use the predictor to screen several additional eukaryotic genomes to revise previously estimated numbers of peroxisomal proteins. Available at http://pprowler.itee.uq.edu.au.  相似文献   

19.
The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for the PABC domain. This domain is contained in homologs of the poly(A)-binding protein PABP and the ubiquitin-protein ligase HYD. Despite the importance of the PAM2 motif, a comprehensive analysis of its occurrence in different proteins has been missing. Using iterated sequence profile searches, we obtained an extensive list of proteins carrying the PAM2 motif. We discuss their functional context and domain architecture, which often consists of RNA-binding domains. Our list of PAM2 motif proteins includes eukaryotic homologs of eRF3/GSPT1/2, PAIP1/2, Tob1/2, Ataxin-2, RBP37, RBP1, Blackjack, HELZ, TPRD, USP10, ERD15, C1D4.14, and the viral protease P29. The identification of the PAM2 motif in as yet uncharacterized proteins can give valuable hints with respect to their cellular function and potential interaction partners and suggests further experimentation. It is also striking that the PAM2 motif appears to occur solely outside globular protein domains.  相似文献   

20.
Ger MF  Rendon G  Tilson JL  Jakobsson E 《PloS one》2010,5(10):e12827
Voltage-gated and ligand-gated ion channels are used in eukaryotic organisms for the purpose of electrochemical signaling. There are prokaryotic homologues to major eukaryotic channels of these sorts, including voltage-gated sodium, potassium, and calcium channels, Ach-receptor and glutamate-receptor channels. The prokaryotic homologues have been less well characterized functionally than their eukaryotic counterparts. In this study we identify likely prokaryotic functional counterparts of eukaryotic glutamate receptor channels by comprehensive analysis of the prokaryotic sequences in the context of known functional domains present in the eukaryotic members of this family. In particular, we searched the nonredundant protein database for all proteins containing the following motif: the two sections of the extracellular glutamate binding domain flanking two transmembrane helices. We discovered 100 prokaryotic sequences containing this motif, with a wide variety of functional annotations. Two groups within this family have the same topology as eukaryotic glutamate receptor channels. Group 1 has a potassium-like selectivity filter. Group 2 is most closely related to eukaryotic glutamate receptor channels. We present analysis of the functional domain architecture for the group of 100, a putative phylogenetic tree, comparison of the protein phylogeny with the corresponding species phylogeny, consideration of the distribution of these proteins among classes of prokaryotes, and orthologous relationships between prokaryotic and human glutamate receptor channels. We introduce a construct called the Evolutionary Domain Network, which represents a putative pathway of domain rearrangements underlying the domain composition of present channels. We believe that scientists interested in ion channels in general, and ligand-gated ion channels in particular, will be interested in this work. The work should also be of interest to bioinformatics researchers who are interested in the use of functional domain-based analysis in evolutionary and functional discovery.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号