首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background  

Identifying domains in protein sequences is an important step in protein structural and functional annotation. Existing domain recognition methods typically evaluate each domain prediction independently of the rest. However, the majority of proteins are multidomain, and pairwise domain co-occurrences are highly specific and non-transitive.  相似文献   

2.

Background  

Genes involved in non-self recognition and host defence are typically capable of rapid diversification and exploit specialized genetic mechanism to that end. Fungi display a non-self recognition phenomenon termed heterokaryon incompatibility that operates when cells of unlike genotype fuse and leads to the cell death of the fusion cell. In the fungus Podospora anserina, three genes controlling this allorecognition process het-d, het-e and het-r are paralogs belonging to the same hnwd gene family. HNWD proteins are STAND proteins (signal transduction NTPase with multiple domains) that display a WD-repeat domain controlling recognition specificity. Based on genomic sequence analysis of different P. anserina isolates, it was established that repeat regions of all members of the gene family are extremely polymorphic and undergoing concerted evolution arguing for frequent recombination within and between family members.  相似文献   

3.

Background

Conserved domains are recognized as the building blocks of eukaryotic proteins. Domains showing a tendency to occur in diverse combinations (??promiscuous?? domains) are involved in versatile architectures in proteins with different functions. Current models, based on global-level analyses of domain combinations in multiple genomes, have suggested that the propensity of some domains to associate with other domains in high-level architectures increases with organismal complexity. Alternative models using domain-based phylogenetic trees propose that domains have become promiscuous independently in different lineages through convergent evolution and are, thus, random with no functional or structural preferences. Here we test whether complex protein architectures have occurred by accretion from simpler systems and whether the appearance of multidomain combinations parallels organismal complexity. As a model, we analyze the modular evolution of the PWWP domain and ask whether its appearance in combinations with other domains into multidomain architectures is linked with the occurrence of more complex life-forms. Whether high-level combinations of domains are conserved and transmitted as stable units (cassettes) through evolution is examined in the genomes of plant or metazoan species selected for their established position in the evolution of the respective lineages.

Results

Using the domain-tree approach, we analyze the evolutionary origins and distribution patterns of the promiscuous PWWP domain to understand the principles of its modular evolution and its existence in combination with other domains in higher-level protein architectures. We found that as a single module the PWWP domain occurs only in proteins with a limited, mainly, species-specific distribution. Earlier, it was suggested that domain promiscuity is a fast-changing (volatile) feature shaped by natural selection and that only a few domains retain their promiscuity status throughout evolution. In contrast, our data show that most of the multidomain PWWP combinations in extant multicellular organisms (humans or land plants) are present in their unicellular ancestral relatives suggesting they have been transmitted through evolution as conserved linear arrangements (??cassettes??). Among the most interesting biologically relevant results is the finding that the genes of the two plant Trithorax family subgroups (ATX1/2 and ATX3/4/5) have different phylogenetic origins. The two subgroups occur together in the earliest land plants Physcomitrella patens and Selaginella moellendorffii.

Conclusion

Gain/loss of a single PWWP domain is observed throughout evolution reflecting dynamic lineage- or species-specific events. In contrast, higher-level protein architectures involving the PWWP domain have survived as stable arrangements driven by evolutionary descent. The association of PWWP domains with the DNA methyltransferases in O. tauri and in the metazoan lineage seems to have occurred independently consistent with convergent evolution. Our results do not support models wherein more complex protein architectures involving the PWWP domain occur with the appearance of more evolutionarily advanced life forms.  相似文献   

4.
5.

Background  

The co-translational incorporation of selenocysteine into nascent polypeptides by recoding the UGA stop codon occurs in all domains of life. In eukaryotes, this event requires at least three specific factors: SECIS binding protein 2 (SBP2), a specific translation elongation factor (eEFSec), selenocysteinyl tRNA, and a cis -acting selenocysteine insertion sequence (SECIS) element in selenoprotein mRNAs. While the phylogenetic relationships of selenoprotein families and the evolution of selenocysteine usage are well documented, the evolutionary history of SECIS binding proteins has not been explored.  相似文献   

6.

Background  

Protein domains are the structural and functional units of proteins. The ability to parse proteins into different domains is important for effective classification, understanding of protein structure, function, and evolution and is hence biologically relevant. Several computational methods are available to identify domains in the sequence. Domain finding algorithms often employ stringent thresholds to recognize sequence domains. Identification of additional domains can be tedious involving intense computation and manual intervention but can lead to better understanding of overall biological function. In this context, the problem of identifying new domains in the unassigned regions of a protein sequence assumes a crucial importance.  相似文献   

7.
In this paper, the inventory presented for singlet CH (calponin homology/actin binding) domain containing human multidomain proteins [1] is extended to several duplex and one quadruplet CH containing forms. Invariably, the duplexes are located at the begin of the molecules. The regions connecting the two CH units suggest amino acid conservations which allows the placing of 18 duplex containing molecules into six groups wherein the gene for one member in each group created the others more recently by gene duplication. The ancient multidomain proteins, possibly, were primarily the result of an exon shuffling (transposition) mechanism that also guided the placing of the CH singlet or duplex domain at the amino end of the newly created proteins. A mechanism that creates pseudogenes could conceivably produce genes that encode multi-domain proteins. Intragenomic duplications (slippage) might have facilitated the occurrence of encoding repeats, thus allowing for the creation of multiple identical domains within one molecule. Gene duplication with subsequent modification and small domain gene recombination which formed multidomain proteins are important forces driving evolution.  相似文献   

8.

Background  

CC chemokine receptor proteins (CCR1 through CCR10) are seven-transmembrane G-protein coupled receptors whose signaling pathways are known for their important roles coordinating immune system responses through targeted trafficking of white blood cells. In addition, some of these receptors have been identified as fusion proteins for viral pathogens: for example, HIV-1 strains utilize CCR5, CCR2 and CCR3 proteins to obtain cellular entry in humans. The extracellular domains of these receptor proteins are involved in ligand-binding specificity as well as pathogen recognition interactions.  相似文献   

9.

Background

The engineering of fusion proteins has become increasingly important and most recently has formed the basis of many biosensors, protein purification systems, and classes of new drugs. Currently, most fusion proteins consist of three or fewer domains, however, more sophisticated designs could easily involve three or more domains. Using traditional subcloning strategies, this requires micromanagement of restriction enzymes sites that results in complex workaround solutions, if any at all.

Results

Therefore, to aid in the efficient construction of fusion proteins involving multiple domains, we have created a new expression vector that allows us to rapidly generate a library of cassettes. Cassettes have a standard vector structure based on four specific restriction endonuclease sites and using a subtle property of blunt or compatible cohesive end restriction enzymes, they can be fused in any order and number of times. Furthermore, the insertion of PCR products into our expression vector or the recombination of cassettes can be dramatically simplified by screening for the presence or absence of fluorescence.

Conclusions

Finally, the utility of this new strategy was demonstrated by the creation of basic cassettes for protein targeting to subcellular organelles and for protein purification using multiple affinity tags.
  相似文献   

10.

Background  

FYVE domains have emerged as membrane-targeting domains highly specific for phosphatidylinositol 3-phosphate (PtdIns(3)P). They are predominantly found in proteins involved in various trafficking pathways. Although FYVE domains may function as individual modules, dimers or in partnership with other proteins, structurally, all FYVE domains share a fold comprising two small characteristic double-stranded β-sheets, and a C-terminal α-helix, which houses eight conserved Zn2+ ion-binding cysteines. To date, the structural, biochemical, and biophysical mechanisms for subcellular targeting of FYVE domains for proteins from various model organisms have been worked out but plant FYVE domains remain noticeably under-investigated.  相似文献   

11.

Background  

PDZ domain is a well-conserved, structural protein domain found in hundreds of signaling proteins that are otherwise unrelated. PDZ domains can bind to the C-terminal peptides of different proteins and act as glue, clustering different protein complexes together, targeting specific proteins and routing these proteins in signaling pathways. These domains are classified into classes I, II and III, depending on their binding partners and the nature of bonds formed. Binding specificities of PDZ domains are very crucial in order to understand the complexity of signaling pathways. It is still an open question how these domains recognize and bind their partners.  相似文献   

12.

Background  

Protein domains coordinate to perform multifaceted cellular functions, and domain combinations serve as the functional building blocks of the cell. The available methods to identify functional domain combinations are limited in their scope, e.g. to the identification of combinations falling within individual proteins or within specific regions in a translated genome. Further effort is needed to identify groups of domains that span across two or more proteins and are linked by a cooperative function. Such functional domain combinations can be useful for protein annotation.  相似文献   

13.
Multidomain proteins form in evolution through the concatenation of domains, but structural domains may comprise multiple segments of the chain. In this work, we demonstrate that new multidomain architectures can evolve by an apparent three-dimensional swap of segments between structurally similar domains within a single-chain monomer. By a comprehensive structural search of the current Protein Data Bank (PDB), we identified 32 well-defined segment-swapped proteins (SSPs) belonging to 18 structural families. Nearly 13% of all multidomain proteins in the PDB may have a segment-swapped evolutionary precursor as estimated by more permissive searching criteria. The formation of SSPs can be explained by two principal evolutionary mechanisms: (i) domain swapping and fusion (DSF) and (ii) circular permutation (CP). By large-scale comparative analyses using structural alignment and hidden Markov model methods, it was found that the majority of SSPs have evolved via the DSF mechanism, and a much smaller fraction, via CP. Functional analyses further revealed that segment swapping, which results in two linkers connecting the domains, may impart directed flexibility to multidomain proteins and contributes to the development of new functions. Thus, inter-domain segment swapping represents a novel general mechanism by which new protein folds and multidomain architectures arise in evolution, and SSPs have structural and functional properties that make them worth defining as a separate group.  相似文献   

14.

Background  

Small heat shock proteins (sHSPs) are products of heat shock response and of other stress responses, and ubiquitous in all three domains of life, archaea, bacteria, and eukarya. They mainly function as molecular chaperones to protect proteins from being denatured in extreme conditions. Study on insect sHSPs could provide some insights into evolution of insects that have adapted to diverse niches in the world.  相似文献   

15.
The fusion of soluble partner to the N terminus of aggregation-prone polypeptide has been popularly used to overcome the formation of inclusion bodies in the E. coli cytosol. The chaperone-like functions of the upstream fusion partner in the artificial multidomain proteins could occur in de novo folding of native multidomain proteins. Here, we show that the N-terminal domains of three E. coli multidomain proteins such as lysyl-tRNA synthetase, threonyl-tRNA synthetase, and aconitase are potent solubility enhancers for various C-terminal heterologous proteins. The results suggest that the N-terminal domains could act as solubility enhancers for the folding of their authentic C-terminal domains in vivo. Tandem repeat of N-terminal domain or insertion of aspartic residues at the C terminus of the N-terminal domain also increased the solubility of fusion proteins, suggesting that the solubilizing ability correlates with the size and charge of N-terminal domains. The solubilizing ability of N-terminal domains would contribute to the autonomous folding of multidomain proteins in vivo, and based on these results, we propose a model of how N-terminal domains solubilize their downstream domains.  相似文献   

16.
During evolution, many new proteins have been formed by the process of gene duplication and combination. The genes involved in this process usually code for whole domains. Small proteins contain one domain; medium and large proteins contain two or more domains. We have compared homologous domains that occur in both one-domain proteins and multidomain proteins. We have determined (1) how the functions of the individual domains in the multidomain proteins combine to produce their overall functions and (2) the extent to which these functions are similar to those in the one-domain homologs. We describe how domain combinations increase the specificity of enzymes; act as links between domains that have functional roles; regulate activity; combine within one chain functions that can act either independently, in concert or in new contexts; and provide the structural framework for the evolution of entirely new functions.  相似文献   

17.

Background  

In this paper we describe an analysis of the size evolution of both protein domains and their indels, as inferred by changing sizes of whole domains or individual unaligned regions or "spacers". We studied relatively early evolutionary events and focused on protein domains which are conserved among various taxonomy groups.  相似文献   

18.
Tordai H  Nagy A  Farkas K  Bányai L  Patthy L 《The FEBS journal》2005,272(19):5064-5078
Originally the term 'protein module' was coined to distinguish mobile domains that frequently occur as building blocks of diverse multidomain proteins from 'static' domains that usually exist only as stand-alone units of single-domain proteins. Despite the widespread use of the term 'mobile domain', the distinction between static and mobile domains is rather vague as it is not easy to quantify the mobility of domains. In the present work we show that the most appropriate measure of the mobility of domains is the number of types of local environments in which a given domain is present. Ranking of domains with respect to this parameter in different evolutionary lineages highlighted marked differences in the propensity of domains to form multidomain proteins. Our analyses have also shown that there is a correlation between domain size and domain mobility: smaller domains are more likely to be used in the construction of multidomain proteins, whereas larger domains are more likely to be static, stand-alone domains. It is also shown that shuffling of a limited set of modules was facilitated by intronic recombination in the metazoan lineage and this has contributed significantly to the emergence of novel complex multidomain proteins, novel functions and increased organismic complexity of metazoa.  相似文献   

19.

Background

In prokaryotes and some eukaryotes, genetic material can be transferred laterally among unrelated lineages and recombined into new host genomes, providing metabolic and physiological novelty. Although the process is usually framed in terms of gene sharing (e.g. lateral gene transfer, LGT), there is little reason to imagine that the units of transfer and recombination correspond to entire, intact genes. Proteins often consist of one or more spatially compact structural regions (domains) which may fold autonomously and which, singly or in combination, confer the protein''s specific functions. As LGT is frequent in strongly selective environments and natural selection is based on function, we hypothesized that domains might also serve as modules of genetic transfer, i.e. that regions of DNA that are transferred and recombined between lineages might encode intact structural domains of proteins.

Methodology/Principal Findings

We selected 1,462 orthologous gene sets representing 144 prokaryotic genomes, and applied a rigorous two-stage approach to identify recombination breakpoints within these sequences. Recombination breakpoints are very significantly over-represented in gene sets within which protein domain-encoding regions have been annotated. Within these gene sets, breakpoints significantly avoid the domain-encoding regions (domons), except where these regions constitute most of the sequence length. Recombination breakpoints that fall within longer domons are distributed uniformly at random, but those that fall within shorter domons may show a slight tendency to avoid the domon midpoint. As we find no evidence for differential selection against nucleotide substitutions following the recombination event, any bias against disruption of domains must be a consequence of the recombination event per se.

Conclusions/Significance

This is the first systematic study relating the units of LGT to structural features at the protein level. Many genes have been interrupted by recombination following inter-lineage genetic transfer, during which the regions within these genes that encode protein domains have not been preferentially preserved intact. Protein domains are units of function, but domons are not modules of transfer and recombination. Our results demonstrate that LGT can remodel even the most functionally conservative modules within genomes.  相似文献   

20.

Background  

Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号