首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 562 毫秒
1.
Protein interactions are fundamental to the functioning of cells, and high throughput experimental and computational strategies are sought to map interactions. Predicting interaction specificity, such as matching members of a ligand family to specific members of a receptor family, is largely an unsolved problem. Here we show that by using evolutionary relationships within such families, it is possible to predict their physical interaction specificities. We introduce the computational method of matrix alignment for finding the optimal alignment between protein family similarity matrices. A second method, 3D embedding, allows visualization of interacting partners via spatial representation of the protein families. These methods essentially align phylogenetic trees of interacting protein families to define specific interaction partners. Prediction accuracy depends strongly on phylogenetic tree complexity, as measured with information theoretic methods. These results, along with simulations of protein evolution, suggest a model for the evolution of interacting protein families in which interaction partners are duplicated in coupled processes. Using these methods, it is possible to successfully find protein interaction specificities, as demonstrated for >18 protein families.  相似文献   

2.
Natural history and functional divergence of protein tyrosine kinases   总被引:3,自引:0,他引:3  
Gu J  Gu X 《Gene》2003,317(1-2):49-57
Cellular signaling is important for many biological processes including growth, differentiation, adhesion, motility and apoptosis. The protein tyrosine kinase (PTK) supergene family is the key mediator in cellular signaling in metazoans, directly associated with a variety of human diseases. All PTKs contain a highly conserved catalytic kinase domain, in spite of variable multi-domain structures. Within each PTK gene family, members exhibit functional divergence in substrate-specificity or temporal/tissue-specific expression, although their primary function is conserved. After conducting phylogenetic analysis on major PTK gene families, we found that the expanding of each PTK family was likely caused by gene or genome duplication event(s) that occurred before the emergence of teleosts but after the vertebrate-amphioxus split. We further investigated the evolutionary pattern of functional divergence after gene duplication in those gene families. Our results show that site-specific shifted evolutionary rate (altered functional constraint) is a common pattern in PTK gene family evolution.  相似文献   

3.
4.
Gene duplication is a key mechanism for the adaptive evolution and neofunctionalization of gene families. Large multigene families often exhibit complex evolutionary histories as a result of frequent gene duplication acting in concordance with positive selection pressures. Alterations in the domain structure of genes, causing changes in the molecular scaffold of proteins, can also result in a complex evolutionary history and has been observed in functionally diverse multigene toxin families. Here, we investigate the role alterations in domain structure have on the tempo of evolution and neofunctionalization of multigene families using the snake venom metalloproteinases (SVMPs) as a model system. Our results reveal that the evolutionary history of viperid (Serpentes: Viperidae) SVMPs is repeatedly punctuated by domain loss, with the single loss of the cysteine-rich domain, facilitating the formation of P-II class SVMPs, occurring prior to the convergent loss of the disintegrin domain to form multiple P-I SVMP structures. Notably, the majority of phylogenetic branches where domain loss was inferred to have occurred exhibited highly significant evidence of positive selection in surface-exposed amino acid residues, resulting in the neofunctionalization of P-II and P-I SVMP classes. These results provide a valuable insight into the mechanisms by which complex gene families evolve and detail how the loss of domain structures can catalyze the accelerated evolution of novel gene paralogues. The ensuing generation of differing molecular scaffolds encoded by the same multigene family facilitates gene neofunctionalization while presenting an evolutionary advantage through the retention of multiple genes capable of encoding functionally distinct proteins.  相似文献   

5.
MOTIVATION: Functional linkages implicate pairwise relationships between proteins that work together to implement biological tasks. During evolution, functionally linked proteins are likely to be preserved or eliminated across a range of genomes in a correlated fashion. Based on this hypothesis, phylogenetic profiling-based approaches try to detect pairs of protein families that show similar evolutionary patterns. Traditionally, the evolutionary pattern of a protein is encoded by either a binary profile of presence and absence of this protein across species or an occurrence profile that indicates the distribution of copies of this protein across species. RESULTS: In our study, we characterize each protein by its enhanced phylogenetic tree, a novel graphical model of the evolution of a protein family with explicitly marked by speciation and duplication events. By topological comparison between enhanced phylogenetic trees, we are able to detect the functionally associated protein pairs. Because the enhanced phylogenetic trees contain more evolutionary information of proteins, our method shows greater performance and discovers functional linkages among proteins more reliably compared with the conventional approaches.  相似文献   

6.
The complete sequence of the Arabidopsis genome enables definitive characterization of multigene families and analysis of their phylogenetic relationships. Using a consensus sequence previously defined for glycosyltransferases that use small-molecular-weight acceptors, 107 gene sequences were identified in the Arabidopsis genome and used to construct a phylogenetic tree. Screening recombinant proteins for their catalytic activities in vitro has revealed enzymes active toward physiologically important substrates, including hormones and secondary metabolites. The aim of this study has been to use the phylogenetic relationships across the entire family to explore the evolution of substrate recognition and regioselectivity of glucosylation. Hydroxycoumarins have been used as the model substrates for the analysis in which 90 sequences have been assayed and 48 sequences shown to recognize these compounds. The study has revealed activity in 6 of the 14 phylogenetic groups of the multigene family, suggesting that basic features of substrate recognition are retained across substantial evolutionary periods.  相似文献   

7.
8.
NCS1 proteins are H+/Na+ symporters specific for the uptake of purines, pyrimidines and related metabolites. In this article, we study the origin, diversification and substrate specificity of fungal NCS1 transporters. We show that the two fungal NCS1 sub‐families, Fur and Fcy, and plant homologues originate through independent horizontal transfers from prokaryotes and that expansion by gene duplication led to the functional diversification of fungal NCS1. We characterised all Fur proteins of the model fungus Aspergillus nidulans and discovered novel functions and specificities. Homology modelling, substrate docking, molecular dynamics and systematic mutational analysis in three Fur transporters with distinct specificities identified residues critical for function and specificity, located within a major substrate binding site, in transmembrane segments TMS1, TMS3, TMS6 and TMS8. Most importantly, we predict and confirm that residues determining substrate specificity are located not only in the major substrate binding site, but also in a putative outward‐facing selective gate. Our evolutionary and structure‐function analysis contributes in the understanding of the molecular mechanisms underlying the functional diversification of eukaryotic NCS1 transporters, and in particular, forward the concept that selective channel‐like gates might contribute to substrate specificity.  相似文献   

9.
The interleukin-1 receptor-associated kinase (IRAK) family comprises critical signaling mediators of the TLR/IL-1R signaling pathways. IRAKs are Ser/Thr kinases. There are 4 members in the vertebrate genome (IRAK1, IRAK2, IRAKM, and IRAK4) and an IRAK homolog, Pelle, in insects. IRAK family members are highly conserved in vertebrates, but the evolutionary relationship between IRAKs in vertebrates and insects is not clear. To investigate the evolutionary history and functional divergence of IRAK members, we performed extensive bioinformatics analysis. The phylogenetic relationship between IRAK sequences suggests that gene duplication events occurred in the evolutionary lineage, leading to early vertebrates. A comparative phylogenetic analysis with insect homologs of IRAKs suggests that the Tube protein is a homolog of IRAK4, unlike the anticipated protein, Pelle. Furthermore, the analysis supports that an IRAK4-like kinase is an ancestral protein in the metazoan lineage of the IRAK family. Through functional analysis, several potentially diverged sites were identified in the common death domain and kinase domain. These sites have been constrained during evolution by strong purifying selection, suggesting their functional importance within IRAKs. In summary, our study highlighted the molecular evolution of the IRAK family, predicted the amino acids that contributed to functional divergence, and identified structural variations among the IRAK paralogs that may provide a starting point for further experimental investigations.  相似文献   

10.
Orthologs typically retain the same function in the course of evolution. Using beta-decarboxylating dehydrogenase family as a model, we demonstrate that orthologs can be confidently identified. The strategy is based on our recent findings that substitutions of only a few amino acid residues in these enzymes are sufficient to exchange substrate and coenzyme specificities. Hence, the few major specificity determinants can serve as reliable markers for determining orthologous or paralogous relationships. The power of this approach has been demonstrated by correcting similarity-based functional misassignment and discovering new genes and related pathways, and should be broadly applicable to other enzyme families.  相似文献   

11.
Zhou D  Zhou J  Meng L  Wang Q  Xie H  Guan Y  Ma Z  Zhong Y  Chen F  Liu J 《Gene》2009,441(1-2):36-44
Plants have evolved diverse adaptive mechanisms that enable them to tolerate abiotic stresses, to varying degrees, and such stresses may have strongly influenced evolutionary changes at levels ranging from molecular to morphological. Previous studies on these phenomena have focused on the adaptive evolution of stress-related orthologous genes in specific lineages. However, heterogenetic evolution of the paralogous genes following duplication has only been examined in a very limited number of stress-response gene families. The COR15 gene encodes a low molecular weight protein that plays an important role in protecting plants from cold stresses. Although two different copies of this gene have been found in the model species, Arabidopsis thaliana, evolutionary patterns of this small gene family in plants have not been previously explored. In this study, we cloned COR15-like sequences and performed evolutionary analyses of these sequences (including those previously reported) in the highly cold-tolerant Draba lineage and related lineages of Brassicaceae. Our phylogenetic analyses indicate that all COR15-like sequences clustered into four clades that corresponded well to the morphological lineages. Gene conversions were found to have probably occurred before/during the divergence of Brassica and Draba lineage. However, repeated, independent duplications of this gene have occurred in different lineages of Brassicaceae. Further comparisons of all sequences suggest that there have been significant inter-lineage differences in evolutionary rates between the duplicated and original genes. We assessed the likelihood that the differences between two well-supported gene subfamilies that appear to have originated from a single duplication, COR15a and COR15b, within the Draba lineage have been driven by adaptive evolution. Comparisons of their non-synonymous/synonymous substitution ratios and rates of predicted amino acid changes indicate that these two gene groups are evolving under different selective pressures and may be functionally divergent. This functional divergence was confirmed by comparing site-specific shifts in evolution indexes of the two groups of predicted proteins. The evidence of differential selection and possible functional divergence suggests that the duplication may be of adaptive significance, with possible implications for the explosive diversification of the Draba lineage during the cooling Quaternary stages and the following worldwide colonization of arid alpine and artic regions.  相似文献   

12.
Kinases that catalyze phosphorylation of sugars, called here sugar kinases, can be divided into at least three distinct nonhomologous families. The first is the hexokinase family, which contains many prokaryotic and eukaryotic sugar kinases with diverse specificities, including a new member, rhamnokinase from Salmonella typhimurium. The three-dimensional structure of hexokinase is known and can be used to build models of functionally important regions of other kinases in this family. The second is the ribokinase family, of unknown three-dimensional structure, and comprises pro- and eukaryotic ribokinases, bacterial fructokinases, the minor 6-phosphofructokinase 2 from Escherichia coli, 6-phosphotagatokinase, 1-phosphofructokinase, and, possibly, inosine-guanosine kinase. The third family, also of unknown three-dimensional structure, contains several bacterial and yeast galactokinases and eukaryotic mevalonate and phosphomevalonate kinases and may have a substrate binding region in common with homoserine kinases. Each of the three families of sugar kinases appears to have a distinct three-dimensional fold, since conserved sequence patterns are strikingly different for the three families. Yet each catalyzes chemically equivalent reactions on similar or identical substrates. The enzymatic function of sugar phosphorylation appears to have evolved independently on the three distinct structural frameworks, by convergent evolution. In addition, evolutionary trees reveal that (1) fructokinase specificity has evolved independently in both the hexokinase and ribokinase families and (2) glucose specificity has evolved independently in different branches of the hexokinase family. These are examples of independent Darwinian adaptation of a structure to the same substrate at different evolutionary times. The flexible combination of active sites and three-dimensional folds observed in nature can be exploited by protein engineers in designing and optimizing enzymatic function.  相似文献   

13.
A major component of the plant nuclear genome is constituted by different classes of repetitive DNA sequences. The structural, functional and evolutionary aspects of the satellite repetitive DNA families, and their organization in the chromosomes is reviewed. The tandem satellite DNA sequences exhibit characteristic chromosomal locations, usually at subtelomeric and centromeric regions. The repetitive DNA family(ies) may be widely distributed in a taxonomic family or a genus, or may be specific for a species, genome or even a chromosome. They may acquire large-scale variations in their sequence and copy number over an evolutionary time-scale. These features have formed the basis of extensive utilization of repetitive sequences for taxonomic and phylogenetic studies. Hybrid polyploids have especially proven to be excellent models for studying the evolution of repetitive DNA sequences. Recent studies explicitly show that some repetitive DNA families localized at the telomeres and centromeres have acquired important structural and functional significance. The repetitive elements are under different evolutionary constraints as compared to the genes. Satellite DNA families are thought to arise de novo as a consequence of molecular mechanisms such as unequal crossing over, rolling circle amplification, replication slippage and mutation that constitute "molecular drive".  相似文献   

14.
The subfamily Iα aminotransferases are typically categorized as having narrow specificity toward carboxylic amino acids (AATases), or broad specificity that includes aromatic amino acid substrates (TATases). Because of their general role in central metabolism and, more specifically, their association with liver‐related diseases in humans, this subfamily is biologically interesting. The substrate specificities for only a few members of this subfamily have been reported, and the reliable prediction of substrate specificity from protein sequence has remained elusive. In this study, a diverse set of aminotransferases was chosen for characterization based on a scoring system that measures the sequence divergence of the active site. The enzymes that were experimentally characterized include both narrow‐specificity AATases and broad‐specificity TATases, as well as AATases with broader‐specificity and TATases with narrower‐specificity than the previously known family members. Molecular function and phylogenetic analyses underscored the complexity of this family's evolution as the TATase function does not follow a single evolutionary thread, but rather appears independently multiple times during the evolution of the subfamily. The additional functional characterizations described in this article, alongside a detailed sequence and phylogenetic analysis, provide some novel clues to understanding the evolutionary mechanisms at work in this family. Proteins 2013. © 2013 Wiley Periodicals, Inc.  相似文献   

15.
Cellular processes are highly interconnected and many proteins are shared in different pathways. Some of these shared proteins or protein families may interact with diverse partners using the same interface regions; such multibinding proteins are the subject of our study. The main goal of our study is to attempt to decipher the mechanisms of specific molecular recognition of multiple diverse partners by promiscuous protein regions. To address this, we attempt to analyze the physicochemical properties of multibinding interfaces and highlight the major mechanisms of functional switches realized through multibinding. We find that only 5% of protein families in the structure database have multibinding interfaces, and multibinding interfaces do not show any higher sequence conservation compared with the background interface sites. We highlight several important functional mechanisms utilized by multibinding families. (a) Overlap between different functional pathways can be prevented by the switches involving nearby residues of the same interfacial region. (b) Interfaces can be reused in pathways where the substrate should be passed from one protein to another sequentially. (c) The same protein family can develop different specificities toward different binding partners reusing the same interface; and finally, (d) inhibitors can attach to substrate binding sites as substrate mimicry and thereby prevent substrate binding.  相似文献   

16.
The "A Disintegrin And Metalloproteinase" (ADAM) protein family and the "A Disintegrin-like And Metalloproteinase with ThromboSpondin motifs" (ADAMTS) protein family are two related families of human proteins. The similarities and differences between these two families have been investigated using phylogenetic trees and homology modeling. The phylogenetic analysis indicates that the two families are well differentiated, even when only the common metalloprotease domain is taken into account. Within the ADAM family, several proteins are lacking the binding motif for the catalytic zinc in the active site and thus presumably lack any catalytic activity. These proteins tend to cluster within the ADAM phylogenetic tree and are expressed in specific tissues, suggesting a functional differentiation. The present analysis allows us to propose the following: (i) ADAMTS proteins have a conserved role in the human organism as proteases, with some differentiation in terms of substrate specificity; (ii) ADAM proteins can act as proteases and/or mediators of intermolecular interactions; (iii) proteolytically active ADAMs tend to be more ubiquitously expressed than the inactive ones.  相似文献   

17.
A comprehensive classification system for transmembrane molecular transporters has been developed and recently approved by the transport panel of the nomenclature committee of the International Union of Biochemistry and Molecular Biology. This system is based on (i) transporter class and subclass (mode of transport and energy coupling mechanism), (ii) protein phylogenetic family and subfamily, and (iii) substrate specificity. Almost all of the more than 250 identified families of transporters include members that function exclusively in transport. Channels (115 families), secondary active transporters (uniporters, symporters, and antiporters) (78 families), primary active transporters (23 families), group translocators (6 families), and transport proteins of ill-defined function or of unknown mechanism (51 families) constitute distinct categories. Transport mode and energy coupling prove to be relatively immutable characteristics and therefore provide primary bases for classification. Phylogenetic grouping reflects structure, function, mechanism, and often substrate specificity and therefore provides a reliable secondary basis for classification. Substrate specificity and polarity of transport prove to be more readily altered during evolutionary history and therefore provide a tertiary basis for classification. With very few exceptions, a phylogenetic family of transporters includes members that function by a single transport mode and energy coupling mechanism, although a variety of substrates may be transported, sometimes with either inwardly or outwardly directed polarity. In this review, I provide cross-referencing of well-characterized constituent transporters according to (i) transport mode, (ii) energy coupling mechanism, (iii) phylogenetic grouping, and (iv) substrates transported. The structural features and distribution of recognized family members throughout the living world are also evaluated. The tabulations should facilitate familial and functional assignments of newly sequenced transport proteins that will result from future genome sequencing projects.  相似文献   

18.
A class of UDP-glycosyltransferases (UGTs) defined by the presence of a C-terminal consensus sequence is found throughout the plant and animal kingdoms. Whereas mammalian enzymes use UDP-glucuronic acid, the plant enzymes typically use UDP-glucose in the transfer reactions. A diverse array of aglycones can be glucosylated by these UGTs. In plants, the aglycones include plant hormones, secondary metabolites involved in stress and defense responses, and xenobiotics such as herbicides. Glycosylation is known to regulate many properties of the aglycones such as their bioactivity, their solubility, and their transport properties within the cell and throughout the plant. As a means of providing a framework to start to understand the substrate specificities and structure-function relationships of plant UGTs, we have now applied a molecular phylogenetic analysis to the multigene family of 99 UGT sequences in Arabidopsis. We have determined the overall organization and evolutionary relationships among individual members with a surprisingly high degree of confidence. Through constructing a composite phylogenetic tree that also includes all of the additional plant UGTs with known catalytic activities, we can start to predict both the evolutionary history and substrate specificities of new sequences as they are identified. The tree already suggests that while the activities of some subgroups of the UGT family are highly conserved among different plant species, others subgroups shift substrate specificity with relative ease.  相似文献   

19.
Sequence similarity and profile searching tools were used to analyze the genome sequences of Arabidopsis thaliana, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans and Drosophila melanogaster for genes encoding three families of histone deacetylase (HDAC) proteins and three families of histone acetyltransferase (HAT) proteins. Plants, animals and fungi were found to have a single member of each of three subfamilies of the GNAT family of HATs, suggesting conservation of these functions. However, major differences were found with respect to sizes of gene families and multi-domain protein structures within other families of HATs and HDACs, indicating substantial evolutionary diversification. Phylogenetic analysis identified a new class of HDACs within the RPD3/HDA1 family that is represented only in plants and animals. A similar analysis of the plant-specific HD2 family of HDACs suggests a duplication event early in dicot evolution, followed by further diversification in the lineage leading to Arabidopsis. Of three major classes of SIR2-type HDACs that are found in animals, fungi have representatives only in one class, whereas plants have representatives only in the other two. Plants possess five CREB-binding protein (CBP)-type HATs compared with one to two in animals and none in fungi. Domain and phylogenetic analyses of the CBP family proteins showed that this family has evolved three distinct types of CBPs in plants. The domain architecture of CBP and TAF(II)250 families of HATs show significant differences between plants and animals, most notably with respect to bromodomain occurrence and their number. Bromodomain-containing proteins in Arabidopsis differ strikingly from animal bromodomain proteins with respect to the numbers of bromodomains and the other types of domains that are present. The substantial diversification of HATs and HDACs that has occurred since the divergence of plants, animals and fungi suggests a surprising degree of evolutionary plasticity and functional diversification in these core chromatin components.  相似文献   

20.
Sequences and structures of all P-loop-fold proteins were compared with the aim of reconstructing the principal events in the evolution of P-loop-containing kinases. It is shown that kinases and some related proteins comprise a monophyletic assemblage within the P-loop NTPase fold. An evolutionary classification of these proteins was developed using standard phylogenetic methods, analysis of shared sequence and structural signatures, and similarity-based clustering. This analysis resulted in the identification of approximately 40 distinct protein families within the P-loop kinase class. Most of these enzymes phosphorylate nucleosides and nucleotides, as well as sugars, coenzyme precursors, adenosine 5'-phosphosulfate and polynucleotides. In addition, the class includes sulfotransferases, amide bond ligases, pyrimidine and dihydrofolate reductases, and several other families of enzymes that have acquired new catalytic capabilities distinct from the ancestral kinase reaction. Our reconstruction of the early history of the P-loop NTPase fold includes the initial split into the common ancestor of the kinase and the GTPase classes, and the common ancestor of ATPases. This was followed by the divergence of the kinases, which primarily phosphorylated nucleoside monophosphates (NMP), but could have had broader specificity. We provide evidence for the presence of at least two to four distinct P-loop kinases, including distinct forms specific for dNMP and rNMP, and related enzymes in the last universal common ancestor of all extant life forms. Subsequent evolution of kinases seems to have been dominated by the emergence of new bacterial and, to a lesser extent, archaeal families. Some of these enzymes retained their kinase activity but evolved new substrate specificities, whereas others acquired new activities, such as sulfate transfer and reduction. Eukaryotes appear to have acquired most of their kinases via horizontal gene transfer from Bacteria, partly from the mitochondrial and chloroplast endosymbionts and partly at later stages of evolution. A distinct superfamily of kinases, which we designated DxTN after its sequence signature, appears to have evolved in selfish replicons, such as bacteriophages, and was subsequently widely recruited by eukaryotes for multiple functions related to nucleic acid processing and general metabolism. In the course of this analysis, several previously undetected groups of predicted kinases were identified, including widespread archaeo-eukaryotic and archaeal families. The results could serve as a framework for systematic experimental characterization of new biochemical and biological functions of kinases.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号