首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 609 毫秒
1.
2.

Background

The members of cupin superfamily exhibit large variations in their sequences, functions, organization of domains, quaternary associations and the nature of bound metal ion, despite having a conserved β-barrel structural scaffold. Here, an attempt has been made to understand structure-function relationships among the members of this diverse superfamily and identify the principles governing functional diversity. The cupin superfamily also contains proteins for which the structures are available through world-wide structural genomics initiatives but characterized as “hypothetical”. We have explored the feasibility of obtaining clues to functions of such proteins by means of comparative analysis with cupins of known structure and function.

Methodology/Principal Findings

A 3-D structure-based phylogenetic approach was undertaken. Interestingly, a dendrogram generated solely on the basis of structural dissimilarity measure at the level of domain folds was found to cluster functionally similar members. This clustering also reflects an independent evolution of the two domains in bicupins. Close examination of structural superposition of members across various functional clusters reveals structural variations in regions that not only form the active site pocket but are also involved in interaction with another domain in the same polypeptide or in the oligomer.

Conclusions/Significance

Structure-based phylogeny of cupins can influence identification of functions of proteins of yet unknown function with cupin fold. This approach can be extended to other proteins with a common fold that show high evolutionary divergence. This approach is expected to have an influence on the function annotation in structural genomics initiatives.  相似文献   

3.
The restriction endonuclease (REase) R. HphI is a Type IIS enzyme that recognizes the asymmetric target DNA sequence 5'-GGTGA-3' and in the presence of Mg(2+) hydrolyzes phosphodiester bonds in both strands of the DNA at a distance of 8 nucleotides towards the 3' side of the target, producing a 1 nucleotide 3'-staggered cut in an unspecified sequence at this position. REases are typically ORFans that exhibit little similarity to each other and to any proteins in the database. However, bioinformatics analyses revealed that R.HphI is a member of a relatively big sequence family with a conserved C-terminal domain and a variable N-terminal domain. We predict that the C-terminal domains of proteins from this family correspond to the nuclease domain of the HNH superfamily rather than to the most common PD-(D/E)XK superfamily of nucleases. We constructed a three-dimensional model of the R.HphI catalytic domain and validated our predictions by site-directed mutagenesis and studies of DNA-binding and catalytic activities of the mutant proteins. We also analyzed the genomic neighborhood of R.HphI homologs and found that putative nucleases accompanied by a DNA methyltransferase (i.e. predicted REases) do not form a single group on a phylogenetic tree, but are dispersed among free-standing putative nucleases. This suggests that nucleases from the HNH superfamily were independently recruited to become REases in the context of RM systems multiple times in the evolution and that members of the HNH superfamily may be much more frequent among the so far unassigned REase sequences than previously thought.  相似文献   

4.
Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.  相似文献   

5.
Mechanisms of bacterial resistance to chromium compounds   总被引:1,自引:0,他引:1  
Chromium is a non-essential and well-known toxic metal for microorganisms and plants. The widespread industrial use of this heavy metal has caused it to be considered as a serious environmental pollutant. Chromium exists in nature as two main species, the trivalent form, Cr(III), which is relatively innocuous, and the hexavalent form, Cr(VI), considered a more toxic species. At the intracellular level, however, Cr(III) seems to be responsible for most toxic effects of chromium. Cr(VI) is usually present as the oxyanion chromate. Inhibition of sulfate membrane transport and oxidative damage to biomolecules are associated with the toxic effects of chromate in bacteria. Several bacterial mechanisms of resistance to chromate have been reported. The best characterized mechanisms comprise efflux of chromate ions from the cell cytoplasm and reduction of Cr(VI) to Cr(III). Chromate efflux by the ChrA transporter has been established in Pseudomonas aeruginosa and Cupriavidus metallidurans (formerly Alcaligenes eutrophus) and consists of an energy-dependent process driven by the membrane potential. The CHR protein family, which includes putative ChrA orthologs, currently contains about 135 sequences from all three domains of life. Chromate reduction is carried out by chromate reductases from diverse bacterial species generating Cr(III) that may be detoxified by other mechanisms. Most characterized enzymes belong to the widespread NAD(P)H-dependent flavoprotein family of reductases. Several examples of bacterial systems protecting from the oxidative stress caused by chromate have been described. Other mechanisms of bacterial resistance to chromate involve the expression of components of the machinery for repair of DNA damage, and systems related to the homeostasis of iron and sulfur.  相似文献   

6.
The Bunyaviridae family of enveloped RNA viruses includes five genuses, orthobunyaviruses, hantaviruses, phleboviruses, nairoviruses and tospoviruses. It has not been determined which Bunyavirus protein mediates virion:cell membrane fusion. Class II viral fusion proteins (beta-penetrenes), encoded by members of the Alphaviridae and Flaviviridae, are comprised of three antiparallel beta sheet domains with an internal fusion peptide located at the end of domain II. Proteomics computational analyses indicate that the carboxyl terminal glycoprotein (Gc) encoded by Sandfly fever virus (SAN), a phlebovirus, has a significant amino acid sequence similarity with envelope protein 1 (E1), the class II fusion protein of Sindbis virus (SIN), an Alphavirus. Similar sequences and common structural/functional motifs, including domains with a high propensity to interface with bilayer membranes, are located collinearly in SAN Gc and SIN E1. Gc encoded by members of each Bunyavirus genus share several sequence and structural motifs. These results suggest that Gc of Bunyaviridae, and similar proteins of Tenuiviruses and a group of Caenorhabditis elegans retroviruses, are class II viral fusion proteins. Comparisons of divergent viral fusion proteins can reveal features essential for virion:cell fusion, and suggest drug and vaccine strategies.  相似文献   

7.
The chromate resistance determinant of Pseudomonas aeruginosa plasmid pUM505 was cloned into broad-host-range vector pSUP104. The hybrid plasmid containing an 11.1-kilobase insert conferred chromate resistance and reduced uptake of chromate in P. aeruginosa PAO1. Resistance to chromate was not expressed in Escherichia coli. Contiguous 1.6- and 6.3-kilobase HindIII fragments from this plasmid hybridized to pUM505 but not to P. aeruginosa chromosomal DNA and only weakly to chromate resistance plasmids pLHB1 and pMG6. Further subcloning produced a plasmid with an insert of 2,145 base pairs, which was sequenced. Analysis of deletions revealed that a single open reading frame was sufficient to determine chromate resistance. This open reading frame encodes a highly hydrophobic polypeptide, ChrA, of 416 amino acid residues that appeared to be expressed in E. coli under control of the T7 promoter. No significant homology was found between ChrA and proteins in the amino acid sequence libraries, but 29% amino acid identity was found with the ChrA amino acid sequence for another chromate resistance determinant sequenced in this laboratory from an Alcaligenes eutrophus plasmid (A. Nies, D. Nies, and S. Silver, submitted for publication).  相似文献   

8.
We describe a small family of proteins, CHR, which contains members that function in chromate and/or sulfate transport. CHR proteins occur in bacteria and archaea. They consist of about 400 amino acyl residues, appear to have 10 transmembrane α-helical segments in an unusual 4+6 arrangement, and arose by an intragenic duplication event.  相似文献   

9.
A comprehensive, structural and functional, in silico analysis of the medium-chain dehydrogenase/reductase (MDR) superfamily, including 583 proteins, was carried out by use of extensive database mining and the blastp program in an iterative manner to identify all known members of the superfamily. Based on phylogenetic, sequence, and functional similarities, the protein members of the MDR superfamily were classified into three different taxonomic categories: (a) subfamilies, consisting of a closed group containing a set of ideally orthologous proteins that perform the same function; (b) families, each comprising a cluster of monophyletic subfamilies that possess significant sequence identity among them and might share or not common substrates or mechanisms of reaction; and (c) macrofamilies, each comprising a cluster of monophyletic protein families with protein members from the three domains of life, which includes at least one subfamily member that displays activity related to a very ancient metabolic pathway. In this context, a superfamily is a group of homologous protein families (and/or macrofamilies) with monophyletic origin that shares at least a barely detectable sequence similarity, but showing the same 3D fold. The MDR superfamily encloses three macrofamilies, with eight families and 49 subfamilies. These subfamilies exhibit great functional diversity including noncatalytic members with different subcellular, phylogenetic, and species distributions. This results from constant enzymogenesis and proteinogenesis within each kingdom, and highlights the huge plasticity that MDR superfamily members possess. Thus, through evolution a great number of taxa-specific new functions were acquired by MDRs. The generation of new functions fulfilled by proteins, can be considered as the essence of protein evolution. The mechanisms of protein evolution inside MDR are not constrained to conserve substrate specificity and/or chemistry of catalysis. In consequence, MDR functional diversity is more complex than sequence diversity. MDR is a very ancient protein superfamily that existed in the last universal common ancestor. It had at least two (and probably three) different ancestral activities related to formaldehyde metabolism and alcoholic fermentation. Eukaryotic members of this superfamily are more related to bacterial than to archaeal members; horizontal gene transfer among the domains of life appears to be a rare event in modern organisms.  相似文献   

10.
Chen G  Pan D  Zhou Y  Lin S  Ke X 《Journal of biosciences》2007,32(4):713-721
Most plant disease-resistance genes (R-genes) isolated so far encode proteins with a nucleotide binding site (NBS) domain and belong to a superfamily. NBS domains related to R-genes show a highly conserved backbone of an amino acid motif, which makes it possible to isolate resistance gene analogues (RGAs) by degenerate primers. Degenerate primers based on the conserved motif (P-loop and GLPL) of the NBS domain from R -genes were used to isolate RGAs from the genomic DNA of sweet potato cultivar Qingnong no.2. Five distinct clusters of RGAs (22 sequences) with the characteristic NBS representing a highly diverse sample were identified in sweet potato genomic DNA. Sequence identity among the 22 RGA nucleotide sequences ranged from 41.2% to 99.4%, while the deduced amino acid sequence identity from the 22 RGAs ranged from 20.6%to 100%. The analysis of sweet potato RGA sequences suggested mutation as the primary source of diversity. The phylogenetic analyses for RGA nucleotide sequences and deduced amino acids showed that RGAs from sweet potato were classified into two distinct groups--toll and interleukin receptor-1 (TIR)-NBS-LRR and non-TIR-NBS-LRR. The high degree of similarity between sweet potato RGAs and NBS sequences derived from R-genes cloned from tomato, tobacco, flax and potato suggest an ancestral relationship. Further studies showed that the ratio of non-synonymous to synonymous substitution within families was low. These data obtained from sweet potato suggest that the evolution of NBS-encoding sequences in sweet potato occur by the gradual accumulation of mutations leading to purifying selection and slow rates of divergence within distinct R-gene families.  相似文献   

11.
The human CD1 proteins belong to a lipid-glycolipid antigen-presenting gene family and are related in structure and function to the MHC class I molecules. Previous mapping and DNA hybridization studies have shown that five linked genes located within a cluster on human chromosome 1q22-23 encode the CD1 protein family. We have analyzed the complete genomic sequence of the human CD1 gene cluster and found that the five active genes are distributed over 175,600 nucleotides and separated by four expanded intervening genomic regions (IGRs) ranging in length between 20 and 68 kb. The IGRs are composed mostly of retroelements including five full-length L1 PA sequences and various pseudogenes. Some L1 sequences have acted as receptors for other subtypes or families of retroelements. Alu molecular clocks that have evolved during primate history are found distributed within the HLA class I duplicated segments (duplicons) but not within the duplicons of CD1. Phylogeny of the alpha3 domain of the class I-like superfamily of proteins shows that the CD1 cluster is well separated from HLA class I by a number of superfamily members including MIC (PERB11), HFE, Zn-alpha2-GP, FcRn, and MR1. Phylogenetically, the human CD1 sequences are interspersed by CD1 sequences from other mammalian species, whereas the human HLA class I sequences cluster together and are separated from the other mammalian sequences. Genomic and phylogenetic analyses support the view that the human CD1 gene copies were duplicated prior to the evolution of primates and the bulk of the HLA class I genes found in humans. In contrast to the HLA class I genomic structure, the human CD1 duplicons are smaller in size, they lack Alu clocks, and they are interrupted by IGRs at least 4 to 14 times longer than the CD1 genes themselves. The IGRs seem to have been created as "buffer zones" to protect the CD1 genes from disruption by transposable elements.  相似文献   

12.
NBS类植物抗病基因保守结构域的克隆为利用简并引物扩增抗病基因同源序列提供了可能.根据抗病基因Gro1-4、Gpa2、N等的P-loop和GLPL保守结构域设计简并引物,分离甘薯近缘野生种三浅裂野牵牛NBS类型抗病基因同源序列,共获得6条相关序列,核苷酸序列的相似性为48%~97%,推测氨基酸序列的相似性在25.2%~95.1%之间.系统进化分析表明,6条三浅裂野牵牛RGA序列可分为2个不同的类群:TIR-NBS和non-TIR-NBS.三浅裂野牵牛RGA序列与源自甘薯的RGA序列有很高的相似性,这在一定程度上反映了三浅裂野牵牛与甘薯之间的亲缘关系.分离的6条RGA序列分别命名为ItRGA1~ItRGA6,GenBank登录号分别为DQ849027~DQ849032.  相似文献   

13.
AAA ATPases form a large protein family with manifold cellular roles. They belong to the AAA+ superfamily of ringshaped P-loop NTPases, which exert their activity through the energy-dependent unfolding of macromolecules. Phylogenetic analyses have suggested the existence of five major clades of AAA domains (proteasome subunits, metalloproteases, domains D1 and D2 of ATPases with two AAA domains, and the MSP1/katanin/spastin group), as well as a number of deeply branching minor clades. These analyses however have been characterized by a lack of consistency in defining the boundaries of the AAA family. We have used cluster analysis to delineate unambiguously the group of AAA sequences within the AAA+ superfamily. Phylogenetic and cluster analysis of this sequence set revealed the existence of a sixth major AAA clade, comprising the mitochondrial, membrane-bound protein BCS1 and its homologues. In addition, we identified several deep branches consisting mainly of hypothetical proteins resulting from genomic projects. Analysis of the AAA N-domains provided direct support for the obtained phylogeny for most branches, but revealed some deep splits that had not been apparent from phylogenetic analysis and some unexpected similarities between distant clades. It also revealed highly degenerate D1 domains in plant MSP1 sequences and in at least one deeply branching group of hypothetical proteins (YC46), showing that AAA proteins with two ATPase domains arose at least three times independently.  相似文献   

14.
15.
Kinesin superfamily proteins (KIFs) are key players or 'hub' proteins in the intracellular transport system, which is essential for cellular function and morphology. The KIF superfamily is also the first large protein family in mammals whose constituents have been completely identified and confirmed both in silico and in vivo. Numerous studies have revealed the structures and functions of individual family members; however, the relationships between members or a perspective of the whole superfamily structure until recently remained elusive. Here, we present a comprehensive summary based on a large, systematic phylogenetic analysis of the kinesin superfamily. All available sequences in public databases, including genomic information from all model organisms, were analyzed to yield the most complete phylogenetic kinesin tree thus far, comprising 14 families. This comprehensive classification builds on the recently proposed standardized nomenclature for kinesins and allows systematic analysis of the structural and functional relationships within the kinesin superfamily.  相似文献   

16.
Ly-49 (YE1/48, A1) is a dimer protein expressed on subpopulations of murine NK cells. It is a member of a superfamily of type II transmembrane proteins containing carbohydrate recognition domains (CRD). In the mouse genome, the detection of multiple restriction fragments that cross-hybridize with Ly-49 cDNA probes suggests the presence of related genes. In this study, we have isolated several genomic clones encoding portions of CRD sequences highly homologous to the CRD of Ly-49. By using primers based on the consensus sequences of the genomic clones, expression of Ly-49-related genes was detected by the polymerase chain reaction in various organs, including lung, kidney, liver, spleen, and thymus. Two full-length cDNA clones that are highly homologous to the Ly-49 gene were subsequently isolated from a lung cDNA library. At the nucleotide level, the two clones are 72% and 80% identical to Ly-49 in their translated regions, but their sequences are different from those of the genomic clones characterized to date. The two cDNA clones potentially encode type II transmembrane proteins containing CRD that are very similar to Ly-49. These amino acid sequences are also homologous to other members of the superfamily of CRD-containing type II transmembrane proteins, including hepatic lectins and the low affinity IgER (CD23). The homology is most evident in the CRD but is also significant in other domains. These results demonstrate the existence of several functional genes that are highly related to Ly-49. These genes comprise a subfamily within the superfamily of type II transmembrane proteins containing CRD.  相似文献   

17.
The chrA gene of Pseudomonas aeruginosa plasmid pUM505 encodes the hydrophobic protein ChrA, which confers resistance to chromate by the energy-dependent efflux of chromate ions. Chromate-sensitive mutants were isolated by in vivo random mutagenesis. Transport experiments with cell suspensions of selected mutants showed that 51CrO4(2-) extrusion was drastically lowered as compared to suspensions of the strain with the wild-type plasmid, confirming that the mutations affected a chromate efflux system. DNA sequence analysis showed that most point mutations affected amino acids clustered in the N-terminal half of ChrA, altering either cytoplasmic regions or transmembrane segments, and replaced residues moderately to highly conserved in ChrA homologs. PhoA and LacZ translational fusions were used to confirm the membrane topology at the N-terminal half of the ChrA protein.  相似文献   

18.
番茄B3超家族成员鉴定及生物信息学分析   总被引:2,自引:0,他引:2  
B3超家族是一类含有B3功能域(与DNA结合的高度保守结构域)的转录因子,在植物生长发育过程中起重要作用。本研究采用生物信息学的方法,利用Pfam中的B3保守结构域序列检索番茄(Solanum lycopersicum L.)蛋白序列,确定了97个B3超家族基因。对番茄B3超家族成员进行了系统进化树分析、染色体定位、结构域分析、组织表达和诱导表达分析等。番茄B3超家族分为LAV、ARF、RAV和REM 4个亚家族,每个亚家族中的数量分别为4、22、9和62个,且在进化树中形成明显不同的分支,每个亚家族都进行了系统进化和结构域分析;番茄12条染色体都含有B3超家族基因;11个成员的表达模式表明,B3超家族同一亚家族成员也具有不同的时空表达模式;在干旱、盐和高温胁迫处理下,部分成员响应强烈并且响应不同的外界信号;而对于ABA处理响应非常弱。本研究将为B3基因超家族成员的生物学功能研究提供参考。  相似文献   

19.
《The Journal of cell biology》1989,109(4):1633-1641
We used chicken alpha spectrin as a ligand probe to isolate Drosophila beta spectrin cDNA sequences from a lambda gt11 expression library. Analysis of 800 residues of deduced amino acid sequence at the amino- terminal end revealed a strikingly conserved domain of integral of 230 residues that shows a high degree of sequence similarity to the amino- terminal domains of alpha actinin and dystrophin. This conserved domain constitutes a new diagnostic criterion for spectrin-related proteins and allows the known properties of one of these proteins to predict functional properties of the others. The conservation of the amino- terminal domain, and other regions in spectrin, alpha actinin, and dystrophin, demonstrates that a common set of domains were linked in different combinations through evolution to generate the distinctive members of the spectrin superfamily.  相似文献   

20.
OsNifU1A is a NifU-like rice (Oryza sativa) protein, discovered recently. Its amino acid sequence is very homologous to the sequence of cyanobacterial CnfU and to the sequences of NifU C-terminal domains. Based on its sequence, OsNifU1A is probably a modular structure consisting of two CnfU-like domains, with domain I (formed by residues Leu73 to Gly153) and domain II (formed by residues Leu154 to Ser226). Domain I have a conserved Cys-X-X-Cys motif, which may function as an iron-sulfur cluster assembly scaffold. Domain II lacks a Cys-X-X-Cys motif and therefore, cannot function analogously. Other NifU-like proteins, with sequences homologous to OsNifU1A domain II, have been identified during plant genomic projects; however, the biological roles of these domains remain unknown. We successfully constructed an Escherichia coli expression system for OsNifU1A domain II that enabled us to synthesize and purify milligram quantities of protein for use in structural and functional studies. Using the Gateway system, we built DNA sequences corresponding to two OsNifU1A domain II fusion proteins. One construct has a (His)6 sequence upstream of the OsNifU1A domain II sequence; the other has an upstream thioredoxin-(His)6 sequence. Recombinant OsNifU1A domain II fusion proteins were extracted from E. coli inclusion bodies by dissolving them in 6 M guanidine-HCl. About 36% of the total (His)6/OsNifU1A domain II fusion protein initially present remained soluble after guanidine-HCl was completely removed by step-wise dialysis; whereas, recovery of soluble Trx-(His)6 fusion protein was about 60% of the total cell lysate. About 2 mg of 15N-labeled OsNifU1A domain II was purified for NMR spectral studies. Examination of the OsNifU1A domain II 1H-15N HSQC NMR spectrum indicated that the purified protein was monomeric and correctly folded. Therefore, we established an efficient procedure for synthesis and purification of 15N-labeled OsNifU1A domain II in quantities sufficient for heteronuclear NMR solution structure studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号