首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background  

PD-(D/E)XK nucleases constitute a large and highly diverse superfamily of enzymes that display little sequence similarity despite retaining a common core fold and a few critical active site residues. This makes identification of new PD-(D/E)XK nuclease families a challenging task as they usually escape detection with standard sequence-based methods. We developed a modified transitive meta profile search approach and to consider the structural diversity of PD-(D/E)XK nuclease fold more thoroughly we analyzed also lower than threshold Meta-BASIC hits to select potentially correct predictions placed among unreliable or incorrect ones.  相似文献   

2.
MOTIVATION: Restriction endonucleases (REases) and homing endonucleases (HEases) are biotechnologically important enzymes. Nearly all structurally characterized REases belong to the PD-(D/E)XK superfamily of nucleases, while most HEases belong to an unrelated LAGLIDADG superfamily. These two protein folds are typically associated with very different modes of protein-DNA recognition, consistent with the different mechanisms of action required to achieve high specificity. REases recognize short DNA sequences using multiple contacts per base pair, while HEases recognize very long sites using a few contacts per base pair, thereby allowing for partial degeneracy of the target sequence. Thus far, neither REases with the LAGLIDADG fold, nor HEases with the PD-(D/E)XK fold, have been found. RESULTS: Using protein fold recognition, we have identified the first member of the PD-(D/E)XK superfamily among homing endonucleases, a cyanobacterial enzyme I-Ssp6803I. We present a model of the I-Ssp6803I-DNA complex based on the structure of Type II restriction endonuclease R.BglI and predict the active site and residues involved in specific DNA sequence recognition by I-Ssp6803I. Our finding reveals a new unexpected evolutionary link between HEases and REases and suggests how PD-(D/E)XK nucleases may develop a 'HEase-like' way of interacting with the extended DNA sequence. This in turn may be exploited to study the evolution of DNA sequence specificity and to engineer nucleases with new substrate specificities.  相似文献   

3.
PD-(D/E)XK nucleases, initially represented by only Type II restriction enzymes, now comprise a large and extremely diverse superfamily of proteins. They participate in many different nucleic acids transactions including DNA degradation, recombination, repair and RNA processing. Different PD-(D/E)XK families, although sharing a structurally conserved core, typically display little or no detectable sequence similarity except for the active site motifs. This makes the identification of new superfamily members using standard homology search techniques challenging. To tackle this problem, we developed a method for the detection of PD-(D/E)XK families based on the binary classification of profile-profile alignments using support vector machines (SVMs). Using a number of both superfamily-specific and general features, SVMs were trained to identify true positive alignments of PD-(D/E)XK representatives. With this method we identified several PFAM families of uncharacterized proteins as putative new members of the PD-(D/E)XK superfamily. In addition, we assigned several unclassified restriction enzymes to the PD-(D/E)XK type. Results show that the new method is able to make confident assignments even for alignments that have statistically insignificant scores. We also implemented the method as a freely accessible web server at http://www.ibt.lt/bioinformatics/software/pdexk/.  相似文献   

4.
Proteins belonging to PD-(D/E)XK phosphodiesterases constitute a functionally diverse superfamily with representatives involved in replication, restriction, DNA repair and tRNA–intron splicing. Their malfunction in humans triggers severe diseases, such as Fanconi anemia and Xeroderma pigmentosum. To date there have been several attempts to identify and classify new PD-(D/E)KK phosphodiesterases using remote homology detection methods. Such efforts are complicated, because the superfamily exhibits extreme sequence and structural divergence. Using advanced homology detection methods supported with superfamily-wide domain architecture and horizontal gene transfer analyses, we provide a comprehensive reclassification of proteins containing a PD-(D/E)XK domain. The PD-(D/E)XK phosphodiesterases span over 21 900 proteins, which can be classified into 121 groups of various families. Eleven of them, including DUF4420, DUF3883, DUF4263, COG5482, COG1395, Tsp45I, HaeII, Eco47II, ScaI, HpaII and Replic_Relax, are newly assigned to the PD-(D/E)XK superfamily. Some groups of PD-(D/E)XK proteins are present in all domains of life, whereas others occur within small numbers of organisms. We observed multiple horizontal gene transfers even between human pathogenic bacteria or from Prokaryota to Eukaryota. Uncommon domain arrangements greatly elaborate the PD-(D/E)XK world. These include domain architectures suggesting regulatory roles in Eukaryotes, like stress sensing and cell-cycle regulation. Our results may inspire further experimental studies aimed at identification of exact biological functions, specific substrates and molecular mechanisms of reactions performed by these highly diverse proteins.  相似文献   

5.
The PD-(D/E)XK nuclease domains, initially identified in type II restriction enzymes, serve as models for studying aspects of protein-DNA interactions, mechanisms of phosphodiester hydrolysis, and provide indispensable tools for techniques in genetic engineering and molecular medicine. However, the low degree of amino acid conservation hampers the possibility of identification of PD-(D/E)XK superfamily members based solely on sequence comparisons. In several proteins implicated in DNA recombination and repair the restriction enzyme-like nuclease domain has been found only after the corresponding structures were determined experimentally. Here, we identified highly diverged variants of the PD-(D/E)XK domain in many proteins and open reading frames using iterative database searches and progressive, structure-guided alignment of sequence profiles. We predicted the possible cellular function for many hypothetical proteins based on their relative similarity to characterized nucleases or observed presence of additional domains. We also identified the nuclease domain in genuine recombinases and restriction enzymes, whose homology to other PD-(D/E)XK enzymes has not been demonstrated previously. The first superfamily-wide comparative analysis, not limited to nucleases of known structure, will guide cloning and characterization of novel enzymes and planning new experiments to better understand those already studied.  相似文献   

6.
The GIY-YIG nuclease domain was originally identified in homing endonucleases and enzymes involved in DNA repair and recombination. Many of the GIY-YIG family enzymes are functional as monomers. We show here that the Cfr42I restriction endonuclease which belongs to the GIY-YIG family and recognizes the symmetric sequence 5′-CCGC/GG-3′ (‘/’ indicates the cleavage site) is a tetramer in solution. Moreover, biochemical and kinetic studies provided here demonstrate that the Cfr42I tetramer is catalytically active only upon simultaneous binding of two copies of its recognition sequence. In that respect Cfr42I resembles the homotetrameric Type IIF restriction enzymes that belong to the distinct PD-(E/D)XK nuclease superfamily. Unlike the PD-(E/D)XK enzymes, the GIY-YIG nuclease Cfr42I accommodates an extremely wide selection of metal-ion cofactors, including Mg2+, Mn2+, Co2+, Zn2+, Ni2+, Cu2+ and Ca2+. To our knowledge, Cfr42I is the first tetrameric GIY-YIG family enzyme. Similar structural arrangement and phenotypes displayed by restriction enzymes of the PD-(E/D)XK and GIY-YIG nuclease families point to the functional significance of tetramerization.  相似文献   

7.
The tRNA splicing endoribonuclease EndA from Methanococcus jannaschii is a homotetramer formed via heterologous interaction between the two pairs of homodimers. Each monomer consists of two alpha/beta domains, the N-terminal domain (NTD) and the C-terminal domain (CTD) containing the RNase A-like active site. Comparison of the EndA coordinates with the publicly available protein structure database revealed the similarity of both domains to site-specific deoxyribonucleases: the NTD to the LAGLIDADG family and the CTD to the PD-(D/E)XK family. Superposition of the NTD on the catalytic domain of LAGLIDADG homing endonucleases allowed a suggestion to be made about which amino acid residues of the tRNA splicing nuclease might participate in formation of a presumptive cryptic deoxyribonuclease active site. On the other hand, the CTD and PD-(D/E)XK endonucleases, represented by restriction enzymes and a phage lambda exonuclease, were shown to share extensive similarities of the structural framework, to which entirely different active sites might be attached in two alternative locations. These findings suggest that EndA evolved from a fusion protein with at least two distinct endonuclease activities: the ribonuclease, which made it an essential "antitoxin" for the cells whose RNA genes were interrupted by introns, and the deoxyribonuclease, which provided the means for homing-like mobility. The residues of the noncatalytic CTDs from the positions corresponding to the catalytic side chains in PD-(D/E)XK deoxyribonucleases map to the surface at the opposite side to the tRNA binding site, for which no function has been implicated. Many restriction enzymes from the PD-(D/E)XK superfamily might have the potential to maintain an additional active or binding site at the face opposite the deoxyribonuclease active site, a property that can be utilized in protein engineering.  相似文献   

8.

Background  

Catalytic domains of Type II restriction endonucleases (REases) belong to a few unrelated three-dimensional folds. While the PD-(D/E)XK fold is most common among these enzymes, crystal structures have been also determined for single representatives of two other folds: PLD (R.BfiI) and half-pipe (R.PabI). Bioinformatics analyses supported by mutagenesis experiments suggested that some REases belong to the HNH fold (e.g. R.KpnI), and that a small group represented by R.Eco29kI belongs to the GIY-YIG fold. However, for a large fraction of REases with known sequences, the three-dimensional fold and the architecture of the active site remain unknown, mostly due to extreme sequence divergence that hampers detection of homology to enzymes with known folds.  相似文献   

9.
Using Meta-BASIC, a highly sensitive method for detection of distant similarity between proteins, we have identified another potential PD-(D/E)XK endonuclease in human herpesvirus 1 (HHV-1) encoded by the UL24 gene. The universal presence of UL24 in completed herpesviral genomes of three major subfamilies, Alphaherpesvirinae, Betaherpesvirinae, and Gammaherpesvirinae, suggests a fundamental role for this predicted PD-(D/E)XK endonuclease activity in the viral life cycle.  相似文献   

10.

Background

The majority of experimentally determined crystal structures of Type II restriction endonucleases (REases) exhibit a common PD-(D/E)XK fold. Crystal structures have been also determined for single representatives of two other folds: PLD (R.BfiI) and half-pipe (R.PabI), and bioinformatics analyses supported by mutagenesis suggested that some REases belong to the HNH fold. Our previous bioinformatic analysis suggested that REase R.Eco29kI shares sequence similarities with one more unrelated nuclease superfamily, GIY-YIG, however so far no experimental data were available to support this prediction. The determination of a crystal structure of the GIY-YIG domain of homing endonuclease I-TevI provided a template for modeling of R.Eco29kI and prompted us to validate the model experimentally.

Results

Using protein fold-recognition methods we generated a new alignment between R.Eco29kI and I-TevI, which suggested a reassignment of one of the putative catalytic residues. A theoretical model of R.Eco29kI was constructed to illustrate its predicted three-dimensional fold and organization of the active site, comprising amino acid residues Y49, Y76, R104, H108, E142, and N154. A series of mutants was constructed to generate amino acid substitutions of selected residues (Y49A, R104A, H108F, E142A and N154L) and the mutant proteins were examined for their ability to bind the DNA containing the Eco29kI site 5'-CCGCGG-3' and to catalyze the cleavage reaction. Experimental data reveal that residues Y49, R104, E142, H108, and N154 are important for the nuclease activity of R.Eco29kI, while H108 and N154 are also important for specific DNA binding by this enzyme.

Conclusion

Substitutions of residues Y49, R104, H108, E142 and N154 predicted by the model to be a part of the active site lead to mutant proteins with strong defects in the REase activity. These results are in very good agreement with the structural model presented in this work and with our prediction that R.Eco29kI belongs to the GIY-YIG superfamily of nucleases. Our study provides the first experimental evidence for a Type IIP REase that does not belong to the PD-(D/E)XK or HNH superfamilies of nucleases, and is instead a member of the unrelated GIY-YIG superfamily.  相似文献   

11.
The restriction endonuclease fold [a three-layer α-β sandwich containing variations of the PD-(D/E)XK nuclease motif] has been greatly diversified during evolution, facilitating its use for many biological functions. Here we characterize DNA binding and cleavage by the PD-(D/E)XK homing endonuclease I-Ssp6803I. Unlike most restriction endonucleases harboring the same core fold, the specificity profile of this enzyme extends over a long (17 bp) target site. The DNA binding and cleavage specificity profiles of this enzyme were independently determined and found to be highly correlated. However, the DNA target sequence contains several positions where binding and cleavage activities are not tightly coupled: individual DNA base-pair substitutions at those positions that significantly decrease cleavage activity have minor effects on binding affinity. These changes in the DNA target sequence appear to correspond to substitutions that uniquely increase the free energy change between the ground state and the transition state, rather than simply decreasing the overall DNA binding affinity. The specificity of the enzyme reflects constraints on its host gene and limitations imposed by the enzyme's quaternary structure and illustrate the highly diverse repertoire of DNA recognition specificities that can be adopted by the related folds surrounding the PD-(D/E)XK nuclease motif.  相似文献   

12.
Type II restriction endonucleases recognize 4-8 base-pair-long DNA sequences and catalyze their cleavage with remarkable specificity. Crystal structures of the PD-(DE)XK superfamily revealed a common alpha/beta core motif and similar active site. In contrast, these enzymes show little sequence similarity and use different strategies to interact with their substrate DNA. The intriguing question is whether this enzyme family could have evolved from a common origin. In our present work, protein structure stability elements were analyzed and compared in three parts of PD-(DE)XK type II restriction endonucleases: (1) core motif, (2) active-site residues, and (3) residues playing role in DNA recognition. High correlation was found between the active-site residues and those stabilization factors that contribute to preventing structural decay. DNA recognition sites were also observed to participate in stabilization centers. It indicates that recognition motifs and active sites in PD-(DE)XK type II restriction endonucleases should have been evolutionary more conserved than other parts of the structure. Based on this observation it is proposed that PD-(DE)XK type II restriction endonucleases have developed from a common ancestor with divergent evolution.  相似文献   

13.

Background  

The PD-(D/E)xK superfamily, containing a wide variety of other exo- and endonucleases, is a notable example of general function conservation in the face of extreme sequence and structural variation. Almost all members employ a small number of shared conserved residues to bind catalytically essential metal ions and thereby effect DNA cleavage. The crystal structure of the RecBCD prokaryotic DNA repair machinery shows that RecB contains such a nuclease domain at its C-terminus. The RecC C-terminal region was reported as having a novel fold.  相似文献   

14.
Bujnicki JM  Rychlewski L 《Gene》2001,267(2):183-191
The Escherichia coli K-12 restriction enzyme Mrr recognizes and cleaves N6-methyladenine- and 5-methylcytosine-containing DNA. Its amino acid sequence has been subjected to structure prediction and comparison with other sequences from publicly available sources. The results obtained suggest that Mrr and related putative endonucleases possess a cleavage domain typical for all the so far structurally characterized type II restriction enzymes, however with an unusual glutamine residue at the central position of the (D/E)-(D/E)XK hallmark of the active site. The "missing" acidic side chain was instead found anchored in a different, unusual position, suggesting that Mrr represents a third topological variant of the endonuclease active site in addition to the two alternatives determined previously (Skirgaila et al., 1998. J. Mol. Biol. 279, 473-481). One of the newly identified putative endonucleases from the Mrr family is composed of two diverged cleavage domains, which possess both the "typical" D-EXK and the "Mrr-like" D-QXK variants of the active site. Among the Mrr homologs there are also proteins from yeast, in which restriction phenotype has not been observed, suggesting that the free-standing Eukaryotic PD-(D/E)XK superfamily members might be implicated in other cellular processes involving enzymatic DNA cleavage.  相似文献   

15.
Guzzo CR  Nagem RA  Barbosa JA  Farah CS 《Proteins》2007,69(3):644-651
The YaeQ family of proteins are found in many Gram-negative and a few Gram-positive bacteria. We have determined the first structure of a member of the YaeQ family by X-ray crystallography. Comparisons with other structures indicate that YaeQ represents a new compact protein fold built around a variation of the PD-(D/E)XK nuclease motif found in type II endonucleases and enzymes involved in DNA replication, repair, and recombination. We show that catalytically important residues in the PD-(D/E)XK nuclease superfamily are spatially conserved in YaeQ and other highly conserved YaeQ residues may be poised to interact with nucleic acid structures.  相似文献   

16.
The GGCC-specific restriction endonuclease BspRI is one of the few Type IIP restriction endonucleases, which were suggested to be a monomer. Amino acid sequence information obtained by Edman sequencing and mass spectrometry analysis was used to clone the gene encoding BspRI. The bspRIR gene is located adjacently to the gene of the cognate modification methyltransferase and encodes a 304 aa protein. Expression of the bspRIR gene in Escherichia coli was dependent on the replacement of the native TTG initiation codon with an ATG codon, explaining previous failures in cloning the gene using functional selection. A plasmid containing a single BspRI recognition site was used to analyze kinetically nicking and second-strand cleavage under steady-state conditions. Cleavage of the supercoiled plasmid went through a relaxed intermediate indicating sequential hydrolysis of the two strands. Results of the kinetic analysis of the first- and second-strand cleavage are consistent with cutting the double-stranded substrate site in two independent binding events. A database search identified eight putative restriction-modification systems in which the predicted endonucleases as well as the methyltransferases share high sequence similarity with the corresponding protein of the BspRI system. BspRI and the related putative restriction endonucleases belong to the PD-(D/E)XK nuclease superfamily.  相似文献   

17.

Background  

The detection of relationships between a protein sequence of unknown function and a sequence whose function has been characterised enables the transfer of functional annotation. However in many cases these relationships can not be identified easily from direct comparison of the two sequences. Methods which compare sequence profiles have been shown to improve the detection of these remote sequence relationships. However, the best method for building a profile of a known set of sequences has not been established. Here we examine how the type of profile built affects its performance, both in detecting remote homologs and in the resulting alignment accuracy. In particular, we consider whether it is better to model a protein superfamily using a single structure-based alignment that is representative of all known cases of the superfamily, or to use multiple sequence-based profiles each representing an individual member of the superfamily.  相似文献   

18.
Several unique protein folds that catalyze the hydrolysis of phosphodiester bonds have arisen independently in nature, including the PD(D/E)XK superfamily (typified by type II restriction endonucleases and many recombination and repair enzymes) and the HNH superfamily (found in an equally wide array of enzymes, including bacterial colicins and homing endonucleases). Whereas the identity and position of catalytic residues within the PD(D/E)XK superfamily are highly variable, the active sites of HNH nucleases are much more strongly conserved. In this study, the ability of an HNH nuclease to tolerate a mutation of its most conserved catalytic residue (its histidine general base), and the mechanism of the most active enzyme variant, were characterized. Conversion of this residue into several altered chemistries, glutamine, lysine, or glutamate, resulted in measurable activity. The histidine to glutamine mutant displays the highest residual activity and a pH profile similar to that of the wild-type enzyme. This activity is dependent on the presence of a neighboring imidazole ring, which has taken over as a less efficient general base for the reaction. This result implies that mutational pathways to alternative HNH-derived catalytic sites do exist but are not as extensively or successfully diverged or reoptimized in nature as variants of the PD(D/E)XK nuclease superfamily. This is possibly due to multiple steric constraints placed on the compact HNH motif, which is simultaneously involved in protein folding, DNA binding, and catalysis, as well as the use of a planar, aromatic imidazole group as a general base.  相似文献   

19.
The restriction endonuclease (REase) R. HphI is a Type IIS enzyme that recognizes the asymmetric target DNA sequence 5'-GGTGA-3' and in the presence of Mg(2+) hydrolyzes phosphodiester bonds in both strands of the DNA at a distance of 8 nucleotides towards the 3' side of the target, producing a 1 nucleotide 3'-staggered cut in an unspecified sequence at this position. REases are typically ORFans that exhibit little similarity to each other and to any proteins in the database. However, bioinformatics analyses revealed that R.HphI is a member of a relatively big sequence family with a conserved C-terminal domain and a variable N-terminal domain. We predict that the C-terminal domains of proteins from this family correspond to the nuclease domain of the HNH superfamily rather than to the most common PD-(D/E)XK superfamily of nucleases. We constructed a three-dimensional model of the R.HphI catalytic domain and validated our predictions by site-directed mutagenesis and studies of DNA-binding and catalytic activities of the mutant proteins. We also analyzed the genomic neighborhood of R.HphI homologs and found that putative nucleases accompanied by a DNA methyltransferase (i.e. predicted REases) do not form a single group on a phylogenetic tree, but are dispersed among free-standing putative nucleases. This suggests that nucleases from the HNH superfamily were independently recruited to become REases in the context of RM systems multiple times in the evolution and that members of the HNH superfamily may be much more frequent among the so far unassigned REase sequences than previously thought.  相似文献   

20.

Background  

Annotation of sequences that share little similarity to sequences of known function remains a major obstacle in genome annotation. Some of the best methods of detecting remote relationships between protein sequences are based on matching sequence profiles. We analyse the superfamily specific performance of sequence profile-profile matching. Our benchmark consists of a set of 16 protein superfamilies that are highly diverse at the sequence level. We relate the performance to the number of sequences in the profiles, the profile diversity and the extent of structural conservation in the superfamily.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号