首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Proteins recognize DNA sequences by two different mechanisms. The first is direct readout, in which recognition is mediated by direct interactions between the protein and the DNA bases. The second is indirect readout, which is caused by the dependence of conformation and the deformability of the DNA structure on the sequence. Various energy functions have been proposed to evaluate the contribution of indirect readout to the free-energy changes in complex formations. We developed a new generalized energy function to estimate the dependence of the deformability of DNA on the sequence. This function was derived from molecular dynamics simulations previously conducted on B-DNA dodecamers, each of which had one possible tetramer sequence embedded at its center. By taking the logarithm of the probability distribution function (PDF) for the base-step parameters of the central base-pair step of the tetramer, its ability to distinguish the native sequence from random ones was superior to that with the previous method that approximated the energy function in harmonic form. From a comparison of the energy profiles calculated with these two methods, we found that the harmonic approximation caused significant errors in the conformational energies of the tetramers that adopted multiple stable conformations.  相似文献   

2.
3.
Protein-DNA recognition plays an essential role in the regulation of gene expression. Regulatory proteins are known to recognize specific DNA sequences directly through atomic contacts (intermolecular readout) and/or indirectly through the conformational properties of the DNA (intramolecular readout). However, little is known about the respective contributions made by these so-called direct and indirect readout mechanisms. We addressed this question by making use of information extracted from a structural database containing many protein-DNA complexes. We quantified the specificity of intermolecular (direct) readout by statistical analysis of base-amino acid interactions within protein-DNA complexes. The specificity of the intramolecular (indirect) readout due to DNA was quantified by statistical analysis of the sequence-dependent DNA conformation. Systematic comparison of these specificities in a large number of protein-DNA complexes revealed that both intermolecular and intramolecular readouts contribute to the specificity of protein-DNA recognition, and that their relative contributions vary depending upon the protein-DNA complexes. We demonstrated that combination of the intermolecular and intramolecular energies derived from the statistical analyses lead to enhanced specificity, and that the combined energy could explain experimental data on binding affinity changes caused by base mutations. These results provided new insight into the relationship between specificity and structure in the process of protein-DNA recognition, which would lead to prediction of specific protein-DNA binding sites.  相似文献   

4.
Proteins recognize a specific DNA sequence not only through direct contact (direct readout) with base pairs but also through sequence-dependent conformation and/or flexibility of DNA (indirect readout). However, it is difficult to assess the contribution of indirect readout to the sequence specificity. What is needed is a straightforward method for quantifying its contributions to specificity. Using Bayesian statistics, we derived the probability of a particular sequence for a given DNA structure from the trajectories of molecular dynamics (MD) simulations of DNAs containing all possible tetramer sequences. Then, we quantified the specificity of indirect readout based on the information entropy associated with the probability. We tested this method with known structures of protein-DNA complexes. This method enabled us to correctly predict those regions where experiments suggested the involvement of indirect readout. The results also indicated new regions where the indirect readout mechanism makes major contributions to the recognition. The present method can be used to estimate the contribution of indirect readout without approximations to the distributions in the conformational ensembles of DNA, and would serve as a powerful tool to study the mechanism of protein-DNA recognition.  相似文献   

5.
MOTIVATION: Direct recognition, or direct readout, of DNA bases by a DNA-binding protein involves amino acids that interact directly with features specific to each base. Experimental evidence also shows that in many cases the protein achieves partial sequence specificity by indirect recognition, i.e., by recognizing structural properties of the DNA. (1) Could threading a DNA sequence onto a crystal structure of bound DNA help explain the indirect recognition component of sequence specificity? (2) Might the resulting pure-structure computational motif manifest itself in familiar sequence-based computational motifs? RESULTS: The starting structure motif was a crystal structure of DNA bound to the integration host factor protein (IHF) of E. coli. IHF is known to exhibit both direct and indirect recognition of its binding sites. (1) Threading DNA sequences onto the crystal structure showed statistically significant partial separation of 60 IHF binding sites from random and intragenic sequences and was positively correlated with binding affinity. (2) The crystal structure was shown to be equivalent to a linear Markov network, and so, to a joint probability distribution over sequences, computable in linear time. It was transformed algorithmically into several common pure-sequence representations, including (a) small sets of short exact strings, (b) weight matrices, (c) consensus regular patterns, (d) multiple sequence alignments, and (e) phylogenetic trees. In all cases the pure-sequence motifs retained statistically significant partial separation of the IHF binding sites from random and intragenic sequences. Most exhibited positive correlation with binding affinity. The multiple alignment showed some conserved columns, and the phylogenetic tree partially mixed low-energy sequences with IHF binding sites but separated high-energy sequences. The conclusion is that deformation energy explains part of indirect recognition, which explains part of IHF sequence-specific binding.  相似文献   

6.
Indirect readout of tRNA for aminoacylation   总被引:1,自引:0,他引:1  
Perona JJ  Hou YM 《Biochemistry》2007,46(37):10419-10432
Aminoacylation of tRNA by aminoacyl-tRNA synthetases is the essential reaction that matches protein amino acids with the trinucleotide sequences specified in mRNA. Direct electrostatic interactions made by tRNA synthetases with discriminating functional groups on the tRNA bases have long been known to determine aminoacylation specificity. However, structural and biochemical studies have revealed a second "indirect readout" mechanism that makes an important contribution as well. In indirect readout, the sequence-dependent conformations of tRNA are recognized through protein contacts with the sugar-phosphate backbone and with nonspecific portions of the bases. This mechanism appears to function in single-stranded regions, in canonical A-type duplex segments, and in the complex tertiary core portion of the tRNA. Operation of the indirect mechanism is not exclusive of the direct mechanism, and both are further mediated by induced-fit rearrangements, in which enzyme and tRNA undergo precise conformational changes after formation of an initial encounter complex. The examples of indirect readout in tRNA synthetase complexes extend the concept beyond its traditional application to DNA duplexes and serve as models for the operation of this mechanism in more complex systems such as the ribosome.  相似文献   

7.
8.
9.
《Journal of molecular biology》2019,431(19):3845-3859
The rules governing sequence-specific DNA–protein recognition are under a long-standing debate regarding the prevalence of base versus shape readout mechanisms to explain sequence specificity and of the conformational selection versus induced fit binding paradigms to explain binding-related conformational changes in DNA. Using a combination of atomistic simulations on a subset of representative sequences and mesoscopic simulations at the protein–DNA interactome level, we demonstrate the prevalence of the shape readout model in determining sequence-specificity and of the conformational selection paradigm in defining the general mechanism for binding-related conformational changes in DNA. Our results suggest that the DNA uses a double mechanism to adapt its structure to the protein: it moves along the easiest deformation modes to approach the bioactive conformation, while final adjustments require localized rearrangements at the base-pair step and backbone level. Our study highlights the large impact of B-DNA dynamics in modulating DNA–protein binding.  相似文献   

10.
M.HgiDII is a methyltransferase (MTase) from Herpetosiphon giganteus that recognizes the sequence GTCGAC. This enzyme belongs to a group of MTases that share a high degree of amino acid similarity, albeit none of them has been thoroughly characterized. To study the catalytic mechanism of M.HgiDII and its interactions with DNA, we performed molecular dynamics simulations with a homology model of M.HgiDII complexed with DNA and S-adenosyl-methionine. Our results indicate that M.HgiDII may not rely only on Glu119 to activate the cytosine ring, which is an early step in the catalysis of cytosine methylation; apparently, Arg160 and Arg162 may also participate in the activation by interacting with cytosine O2. Another residue from the catalytic site, Val118, also played a relevant role in the catalysis of M.HgiDII. Val118 interacted with the target cytosine and kept water molecules from accessing the region of the catalytic pocket where Cys79 interacts with cytosine, thus preventing water-mediated disruption of interactions in the catalytic site. Specific recognition of DNA was mediated mainly by amino acids of the target recognition domain, although some amino acids (loop 80–88) of the catalytic domain may also contribute to DNA recognition. These interactions involved direct contacts between M.HgiDII and DNA, as well as indirect contacts through water bridges. Additionally, analysis of sequence alignments with closely related MTases helped us to identify a motif in the TRD of M.HgiDII that may be relevant to specific DNA recognition.  相似文献   

11.
Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein–DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein–DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.  相似文献   

12.
The M.EcoRV DNA methyltransferase recognizes GATATC sites. It is related to EcoDam, which methylates GATC sites. The DNA binding domain of M.EcoRV is similar to that of EcoDam suggesting a similar mechanism of DNA recognition. We show that amino acid residue Lys11 of M.EcoRV is involved in recognition of Gua1 and Arg128 contacts the Gua in base pair 6. These residues correspond to Lys9 and Arg124 in EcoDam, which recognize the Gua residues in both strands of the Dam recognition sequence, indicating that M.EcoRV and EcoDam make similar contacts to outermost base pairs of their recognition sequences and M.EcoRV recognizes its target site as an expanded GATC site. In contrast to EcoDam, M.EcoRV considerably bends the DNA (59+/-4 degrees) suggesting indirect readout of the AT-rich inner sequence. Recognition of an expanded target site by DNA bending is a new principle for changing DNA recognition specificity of proteins during molecular evolution. R128A is inefficient in DNA bending and binding, whereas K11A bends DNA with relaxed sequence specificity. These results suggest a temporal order of the formation of protein-DNA contacts in which the Gua6-Arg128 contact forms early followed by DNA bending and, finally, the formation of the Lys11-Gua1 contact.  相似文献   

13.
The DNA interaction of the Escherichia coli cyclic AMP receptor protein (CRP) represents a typical example of a dual recognition mechanism exhibiting both direct and indirect readout. We have dissected the direct and indirect components of DNA recognition by CRP employing in vitro selection of a random library of DNA-binding sites containing inosine (I) and 2,6-diaminopurine (D) instead of guanine and adenine, respectively. Accordingly, the DNA helix minor groove is structurally altered due to the ‘transfer’ of the 2-amino group of guanine (now I) to adenine (now D), whereas the major groove is functionally intact. The majority of the selected sites contain the natural consensus sequence TGTGAN6TCACA (i.e. TITIDN6TCDCD). Thus, direct readout of the consensus sequence is independent of minor groove conformation. Consequently, the indirect readout known to occur in the TG/CA base pair step (primary kink site) in the consensus sequence is not affected by I–D substitutions. In contrast, the flanking regions are selected as I/C rich sequences (mostly I-tracts) instead of A/T rich sequences which are known to strongly increase CRP binding, thereby demonstrating almost exclusive indirect readout of helix structure/flexibility in this region through (anisotropic) flexibility of I-tracts.  相似文献   

14.
The crystal structure of the HincII restriction endonuclease-DNA complex shows that degenerate specificity for blunt-ended cleavage at GTPyPuAC sequences arises from indirect readout of conformational preferences at the center pyrimidine-purine step. Protein-induced distortion of the DNA is accomplished by intercalation of glutamine side chains into the major groove on either side of the recognition site, generating bending by either tilt or roll at three distinct loci. The intercalated side chains propagate a concerted shift of all six target-site base pairs toward the minor groove, producing an unusual cross-strand purine stacking at the center pyrimidine-purine step. Comparison of the HincII and EcoRV cocrystal structures suggests that sequence-dependent differences in base-stacking free energies are a crucial underlying factor mediating protein recognition by indirect readout.  相似文献   

15.
The multiprotein factor composed of XPA and replication protein A (RPA) is an essential subunit of the mammalian nucleotide excision repair system. Although XPA-RPA has been implicated in damage recognition, its activity in the DNA repair pathway remains controversial. By replacing DNA adducts with mispaired bases or non-hybridizing analogues, we found that the weak preference of XPA and RPA for damaged substrates is entirely mediated by indirect readout of DNA helix conformations. Further screening with artificially distorted substrates revealed that XPA binds most efficiently to rigidly bent duplexes but not to single-stranded DNA. Conversely, RPA recognizes single-stranded sites but not backbone bending. Thus, the association of XPA with RPA generates a double-check sensor that detects, simultaneously, backbone and base pair distortion of DNA. The affinity of XPA for sharply bent duplexes, characteristic of architectural proteins, is not compatible with a direct function during recognition of nucleotide lesions. Instead, XPA in conjunction with RPA may constitute a regulatory factor that monitors DNA bending and unwinding to verify the damage-specific localization of repair complexes or control their correct three-dimensional assembly.  相似文献   

16.
The DNA-binding domain of Myb consists of three imperfect tandem repeats and the third one which is essential for sequence-specific binding was established to have a helix-turn-helix-related motif. DNA sequences recognized by Myb have been reported to contain TAACPy sequence. Here we have examined the details of Myb-binding sequence. Using DNAs with a single mutation on the various sites of two specific DNAs and some fragments of the DNA-binding domain of Myb, we have found that (i) in a specific DNA which contains only one AAC sequence, each AAC nucleotide is found to be essential for the specific binding of Myb, while any other mutations cause no serious binding loss, (ii) in a specific DNA which contains two AAC sequences separately, one AAC is not so important in the binding, and (iii) for the specific binding with DNA, at least both repeats 2 and 3 of Myb are required. These findings suggest that repeat 3 containing a helix-turn-helix-related structure recognizes the core AAC sequence and repeat 2 supports this recognition by interactions with phosphate groups of DNA.  相似文献   

17.
18.
19.
Recognition of DNA sequences by the repressor of bacteriophage 434   总被引:2,自引:0,他引:2  
The structure of a complex between the DNA-binding domain of phage 434 repressor and a 14 base-pair synthetic DNA operator reveals the molecular interactions important for sequence-specific recognition. A set of contacts with DNA backbone, notably involving hydrogen bonds between peptide-NH groups and DNA phosphates, position the repressor and fix the DNA configuration. Direct interactions between amino acid side chains and DNA bases involve nonpolar van der Waals contacts as well as hydrogen bonds. The structures of the repressor domain and of the 434 cro protein are extremely similar. There appear to be no major conformational changes in the proteins when they bind to DNA.  相似文献   

20.
DNA sequence recognition by the homodimeric C-terminal domain of the human papillomavirus type 16 E2 protein (E2C) is known to involve both direct readout and DNA-dependent indirect readout mechanisms, while protein-dependent indirect readout has been deduced but not directly observed. We have investigated coupling between specific DNA binding and the dynamics of the unusual E2C fold, using pH as an external variable. Nuclear magnetic resonance and isothermal titration calorimetry show that pH titration of His318 in the complex interface and His288 in the core of the domain is coupled to both binding and the dynamics of the β-barrel core of E2C, with a tradeoff between dimer stability and function. Specific DNA binding is, in turn, coupled to the slow dynamics and amide hydrogen exchange in the entire β-barrel, reaching residues far apart from the DNA recognition elements but not affecting the two helices of each monomer. The changes are largest in the dimerization interface, suggesting that the E2C β-barrel acts as a hinge that regulates the relative position of the DNA recognition helices. In conclusion, the cooperative dynamics of the human papillomavirus type 16 E2C β-barrel is coupled to sequence recognition in a protein-dependent indirect readout mechanism. The patterns of residue substitution in genital papillomaviruses support the importance of the protonation states of His288 and His318 and suggest that protein-dependent indirect readout and histidine pH titration may regulate DNA binding in the cell.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号