首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Proteins recognize DNA sequences by two different mechanisms. The first is direct readout, in which recognition is mediated by direct interactions between the protein and the DNA bases. The second is indirect readout, which is caused by the dependence of conformation and the deformability of the DNA structure on the sequence. Various energy functions have been proposed to evaluate the contribution of indirect readout to the free-energy changes in complex formations. We developed a new generalized energy function to estimate the dependence of the deformability of DNA on the sequence. This function was derived from molecular dynamics simulations previously conducted on B-DNA dodecamers, each of which had one possible tetramer sequence embedded at its center. By taking the logarithm of the probability distribution function (PDF) for the base-step parameters of the central base-pair step of the tetramer, its ability to distinguish the native sequence from random ones was superior to that with the previous method that approximated the energy function in harmonic form. From a comparison of the energy profiles calculated with these two methods, we found that the harmonic approximation caused significant errors in the conformational energies of the tetramers that adopted multiple stable conformations.  相似文献   

2.
Proteins recognize a specific DNA sequence not only through direct contact (direct readout) with base pairs but also through sequence-dependent conformation and/or flexibility of DNA (indirect readout). However, it is difficult to assess the contribution of indirect readout to the sequence specificity. What is needed is a straightforward method for quantifying its contributions to specificity. Using Bayesian statistics, we derived the probability of a particular sequence for a given DNA structure from the trajectories of molecular dynamics (MD) simulations of DNAs containing all possible tetramer sequences. Then, we quantified the specificity of indirect readout based on the information entropy associated with the probability. We tested this method with known structures of protein-DNA complexes. This method enabled us to correctly predict those regions where experiments suggested the involvement of indirect readout. The results also indicated new regions where the indirect readout mechanism makes major contributions to the recognition. The present method can be used to estimate the contribution of indirect readout without approximations to the distributions in the conformational ensembles of DNA, and would serve as a powerful tool to study the mechanism of protein-DNA recognition.  相似文献   

3.
It is well recognized that base sequence exerts a significant influence on the properties of DNA and plays a significant role in protein–DNA interactions vital for cellular processes. Understanding and predicting base sequence effects requires an extensive structural and dynamic dataset which is currently unavailable from experiment. A consortium of laboratories was consequently formed to obtain this information using molecular simulations. This article describes results providing information not only on all 10 unique base pair steps, but also on all possible nearest-neighbor effects on these steps. These results are derived from simulations of 50–100 ns on 39 different DNA oligomers in explicit solvent and using a physiological salt concentration. We demonstrate that the simulations are converged in terms of helical and backbone parameters. The results show that nearest-neighbor effects on base pair steps are very significant, implying that dinucleotide models are insufficient for predicting sequence-dependent behavior. Flanking base sequences can notably lead to base pair step parameters in dynamic equilibrium between two conformational sub-states. Although this study only provides limited data on next-nearest-neighbor effects, we suggest that such effects should be analyzed before attempting to predict the sequence-dependent behavior of DNA.  相似文献   

4.
Structural, regulatory and enzymatic proteins interact with DNA to maintain a healthy and functional genome. Yet, our structural understanding of how proteins interact with DNA is limited. We present MELD-DNA, a novel computational approach to predict the structures of protein–DNA complexes. The method combines molecular dynamics simulations with general knowledge or experimental information through Bayesian inference. The physical model is sensitive to sequence-dependent properties and conformational changes required for binding, while information accelerates sampling of bound conformations. MELD-DNA can: (i) sample multiple binding modes; (ii) identify the preferred binding mode from the ensembles; and (iii) provide qualitative binding preferences between DNA sequences. We first assess performance on a dataset of 15 protein–DNA complexes and compare it with state-of-the-art methodologies. Furthermore, for three selected complexes, we show sequence dependence effects of binding in MELD predictions. We expect that the results presented herein, together with the freely available software, will impact structural biology (by complementing DNA structural databases) and molecular recognition (by bringing new insights into aspects governing protein–DNA interactions).  相似文献   

5.
Looking into DNA recognition: zinc finger binding specificity   总被引:5,自引:2,他引:3       下载免费PDF全文
We present a quantitative, theoretical analysis of the recognition mechanisms used by two zinc finger proteins: Zif268, which selectively binds to GC-rich sequences, and a Zif268 mutant, which binds to a TATA box site. This analysis is based on a recently developed method (ADAPT), which allows binding specificity to be analyzed via the calculation of complexation energies for all possible DNA target sequences. The results obtained with the zinc finger proteins show that, although both mainly select their targets using direct, pairwise protein–DNA interactions, they also use sequence-dependent DNA deformation to enhance their selectivity. A new extension of our methodology enables us to determine the quantitative contribution of these two components and also to measure the contributions of individual residues to overall specificity. The results show that indirect recognition is particularly important in the case of the TATA box binding mutant, accounting for 30% of the total selectivity. The residue-by-residue analysis of the protein–DNA interaction energy indicates that the existence of amino acid–base contacts does not necessarily imply sequence selectivity, and that side chains without contacts can nevertheless contribute to defining the protein's target sequence.  相似文献   

6.
The various nearest neighbor stacking interaction energies of stacked base pairs in the DNA double helix are calculated for both A- and B-type conformations using an ab initio molecular orbital method. It is demonstrated that the sequence-dependent conformational preference for A- or B-type results from the stacking interaction. In particular, the base sequence showing the highest preference for an A-type conformation is revealed as GC/GC, and the one with the next highest preference, AT/AT; for a B-type conformation, the respective sequences are CG/CG and CA/TG. The overall conformation of a DNA fragment is not determined by these particular sequences only but is influenced by all base pair steps. An intrinsically favorable conformation is predicted from the constituent stacking interaction.  相似文献   

7.
Integration host factor (IHF) is a bacterial histone-like protein whose primary biological role is to condense the bacterial nucleoid and to constrain DNA supercoils. It does so by binding in a sequence-independent manner throughout the genome. However, unlike other structurally related bacterial histone-like proteins, IHF has evolved a sequence-dependent, high affinity DNA-binding motif. The high affinity binding sites are important for the regulation of a wide range of cellular processes. A remarkable feature of IHF is that it employs an indirect readout mechanism to bind and wrap DNA at both the nonspecific and high affinity (sequence-dependent) DNA sites. In this study we assessed the contributions of pre-formed and protein-induced DNA conformations to the energetics of IHF binding. Binding energies determined experimentally were compared with energies predicted for the IHF-induced deformation of the DNA helix (DNA deformation energy) in the IHF-DNA complex. Combinatorial sets of de novo DNA sequences were designed to systematically evaluate the influence of sequence-dependent structural characteristics of the conserved IHF recognition elements of the consensus DNA sequence. We show that IHF recognizes pre-formed conformational characteristics of the consensus DNA sequence at high affinity sites, whereas at all other sites relative affinity is determined by the deformational energy required for nearest-neighbor base pairs to adopt the DNA structure of the bound DNA-IHF complex.  相似文献   

8.
The nucleic-acid bases carry structural and energetic signatures that contribute to the unique features of genetic sequences. Here we review the connection between the chemical structure of the constituent nucleotides and the polymeric properties of DNA. The sequence-dependent accumulation of charge on the major- and minor-groove edges of the Watson-Crick base pairs, obtained from ab initio calculations, presents unique motifs for direct sequence recognition. The optimization of base interactions generates a propellering of base-pair planes of the same handedness as that found in high-resolution double-helical structures. The optimized base pairs also deform along conformational pathways, i.e., normal modes, of the same type induced by the binding of proteins. Empirical energy computations that incorporate the properties of the base pairs account satisfactorily for general features of the next level of double-helical structure, but miss key sequence-dependent differences in dimeric structure and deformability. The latter discrepancies appear to reflect factors other than intrinsic base-pair structure.  相似文献   

9.
‘Indirect readout’ refers to the proposal that proteins can recognize the intrinsic three-dimensional shape or flexibility of a DNA binding sequence apart from direct protein contact with DNA base pairs. The differing affinities of human papillomavirus (HPV) E2 proteins for different E2 binding sites have been proposed to reflect indirect readout. DNA bending has been observed in X-ray structures of E2 protein–DNA complexes. X-ray structures of three different E2 DNA binding sites revealed differences in intrinsic curvature. DNA sites with intrinsic curvature in the direction of protein-induced bending were bound more tightly by E2 proteins, supporting the indirect readout model. We now report solution measurements of intrinsic DNA curvature for three E2 binding sites using a sensitive electrophoretic phasing assay. Measured E2 site curvature agrees well the predictions of a dinucleotide model and supports an indirect readout hypothesis for DNA recognition by HPV E2.  相似文献   

10.
The crystal structure of the HincII restriction endonuclease-DNA complex shows that degenerate specificity for blunt-ended cleavage at GTPyPuAC sequences arises from indirect readout of conformational preferences at the center pyrimidine-purine step. Protein-induced distortion of the DNA is accomplished by intercalation of glutamine side chains into the major groove on either side of the recognition site, generating bending by either tilt or roll at three distinct loci. The intercalated side chains propagate a concerted shift of all six target-site base pairs toward the minor groove, producing an unusual cross-strand purine stacking at the center pyrimidine-purine step. Comparison of the HincII and EcoRV cocrystal structures suggests that sequence-dependent differences in base-stacking free energies are a crucial underlying factor mediating protein recognition by indirect readout.  相似文献   

11.
The DNA interaction of the Escherichia coli cyclic AMP receptor protein (CRP) represents a typical example of a dual recognition mechanism exhibiting both direct and indirect readout. We have dissected the direct and indirect components of DNA recognition by CRP employing in vitro selection of a random library of DNA-binding sites containing inosine (I) and 2,6-diaminopurine (D) instead of guanine and adenine, respectively. Accordingly, the DNA helix minor groove is structurally altered due to the ‘transfer’ of the 2-amino group of guanine (now I) to adenine (now D), whereas the major groove is functionally intact. The majority of the selected sites contain the natural consensus sequence TGTGAN6TCACA (i.e. TITIDN6TCDCD). Thus, direct readout of the consensus sequence is independent of minor groove conformation. Consequently, the indirect readout known to occur in the TG/CA base pair step (primary kink site) in the consensus sequence is not affected by I–D substitutions. In contrast, the flanking regions are selected as I/C rich sequences (mostly I-tracts) instead of A/T rich sequences which are known to strongly increase CRP binding, thereby demonstrating almost exclusive indirect readout of helix structure/flexibility in this region through (anisotropic) flexibility of I-tracts.  相似文献   

12.
The binding of proteins to specific sequences of DNA is an important feature of virtually all DNA transactions. Proteins recognize specific DNA sequences using both direct readout (sensing types and positions of DNA functional groups) and indirect readout (sensing DNA conformation and deformability). Previously we showed that the P22 c2 repressor N-terminal domain (P22R NTD) forces the central non-contacted 5'-ATAT-3' sequence of the DNA operator into the B′ state, a state known to affect DNA hydration, rigidity and bending. Usually the B′ state, with a narrow minor groove and a spine of hydration, is reserved for A-tract DNA (TpA steps disrupt A-tracts). Here, we have co-crystallized P22R NTD with an operator containing a central 5′-ACGT-3′ sequence in the non-contacted region. C·G base pairs have not previously been observed in the B′ state and are thought to prevent it. However, P22R NTD induces a narrow minor groove and a spine of hydration to 5'-ACGT-3'. We observe that C·G base pairs have distinctive destabilizing and disordering effects on the spine of hydration. It appears that the reduced stability of the spine results in a higher energy cost for the B to B′ transition. The differential effect of DNA sequence on the barrier to this transition allows the protein to sense the non-contacted DNA sequence.  相似文献   

13.
14.
15.
The repressor of bacteriophage P22 (P22R) discriminates between its various DNA binding sites by sensing the identity of non-contacted base pairs at the center of its binding site. The “indirect readout” of these non-contacted bases is apparently based on DNA's sequence-dependent conformational preferences. The structures of P22R–DNA complexes indicate that the non-contacted base pairs at the center of the binding site are in the B′ state. This finding suggests that indirect readout and therefore binding site discrimination depend on P22R's ability to either sense and/or impose the B′ state on the non-contacted bases of its binding sites. We show here that the affinity of binding sites for P22R depends on the tendency of the central bases to assume the B′-DNA state. Furthermore, we identify functional groups in the minor groove of the non-contacted bases as the essential modulators of indirect readout by P22R. In P22R–DNA complexes, the negatively charged E44 and E48 residues are provocatively positioned near the negatively charged DNA phosphates of the non-contacted nucleotides. The close proximity of the negatively charged groups on protein and DNA suggests that electrostatics may play a key role in the indirect readout process. Changing either of two negatively charged residues to uncharged residues eliminates the ability of P22R to impose structural changes on DNA and to recognize non-contacted base sequence. These findings suggest that these negatively charged amino acids function to force the P22R-bound DNA into the B′ state and therefore play a key role in indirect readout by P22R.  相似文献   

16.
Protein-DNA recognition plays an essential role in the regulation of gene expression. Regulatory proteins are known to recognize specific DNA sequences directly through atomic contacts (intermolecular readout) and/or indirectly through the conformational properties of the DNA (intramolecular readout). However, little is known about the respective contributions made by these so-called direct and indirect readout mechanisms. We addressed this question by making use of information extracted from a structural database containing many protein-DNA complexes. We quantified the specificity of intermolecular (direct) readout by statistical analysis of base-amino acid interactions within protein-DNA complexes. The specificity of the intramolecular (indirect) readout due to DNA was quantified by statistical analysis of the sequence-dependent DNA conformation. Systematic comparison of these specificities in a large number of protein-DNA complexes revealed that both intermolecular and intramolecular readouts contribute to the specificity of protein-DNA recognition, and that their relative contributions vary depending upon the protein-DNA complexes. We demonstrated that combination of the intermolecular and intramolecular energies derived from the statistical analyses lead to enhanced specificity, and that the combined energy could explain experimental data on binding affinity changes caused by base mutations. These results provided new insight into the relationship between specificity and structure in the process of protein-DNA recognition, which would lead to prediction of specific protein-DNA binding sites.  相似文献   

17.
DNA-binding drugs have numerous applications in the engineered gene regulation. However, the drug-DNA recognition mechanism is poorly understood. Drugs can recognize specific DNA sequences not only through direct contacts but also indirectly through sequence-dependent conformation, in a similar manner to the indirect readout mechanism in protein-DNA recognition. We used a knowledge-based technique that takes advantage of known DNA structures to evaluate the conformational energies. We built a dataset of non-redundant free B-DNA crystal structures to calculate the distributions of adjacent base-step and base-pair conformations, and estimated the effective harmonic potentials of mean force (PMF). These PMFs were used to calculate the conformational energy of drug-DNA complexes, and the Z-score as a measure of the binding specificity. Comparing the Z-scores for drug-DNA complexes with those for free DNA structures with the same sequence, we observed that in several cases the Z-scores became more negative upon drug binding. Furthermore, the specificity is position-dependent within the drug-bound region of DNA. These results suggest that DNA conformation plays an important role in the drug-DNA recognition. The presented method provides a tool for the analysis of drug-DNA recognition and can facilitate the development of drugs for targeting a specific DNA sequence.  相似文献   

18.
Specific recognition by EcoRV endonuclease of its cognate, sharply bent GATATC site at the center TA step occurs solely via hydrophobic interaction with thymine methyl groups. Mechanistic kinetic analyses of base analog-substituted DNAs at this position reveal that direct readout provides 5 kcal mol(-1) toward specificity, with an additional 6-10 kcal mol(-1) arising from indirect readout. Crystal structures of several base analog complexes show that the major-groove hydrophobic contacts are crucial to forming required divalent metal-binding sites, and that indirect readout operates in part through the sequence-dependent free-energy cost of unstacking the center base-pair step of the DNA.  相似文献   

19.
The width of the DNA minor groove varies with sequence and can be a major determinant of DNA shape recognition by proteins. For example, the minor groove within the center of the Fis–DNA complex narrows to about half the mean minor groove width of canonical B-form DNA to fit onto the protein surface. G/C base pairs within this segment, which is not contacted by the Fis protein, reduce binding affinities up to 2000-fold over A/T-rich sequences. We show here through multiple X-ray structures and binding properties of Fis–DNA complexes containing base analogs that the 2-amino group on guanine is the primary molecular determinant controlling minor groove widths. Molecular dynamics simulations of free-DNA targets with canonical and modified bases further demonstrate that sequence-dependent narrowing of minor groove widths is modulated almost entirely by the presence of purine 2-amino groups. We also provide evidence that protein-mediated phosphate neutralization facilitates minor groove compression and is particularly important for binding to non-optimally shaped DNA duplexes.  相似文献   

20.
Aristolochic acids I and II are prevalent plant toxicants found in the Aristolochiaceae plant family. Metabolic activation of the aristolochic acids leads to the formation of a cyclic N-hydroxylactam product that can react with the peripheral amino group of purine bases generating bulky DNA adducts. These lesions are mutagenic and established human carcinogens. Interestingly, although AL-dG adducts progressively disappear from the DNA of laboratory animals, AL-dA lesions has lasting persistence in the genome. We describe here NMR structural studies of an undecameric duplex damaged at its center by the presence of an ALII-dA adduct. Our data establish a locally perturbed double helical structure that accommodates the bulky adduct by displacing the counter residue into the major groove and stacking the ALII moiety between flanking bases. The presence of the ALII-dA perturbs the conformation of the 5'-side flanking base pair, but all other pairs of the duplex adopt standard conformations. Thermodynamic studies reveal that the lesion slightly decreases the energy of duplex formation in a sequence-dependent manner. We discuss our results in terms of its implications for the repair of ALII-dA adducts in mammalian cells.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号