首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The mean (G + C) composition (51.0%) and standard deviation (+/- 3.8%) of published DNA sequences accounting for 10% of the E. coli genome is in excellent agreement with the principal overall distribution determined by high resolution melting. While differences in base and neighbor characteristics are small and uniform throughout all regions of the genome, it is found that the (G + C) content of sequences varies in segmented fashion within boundaries corresponding to coding (53% G + C) and noncoding (46% G + C) regions; with variances in the latter being six-fold greater than in coding regions. The variance in different regions shows a strong negative dependence on (G + C) content of the region, reflecting the condition that A-T and G-C base pairs are preferred neighbors of A-T and C-G pairs, respectively; with the bias increasing with decreasing (G + C) content. Neighbor analysis indicates the most extreme positive biases occur in AA, TT, GC and CG throughout all regions, but particularly in noncoding regions. Extraordinary numbers of oligomeric strings of (A)n, etc., are the further consequence of this bias. These and other characteristics point to the existence of inherent biases in neighbor frequencies levied during replication or repair, and which reflect, in turn, neighbor influences during mutation. The bias in codon usage noted by Grantham and others is seen here as due, in part, to the adaptation of coding sequences to this microenvironment through selection among synonymous codons so as to preserve inherent neighbor biases.  相似文献   

2.
Temperature-Gradient Gel Electrophoresis (TGGE) was employed to determine the thermal stabilities of 28 DNA fragments, 373 bp long, with two adjacent mismatched base pairs, and eight DNAs with Watson-Crick base pairs at the same positions. Heteroduplex DNAs containing two adjacent mismatches were formed by melting and reannealing pairs of homologous 373 bp DNA fragments differing by two adjacent base pairs. Product DNAs were separated based on their thermal stability by parallel and perpendicular TGGE. The polyacrylamide gel contained 3.36 M urea and 19.2 % formamide to lower the DNA melting temperatures. The order of stability was determined in the sequence context d(CXYG).d(CY'X'G) where X.X' and Y.Y" represent the mismatched or Watson-Crick base pairs. The identity of the mismatched bases and their stacking interactions influence DNA stability. Mobility transition melting temperatures (T u) of the DNAs with adjacent mismatches were 1.0-3.6 degrees C (+/-0.2 degree C) lower than the homoduplex DNA with the d(CCAG).d(CTGG) sequence. Two adjacent G.A pairs, d(CGAG).d(CGAG), created a more stable DNA than DNAs with Watson-Crick A.T pairs at the same sites. The d(GA).d(GA) sequence is estimated to be 0.4 (+/-30%) kcal/mol more stable in free energy than d(AA).d(TT) base pairs. This result confirms the unusual stability of the d(GA).d(GA) sequence previously observed in DNA oligomers. All other DNAs with adjacent mismatched base pairs were less stable than Watson-Crick homoduplex DNAs. Their relative stabilities followed an order expected from previous results on single mismatches. Two homoduplex DNAs with identical nearest neighbor sequences but different next-nearest neighbor sequences had a small but reproducible difference in T u value. This result indicates that sequence dependent next neighbor stacking interactions influence DNA stability.  相似文献   

3.
Temperature-gradient gel electrophoresis (TGGE) was employed to determine the thermal stabilities of 48 DNA fragments that differ by single base pair mismatches. The approach provides a rapid way for studying how specific base mismatches effect the stability of a long DNA fragment. Homologous 373 bp DNA fragments differing by single base pair substitutions in their first melting domain were employed. Heteroduplexes were formed by melting and reannealing pairs of DNAs, one of which was 32P-labeled on its 5'-end. Product DNAs were separated based on their thermal stability by parallel and perpendicular temperature-gradient gel electrophoresis. The order of stability was determined for all common base pairs and mismatched bases in four different nearest neighbor environments; d(GXT).d(AYC), d(GXG).d(CYC), d(CXA).d(TYG), and d(TXT).d(AYA) with X,Y = A, T, C, or G. DNA fragments containing a single mismatch were destabilized by 1 to 5 degrees C with respect to homologous DNAs with complete Watson-Crick base pairing. Both the bases at the mismatch site and neighboring stacking interactions influence the destabilization caused by a mismatch. G.T, G.G and G.A mismatches were always among the most stable mismatches for all nearest neighbor environments examined. Purine.purine mismatches were generally more stable than pyrimidine.pyrimidine mispairs. Our results are in very good agreement with data where available from solution studies of short DNA oligomers.  相似文献   

4.
Recent studies of homooligomer tracts suggest different characteristics from random sequence DNA (dA).(dT) and (dG).(dC) tracts are frequent in upstream regions and in some cases have been shown to be essential for regulation. Here we examine homooligomer occurrences in non-coding and coding eukaryotic sequences, focusing on the context in which the homooligomers occur. This analysis of sequences in the junction areas yields distinct and consistent characteristics. In particular, the nucleotide interrupting a run is most frequently complementary to the run. The base next to it is most frequently identical to the one constituting the run. For A or T runs the least frequent nearest and next to nearest neighbors are G or C. For G or C tracts the least frequent are A or T. Complementary oligomers behave similarly. These and additional trends are strongest for run lengths greater than or equal to 3. The computations are carried out on the whole eukaryotic database of greater than 4 x 10(6) nucleotides, separately for coding and non-coding regions. These same trends are evident for both groups, but are somewhat stronger for the non-coding regions. The context in which the homooligomers occur may yield some clues to DNA conformation and its biological implications.  相似文献   

5.
MOTIVATION: MELTSIM is a windows-based statistical mechanical program for simulating melting curves of DNAs of known sequence and genomic dimensions under different conditions of ionic strength with great accuracy. The program is useful for mapping variations of base compositions of sequences, conducting studies of denaturation, establishing appropriate conditions for hybridization and renaturation, determinations of sequence complexity, and sequence divergence. RESULTS: Good agreement is achieved between experimental and calculated melting curves of plasmid, bacterial, yeast and human DNAs. Denaturation maps that accompany the calculated curves indicate non-coding regions have a significantly lower (G+C) composition than coding regions in all species examined. Curves of partially sequenced human DNA suggest the current database may be heavily biased with coding regions, and excluding large (A+T)-rich elements. AVAILABILITY: MELTSIM 1.0 is available at: //www.uml.edu/Dept/Chem/UMLBIC/Apps/MEL TSIM/MELTSIM-1.0-Win/meltsim. zip. Melting curve plots in this paper were made with GNUPLOT 3.5, available at: http://www.cs.dartmouth.edu/gnuplot_inf o.html Contact : blake@maine.maine.edu;  相似文献   

6.
Early biochemical experiments measuring nearest neighbor frequencies established that the set of dinucleotide relative abundance values (dinucleotide biases) is a remarkably stable property of the DNA of an organism. Analyses of currently available genomic sequence data have extended these earlier results, showing that the dinucleotide biases evaluated for successive 50 kb segments of a genome are significantly more similar to each other than to those of sequences from more distant organisms. From this perspective, the set of dinucleotide biases constitutes a 'genomic signature' that can discriminate sequences from different organisms. The dinucleotide biases appear to reflect species-specific properties of DNA stacking energies, modification, replication, and repair mechanisms. The genomic signature is useful for detecting pathogenicity islands in bacterial genomes.  相似文献   

7.
Some rules in the ordering of nucleotides in the DNA.   总被引:2,自引:11,他引:2       下载免费PDF全文
Natural DNA sequences contain distinct nearest neighbor patterns. Eukaryotic as well as prokaryotic sequences show a consistent hierarchy in the frequencies of appearance of most doublets.  相似文献   

8.
The accumulation of the cosolutes ethylene glycol, urea, glycine, sarcosine, and glycine betaine at the single-stranded DNA surface exposed upon melting the double helix has been quantified for DNA samples of different guanine-cytosine (GC) content using the local-bulk partitioning model [Record, M. T., Jr., Zhang, W., and Anderson, C. F. (1998) Adv. Protein Chem. 51, 281-353]. Urea and ethylene glycol are both locally accumulated at single-stranded DNA relative to bulk solution. Urea exhibits a stronger affinity for adenine (A) and thymine (T) bases, leading to a greater net dehydration of these bases upon DNA melting; ethylene glycol local accumulation is practically independent of base composition. However, glycine, sarcosine, and glycine betaine are not necessarily locally accumulated at single strands after melting relative to bulk solution, although they are locally accumulated relative to double-stranded DNA. The local accumulation of glycine, sarcosine, and glycine betaine at single strands relative to double-stranded DNA decreases with bulk cosolute molality and increases with GC content for all N-methylated glycines, demonstrating a stronger affinity for G and C bases. Glycine also shows a minimum in melting temperature T(m) at 1-2 m for DNA samples of 50% GC content or less. Increasing ionic strength attenuates the local accumulation of urea, glycine, sarcosine, and glycine betaine and removes the minimum in T(m) with glycine. This attenuation in local accumulation results in counterion release during the melting transition that is dependent on water activity and, hence, cosolute molality.  相似文献   

9.
Summary Natural DNA sequences were recently found to contain distinct nearest neighbor patterns. Hetero-dinucleotides were demonstrated to appear consistently more (less) than their mirror-image counterparts. This paper shows that this assymmetric behavior does not stem from the coding requirements of the DNA. It also shows some codon patterns in prokaryotic and eukaryotic genomes which came up in the course of this work.  相似文献   

10.
Lando DY  Fridman AS 《Biopolymers》2001,58(4):374-389
Short melted regions less than 100 base pairs (bp) in length are rarely found in the differential melting curves (DMC) of natural DNAs. Therefore, it is supposed that their characteristics do not affect DNA melting behavior. However, in our previous study, a strong influence of the form of the entropy factor of small loops on melting of cross-linked DNAs was established (D. Y. Lando, A. S. Fridman et al., Journal of Biomolecular Structure and Dynamics, 1997, Vol. 15, pp. 141-150; Journal of Biomolecular Structure and Dynamics, 1998, Vol. 16, pp. 59-67). Quite different dependencies of the melting temperature on the relative concentration of interstrand cross-links were obtained for the loop entropy factors given by the Fixman-Freire (Jacobson-Stockmayer) and Wartell-Benight relations. In the present study, the influence of the entropy factor of small loops on the melting of natural DNAs, cross-linked DNAs and periodical double-stranded polynucleotides is compared using computer simulation. A fast combined computational method for calculating DNA melting curves was developed for this investigation. It allows us to assign an arbitrary dependence of the loop entropy factor on the length of melted regions for the terms corresponding to small loops (less than tau bp in length). These terms are calculated using Poland's approach. The Fixman-Freire approach is used for long loops. Our calculations have shown that the temperature dependence of the average length of interior melted regions (loops) has a maximum at T approximately T(m) (T(m) is the DNA melting temperature) in contrast to the dependence of the total average length of melted regions, which increases almost monotonously. Computer modeling demonstrates that prohibition of formation of loops less than tau base pairs in length does not markedly change the DMC for tau < 150 bp. However, the same prohibition strongly affects the average length of internal melted regions for much smaller tau's. The effect is already noticeable for tau = 1 bp and increases with tau. A tenfold increase in the entropy factor of all loops with length less than tau bp causes a noticeable alteration of the DMC for tau > or = 30 bp. It is shown that DMCs are identical for the Wartell-Benight and for the Fixman-Freire (Jacobson-Stockmayer) form of the loop entropy factor. However, for low degree of denaturation, the average length of internal melted regions is 40% lower for the Wartell-Benight form due to the fluctuational opening of short AT-rich regions less than 10 bp in length. The same calculations carried out for periodical polynucleotides demonstrate a much stronger difference in melting behavior for different forms of entropy factors of short loops. The strongest difference occurs if the length of stable GC-rich and unstable AT-rich stretches is equal to 30 bp. However, the comparison carried out in this work demonstrates that the entropy factor of short loops influences melting behavior of cross-linked DNA much stronger than of unmodified DNA with random or periodical sequences.  相似文献   

11.
The one-dimensional Ising model, with nearest neighbor correlation only, suitably modified, is used to explain the observed linear dependence of melting temperature of copolymeric DNA with GC content. Transition curves are plotted for regular, random, and Markoff distribution of base pairs for various values of a correlation parameter U between nearest neighbor bonds. Exact analytic formulas are given for fraction of bonds intact at a particular temperature for various regular distributions for all U and approximate ones for random and Markoff distributions for small U. A scheme is indicated for further improvement. The model, in principle, makes it possible to estimate the statistical distribution of base pairs from the detailed shape of the transition curve.  相似文献   

12.
Relationship between pyrimidine distribution patterns and radiosensitivity (Z) of DNA molecules of different species was derived by computer analysis of recurrence frequency of pyrimidine clusters. Blocking factors (beta) and Z for coding and non-coding DNA sequences of species from different taxonomic classes have been calculated within a new model. The radiosensitivity of coding DNA sequences practically does not vary whereas Z values were increased during evolution from simplest to higher organisms. The beta and Z values calculated for several groups of individual genes were shown to vary considerably.  相似文献   

13.
Phosphate-methylated (P.M.) DNA possesses a very high affinity for complementary natural DNA, as a result of the absence of interstrand electrostatic repulsions. In this study, a model system phosphate-methylated d[Cn] with natural d(Gk) (n less than k) is chosen for an investigation of the thermodynamic properties that determine duplex stability. The enthalpy change of a melting transition is shown to be considerably larger than is observed for corresponding natural DNA duplexes. It is found that delta Hn0 of GG/CC nearest neighbor pairwise interaction equals -15.6 kcal/mol, compared to -11.0 kcal/mol for the natural analog. The entropy change is strongly dependent on the length of the natural DNA strand and the number of phosphate-methylated DNA oligomers hybridized. The results are explained by means of a model in which a cooperative effect for subsequent hybridizations of phosphate-methylated DNA oligomers is assumed, thus giving additional stability.  相似文献   

14.
Summary The compositional distributions of coding sequences and DNA molecules (in the 50-100-kb range) are remarkably narrower in murids (rat and mouse) compared to humans (as well as to all other mammals explored so far). In murids, both distributions begin at higher and end at lower GC values. A comparison of homologous coding sequences from murids and humans revealed that their different compositional distributions are due to differences in GC levels in all three codon positions, particularly of genes located at both ends of the distribution. In turn, these differences are responsible for differences in both codon usage and amino acids. When GC levels at first+second codon positions and third codon positions, respectively, of murid genes are plotted against corresponding GC levels of homologous human genes, linear relationships (with very high correlation coefficients and slopes of about 0.78 and 0.60, respectively) are found. This indicates a conservation of the order of GC levels in homologous genes from humans and murids. (The same comparison for mouse and rat genes indicates a conservation of GC levels of homologous genes.) A similar linear relationship was observed when plotting GC levels of corresponding DNA fractions (as obtained by density gradient centrifugation in the presence of a sequence-specific ligand) from mouse and human. These findings indicate that orderly compositional changes affecting not only coding sequences but also noncoding sequences took place since the divergence of murids. Such directional fixations of mutations point to the existence of selective pressures affecting the genome as a whole.  相似文献   

15.
Oligonucleotide and codon frequencies have been determined in published sequences of E. coli DNA totaling 103,100bp with 18,459 reading frame trinucleotides; corresponding to 2.5% of the total genome. Dinucleotide frequencies are in excellent agreement with those determined by nearest neighbor chemical analysis, indicating the computer count of a limited sampling to be a good representation of the overall frequencies in total genomic DNA. The distinctive nonrandom codon pattern is found to be uniformly distributed and contributes to a distinctive nonrandom oligonucleotide pattern; enabling correlations between frequency levels to be extended beyond reading frame sequences. Correlation analysis indicates a surprisingly high degree of correlation everywhere in the genome. Coefficients of correlation between oligonucleotide frequencies overall and those in specific segments vary as follows: primary strands of individual coding sequences greater than 0.9 greater than lambda DNA greater than noncoding, non-RNA greater than phi X174 DNA greater than complementary strands greater than RNA genes congruent to 0.6 greater than transposon-insertion elements greater than T7DNA much greater than eukaryotic sequences congruent to 0. It is concluded that this high degree of oligonucleotide and codon correspondence in E. coli reflects the widespread distribution of remnants of an early and slowly changing codon pattern that has been continually dispersed by duplication-divergence processes, leading to the present genome.  相似文献   

16.
We demonstrate the elongation of various hexanucleotide sequences with thermophilic DNA polymerase, under isothermal or thermal cyclic reaction conditions. We prepared 10 types of double repeat hexanucleotide duplexes with various GC compositions containing between 0 and 6 GC nucleotides per repeat and incubated these duplexes with thermophilic Taq DNA polymerase and dNTPs at various temperatures. All of the model repetitive short duplexes were elongated under the isothermal incubation conditions, although there were some differences in the elongation efficiencies derived from the GC composition in the repetitive sequences. It was also found that all of the model repetitive duplexes were extended more effectively by a 3-step thermal cyclic reaction involving denaturation, annealing, and extension. On the basis of this technique, we prepared a glutamate-encoding short repetitive duplex and created long repetitive DNAs under isothermal and thermal cyclic reaction conditions. DNA sequencing analysis of the cloned repetitive DNA revealed that well-ordered long repetitive DNAs of various chain lengths were created by this DNA polymerase-catalyzed ligation method, and these were easily cloned into vectors by the TA-cloning method. This method could be useful for obtaining DNAs encoding arbitrary long repetitive amino acid sequences more effectively than the conventional T4 ligase-catalyzed ligation method.  相似文献   

17.
Vallone PM  Benight AS 《Biochemistry》2000,39(26):7835-7846
Effects of different end sequences on melting, circular dichroism spectra (CD), and enzyme binding properties were investigated for four 40 base pair, non-self-complementary duplex DNA oligomers. The center sequences of these oligoduplexes have either of two 22 base pair modules flanked on both sides by sequences differing in AT content. Temperature-induced melting transitions monitored by differential scanning calorimetry (DSC) and ultraviolet absorbance were measured for the six duplexes in buffered 115 mM Na(+) solutions. Values of the melting transition enthalpy, DeltaH(cal), and entropy, DeltaS(cal), were obtained directly from DSC experiments. Melting transition parameters, DeltaH(vH) and DeltaS(vH), were also estimated from a van't Hoff analysis of optical melting curves collected as a function of DNA concentration, assuming that the melting transition is two-state. Melting free energies (20 degrees C) evaluated from DSC melting experiments on the four duplex DNAs ranged from -52.2 to -77.5 kcal/mol. Free energies based on the van't Hoff analysis were -37.9 to -58.8 kcal/mol. Although the values are different, trends in the melting free energies of the four duplex DNAs as a function of sequence were identical in both DSC and optical analyses. Subject to several assumptions, values for the initiation free energy were estimated for each duplex, defined as DeltaG(int) = DeltaG(cal) - DeltaG(pred), where DeltaG(cal) is the experimental free energy at 20 degrees C determined from the experimentially measured values of the transition enthalpy, DeltaH(cal), and entropy, DeltaS(cal). The predicted free energy of the sequence, DeltaG(pred)(20 degrees C), is obtained using published nearest-neighbor sequence stability values. For three of the four duplexes, values of DeltaG(int) are essentially nil. In contrast, the duplex with 81.8% GC has a considerably higher estimate of DeltaG(int) = 7.1 kcal/mol. The CD spectra for the six duplexes collected over the wavelength range from 200 to 320 nm are also sequence-dependent. Factor analysis of the CD spectra by singular value decomposition revealed that the experimental CD spectra could be reconstructed from linear combinations of two minor and one major subspectra. Changes in the coefficients of the major subspectrum for different sequences reflect incremental sequence-dependent variations of the CD spectra. Equilibrium binding by BamHI restriction endonuclease to the 40 base pair DNAs whose central eight base pairs contain the recognition sequence for BamHI restriction enzyme bounded by A.T base pairs, 5'-A-GGATCC-A-3' was investigated. Binding assays were performed by titering BamHI against a constant concentration of each of the duplex DNA substrates, in the absence of Mg(2+), followed by analysis by gel retardation. Under the conditions employed, the enzyme binds but does not cleave the DNAs. Results of the assays revealed two binding modes with retarded gel mobilities. Binding isotherms for the fraction of bound DNA species versus enzyme concentration for each binding mode were constructed and analyzed with a simple two-step equilibrium binding model. This analysis provided semiquantitative estimates on the equilibrium binding constants for BamHI to the four DNAs. Values obtained for the binding constants varied only 7-fold and ranged from 6 x 10(-)(8) to 42 x 10(-)(8) M, with binding free energies from -8.6 to -9.7 (+/- 0.2) kcal/mol depending on the sequence that flanks the enzyme binding site. Unlike what was found earlier in binding studies of the 22 base pair duplexes that constitute the core modules of the present 40-mers [Riccelli, P. V., Vallone, P. M., Kashin, I., Faldasz, B. D., Lane, M. J., and Benight, A. S. (1999) Biochemistry 38, 11197-11208], no obvious relationship between binding and stability was found for these longer DNAs. Apparently, effects of flanking sequence stability on restriction enzyme binding may only be measurable in very short duplex deoxyoligonucl  相似文献   

18.
Stacking energies in DNA   总被引:12,自引:0,他引:12  
Variations in base mono- and dipoles result in variations in stacking energies for the 10 unique neighbor pairs in DNA. Stacking energies for pair M on N, expressed as TMN, were derived by matrix decomposition of a large set of linear algebraic expressions relating the measured Tm for subtransitions emanating from large polymeric DNAs, and the fractional neighbor frequencies, fMN, for the domains responsible for the transitions, Tm = sigma fMNTMN. Tm were determined for subtransitions that dissociate in approximately all-or-none fashion in high resolution melting profiles of partially deleted and recombinant forms of pBR322 DNA. Three different analytical maneuvers were undertaken to resolve subtransitions: site-specific cleavage of domains; deletion of domains; and addition of domains. Three dozen domains of widely divergent, quasi-random neighbor frequencies were identified and assigned, resulting in a unique set of values for TMN with standard deviation, sigma = +/- 0.23 degree C. The average difference between calculated and experimental Tm for domains is only +/- 0.17 degree C, indicating that the thermodynamic properties of these domains are not in any way unusual. Assuming delta S to be constant for all pairs, the corresponding delta HMN are found to have a precision of +/- 10 calories.mol-1 and an accuracy of +/- 606 calories.mol-1. TMN used to calculate melting curves by statistical mechanical analysis of sequences of the different plasmid specimens in this study were in quantitative agreement with observed curves for most sequences. These TMN differ significantly from those determined previously and also correlate poorly with values determined by quantum chemical analysis. Stabilities of neighbor pairs, expressed as the difference in free energy between that for a given pair (MN) and that for the average of like pairs (M, N), depend on the relationship of stacked purines and pyrimidines as follows. delta delta Gpu-py(-466 cal) greater than delta delta Gpu-pu(+52 cal) greater than delta delta Gpy-pu(+335 cal) Differences between experimental Tm and Tm calculated with TMN for the isolated neighbor pairs in the B-conformation are useful in the identification of altered structures and unusual modes of dissociation of helixes. A significantly higher Tm is observed for the highly biased repeated sequence synthetic helixes dA.dT, d(AGC).d(GCT), and d(GAT).d(ATC), reflecting auxiliary sources of stability such as bifurcated hydrogen bonds and/or altered structures for these helixes.  相似文献   

19.
秦丹  徐存拴 《遗传》2013,35(11):1253-1264
非编码DNA序列是指基因组中不编码蛋白质的DNA序列。这些序列可以结合调节因子、转录为功能性RNA、单独或协同地调节生理活动和病理过程。文章围绕基因表达调控作用, 总结了近几年非编码DNA序列的研究成果, 对其结构、功能和可能的作用机制进行了初步阐述, 介绍了目前鉴定非编码DNA序列中功能元件的计算方法和实验技术, 并对非编码DNA未来的研究进行了展望。  相似文献   

20.
By electron microscopy and melting of the DNA of some bacteriophages from the soil bacterium Rhizobium lupini, it was found that molecular weights range between 27 and 50 Mdaltons and GC contents between 53 and 62%. All DNAs studied are linear; one has exposed single-stranded terminals. Partial heat denaturation of two phage DNA permitted mapping of AT-rich sites within unique nucleotide sequences. The maps are asymmetric with respect to the midpoints of the DNA.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号