首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Wang JP  Widom J 《Nucleic acids research》2005,33(21):6743-6755
DNA sequences that are present in nucleosomes have a preferential approximately 10 bp periodicity of certain dinucleotide signals, but the overall sequence similarity of the nucleosomal DNA is weak, and traditional multiple sequence alignment tools fail to yield meaningful alignments. We develop a mixture model that characterizes the known dinucleotide periodicity probabilistically to improve the alignment of nucleosomal DNAs. We assume that a periodic dinucleotide signal of any type emits according to a probability distribution around a series of 'hot spots' that are equally spaced along nucleosomal DNA with 10 bp period, but with a 1 bp phase shift across the middle of the nucleosome. We model the three statistically most significant dinucleotide signals, AA/TT, GC and TA, simultaneously, while allowing phase shifts between the signals. The alignment is obtained by maximizing the likelihood of both Watson and Crick strands simultaneously. The resulting alignment of 177 chicken nucleosomal DNA sequences revealed that all 10 distinct dinucleotides are periodic, however, with only two distinct phases and varying intensity. By Fourier analysis, we show that our new alignment has enhanced periodicity and sequence identity compared with center alignment. The significance of the nucleosomal DNA sequence alignment is evaluated by comparing it with that obtained using the same model on non-nucleosomal sequences.  相似文献   

2.
Positional distributions of various dinucleotides in experimentally derived human nucleosome DNA sequences are analyzed. Nucleosome positioning in this species is found to depend largely on GG and CC dinucleotides periodically distributed along the nucleosome DNA sequence, with the period of 10.4 bases. The GG and CC dinucleotides oscillate counterphase, i.e., their respective preferred positions are shifted about a half-period from one another, as it was observed earlier for AA and TT dinucleotides. Other purine-purine and pyrimidine-pyrimidine dinucleotides (RR and YY) display the same periodical and counterphase pattern. The dominance of oscillating GG and CC dinucleotides in human nucleosomes and the contribution of AG(CT), GA(TC), and AA(TT) suggest a general nucleosome DNA sequence pattern - counterphase oscillation of RR and YY dinucleotides. AA and TT dinucleotides, commonly accepted as major players, are only weak contributors in the case of human nucleosomes.  相似文献   

3.
Sequence-directed DNA curvature is most commonly associated with AA dinucleotides in the form of polyadenine runs. We demonstrate inherent curvature in DNA which lacks AA/TT dinucleotides using the criteria of polyacrylamide gel mobility and efficiency of DNA cyclization. These studies are based upon two 21-base pair synthetic DNA fragments designed to exhibit fixed curvature according to deflections made to the helical axis by non-AA dinucleotide stacks. Repeats of these sequences display anomalously slow migration in polyacrylamide gels. Moreover, both sequences describe helical conformations that are closed into circles by DNA ligase at much smaller sizes than is typical of nondeformed DNA. Chemical cleavage of these DNA molecules with hydroxyl radical is also consistent with local variation in helical conformation at specific dinucleotide steps.  相似文献   

4.
Positional correlation analysis for the complete genome of Saccharomyces cerevisiae is performed with the aim to reveal possible chromatin-related sequence features. A strong periodicity with the period 10.4 bases is detected in the distance histograms for the dinucleotides AA and TT, with the characteristic decay distance of approximately 50 base pairs. The oscillations are observed as well in the distributions of other dinucleotides. However, the respective amplitudes are small, consistent with secondary effects, due to dominant periodicity of AA and TT. The observations are in accord with earlier data on the chromatin sequence periodicities and nucleosome DNA sequence patterns. The autocorrelations of AA and TT dinucleotides in yeast include also a counter-phase component. A tentative DNA sequence pattern for the yeast nucleosomes is suggested and verified by comparison of its autocorrelation plots with the respective natural autocorrelations. The nucleosome mapping guided by the pattern is in accord with experimental data on the linker length distribution in yeast.  相似文献   

5.
Abstract

Positional correlation analysis for the complete genome of Saccharomyces cerevisiae is performed with the aim to reveal possible chromatin-related sequence features. A strong periodicity with the period 10.4 bases is detected in the distance histograms for the dinucleotides AA and TT, with the characteristic decay distance of approximately 50 base pairs. The oscillations are observed as well in the distributions of other dinucleotides. However, the respective amplitudes are small, consistent with secondary effects, due to dominant periodicity of AA and TT. The observations are in accord with earlier data on the chromatin sequence periodicities and nucleosome DNA sequence patterns. The autocorrelations of AA and TT dinucleotides in yeast include also a counter-phase component. A tentative DNA sequence pattern for the yeast nucleosomes is suggested and verified by comparison of its autocorrelation plots with the respective natural autocorrelations. The nucleosome mapping guided by the pattern is in accord with experimental data on the linker length distribution in yeast.  相似文献   

6.
Nucleosome formation and positioning, which play important roles in a number of biological processes, are thought to be related to the distinctive periodic dinucleotide patterns observed in the DNA sequence wrapped around the protein octamer. Previous research shows that flexibility is a key structural property of a nucleosomal DNA sequence. However, the relationship between the flexibility and the periodic dinucleotide patterns has received little attention in research in the past. In this study, we propose the use of three different models to measure the flexibility of yeast DNA sequences. Although the three models involve different parameters, they deliver consistent results showing that yeast nucleosomal DNA sequences are more flexible than non-nucleosomal ones. In contrast to random flexibility values along non-nucleosomal DNA sequences, the flexibility of nucleosomal DNA sequences shows a clear periodicity of 10.14 base pairs, which is consistent with the periodicity of dinucleotide distributions. We also demonstrate that there is a strong relationship between the peak positions of the flexibility and the dinucleotide frequencies. Correlation between the flexibility and the dinucleotide patterns of CA/TG, CG, GC, GG/CC, AG/CT, AC/GT and GA/TC are positive with an average value of 0.5946. The highest correlation is shown by CA/TG with a value of 0.7438 and the lowest correlation is shown by AA/TT with a value of −0.7424. The source codes and data sets are available for downloading on http://www.hy8.com/bioinformatics.htm.  相似文献   

7.
Extensive DNA sequence analysis of three eukaryotes, S. cerevisiae, C. elegans, and D. melanogaster, reveals two different AA/TT periodical patterns associated with the nucleosome positioning. The first pattern is the counter-phase oscillation of AA and TT dinucleotides, which has been frequently considered as the nucleosome DNA pattern. This represents the sequence rule I for chromatin structure. The second pattern is the in-phase oscillation of the AA and TT dinucleotides with the same nucleosome DNA period, 10.4 bases. This pattern apparently corresponds to curved DNA, that also participates in the nucleosome formation, and represents the sequence rule II for chromatin. The positional correlations of AA and TT dinucleotides also indicate that the nucleosomes are separated by specific linker sizes (preferably 8, 18, ... bases), dictated by the steric exclusion rules. Thus, the sequence positions of the neighboring nucleosomes are correlated, and this represents the sequence rule III.  相似文献   

8.
An estimated 80% of genomic DNA in eukaryotes is packaged as nucleosomes, which, together with the remaining interstitial linker regions, generate higher order chromatin structures [1]. Nucleosome sequences isolated from diverse organisms exhibit ∼10 bp periodic variations in AA, TT and GC dinucleotide frequencies. These sequence elements generate intrinsically curved DNA and help establish the histone-DNA interface. We investigated an important unanswered question concerning the interplay between chromatin organization and genome evolution: do the DNA sequence preferences inherent to the highly conserved histone core exert detectable natural selection on genomic divergence and polymorphism? To address this hypothesis, we isolated nucleosomal DNA sequences from Drosophila melanogaster embryos and examined the underlying genomic variation within and between species. We found that divergence along the D. melanogaster lineage is periodic across nucleosome regions with base changes following preferred nucleotides, providing new evidence for systematic evolutionary forces in the generation and maintenance of nucleosome-associated dinucleotide periodicities. Further, Single Nucleotide Polymorphism (SNP) frequency spectra show striking periodicities across nucleosomal regions, paralleling divergence patterns. Preferred alleles occur at higher frequencies in natural populations, consistent with a central role for natural selection. These patterns are stronger for nucleosomes in introns than in intergenic regions, suggesting selection is stronger in transcribed regions where nucleosomes undergo more displacement, remodeling and functional modification. In addition, we observe a large-scale (∼180 bp) periodic enrichment of AA/TT dinucleotides associated with nucleosome occupancy, while GC dinucleotide frequency peaks in linker regions. Divergence and polymorphism data also support a role for natural selection in the generation and maintenance of these super-nucleosomal patterns. Our results demonstrate that nucleosome-associated sequence periodicities are under selective pressure, implying that structural interactions between nucleosomes and DNA sequence shape sequence evolution, particularly in introns.  相似文献   

9.
Wang DD  Yan H 《Physical biology》2011,8(6):066004
Nucleosomes, which contain DNA and proteins, are the basic unit of eukaryotic chromatins. Polymers such as DNA and proteins are dynamic, and their conformational changes can lead to functional changes. Periodic dinucleotide patterns exist in nucleosomal DNA chains and play an important role in the nucleosome structure. In this paper, we use normal mode analysis to detect significant structural deformations of nucleosomal DNA and investigate the relationship between periodic dinucleotides and DNA motions. We have found that periodic dinucleotides are usually located at the peaks or valleys of DNA and protein motions, revealing that they dominate the nucleosome dynamics. Also, a specific dinucleotide pattern CA/TG appears most frequently.  相似文献   

10.
Liu H  Wu J  Xie J  Yang X  Lu Z  Sun X 《Biophysical journal》2008,94(12):4597-4604
By analyzing dinucleotide position-frequency data of yeast nucleosome-bound DNA sequences, dinucleotide periodicities of core DNA sequences were investigated. Within frequency domains, weakly bound dinucleotides (AA, AT, and the combinations AA-TT-TA and AA-TT-TA-AT) present doublet peaks in a periodicity range of 10-11 bp, and strongly bound dinucleotides present a single peak. A time-frequency analysis, based on wavelet transformation, indicated that weakly bound dinucleotides of core DNA sequences were spaced smaller (∼10.3 bp) at the two ends, with larger (∼11.1 bp) spacing in the middle section. The finding was supported by DNA curvature and was prevalent in all core DNA sequences. Therefore, three approaches were developed to predict nucleosome positions. After analyzing a 2200-bp DNA sequence, results indicated that the predictions were feasible; areas near protein-DNA binding sites resulted in periodicity profiles with irregular signals. The effects of five dinucleotide patterns were evaluated, indicating that the AA-TT pattern exhibited better performance. A chromosome-scale prediction demonstrated that periodicity profiles perform better than previously described, with up to 59% accuracy. Based on predictions, nucleosome distributions near the beginning and end of open reading frames were analyzed. Results indicated that the majority of open reading frames’ start and end sites were occupied by nucleosomes.  相似文献   

11.
Abstract

Extensive DNA sequence analysis of three eukaryotes, S. cerevisiae, C. elegans, and D. melanogaster, reveals two different AA/TT periodical patterns associated with the nucleosome positioning. The first pattern is the counter-phase oscillation of AA and TT dinucleotides, which has been frequently considered as the nucleosome DNA pattern. This represents the sequence rule I for chromatin structure. The second pattern is the in-phase oscillation of the AA and TT dinucleotides with the same nucleosome DNA period, 10.4 bases. This pattern apparently corresponds to curved DNA, that also participates in the nucleosome formation, and represents the sequence rule II for chromatin. The positional correlations of AA and TT dinucleotides also indicate that the nucleosomes are separated by specific linker sizes (preferably 8, 18,…bases), dictated by the steric exclusion rules. Thus, the sequence positions of the neighboring nucleosomes are correlated, and this represents the sequence rule III.  相似文献   

12.
By measuring prevailing distances between YY, YR, RR, and RY dinucleotides in the large database of the nucleosome DNA fragments from C. elegans, the consensus sequence structure of the nucleosome DNA repeat of C. elegans was reconstructed: (YYYYYRRRRR)n. An actual period was estimated to be 10.4 bases. The pattern is fully consistent with the nucleosome DNA patterns of other eukaryotes, as established earlier, and, thus, the YYYYYRRRRR repeat can be considered as consensus nucleosome DNA sequence repeat across eukaryotic species. Similar distance analysis for [A, T] dinucleotides suggested the related pattern (TTTYTARAAA)n where the TT and AA dinucleotides display rather out of phase behavior, contrary to the "AA or TT" in-phase periodicity, considered in some publications. A weak 5-base periodicity in the distribution of TA dinucleotides was detected.  相似文献   

13.
It is generally accepted that the organization of eukaryotic DNA into chromatin is strongly governed by a code inherent in the genomic DNA sequence. This code, as well as other codes, is superposed on the triplets coding for amino acids. The history of the chromatin code started three decades ago with the discovery of the periodic appearance of certain dinucleotides, with AA/TT and RR/YY giving the strongest signals, all with a period of 10.4 bases. Every base-pair stack in the DNA duplex has specific deformation properties, thus favoring DNA bending in a specific direction. The appearance of the corresponding dinucleotide at the distance 10.4 xn bases will facilitate DNA bending in that direction, which corresponds to the minimum energy of DNA folding in the nucleosome. We have analyzed the periodic appearances of all 16 dinucleotides in the genomes of thirteen different eukaryotic organisms. Our data show that a large variety of dinucleotides (if not all) are, apparently, contributing to the nucleosome positioning code. The choice of the periodical dinucleotides differs considerably from one organism to another. Among other 10.4 base periodicities, a strong and very regular 10.4 base signal was observed for CG dinucleotides in the genome of the honey bee A. mellifera. Also, the dinucleotide CG appears as the only periodical component in the human genome. This observation seems especially relevant since CpG methylation is well known to modulate chromatin packing and regularity. Thus, the selection of the dinucleotides contributing to the chromatin code is species specific, and may differ from region to region, depending on the sequence context.  相似文献   

14.
Dinucleosome formation is the first step in the organization of the higher order chromatin structure. With the ultimate aim of elucidating the dinucleosome structure, we constructed a library of human dinucleosome DNA. The library consists of PCR-amplifiable DNA fragments obtained by treatment of nuclei of erythroid K562 cells with micrococcal nuclease followed by extraction of DNA and adaptor ligation to the blunt-ended DNA fragments. The library was then cloned using a plasmid vector and the sequences of the clones were determined. The dominating clones containing the Alu elements were removed. A total of 1002 clones, which comprised a dinucleosome database, contained 84 and 918 clones from the clones before and after removing Alu elements, respectively. Approximately 70% of the clones were between 300 and 400 bp in size and they were distributed to various locations of all chromosomes except the Y chromosome. The clones containing A(2)N(8)A(2)N(8)A(2) or T(2)N(8)T(2)N(8)T(2) sequences were classified into three types, Type I (N shape), Type II (V shape) and Type III (M shape) according to DNA curvature plots. The locations of experimentally determined curved DNA segments matched well with the calculated ones though the clones of Types I and III showed additional curved DNA segments as revealed by the curvature plots. The distributions of complementary dinucleotides in the nucleosome DNA, at the ends of the dinucleosome DNA clones, allowed us to predict the positions of the nucleosome dyad axis, and estimate the size of the nucleosome core DNA, 125nt. The distributions of AA and TT dinucleotides, as well as other RR and YY dinucleotides, showed a periodicity with an average period of 10.4 bases, close to the values observed before. Mapping of nucleosome positions in the dinucleosome database based on the observed periodicity revealed that the nucleosomes were separated by a linker of 7.5+ approximately 10 x n nt. This indicates that the nucleosome-nucleosome orientations are, typically, halfway between parallel and antiparallel. Also an important finding is that the distributions of AA/TT and other RR/YY dinucleotides, apparently, reflect both DNA curvature and DNA bendability, cooperatively contributing to the nucleosome formation.  相似文献   

15.
Abstract

By measuring prevailing distances between YY, YR, RR, and RY dinucleotides in the large database of the nucleosome DNA fragments from C. elegans, the consensus sequence structure of the nucleosome DNA repeat of C. elegans was reconstructed: (YYYYYRRRRR)n. An actual period was estimated to be 10.4 bases. The pattern is fully consistent with the nucleosome DNA patterns of other eukaryotes, as established earlier, and, thus, the YYYYYRRRRR repeat can be considered as consensus nucleosome DNA sequence repeat across eukaryotic species. Similar distance analysis for [A, T] dinucleotides suggested the related pattern (TTTYTARAAA)n where the TT and AA dinucleotides display rather out of phase behavior, contrary to the “AA or TT” in-phase periodicity, considered in some publications. A weak 5-base periodicity in the distribution of TA dinucleotides was detected.  相似文献   

16.
A protocol was devised to select for DNA molecules that efficiently form circles from a library of 126 base pair DNAs containing 90 randomized base pairs. After six rounds of selection, individual molecules from the library showed 20‐ to 100‐fold greater j‐factors compared with the starting library, validating the selection protocol. High‐throughput sequencing revealed a sinusoidal pattern of enrichment and de‐enrichment of A/T dinucleotides in the random region with a 10.4 base pair period associated with the helicity of DNA. A similar, but more moderate pattern of C/G dinucleotides was offset by precisely half a helical turn. While C/G dinucleotide enrichments were evenly distributed, A/T dinucleotide enrichments displayed a preference to cluster in individual DNA molecules. The most highly enriched 10 base pair sequences in the random region contained adjacent blocks of A/T and C/G trinucleotides present in some, but not all, rapidly cyclizing molecules. The phased dinucleotide enrichments closely match those present in accurately mapped yeast nucleosomes, confirming the importance of DNA bending in nucleosome formation. However, at certain sites the nucleosomal DNAs show dinucleotide enrichments that differ substantially from the cyclization data. These discrepancies can often be correlated with sequence specific contacts that form between histones and DNA. © 2015 Wiley Periodicals, Inc. Biopolymers 103: 303–320, 2015.  相似文献   

17.
Prokaryotic sequences are responsible for more than just protein coding. There are two 10- to 11-base periodical patterns superimposed on the protein coding message within the same sequence. Positional auto- and cross-correlation analysis of the sequences shows that these two patterns are a short-range counter-phase oscillation of AA and TT dinucleotides and a medium-range in-phase oscillation of the same dinucleotides, spanning distances of up to ∼30 and ∼100 bases, respectively. The short-range oscillation is encoded by the amino acid sequences themselves, apparently, due to the presence of amphipathic α-helices in the proteins. The medium-range oscillation, related to DNA folding in the cell, is created largely by a special choice of the bases in the third positions of the codons. Interestingly, the amino acid sequences do contribute to that signal as well. That is, the very amino acid sequences are, to some extent, degenerate to serve the same oscillating pattern that is associated with the degenerate third codon positions. [Reviewing Editor: Dr. Richard Kliman]  相似文献   

18.
19.
Trypanosomatid mitochondrial DNA (kDNA) possesses thousands of copies of small circular molecules called minicircles. Due to a high level of nucleotide polymorphism among copies, sequence alignment for species or strain characterization is not appropriate. In this work we report dinucleotide absolute frequency as a method to analyze minicircle sequences heterogeneity in trypanosomatids. Using Trypanosoma rangeli and Leishmania guyanensis minicircles as example of sequence length heterogeneity, we show that dinucleotide frequency of minicircles whose length variation is less than to 10% is relatively constant. Dinucleotide frequencies in Leishmania genus point out three clusters of predominant dinucleotide profiles: GG/TT/TG for Old World species; ii) TT/AA/TA for New World species and iii) TT/GG(AA) TA(AT) for Sauroleishmania. Trypanosoma species displayed broad range composition and the highest frequency values. Their dinucleotide profile appears to be species specific, except for African trypanosomes which exhibit similar composition. The low number of sequences from Crithidia, Herpetomonas, Phytomonas and Wallaceina did not allow a generalized analysis, however some species present highly similar compositional profile, e.g., Wallaceina species. Distinct signatures for Trypanosomatidae family members can be generated by using values of absolute frequencies, range and composition of most/least frequent dinucleotides from minicircles. Each species can be graphically represented by a diagram of frequencies along with a box plot of summary statistics.  相似文献   

20.
Sequence-dependent deformational anisotropy of chromatin DNA.   总被引:26,自引:14,他引:12       下载免费PDF全文
As found in previous work (E.N. Trifonov and J.L. Sussman, Proc. Natl. Acad. Sci. USA, in press) some dinucleotides of the chromatin DNA sequences have a clear tendency to be repeated along the sequences with a period of about 10.5 bases. A special iteration procedure is developed to find if there are phase relationships between different periodically repeating dinucleotides of chromatin DNA. A very specific symmetrical pattern of preferences of different dinucleotides to certain positions within a repeating 10.5 base frame is indeed found. This is interpreted as a manifestation of sequence-dependent deformational anisotropy of the chromatin DNA which facilitates its smooth folding in chromatin. The pattern found can be used for locating unidirectionally curved portions of the DNA molecules, possibly corresponding to nucleosomal DNA. This implies that the DNA is bound to the nucleosomes by one specific side which corresponds to the direction of the sequence-dependent curving of the DNA axis. The 10.5 base periodicity found can be considered as the second message present in chromatin DNA sequences together with 3 base frame coding message.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号