首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background  

Bisulfite sequencing is a powerful technique to study DNA cytosine methylation. Bisulfite treatment followed by PCR amplification specifically converts unmethylated cytosines to thymine. Coupled with next generation sequencing technology, it is able to detect the methylation status of every cytosine in the genome. However, mapping high-throughput bisulfite reads to the reference genome remains a great challenge due to the increased searching space, reduced complexity of bisulfite sequence, asymmetric cytosine to thymine alignments, and multiple CpG heterogeneous methylation.  相似文献   

2.
Deep sequencing after bisulfite conversion (BS-Seq) is the method of choice to generate whole genome maps of cytosine methylation at single base-pair resolution. Its application to genomic DNA of Arabidopsis flower bud tissue resulted in the first complete methylome, determining a methylation rate of 6.7% in this tissue. BS-Seq reads were mapped onto an in silico converted reference genome, applying the so-called 3-letter genome method. Here, we present BiSS (Bisufite Sequencing Scorer), a new method applying Smith-Waterman alignment to map bisulfite-converted reads to a reference genome. In addition, we introduce a comprehensive adaptive error estimate that accounts for sequencing errors, erroneous bisulfite conversion and also wrongly mapped reads. The re-analysis of the Arabidopsis methylome data with BiSS mapped substantially more reads to the genome. As a result, it determines the methylation status of an extra 10% of cytosines and estimates the methylation rate to be 7.7%. We validated the results by individual traditional bisulfite sequencing for selected genomic regions. In addition to predicting the methylation status of each cytosine, BiSS also provides an estimate of the methylation degree at each genomic site. Thus, BiSS explores BS-Seq data more extensively and provides more information for downstream analysis.  相似文献   

3.
Sodium bisulfite modification-based fine mapping of methylated cytosines represents the gold standard technique for DNA methylation studies. A major problem with this approach, however is that it results in considerable DNA degradation, and large quantities of genomic DNA material are needed if numerous genomic regions are to be profiled. In this study, we examined whether whole genome amplification (WGA) techniques can be applied to sodium bisulfite-treated DNA and whether WGA would bias DNA methylation results. Sodium bisulfite-treated DNA was amplified using a standard WGA method: optimized primer-extension preamplification (PEP) with degenerate primers. Following the PCR of bisulfite-treated DNA, the DNA methylation profiles of specific DNA fragments were assessed using three approaches: (i) direct sequencing of the overall product; (ii) the sequencing of cloned PCR products; and (iii) methylation-sensitive single nucleotide primer extension (MS-SNuPE)--and compared with those obtained from bisulfite-treated DNA not subjected to WGA. Our data indicates that the DNA methylation profiles obtained from WGA of sodium bisulfite-treated DNA are consistent with those obtained from non-WGA DNA. The average difference in methylation percentage calculated from the two sets of template using MS-SNuPE was 4%. If our results are replicated on other genomic loci, WGA may become a useful technique in DNA methylation studies.  相似文献   

4.

Background

Bisulfite sequencing using next generation sequencers yields genome-wide measurements of DNA methylation at single nucleotide resolution. Traditional aligners are not designed for mapping bisulfite-treated reads, where the unmethylated Cs are converted to Ts. We have developed BS Seeker, an approach that converts the genome to a three-letter alphabet and uses Bowtie to align bisulfite-treated reads to a reference genome. It uses sequence tags to reduce mapping ambiguity. Post-processing of the alignments removes non-unique and low-quality mappings.

Results

We tested our aligner on synthetic data, a bisulfite-converted Arabidopsis library, and human libraries generated from two different experimental protocols. We evaluated the performance of our approach and compared it to other bisulfite aligners. The results demonstrate that among the aligners tested, BS Seeker is more versatile and faster. When mapping to the human genome, BS Seeker generates alignments significantly faster than RMAP and BSMAP. Furthermore, BS Seeker is the only alignment tool that can explicitly account for tags which are generated by certain library construction protocols.

Conclusions

BS Seeker provides fast and accurate mapping of bisulfite-converted reads. It can work with BS reads generated from the two different experimental protocols, and is able to efficiently map reads to large mammalian genomes. The Python program is freely available at http://pellegrini.mcdb.ucla.edu/BS_Seeker/BS_Seeker.html.  相似文献   

5.
DNA methylation has been proposed to be important in many biological processes and is the subject of intense study. Traditional bisulfite genomic sequencing allows detailed high-resolution methylation pattern analysis of each molecule with haplotype information across a few hundred bases at each locus, but lacks the capacity to gather voluminous data. Although recent technological developments are aimed at assessing DNA methylation patterns in a high-throughput manner across the genome, the haplotype information cannot be accurately assembled when the sequencing reads are short or when each hybridization target only includes one or two cytosine-phosphate-guanine (CpG) sites. Whether a distinct and nonrandom DNA methylation pattern is present at a given locus is difficult to discern without the haplotype information, and the DNA methylation patterns are much less apparent because the data are often obtained only as methylation frequencies at each CpG site with some of these methods. It would facilitate the interpretation of data obtained from high-throughput bisulfite sequencing if the loci with nonrandom DNA methylation patterns could be distinguished from those that are randomly methylated. In this study, we carried out traditional genomic bisulfite sequencing using the normal diploid human embryonic stem (hES) cell lines, and utilized Hamming distance analysis to evaluate the existence of a distinct and nonrandom DNA methylation pattern at each locus studied. Our findings suggest that Hamming distance is a simple, quick, and useful tool to identify loci with nonrandom DNA methylation patterns and may be utilized to discern links between biological changes and DNA methylation patterns in the high-throughput bisulfite sequencing data sets.  相似文献   

6.
Methylation, the addition of methyl groups to cytosine (C), plays an important role in the regulation of gene expression in both normal and dysfunctional cells. During bisulfite conversion and subsequent PCR amplification, unmethylated Cs are converted into thymine (T), while methylated Cs will not be converted. Sequencing of this bisulfite-treated DNA permits the detection of methylation at specific sites. Through the introduction of next-generation sequencing technologies (NGS) simultaneous analysis of methylation motifs in multiple regions provides the opportunity for hypothesis-free study of the entire methylome. Here we present a whole methylome sequencing study that compares two different bisulfite conversion methods (in solution versus in gel), utilizing the high throughput of the SOLiD™ System. Advantages and disadvantages of the two different bisulfite conversion methods for constructing sequencing libraries are discussed. Furthermore, the application of the SOLiD™ bisulfite sequencing to larger and more complex genomes is shown with preliminary in silico created bisulfite converted reads.  相似文献   

7.
Xu YH  Manoharan HT  Pitot HC 《BioTechniques》2007,43(3):334, 336-340, 342
The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.  相似文献   

8.
Cytosine methylation is the quintessential epigenetic mark. Two well-established methods, bisulfite sequencing and methyl-DNA immunoprecipitation (MeDIP) lend themselves to the genome-wide analysis of DNA methylation by high throughput sequencing. Here we provide an overview and brief review of these methods. We summarize our experience with MeDIP followed by high throughput Illumina/Solexa sequencing, exemplified by the analysis of the methylated fraction of the Neurospora crassa genome ("methylome"). We provide detailed methods for DNA isolation, processing and the generation of in vitro libraries for Illumina/Solexa sequencing. We discuss potential problems in the generation of sequencing libraries. Finally, we provide an overview of software that is appropriate for the analysis of high throughput sequencing data generated by Illumina/Solexa-type sequencing by synthesis, with a special emphasis on approaches and applications that can generate more accurate depictions of sequence reads that fall in repeated regions of a chosen reference genome.  相似文献   

9.
10.
High-throughput bisulfite sequencing is widely used to measure cytosine methylation at single-base resolution in eukaryotes. It permits systems-level analysis of genomic methylation patterns associated with gene expression and chromatin structure. However, methods for large-scale identification of methylation patterns from bisulfite sequencing are lacking. We developed a comprehensive tool, CpG_MPs, for identification and analysis of the methylation patterns of genomic regions from bisulfite sequencing data. CpG_MPs first normalizes bisulfite sequencing reads into methylation level of CpGs. Then it identifies unmethylated and methylated regions using the methylation status of neighboring CpGs by hotspot extension algorithm without knowledge of pre-defined regions. Furthermore, the conservatively and differentially methylated regions across paired or multiple samples (cells or tissues) are identified by combining a combinatorial algorithm with Shannon entropy. CpG_MPs identified large amounts of genomic regions with different methylation patterns across five human bisulfite sequencing data during cellular differentiation. Different sequence features and significantly cell-specific methylation patterns were observed. These potentially functional regions form candidate regions for functional analysis of DNA methylation during cellular differentiation. CpG_MPs is the first user-friendly tool for identifying methylation patterns of genomic regions from bisulfite sequencing data, permitting further investigation of the biological functions of genome-scale methylation patterns.  相似文献   

11.
DNA methylation is an epigenetic modification critical for normal development and diseases. The determination of genome-wide DNA methylation at single-nucleotide resolution is made possible by sequencing bisulfite treated DNA with next generation high-throughput sequencing. However, aligning bisulfite short reads to a reference genome remains challenging as only a limited proportion of them (around 50–70%) can be aligned uniquely; a significant proportion, known as multireads, are mapped to multiple locations and thus discarded from downstream analyses, causing financial waste and biased methylation inference. To address this issue, we develop a Bayesian model that assigns multireads to their most likely locations based on the posterior probability derived from information hidden in uniquely aligned reads. Analyses of both simulated data and real hairpin bisulfite sequencing data show that our method can effectively assign approximately 70% of the multireads to their best locations with up to 90% accuracy, leading to a significant increase in the overall mapping efficiency. Moreover, the assignment model shows robust performance with low coverage depth, making it particularly attractive considering the prohibitive cost of bisulfite sequencing. Additionally, results show that longer reads help improve the performance of the assignment model. The assignment model is also robust to varying degrees of methylation and varying sequencing error rates. Finally, incorporating prior knowledge on mutation rate and context specific methylation level into the assignment model increases inference accuracy. The assignment model is implemented in the BAM-ABS package and freely available at https://github.com/zhanglabvt/BAM_ABS.  相似文献   

12.
MOTIVATION: Methylation of cytosines in DNA plays an important role in the regulation of gene expression, and the analysis of methylation patterns is fundamental for the understanding of cell differentiation, aging processes, diseases and cancer development. Such analysis has been limited, because technologies for detailed and efficient high-throughput studies have not been available. We have developed a novel quantitative methylation analysis algorithm and workflow based on direct DNA sequencing of PCR products from bisulfite-treated DNA with high-throughput sequencing machines. This technology is a prerequisite for success of the Human Epigenome Project, the first large genome-wide sequencing study for DNA methylation in many different tissues. Methylation in tissue samples which are compositions of different cells is a quantitative information represented by cytosine/thymine proportions after bisulfite conversion of unmethylated cytosines to uracil and PCR. Calculation of quantitative methylation information from base proportions represented by different dye signals in four-dye sequencing trace files needs a specific algorithm handling imbalanced and overscaled signals, incomplete conversion, quality problems and basecaller artifacts. RESULTS: The algorithm we developed has several key properties: it analyzes trace files from PCR products of bisulfite-treated DNA sequenced directly on ABI machines; it yields quantitative methylation measurements for individual cytosine positions after alignment with genomic reference sequences, signal normalization and estimation of effectiveness of bisulfite treatment; it works in a fully automated pipeline including data quality monitoring; it is efficient and avoids the usual cost of multiple sequencing runs on subclones to estimate DNA methylation. The power of our new algorithm is demonstrated with data from two test systems based on mixtures with known base compositions and defined methylation. In addition, the applicability is proven by identifying CpGs that are differentially methylated in real tissue samples.  相似文献   

13.
While cytosine methylation has been widely studied in extant populations, relatively few studies have analyzed methylation in ancient DNA. Most existing studies of epigenetic marks in ancient DNA have inferred patterns of methylation in highly degraded samples using post-mortem damage to cytosines as a proxy for cytosine methylation levels. However, this approach limits the inference of methylation compared with direct bisulfite sequencing, the current gold standard for analyzing cytosine methylation at single nucleotide resolution. In this study, we used direct bisulfite sequencing to assess cytosine methylation in ancient DNA from the skeletal remains of 30 Native Americans ranging in age from approximately 230 to 4500 years before present. Unmethylated cytosines were converted to uracils by treatment with sodium bisulfite, bisulfite products of a CpG-rich retrotransposon were pyrosequenced, and C-to-T ratios were quantified for a single CpG position. We found that cytosine methylation is readily recoverable from most samples, given adequate preservation of endogenous nuclear DNA. In addition, our results indicate that the precision of cytosine methylation estimates is inversely correlated with aDNA preservation, such that samples of low DNA concentration show higher variability in measures of percent methylation than samples of high DNA concentration. In particular, samples in this study with a DNA concentration above 0.015 ng/μL generated the most consistent measures of cytosine methylation. This study presents evidence of cytosine methylation in a large collection of ancient human remains, and indicates that it is possible to analyze epigenetic patterns in ancient populations using direct bisulfite sequencing approaches.  相似文献   

14.
DNA methylation plays a crucial role in higher organisms. Coupling bisulfite treatment with next generation sequencing enables the interrogation of 5-methylcytosine sites in the genome. However, bisulfite conversion introduces mismatches between the reads and the reference genome, which makes mapping of Illumina and SOLiD reads slow and inaccurate. BatMeth is an algorithm that integrates novel Mismatch Counting, List Filtering, Mismatch Stage Filtering and Fast Mapping onto Two Indexes components to improve unique mapping rate, speed and precision. Experimental results show that BatMeth is faster and more accurate than existing tools. BatMeth is freely available at http://code.google.com/p/batmeth/.  相似文献   

15.
DNA methylation is an epigenetic mark at the interface of genetic and environmental factors relevant to human disease. Quantitative assessments of global DNA methylation levels have therefore become important tools in epidemiology research, particularly for understanding effects of environmental exposures in complex diseases. Among the available methods of quantitative DNA methylation measurements, bisulfite sequencing is considered the gold standard, but whole-genome bisulfite sequencing (WGBS) has previously been considered too costly for epidemiology studies with high sample numbers. Pyrosequencing of repetitive sequences within bisulfite-treated DNA has been routinely used as a surrogate for global DNA methylation, but a comparison of pyrosequencing to WGBS for accuracy and reproducibility of methylation levels has not been performed. This study compared the global methylation levels measured from uniquely mappable (non-repetitive) WGBS sequences to pyrosequencing assays of several repeat sequences and repeat assay-matched WGBS data and determined uniquely mappable WGBS data to be the most reproducible and accurate measurement of global DNA methylation levels. We determined sources of variation in repetitive pyrosequencing assays to be PCR amplification bias, PCR primer selection bias in methylation levels of targeted sequences, and inherent variability in methylation levels of repeat sequences. Low-coverage, uniquely mappable WGBS showed the strongest correlation between replicates of all assays. By using multiplexing by indexed bar codes, the cost of WGBS can be lowered significantly to improve the accuracy of global DNA methylation assessments for human studies.  相似文献   

16.
Bisulfite genomic sequencing is a widely used technique foranalyzing cytosine-methylation of DNA. By treating DNA withbisulfite, cytosine residues are deaminated to uracil, whileleaving 5-methylcytosine largely intact. Subsequent PCR andnucleotide sequence analysis permit unequivocal determinationof the methylation status at cytosine residues. A major caveatassociated with the currently practiced procedure is that ittakes 16–20 hr for completion of the conversion of cytosineto uracil. Here we report that a complete deamination of cytosineto uracil can be achieved in shorter periods by using a highlyconcentrated bisulfite solution at an elevated temperature.Time course experiments demonstrated that treating DNA with9 M bisulfite for 20 min at 90°C or 40 min at 70°C allcytosine residues in the DNA were converted to uracil. Underthese conditions, the majority of 5-methylcytosines remainedintact. When a high molecular weight DNA derived from a cellline (containing a number of genes whose methylation statuswas known) was treated with bisulfite under the above conditionsand amplified and sequenced, the results obtained were consistentwith those reported in the literature. Although some degradationof DNA occurred during this process, the amount of treated DNArequired for the amplification was nearly equal to that requiredfor the conventional bisulfite genomic sequencing procedure.The increased speed of DNA methylation analysis with this novelprocedure is expected to advance various aspects of DNA sciences.  相似文献   

17.
Bisulfite sequencing is a key methodology in epigenetics. However, the standard workflow of bisulfite sequencing involves heat and strongly basic conditions to convert the intermediary product 5,6-dihydrouridine-6-sulfonate (dhU6S) (generated by reaction of bisulfite with deoxycytidine (dC)) to uracil (dU). These harsh conditions generally lead to sample loss and DNA damage while milder conditions may result in incomplete conversion of intermediates to uracil. Both can lead to poor recovery of bisulfite-treated DNA by the polymerase chain reaction (PCR) as either damaged DNA and/or intermediates of bisulfite treatment are poor substrate for standard DNA polymerases. Here we describe an engineered DNA polymerase (5D4) with an enhanced ability to replicate and PCR amplify bisulfite-treated DNA due to an ability to bypass both DNA lesions and bisulfite intermediates, allowing significantly milder conversion conditions and increased sensitivity in the PCR amplification of bisulfite-treated DNA. Incorporation of the 5D4 DNA polymerase into the bisulfite sequencing workflow thus promises significant sensitivity and efficiency gains.  相似文献   

18.
DNA methylation strongly affects chromatin structure and the regulation of gene expression. For many years, bisulfite sequencing PCR (BSP) has served as the “gold standard” for measuring DNA methylation. However, with the evolution of pyrosequencing as a tool to evaluate DNA methylation, the need arises to compare the relative efficiencies of the two techniques in measuring DNA methylation. We provide for the first time a direct assessment of BSP and pyrosequencing to detect and quantify hypomethylation, hypermethylation, and mixed methylation of the ABCB1 promoter in various drug-sensitive and drug-resistant MCF-7 breast cancer cell lines through head-to-head experimentation. Our findings indicate that although both methods can reliably detect increased, decreased, and mixed methylation of DNA, BSP appears to be more sensitive than pyrosequencing at detecting strong hypermethylation of DNA. However, we also observed greater variability in the methylation of CpG sites by BSP, possibly due to the additional bacterial cloning step required by BSP over pyrosequencing. BSP and pyrosequencing equally detected hypomethylation and mixed methylation of DNA. The ability of pyrosequencing to reliably detect differences in DNA methylation across cell populations without requiring the cloning of bisulfite-treated DNA into bacterial expression vectors was seen as a major advantage of this technique.  相似文献   

19.
20.
Bisulfite sequencing is a valuable tool for mapping the position of 5-methylcytosine in the genome at single base resolution. However, the associated chemical treatment causes strand scission, which depletes the number of sequenceable DNA fragments in a library and thus necessitates PCR amplification. The AT-rich nature of the library generated from bisulfite treatment adversely affects this amplification, resulting in the introduction of major biases that can confound methylation analysis. Here, we report a method that enables more accurate methylation analysis, by rebuilding bisulfite-damaged components of a DNA library. This recovery after bisulfite treatment (ReBuilT) approach enables PCR-free bisulfite sequencing from low nanogram quantities of genomic DNA. We apply the ReBuilT method for the first whole methylome analysis of the highly AT-rich genome of Plasmodium berghei. Side-by-side comparison to a commercial protocol involving amplification demonstrates a substantial improvement in uniformity of coverage and reduction of sequence context bias. Our method will be widely applicable for quantitative methylation analysis, even for technically challenging genomes, and where limited sample DNA is available.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号