首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
One of the main endeavors in today's life science remains the efficient sequencing of long DNA molecules. Today, most de novo sequencing of DNA is still performed using the electrophoresis-based Sanger concept of 1977, in spite of certain restrictions of this method. Methods using mass spectrometry to acquire the Sanger sequencing data are limited by short sequencing lengths of 15-25 nt. We propose a new method for DNA sequencing using base-specific cleavage and mass spectrometry that appears to be a promising alternative to classical DNA sequencing approaches. A single stranded DNA or RNA molecule is cleaved by a base-specific (bio-)chemical reaction using, for example, RNAses. The cleavage reaction is modified such that not all, but only a certain percentage of bases are cleaved. The resulting mixture of fragments is then analyzed using MALDI-TOF mass spectrometry, whereby we acquire the molecular masses of fragments. For every peak in the mass spectrum, we calculate those base compositions that will potentially create a peak of the observed mass and, repeating the cleavage reaction for all four bases, finally try to uniquely reconstruct the underlying sequence from these observed spectra. This leads us to the combinatorial problem of sequencing from compomers and, finally, to the graph-theoretical problem of finding a walk in a subgraph of the de Bruijn graph. Application of this method to simulated data indicates that it might be capable of sequencing DNA molecules with 200+ nt.  相似文献   

2.
3.
Different protocols based on Illumina high-throughput DNA sequencing and denaturing gradient gel electrophoresis (DGGE)-cloning were developed and applied for investigating hot spring related samples. The study was focused on three target genes: archaeal and bacterial 16S rRNA and mcrA of methanogenic microflora. Shorter read lengths of the currently most popular technology of sequencing by Illumina do not allow analysis of the complete 16S rRNA region, or of longer gene fragments, as was the case of Sanger sequencing. Here, we demonstrate that there is no need for special indexed or tailed primer sets dedicated to short variable regions of 16S rRNA since the presented approach allows the analysis of complete bacterial 16S rRNA amplicons (V1–V9) and longer archaeal 16S rRNA and mcrA sequences. Sample augmented with transposon is represented by a set of approximately 300 bp long fragments that can be easily sequenced by Illumina MiSeq. Furthermore, a low proportion of chimeric sequences was observed. DGGE-cloning based strategies were performed combining semi-nested PCR, DGGE and clone library construction. Comparing both investigation methods, a certain degree of complementarity was observed confirming that the DGGE-cloning approach is not obsolete. Novel protocols were created for several types of laboratories, utilizing the traditional DGGE technique or using the most modern Illumina sequencing.  相似文献   

4.
All currently available DNA sequencing protocols rest fundamentally upon the homogeneity of the template. In this paper we describe the parallel DNA sequencing of various templates in one sample by a combination of the Sanger method and MALDI-TOF mass spectrometric analysis of the products. PCR-amplified hypervariable 16S rDNA fragments of the bacterium Escherichia coli DF1020 and cDNA of the 6-phosphofructo-1-kinase isoenzymes (PFK-1, EC 2.7.1.11) in rat brain were chosen as model systems for essentially heterogeneous templates. Avoiding cloning of the inhomogeneous PCR products we were able to read three sequences for both the 16S rDNA fragment of E.coli DF1020 and the cDNA of 6-phosphofructo-1-kinase from the peak lists of the Sanger sequencing reactions. Short sequences with a length between 21 and 25 nt were sufficient to reflect the heterogeneity of the 16S rDNA genes in E.coli and the existence of three isoenzymes of PFK-1 in rat brain.  相似文献   

5.
DNA barcoding is an effective approach for species identification and for discovery of new and/or cryptic species. Sanger sequencing technology is the method of choice for obtaining standard 650 bp cytochrome c oxidase subunit I (COI) barcodes. However, DNA degradation/fragmentation makes it difficult to obtain a full-length barcode from old specimens. Mini-barcodes of 130 bp from the standard barcode region have been shown to be effective for accurate identification in many animal groups and may be readily obtained from museum samples. Here we demonstrate the application of an alternative sequencing technology, the four-enzymes single-specimen pyrosequencing, in rapid, cost-effective mini-barcode analysis. We were able to generate sequences of up to 100 bp from mini-barcode fragments of COI in 135 fresh and 50 old Lepidoptera specimens (ranging from 53-97 year-old). The sequences obtained using pyrosequencing were of high quality and we were able to robustly match all the tested pyro-sequenced samples to their respective Sanger-sequenced standard barcode sequences, where available. Simplicity of the protocol and instrumentation coupled with higher speed and lower cost per sequence than Sanger sequencing makes this approach potentially useful in efforts to link standard barcode sequences from unidentified specimens to known museum specimens with only short DNA fragments.  相似文献   

6.
Targeted high-throughput sequencing of tagged nucleic acid samples   总被引:6,自引:2,他引:4  
High-throughput 454 DNA sequencing technology allows much faster and more cost-effective sequencing than traditional Sanger sequencing. However, the technology imposes inherent limitations on the number of samples that can be processed in parallel. Here we introduce parallel tagged sequencing (PTS), a simple, inexpensive and flexible barcoding technique that can be used for parallel sequencing any number and type of double-stranded nucleic acid samples. We demonstrate that PTS is particularly powerful for sequencing contiguous DNA fragments such as mtDNA genomes: in theory as many as 250 mammalian mtDNA genomes can be sequenced in a single GS FLX run. PTS dramatically increases the sequencing throughput of samples in parallel and thus fully mobilizes the resources of the 454 technology for targeted sequencing.  相似文献   

7.
Bisulfite sequencing is widely used for analysis of DNA methylation status (i.e., 5-methylcytosine [5mC] vs. cytosine [C]) in CpG-rich or other loci in genomic DNA (gDNA). Such methods typically involve reaction of gDNA with bisulfite followed by polymerase chain reaction (PCR) amplification of specific regions of interest that, overall, converts C→T (thymine) and 5mC→C and then capillary sequencing to measure C versus T composition at CpG sites. Massively parallel sequencing by oligonucleotide ligation and detection (SOLiD) has recently enabled relatively low-cost whole genome sequencing, and it would be highly desirable to apply such massively parallel sequencing to bisulfite-converted whole genomes to determine DNA methylation status of an entire genome, which has heretofore not been reported. As an initial step toward achieving this goal, we have extended our ongoing interest in improving bisulfite conversion sample preparation to include a human genome-wide fragment library for SOliD. The current article features novel use of formamide denaturant during bisulfite conversion of a suitably constructed library directly in a band slice from polyacryamide gel electrophoresis (PAGE). To validate this new protocol for 5mC-protected fragment library conversion, which we refer to as Bis-PAGE, capillary-based size analysis and Sanger sequencing were carried out for individual amplicons derived from single-molecule PCR (smPCR) of randomly selected library fragments. smPCR/Capillary Sanger sequencing of approximately 200 amplicons unambiguously demonstrated greater than 99% C→T conversion. All of these approximately 200 Sanger sequences were analyzed with a previously published web-accessible bioinformatics tool (methBLAST) for mapping to human chromosomes, the results of which indicated random distribution of analyzed fragments across all chromosomes. Although these particular Bis-PAGE conversion and quality control methods were exemplified in the context of a fragment library for SOLiD, the concepts can be generalized to include other genome-wide library constructions intended for DNA methylation analysis by alternative high-throughput or massively parallelized methods that are currently available.  相似文献   

8.
Next-Generation-Sequencing is advantageous because of its much higher data throughput and much lower cost compared with the traditional Sanger method. However, NGS reads are shorter than Sanger reads, making de novo genome assembly very challenging. Because genome assembly is essential for all downstream biological studies, great efforts have been made to enhance the completeness of genome assembly, which requires the presence of long reads or long distance information. To improve de novo genome assembly, we develop a computational program, ARF-PE, to increase the length of Illumina reads. ARF-PE takes as input Illumina paired-end (PE) reads and recovers the original DNA fragments from which two ends the paired reads are obtained. On the PE data of four bacteria, ARF-PE recovered >87% of the DNA fragments and achieved >98% of perfect DNA fragment recovery. Using Velvet, SOAPdenovo, Newbler, and CABOG, we evaluated the benefits of recovered DNA fragments to genome assembly. For all four bacteria, the recovered DNA fragments increased the assembly contiguity. For example, the N50 lengths of the P. brasiliensis contigs assembled by SOAPdenovo and Newbler increased from 80,524 bp to 166,573 bp and from 80,655 bp to 193,388 bp, respectively. ARF-PE also increased assembly accuracy in many cases. On the PE data of two fungi and a human chromosome, ARF-PE doubled and tripled the N50 length. However, the assembly accuracies dropped, but still remained >91%. In general, ARF-PE can increase both assembly contiguity and accuracy for bacterial genomes. For complex eukaryotic genomes, ARF-PE is promising because it raises assembly contiguity. But future error correction is needed for ARF-PE to also increase the assembly accuracy. ARF-PE is freely available at http://140.116.235.124/~tliu/arf-pe/.  相似文献   

9.
Targeted genome capture combined with next-generation sequencing was used to analyze 2.9 Mb of the DFNB79 interval on chromosome 9q34.3, which includes 108 candidate genes. Genomic DNA from an affected member of a consanguineous family segregating recessive, nonsyndromic hearing loss was used to make a library of fragments covering the DFNB79 linkage interval defined by genetic analyses of four pedigrees. Homozygosity for eight previously unreported variants in transcribed sequences was detected by evaluating a library of 402,554 sequencing reads and was later confirmed by Sanger sequencing. Of these variants, six were determined to be polymorphisms in the Pakistani population, and one was in a noncoding gene that was subsequently excluded genetically from the DFNB79 linkage interval. The remaining variant was a nonsense mutation in a predicted gene, C9orf75, renamed TPRN. Evaluation of the other three DFNB79-linked families identified three additional frameshift mutations, for a total of four truncating alleles of this gene. Although TPRN is expressed in many tissues, immunolocalization of the protein product in the mouse cochlea shows prominent expression in the taper region of hair cell stereocilia. Consequently, we named the protein taperin.  相似文献   

10.
Interdigitating dendritic cell sarcoma (IDCS) is an aggressive neoplasm and is an extremely rare disease, with a challenging diagnosis. Etiology of IDCS is also unknown and most studies with only case reports. In our case, immunohistochemistry showed that the tumor cells were positive for S100, CD45, and CD68, but negative for CD1a and CD21. This study aimed to investigate the causative factors of IDCS by sequencing the protein-coding regions of IDCS. We performed whole-exome sequencing with genomic DNA from blood and sarcoma tissue of the IDCS patient using the Illumina Hiseq 2500 platform. After that, we conducted Sanger sequencing for validation of sarcoma-specific variants and gene ontology analysis using DAVID bioinformatics resources. Through comparing sequencing data of sarcoma with normal blood, we obtained 15 nonsynonymous single nucleotide polymorphisms (SNPs) as sarcoma-specific variants. Although the 15 SNPs were not validated by Sanger sequencing due to tumor heterogeneity and low sensitivity of Sanger sequencing, we examined the function of the genes in which each SNP is located. Based on previous studies and gene ontology database, we found that POLQ encoding DNA polymerase theta enzyme and FNIP1 encoding tumor suppressor folliculin-interacting protein might have contributed to the IDCS. Our study provides potential causative genetic factors of IDCS and plays a role in advancing the understanding of IDCS pathogenesis.  相似文献   

11.
Direct sequencing of polymerase chain reaction (PCR)-generated templates is a commonly used technique in molecular biology laboratories. We describe an improved method for direct sequencing of PCR fragments longer than 20 kb obtained with a commercial mixture ofTaq andPwo DNA polymerases. The sequencing protocol was optimized for an automated infrared DNA sequencer, consistently yielding long reads (500–600 bases).  相似文献   

12.
A conventional method of DNA sequencing can determine up to 1000 base pairs at one time. Therefore, long DNA should be cut into many short fragments that are suitable for DNA sequencing. Those fragments, however, lose their order information. If the fragments are prepared from the terminus of the long DNA, the reorganization process can be omitted. This process consists of following unit operations; manipulation of genomic DNA, fixation with a stretched form, cutting from the terminus, recovery and amplification. In these unit operations, manipulation and cutting of DNA are focused in this report. Globular transformation suppresses break down of long genome DNA and permits manipulation of large DNA. Because globular transition is reversible, the coiled DNA can be sequentially spun from the globular DNA like a spindle. Thespun DNA was successfully fixed on a glass surface in an arbitrary pattern. To prepare fragments from the stretched DNA molecule, a method to cut DNA moleculen was developed. Since most restriction enzyme requires magnesium ion for their activation, the restriction enzyme was successfully activated only when magnesium ion was electrochemically supplied.  相似文献   

13.
False terminations occurring in fluorescent dye-primer DNA sequencing, and nonsequencing primer extension DNA fragments generated in dye-terminator sequencing cause background noise in fluorescent electropherograms, leading to errors in sequence determination. We describe here a DNA sequencing chemistry that produces accurate and clean sequencing data on a fluorescent DNA sequencer, eliminating the false terminations and background noise. The procedure involves coupling fluorescence energy transfer (ET) primers that produce high fluorescent signals with solid-phase-capturable biotinylated dideoxynucleotides to generate Sanger DNA sequencing fragments. After the sequencing reaction,the DNA extension fragments that carry a biotin at the 3' end are captured with streptavidin-coated magnetic beads, while the other components in the sequencing reaction are washed away. Only pure DNA extension products terminated by the biotinylated dideoxynucleotides are released from the magnetic beads and are loaded onto a sequencing gel to produce accurate sequencing data.  相似文献   

14.
Heterogeneity is a ubiquitous feature of biological systems. A complete understanding of such systems requires a method for uniquely identifying and tracking individual components and their interactions with each other. We have developed a novel method of uniquely tagging individual cells in vivo with a genetic ‘barcode’ that can be recovered by DNA sequencing. Our method is a two-component system comprised of a genetic barcode cassette whose fragments are shuffled by Rci, a site-specific DNA invertase. The system is highly scalable, with the potential to generate theoretical diversities in the billions. We demonstrate the feasibility of this technique in Escherichia coli. Currently, this method could be employed to track the dynamics of populations of microbes through various bottlenecks. Advances of this method should prove useful in tracking interactions of cells within a network, and/or heterogeneity within complex biological samples.  相似文献   

15.

Background

Usually, next generation sequencing (NGS) technology has the property of ultra-high throughput but the read length is remarkably short compared to conventional Sanger sequencing. Paired-end NGS could computationally extend the read length but with a lot of practical inconvenience because of the inherent gaps. Now that Illumina paired-end sequencing has the ability of read both ends from 600 bp or even 800 bp DNA fragments, how to fill in the gaps between paired ends to produce accurate long reads is intriguing but challenging.

Results

We have developed a new technology, referred to as pseudo-Sanger (PS) sequencing. It tries to fill in the gaps between paired ends and could generate near error-free sequences equivalent to the conventional Sanger reads in length but with the high throughput of the Next Generation Sequencing. The major novelty of PS method lies on that the gap filling is based on local assembly of paired-end reads which have overlaps with at either end. Thus, we are able to fill in the gaps in repetitive genomic region correctly. The PS sequencing starts with short reads from NGS platforms, using a series of paired-end libraries of stepwise decreasing insert sizes. A computational method is introduced to transform these special paired-end reads into long and near error-free PS sequences, which correspond in length to those with the largest insert sizes. The PS construction has 3 advantages over untransformed reads: gap filling, error correction and heterozygote tolerance. Among the many applications of the PS construction is de novo genome assembly, which we tested in this study. Assembly of PS reads from a non-isogenic strain of Drosophila melanogaster yields an N50 contig of 190 kb, a 5 fold improvement over the existing de novo assembly methods and a 3 fold advantage over the assembly of long reads from 454 sequencing.

Conclusions

Our method generated near error-free long reads from NGS paired-end sequencing. We demonstrated that de novo assembly could benefit a lot from these Sanger-like reads. Besides, the characteristic of the long reads could be applied to such applications as structural variations detection and metagenomics.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-14-711) contains supplementary material, which is available to authorized users.  相似文献   

16.
Scanning for mutations by DNA melting analysis (DMA) is based on asymmetric PCR followed by the melting of duplexes formed by single-stranded amplicons with TaqMan probes. The method is optimally suited for clinical genetic testing; it is easy to perform, high-throughput, and sensitive. The detection limit of mutant alleles by the DMA method is about 3%, which is much higher than the sensitivity of Sanger sequencing. In addition, the DMA method is realized in a closed-tube format, while 2-h assay is carried out in a single tube without any intermediate or additional procedures thereby minimizing the risk of cross contamination of the samples. The validation of the DMA method was performed by scanning for mutations of clinically significant genes KRAS, NRAS, BRAF, and PIK3CA in 324 DNA samples from tumors of patients with melanoma, colorectal and lung cancer. DNA was isolated either directly from tumor tissues, or from formalin- fixed paraffin-embedded tumor tissues. The detected mutations were verified by Sanger sequencing. The spectra of mutations identified in each tumor type correspond to the literature data and, thus, validate the use of DMA.  相似文献   

17.
DNA barcoding has become one of the most important techniques in plant species identification. Successful application of this technology is dependent on the availability of reference database of high species coverage. Unfortunately, there are experimental and data processing challenges to construct such a library within a short time. Here, we present our solutions to these challenges. We sequenced six conventional DNA barcode fragments (ITS1, ITS2, matK1, matK2, rbcL1, and rbcL2) of 380 flowering plants on next‐generation sequencing (NGS) platforms (Illumina Hiseq 2500 and Ion Torrent S5) and the Sanger sequencing platform. After comparing the sequencing depths, read lengths, base qualities, and base accuracies, we conclude that Illumina Hiseq2500 PE250 run is suitable for conventional DNA barcoding. We developed a new “Cotu” method to create consensus sequences from NGS reads for longer output sequences and more reliable bases than the other three methods. Step‐by‐step instructions to our method are provided. By using high‐throughput machines (PCR and NGS), labeling PCR, and the Cotu method, it is possible to significantly reduce the cost and labor investments for DNA barcoding. A regional or even global DNA barcoding reference library with high species coverage is likely to be constructed in a few years.  相似文献   

18.
A series of charge-modified, dye-labeled 2′, 3′-dideoxynucleoside-5′-triphosphates have been synthesized and evaluated as reagents for dye-terminator DNA sequencing. Unlike the commonly used dye-labeled terminators, these terminators possess a net positive charge and migrate in the opposite direction to dye-labeled Sanger fragments during electrophoresis. Post-sequencing reaction purification is not required to remove unreacted nucleotide or associated breakdown products prior to electrophoresis. Thus, DNA sequencing reaction mixtures can be loaded directly onto a separating medium such as a sequencing gel. The charge-modified nucleotides have also been shown to be more efficiently incorporated by a number of DNA polymerases than regular dye-labeled dideoxynucleotide terminators or indeed normal dideoxynucleoside-5′-triphosphates.  相似文献   

19.
20.
DNA barcoding is an efficient method to identify specimens and to detect undescribed/cryptic species. Sanger sequencing of individual specimens is the standard approach in generating large‐scale DNA barcode libraries and identifying unknowns. However, the Sanger sequencing technology is, in some respects, inferior to next‐generation sequencers, which are capable of producing millions of sequence reads simultaneously. Additionally, direct Sanger sequencing of DNA barcode amplicons, as practiced in most DNA barcoding procedures, is hampered by the need for relatively high‐target amplicon yield, coamplification of nuclear mitochondrial pseudogenes, confusion with sequences from intracellular endosymbiotic bacteria (e.g. Wolbachia) and instances of intraindividual variability (i.e. heteroplasmy). Any of these situations can lead to failed Sanger sequencing attempts or ambiguity of the generated DNA barcodes. Here, we demonstrate the potential application of next‐generation sequencing platforms for parallel acquisition of DNA barcode sequences from hundreds of specimens simultaneously. To facilitate retrieval of sequences obtained from individual specimens, we tag individual specimens during PCR amplification using unique 10‐mer oligonucleotides attached to DNA barcoding PCR primers. We employ 454 pyrosequencing to recover full‐length DNA barcodes of 190 specimens using 12.5% capacity of a 454 sequencing run (i.e. two lanes of a 16 lane run). We obtained an average of 143 sequence reads for each individual specimen. The sequences produced are full‐length DNA barcodes for all but one of the included specimens. In a subset of samples, we also detected Wolbachia, nontarget species, and heteroplasmic sequences. Next‐generation sequencing is of great value because of its protocol simplicity, greatly reduced cost per barcode read, faster throughout and added information content.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号