首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
The advent and widespread application of next-generation sequencing (NGS) technologies to the study of microbial genomes has led to a substantial increase in the number of studies in which whole genome sequencing (WGS) is applied to the analysis of microbial genomic epidemiology. However, microorganisms such as Mycobacterium tuberculosis (MTB) present unique problems for sequencing and downstream analysis based on their unique physiology and the composition of their genomes. In this study, we compare the quality of sequence data generated using the Nextera and TruSeq isolate preparation kits for library construction prior to Illumina sequencing-by-synthesis. Our results confirm that MTB NGS data quality is highly dependent on the purity of the DNA sample submitted for sequencing and its guanine-cytosine content (or GC-content). Our data additionally demonstrate that the choice of library preparation method plays an important role in mitigating downstream sequencing quality issues. Importantly for MTB, the Illumina TruSeq library preparation kit produces more uniform data quality than the Nextera XT method, regardless of the quality of the input DNA. Furthermore, specific genomic sequence motifs are commonly missed by the Nextera XT method, as are regions of especially high GC-content relative to the rest of the MTB genome. As coverage bias is highly undesirable, this study illustrates the importance of appropriate protocol selection when performing NGS studies in order to ensure that sound inferences can be made regarding mycobacterial genomes.  相似文献   

2.
Here we demonstrate a method for unbiased multiplexed deep sequencing of RNA and DNA libraries using a novel, efficient and adaptable barcoding strategy called Post Amplification Ligation-Mediated (PALM). PALM barcoding is performed as the very last step of library preparation, eliminating a potential barcode-induced bias and allowing the flexibility to synthesize as many barcodes as needed. We sequenced PALM barcoded micro RNA (miRNA) and DNA reference samples and evaluated the quantitative barcode-induced bias in comparison to the same reference samples prepared using the Illumina TruSeq barcoding strategy. The Illumina TruSeq small RNA strategy introduces the barcode during the PCR step using differentially barcoded primers, while the TruSeq DNA strategy introduces the barcode before the PCR step by ligation of differentially barcoded adaptors. Results show virtually no bias between the differentially barcoded miRNA and DNA samples, both for the PALM and the TruSeq sample preparation methods. We also multiplexed miRNA reference samples using a pre-PCR barcode ligation. This barcoding strategy results in significant bias.  相似文献   

3.
Hybridization-based target enrichment protocols require relatively large starting amounts of genomic DNA, which is not always available. Here, we tested three approaches to pre-capture library preparation starting from 10 ng of genomic DNA: (i and ii) whole-genome amplification of DNA samples with REPLI-g (Qiagen) and GenomePlex (Sigma) kits followed by standard library preparation, and (iii) library construction with a low input oriented ThruPLEX kit (Rubicon Genomics). Exome capture with Agilent SureSelectXT2 Human AllExon v4+UTRs capture probes, and HiSeq2000 sequencing were performed for test libraries along with the control library prepared from 1 µg of starting DNA. Tested protocols were characterized in terms of mapping efficiency, enrichment ratio, coverage of the target region, and reliability of SNP genotyping. REPLI-g- and ThruPLEX-FD-based protocols seem to be adequate solutions for exome sequencing of low input samples.  相似文献   

4.
5.
Resting eggs banks are unique windows that allow us to directly observe shifts in population genetics, and phenotypes over time as natural populations evolve. Though a variety of planktonic organisms also produce resting stages, the keystone freshwater consumer, Daphnia, is a well‐known model for paleogenetics and resurrection ecology. Nevertheless, paleogenomic investigations are limited largely because resting eggs do not contain enough DNA for genomic sequencing. In fact, genomic studies even on extant populations include a laborious preparatory phase of batch culturing dozens of individuals to generate sufficient genomic DNA. Here, we furnish a protocol to generate whole genomes of single ephippial (resting) eggs and single daphniids. Whole genomes of single ephippial eggs and single adults were amplified using Qiagen REPLI‐g Single Cell kit reaction, followed by NEBNext Ultra DNA Library Prep Kit for library construction and Illumina sequencing. We compared the quality of the single‐egg and single‐individual amplified genomes to the standard batch genomic DNA extraction in the absence of genome amplification. At mean 20× depth, coverage was essentially identical for the amplified single individual relative to the unamplified batch extracted genome (>90% of the genome was covered and callable). Finally, while amplification resulted in the slight loss of heterozygosity for the amplified genomes, estimates were largely comparable and illustrate the utility and limitations of this approach in estimating population genetic parameters over long periods of time in natural populations of Daphnia and also other small species known to produce resting stages.  相似文献   

6.
7.
Next-generation sequencing (NGS) is emerging as a powerful tool for elucidating genetic information for a wide range of applications. Unfortunately, the surging popularity of NGS has not yet been accompanied by an improvement in automated techniques for preparing formatted sequencing libraries. To address this challenge, we have developed a prototype microfluidic system for preparing sequencer-ready DNA libraries for analysis by Illumina sequencing. Our system combines droplet-based digital microfluidic (DMF) sample handling with peripheral modules to create a fully-integrated, sample-in library-out platform. In this report, we use our automated system to prepare NGS libraries from samples of human and bacterial genomic DNA. E. coli libraries prepared on-device from 5 ng of total DNA yielded excellent sequence coverage over the entire bacterial genome, with >99% alignment to the reference genome, even genome coverage, and good quality scores. Furthermore, we produced a de novo assembly on a previously unsequenced multi-drug resistant Klebsiella pneumoniae strain BAA-2146 (KpnNDM). The new method described here is fast, robust, scalable, and automated. Our device for library preparation will assist in the integration of NGS technology into a wide variety of laboratories, including small research laboratories and clinical laboratories.  相似文献   

8.
9.
miRNAs act as important regulators of gene expression by promoting mRNA degradation or by attenuating protein translation. Since miRNAs are stably expressed in bodily fluids, there is growing interest in profiling these miRNAs, as it is minimally invasive and cost-effective as a diagnostic matrix. A technical hurdle in studying miRNA dynamics is the ability to reliably extract miRNA as small sample volumes and low RNA abundance create challenges for extraction and downstream applications. The purpose of this study was to develop a pipeline for the recovery of miRNA using small volumes of archived serum samples. The RNA was extracted employing several widely utilized RNA isolation kits/methods with and without addition of a carrier. The small RNA library preparation was carried out using Illumina TruSeq small RNA kit and sequencing was carried out using Illumina platform. A fraction of five microliters of total RNA was used for library preparation as quantification is below the detection limit. We were able to profile miRNA levels in serum from all the methods tested. We found out that addition of nucleic acid based carrier molecules had higher numbers of processed reads but it did not enhance the mapping of any miRBase annotated sequences. However, some of the extraction procedures offer certain advantages: RNA extracted by TRIzol seemed to align to the miRBase best; extractions using TRIzol with carrier yielded higher miRNA-to-small RNA ratios. Nuclease free glycogen can be carrier of choice for miRNA sequencing. Our findings illustrate that miRNA extraction and quantification is influenced by the choice of methodologies. Addition of nucleic acid- based carrier molecules during extraction procedure is not a good choice when assaying miRNA using sequencing. The careful selection of an extraction method permits the archived serum samples to become valuable resources for high-throughput applications.  相似文献   

10.
Marine habitats are Earth''s largest aquatic ecosystems, yet little is known about marine organism''s genomes. Molecular studies can unravel their genetics print, thus shedding light on specie''s adaptation and speciation with precise authentication. However, extracting high molecular weight DNA from marine organisms and subsequent DNA library preparation for whole genome sequencing is challenging. The challenges can be explained by excessive metabolites secretion that co‐precipitates with DNA and barricades their sequencing. In this work, we sought to resolve this issue by describing an optimized isolation method and comparing its performance with the most commonly reported protocols or commercial kits: SDS/phenol–chloroform method, Qiagen Genomic Tips kit, Qiagen DNeasy Plant mini kit, a modified protocol of Qiagen DNeasy Plant kit, Qiagen DNeasy Blood and Tissue kit, and Qiagen Qiamp DNA Stool mini kit. Our method proved to work significantly better for different marine species regardless of their shape, consistency, and sample preservation, improving Oxford Nanopore Technologies sequencing yield by 39 folds for Spirobranchus sp. and enabling generation of almost 10 GB data per flow cell/run for Chrysaora sp. and Palaemon sp. samples.  相似文献   

11.
12.
DNA barcoding has become one of the most important techniques in plant species identification. Successful application of this technology is dependent on the availability of reference database of high species coverage. Unfortunately, there are experimental and data processing challenges to construct such a library within a short time. Here, we present our solutions to these challenges. We sequenced six conventional DNA barcode fragments (ITS1, ITS2, matK1, matK2, rbcL1, and rbcL2) of 380 flowering plants on next‐generation sequencing (NGS) platforms (Illumina Hiseq 2500 and Ion Torrent S5) and the Sanger sequencing platform. After comparing the sequencing depths, read lengths, base qualities, and base accuracies, we conclude that Illumina Hiseq2500 PE250 run is suitable for conventional DNA barcoding. We developed a new “Cotu” method to create consensus sequences from NGS reads for longer output sequences and more reliable bases than the other three methods. Step‐by‐step instructions to our method are provided. By using high‐throughput machines (PCR and NGS), labeling PCR, and the Cotu method, it is possible to significantly reduce the cost and labor investments for DNA barcoding. A regional or even global DNA barcoding reference library with high species coverage is likely to be constructed in a few years.  相似文献   

13.
DNA-methylation is an important epigenetic feature in health and disease. Methylated sequence capturing by Methyl Binding Domain (MBD) based enrichment followed by second-generation sequencing provides the best combination of sensitivity and cost-efficiency for genome-wide DNA-methylation profiling. However, existing implementations are numerous, and quality control and optimization require expensive external validation. Therefore, this study has two aims: 1) to identify a best performing kit for MBD-based enrichment using independent validation data, and 2) to evaluate whether quality evaluation can also be performed solely based on the characteristics of the generated sequences. Five commercially available kits for MBD enrichment were combined with Illumina GAIIx sequencing for three cell lines (HCT15, DU145, PC3). Reduced representation bisulfite sequencing data (all three cell lines) and publicly available Illumina Infinium BeadChip data (DU145 and PC3) were used for benchmarking. Consistent large-scale differences in yield, sensitivity and specificity between the different kits could be identified, with Diagenode''s MethylCap kit as overall best performing kit under the tested conditions. This kit could also be identified with the Fragment CpG-plot, which summarizes the CpG content of the captured fragments, implying that the latter can be used as a tool to monitor data quality. In conclusion, there are major quality differences between kits for MBD-based capturing of methylated DNA, with the MethylCap kit performing best under the used settings. The Fragment CpG-plot is able to monitor data quality based on inherent sequence data characteristics, and is therefore a cost-efficient tool for experimental optimization, but also to monitor quality throughout routine applications.  相似文献   

14.
目的:建立新的线粒体基因组DNA杂交捕获探针制备方法并用进行初步应用。方法:通过PCR技术扩增特异线粒体DNA片段,并与生物素偶联,最后与标记磁珠的亲和素混合获得捕获探针。并自行制备的线粒体基因组DNA文库捕获探针与肝癌全基因组测序文库进行液相杂交。分离捕获产物后PCR扩增并进行测序分析。结果:成功建立了线粒体基因组杂交捕获探针制备方法并成功分离线粒体基因组DNA;对测序数据的分析显示:90%以上测序数据来自线粒体基因组DNA,且覆盖率达到100%,且均一性良好。检测到的同质性变异位点数量和异质性变异位点数量与全基因组测序数据产生的结果接近(P=0.9152,P=0.8409)。结论:新方法制备的线粒体基因组DNA杂交捕获探针可以从全基因组文库中高效捕获线粒体基因组DNA测序文库。  相似文献   

15.
High-throughput DNA sequencing technologies have revolutionized genomic analysis, including the de novo assembly of whole genomes. Nevertheless, assembly of complex genomes remains challenging, in part due to the presence of dispersed repeats which introduce ambiguity during genome reconstruction. Transposable elements (TEs) can be particularly problematic, especially for TE families exhibiting high sequence identity, high copy number, or complex genomic arrangements. While TEs strongly affect genome function and evolution, most current de novo assembly approaches cannot resolve long, identical, and abundant families of TEs. Here, we applied a novel Illumina technology called TruSeq synthetic long-reads, which are generated through highly-parallel library preparation and local assembly of short read data and which achieve lengths of 1.5–18.5 Kbp with an extremely low error rate (0.03% per base). To test the utility of this technology, we sequenced and assembled the genome of the model organism Drosophila melanogaster (reference genome strain y; cn, bw, sp) achieving an N50 contig size of 69.7 Kbp and covering 96.9% of the euchromatic chromosome arms of the current reference genome. TruSeq synthetic long-read technology enables placement of individual TE copies in their proper genomic locations as well as accurate reconstruction of TE sequences. We entirely recovered and accurately placed 4,229 (77.8%) of the 5,434 annotated transposable elements with perfect identity to the current reference genome. As TEs are ubiquitous features of genomes of many species, TruSeq synthetic long-reads, and likely other methods that generate long-reads, offer a powerful approach to improve de novo assemblies of whole genomes.  相似文献   

16.
新一代测序技术(NGS)的文库制备方法在基因组的拼装中起着重要作用。但是NGS技术制备的普通DNA文库片段只有500 bp左右,难以满足复杂基因组的从头(de novo)拼装要求。三代测序技术的读长可以达到20 kb,但是其高错误率及测序成本过高使得其又不易推广。因此二代测序的Mate-paired文库制备技术一直在基因组的de novo拼装中扮演着非常重要的角色。目前主流的NGS平台Illumina制备的Mate-paired文库的片段范围只有2~5 kb,为了得到更长的可用于Illumina平台测序的Mate-paired文库,本研究首次整合并优化了Illumina和Roche/454两种测序平台的Mate-paired文库制备技术,采用诱导环化酶来提高基因组长片段DNA的环化效率,成功建立了20 kb Mate-paired文库制备技术,并已将该技术应用于人类基因组20 kb Mate-paired文库制备。该技术为Illumina平台制备长片段Mate-paired库提供了方法指导。  相似文献   

17.
Although per-base sequencing costs have decreased during recent years, library preparation for targeted massively parallel sequencing remains constrained by high reagent cost, limited design flexibility, and protocol complexity. To address these limitations, we previously developed Hi-Plex, a polymerase chain reaction (PCR) massively parallel sequencing strategy for screening panels of genomic target regions. Here, we demonstrate that Hi-Plex applied with hybrid adapters can generate a library suitable for sequencing with both the Ion Torrent and the TruSeq chemistries and that adjusting primer concentrations improves coverage uniformity. These results expand Hi-Plex capabilities as an accurate, affordable, flexible, and rapid approach for various genetic screening applications.  相似文献   

18.
19.
Cost-effectiveness, quality, time-effectiveness and ease of the methodology are the most crucial factors in isolating quality DNA from wide variety of samples. Thus, research efforts focusing on the development of an efficient DNA extraction protocol is the need of the hour. The present study therefore, focuses on development of an efficient, rapid and free of inhibitory substances based methodology for extracting metagenomic DNA from diverse environmental samples viz. anaerobic biogas digesta, ruminant stomach, human feces, soil, and microbial starter cultures used for preparation of fermented food. PCR–DGGE based analysis and quality metagenomic library preparation, using DNA extraction methodology, validates the developed protocol. The developed protocol is cost effective, capable of isolating DNA from small sample size (100–1000 µl), time efficient (1.5–2.0 h protocol) and results in significantly higher DNA yield (4–8 times increased yield) when compared to previously available DNA extraction method and a commercial DNA extraction kit. The DNA extracted from the samples using different protocols was evaluated based on its ability to identify diverse microbial species using PCR–DGGE profiles targeting variable region within the 16S rRNA gene. The results of microbial community analysis revealed comparability of the developed protocol to commercial kits, in effectively identifying dominant representatives of the microbial community in different samples. Using the DNA extracted from the presented methodology, metagenomic libraries were prepared, which were found suitable for sequencing on Illumina platform.  相似文献   

20.
Next-generation sequencing (NGS) is getting routinely used in the diagnosis of hereditary diseases, such as human cardiomyopathies. Hence, it is of utter importance to secure high quality sequencing data, enabling the identification of disease-relevant mutations or the conclusion of neg-ative test results. During the process of sample preparation, each protocol for target enrichment library preparation has its own requirements for quality control (QC); however, there is little evi-dence on the actual impact of these guidelines on resulting data quality. In this study, we analyzed the impact of QC during the diverse library preparation steps of Agilent SureSelect XT target enrichment and Illumina sequencing. We quantified the parameters for a cohort of around 600 sam-ples, which include starting amount of DNA, amount of sheared DNA, smallest and largest frag-ment size of the starting DNA; amount of DNA after the pre-PCR, and smallest and largest fragment size of the resulting DNA;as well as the amount of the final library, the corresponding smallest and largest fragment size, and the number of detected variants. Intriguingly, there is a high tolerance for variations in all QC steps, meaning that within the boundaries proposed in the current study, a considerable variance at each step of QC can be well tolerated without compromising NGS quality.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号