首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Screening large numbers of target regions in multiple DNA samples for sequence variation is an important application of next-generation sequencing but an efficient method to enrich the samples in parallel has yet to be reported. We describe an advanced method that combines DNA samples using indexes or barcodes prior to target enrichment to facilitate this type of experiment. Sequencing libraries for multiple individual DNA samples, each incorporating a unique 6-bp index, are combined in equal quantities, enriched using a single in-solution target enrichment assay and sequenced in a single reaction. Sequence reads are parsed based on the index, allowing sequence analysis of individual samples. We show that the use of indexed samples does not impact on the efficiency of the enrichment reaction. For three- and nine-indexed HapMap DNA samples, the method was found to be highly accurate for SNP identification. Even with sequence coverage as low as 8x, 99% of sequence SNP calls were concordant with known genotypes. Within a single experiment, this method can sequence the exonic regions of hundreds of genes in tens of samples for sequence and structural variation using as little as 1 μg of input DNA per sample.  相似文献   

2.
Viral infections cause many different diseases stemming both from well-characterized viral pathogens but also from emerging viruses, and the search for novel viruses continues to be of great importance. High-throughput sequencing is an important technology for this purpose. However, viral nucleic acids often constitute a minute proportion of the total genetic material in a sample from infected tissue. Techniques to enrich viral targets in high-throughput sequencing have been reported, but the sensitivity of such methods is not well established. This study compares different library preparation techniques targeting both DNA and RNA with and without virion enrichment. By optimizing the selection of intact virus particles, both by physical and enzymatic approaches, we assessed the effectiveness of the specific enrichment of viral sequences as compared to non-enriched sample preparations by selectively looking for and counting read sequences obtained from shotgun sequencing. Using shotgun sequencing of total DNA or RNA, viral targets were detected at concentrations corresponding to the predicted level, providing a foundation for estimating the effectiveness of virion enrichment. Virion enrichment typically produced a 1000-fold increase in the proportion of DNA virus sequences. For RNA virions the gain was less pronounced with a maximum 13-fold increase. This enrichment varied between the different sample concentrations, with no clear trend. Despite that less sequencing was required to identify target sequences, it was not evident from our data that a lower detection level was achieved by virion enrichment compared to shotgun sequencing.  相似文献   

3.
Recent advances in sequencing technology allow for accurate detection of mitochondrial sequence variants, even those in low abundance at heteroplasmic sites. Considerable sequencing cost savings can be achieved by enriching samples for mitochondrial (relative to nuclear) DNA. Reduction in nuclear DNA (nDNA) content can also help to avoid false positive variants resulting from nuclear mitochondrial sequences (numts). We isolate intact mitochondrial organelles from both human cell lines and blood components using two separate methods: a magnetic bead binding protocol and differential centrifugation. DNA is extracted and further enriched for mitochondrial DNA (mtDNA) by an enzyme digest. Only 1 ng of the purified DNA is necessary for library preparation and next generation sequence (NGS) analysis. Enrichment methods are assessed and compared using mtDNA (versus nDNA) content as a metric, measured by using real-time quantitative PCR and NGS read analysis. Among the various strategies examined, the optimal is differential centrifugation isolation followed by exonuclease digest. This strategy yields >35% mtDNA reads in blood and cell lines, which corresponds to hundreds-fold enrichment over baseline. The strategy also avoids false variant calls that, as we show, can be induced by the long-range PCR approaches that are the current standard in enrichment procedures. This optimization procedure allows mtDNA enrichment for efficient and accurate massively parallel sequencing, enabling NGS from samples with small amounts of starting material. This will decrease costs by increasing the number of samples that may be multiplexed, ultimately facilitating efforts to better understand mitochondria-related diseases.  相似文献   

4.
Next-generation sequencing (NGS) technologies have transformed genomic research and have the potential to revolutionize clinical medicine. However, the background error rates of sequencing instruments and limitations in targeted read coverage have precluded the detection of rare DNA sequence variants by NGS. Here we describe a method, termed CypherSeq, which combines double-stranded barcoding error correction and rolling circle amplification (RCA)-based target enrichment to vastly improve NGS-based rare variant detection. The CypherSeq methodology involves the ligation of sample DNA into circular vectors, which contain double-stranded barcodes for computational error correction and adapters for library preparation and sequencing. CypherSeq is capable of detecting rare mutations genome-wide as well as those within specific target genes via RCA-based enrichment. We demonstrate that CypherSeq is capable of correcting errors incurred during library preparation and sequencing to reproducibly detect mutations down to a frequency of 2.4 × 10−7 per base pair, and report the frequency and spectra of spontaneous and ethyl methanesulfonate-induced mutations across the Saccharomyces cerevisiae genome.  相似文献   

5.
Enriching target sequences in sequencing libraries via capture hybridization to bait/probes is an efficient means of leveraging the capabilities of next-generation sequencing for obtaining sequence data from target regions of interest. However, homologous sequences from non-target regions may also be enriched by such methods. Here we investigate the fidelity of capture enrichment for complete mitochondrial DNA (mtDNA) genome sequencing by analyzing sequence data for nuclear copies of mtDNA (NUMTs). Using capture-enriched sequencing data from a mitochondria-free cell line and the parental cell line, and from samples previously sequenced from long-range PCR products, we demonstrate that NUMT alleles are indeed present in capture-enriched sequence data, but at low enough levels to not influence calling the authentic mtDNA genome sequence. However, distinguishing NUMT alleles from true low-level mutations (e.g. heteroplasmy) is more challenging. We develop here a computational method to distinguish NUMT alleles from heteroplasmies, using sequence data from artificial mixtures to optimize the method.  相似文献   

6.
7.
Intramolecular integration within Moloney murine leukemia virus DNA   总被引:36,自引:19,他引:17       下载免费PDF全文
By screening a library of unintegrated, circular Moloney murine leukemia virus (M-MuLV) DNA cloned in lambda phage, we found that approximately 20% of the M-MuLV DNA inserts contained internal sequence deletions or inversions. Restriction enzyme mapping demonstrated tht the deleted segments frequently abutted a long terminal repeat (LTR) sequence, whereas the inverted segments were usually flanked by LTR sequences, suggesting that many of the variants arose as a consequence of M-MuLV DNA molecules integrating within their own DNA. Nucleotide sequencing also suggested that most of the variant inserts were generated by autointegration. One of the recombinant M-MuLV DNA inserts contained a large inverted repeat of a unique M-MuLV sequence abutting an LTR. This molecule was shown by nucleotide sequencing to have arisen by an M-MuLV DNA Molecule integrating within a second M-MuLV DNA molecule before cloning. The autointegrated M-MuLV DNA had generally lost two base pairs from the LTR sequence at each junction with target site DNA, whereas a four-base-pair direct repeat of target site DNA flanked the integrated viral DNA. Nucleotide sequencing of preintegration target site DNA showed that this four-base-pair direct repeat was present only once before integration and was thus reiterated by the integration event. The results obtained from the autointegrated clones were supported by nucleotide sequencing of the host-virus junction of two cloned M-MuLV integrated proviruses obtained from infected rat cells. Detailed analysis of the different unique target site sequences revealed no obvious common features.  相似文献   

8.
A protocol was devised to select for DNA molecules that efficiently form circles from a library of 126 base pair DNAs containing 90 randomized base pairs. After six rounds of selection, individual molecules from the library showed 20‐ to 100‐fold greater j‐factors compared with the starting library, validating the selection protocol. High‐throughput sequencing revealed a sinusoidal pattern of enrichment and de‐enrichment of A/T dinucleotides in the random region with a 10.4 base pair period associated with the helicity of DNA. A similar, but more moderate pattern of C/G dinucleotides was offset by precisely half a helical turn. While C/G dinucleotide enrichments were evenly distributed, A/T dinucleotide enrichments displayed a preference to cluster in individual DNA molecules. The most highly enriched 10 base pair sequences in the random region contained adjacent blocks of A/T and C/G trinucleotides present in some, but not all, rapidly cyclizing molecules. The phased dinucleotide enrichments closely match those present in accurately mapped yeast nucleosomes, confirming the importance of DNA bending in nucleosome formation. However, at certain sites the nucleosomal DNAs show dinucleotide enrichments that differ substantially from the cyclization data. These discrepancies can often be correlated with sequence specific contacts that form between histones and DNA. © 2015 Wiley Periodicals, Inc. Biopolymers 103: 303–320, 2015.  相似文献   

9.
Structure of the rat alpha 1-acid glycoprotein gene.   总被引:4,自引:2,他引:2  
  相似文献   

10.
Conditions have been established for demonstrating small numbers of genes of the Epstein-Barr virus (EBV) in B-lymphoid cells by in situ hybridization using biotinylated EBV-specific DNA from cloned BamHI fragments of the viral genome. Single copies of EBV genomes were successfully visualized with minimal background when the probe concentration was 0.2 micrograms/ml, the DNA denaturation step was performed at 100 degrees C, and the immunochemical detection system employed a three-layer peroxidase protocol with gold-silver amplification of the diaminobenzidine substrate. The minimal target DNA detectable was about 10 kilobase pairs. In the case of sectioned cells fixed overnight with formalin, simulating conditions used in routine tissue fixation, this approach failed to demonstrate EBV DNA present at less than 100 copies per cell, that is, at the level found in Raji cells. However, when denaturation was performed using microwave irradiation with the other optimized conditions maintained, EBV DNA could be visualized in 10-20% of such cells, although not in cells known to contain fewer than 10 copies per cell. Thus, microwave irradiation partially overcomes the limit of DNA target detection imposed by formalin.  相似文献   

11.
12.
Targeted DNA enrichment coupled with next generation sequencing has been increasingly used for interrogation of select sub-genomic regions at high depth of coverage in a cost effective manner. Specificity measured by on-target efficiency is a key performance metric for target enrichment. Non-specific capture leads to off-target reads, resulting in waste of sequencing throughput on irrelevant regions. Microdroplet-PCR allows simultaneous amplification of up to thousands of regions in the genome and is among the most commonly used strategies for target enrichment. Here we show that carryover of single-stranded template genomic DNA from microdroplet-PCR constitutes a major contributing factor for off-target reads in the resultant libraries. Moreover, treatment of microdroplet-PCR enrichment products with a nuclease specific to single-stranded DNA alleviates off-target load and improves enrichment specificity. We propose that nuclease treatment of enrichment products should be incorporated in the workflow of targeted sequencing using microdroplet-PCR for target capture. These findings may have a broad impact on other PCR based applications for which removal of template DNA is beneficial.  相似文献   

13.
Eukaryotic DNA is organized into a macromolecular structure called chromatin. The basic repeating unit of chromatin is the nucleosome, which consists of two copies of each of the four core histones and DNA. The nucleosomal organization and the positions of nucleosomes have profound effects on all DNA-dependent processes. Understanding the factors that influence nucleosome positioning is therefore of general interest. Among the many determinants of nucleosome positioning, the DNA sequence has been proposed to have a major role. Here, we analyzed more than 860,000 nucleosomal DNA sequences to identify sequence features that guide the formation of nucleosomes in vivo. We found that both a periodic enrichment of AT base pairs and an out-of-phase oscillating enrichment of GC base pairs as well as the overall preference for GC base pairs are determinants of nucleosome positioning. The preference for GC pairs can be related to a lower energetic cost required for deformation of the DNA to wrap around the histones. In line with this idea, we found that only incorporation of both signal components into a sequence model for nucleosome formation results in maximal predictive performance on a genome-wide scale. In this manner, one achieves greater predictive power than published approaches. Our results confirm the hypothesis that the DNA sequence has a major role in nucleosome positioning in vivo.  相似文献   

14.
靶向富集技术具有良好的均一性、特异性和灵敏性,只需较低的测序覆盖度即可满足后续的分析。该技术被广泛应用于特定位点重测序和染色体构象等研究。为了靶向富集测序文库中特定的染色体DNA片段,本研究以21号染色体为例,采用人胚肝二倍体细胞CCC-HEL-1建立了一种单染色体靶向富集方法。具体步骤如下:(1)首先将人正常核型二倍体细胞同步化到G2/M 期,制备染色体悬液;然后流式分选出21号单染色体。(2)采用多重退火环形扩增技术(MALBAC)对21号染色体DNA进行扩增;用扩增的DNA产物构建文库。(3)利用体外转录合成、生物素标记的RNA探针与文库进行液相杂交,捕获并验证捕获效率。荧光定量PCR检测结果证明,特定的21号染色体呈上百倍的富集,而靶标文库则富集了近60 倍。结果提示,特异单染色体靶向富集方法成功建立。该法不仅可实现任一单染色体的靶向富集,用于单条染色体功能组学研究,而且还可能用于查找特定疾病相关的染色体异常(如DNA突变、异位、重复及倒置等),具有一定的实用意义。  相似文献   

15.
Chen D  Zhang W  Zhu ZD  Huang Y  Wang P  Zhou BB  Yang XN  Xiao HS  Zhang QH 《遗传》2010,32(12):1296-1303
文章旨在建立一种基因组目标靶序列捕捉文库的方法,并结合第二代测序技术,以实现候选基因区段的深度测序。利用Agilent公司的eArray在线平台,对1250个基因的11824个外显子共2414977bp的基因组序列进行120个碱基长度的捕捉探针(钓饵)设计,并制备成SureSelect液相靶序列捕获试剂。选用2例人基因组DNA,超声打断后末端补平并磷酸化,连接SOLiD接头,回收150bp~200bp的DNA片段,与靶序列探针杂交捕获目标序列,油包水微乳滴PCR扩增后,磁珠分离富集,上SOLiD测序系统通过工作流程分析(WFA)进行文库质量的评价,或正式测序反应。结果显示对所包含的11147个基因外显子片段设计出并合成了46509个捕捉探针,制备成SureSelect试剂盒。探针可有效地捕捉并富集基因组DNA的目标靶片段,定量PCR显示富集效率可达29倍。WFA分析表明文库可以在SOLiD仪器进行正式测序。测序结果显示靶序列区域的测序数占有效总测序数的比例达到70%,覆盖率均在200×以上。结果表明本研究所建立的SureSelect基因组靶序列捕捉、富集建立测序文库的技术路线可行,可直接用于SOLiD测序仪的测序。  相似文献   

16.
The complete nucleotide sequences of two copies of a putative insertion sequence IS1000 from Thermus thermophilus HB8 are presented. IS1000 is 1196 base pairs long, contains a long open reading frame which could code for a protein of 317 amino acids, and has imperfect terminal inverted repeats of 6 base pairs (confirmed by the terminal sequencing of 4.5 copies of IS1000), but does not cause a target site duplication. There are at least 6 copies of IS1000 in the genome of T. thermophilus HB8. A search of the GEN-EMBL data base revealed that the putative 317 amino acid protein had significant homology with open reading frames in the transposable elements IS110 of Streptomyces coelicolor and IS492 of Pseudomonas atlantica.  相似文献   

17.
18.
Marine experimental stem cell transplantations require the accurate discrimination and quantification of donor cells from host cells. A Y-chromosome-specific, quantitative real-time PCR (kinetic PCR) protocol for blood-derived DNA was developed. The assay sensitivity was extremely high with accurate detection of only 10 pg (six copies of Y target DNA) in a variable background of female DNA background ranging from 2.5 to 50 ng. The dynamic range of the assay provided accurate results ranging from 2.2 x 10(-2)% to 100% of male DNA in female background. The kinetic PCR assay can be used in all mouse strains, and a sample size as low as 2.5 ng total DNA is sufficient for analysis. Therefore, kinetic PCR allows engraftment kinetic studies on repeated blood draws of individual animals with no need for sacrifice. Compared to conventional PCR, the assay is much simplified, as neither the accurate adjustment of sample DNA concentration nor a post-reaction analysis procedure is required. The procedure is simple, free of radioactivity, and permits a throughput of 500-600 reactions per day.  相似文献   

19.
Hybridization-based target enrichment protocols require relatively large starting amounts of genomic DNA, which is not always available. Here, we tested three approaches to pre-capture library preparation starting from 10 ng of genomic DNA: (i and ii) whole-genome amplification of DNA samples with REPLI-g (Qiagen) and GenomePlex (Sigma) kits followed by standard library preparation, and (iii) library construction with a low input oriented ThruPLEX kit (Rubicon Genomics). Exome capture with Agilent SureSelectXT2 Human AllExon v4+UTRs capture probes, and HiSeq2000 sequencing were performed for test libraries along with the control library prepared from 1 µg of starting DNA. Tested protocols were characterized in terms of mapping efficiency, enrichment ratio, coverage of the target region, and reliability of SNP genotyping. REPLI-g- and ThruPLEX-FD-based protocols seem to be adequate solutions for exome sequencing of low input samples.  相似文献   

20.
DNA samples derived from vertebrate skin, bodily cavities and body fluids contain both host and microbial DNA; the latter often present as a minor component. Consequently, DNA sequencing of a microbiome sample frequently yields reads originating from the microbe(s) of interest, but with a vast excess of host genome-derived reads. In this study, we used a methyl-CpG binding domain (MBD) to separate methylated host DNA from microbial DNA based on differences in CpG methylation density. MBD fused to the Fc region of a human antibody (MBD-Fc) binds strongly to protein A paramagnetic beads, forming an effective one-step enrichment complex that was used to remove human or fish host DNA from bacterial and protistan DNA for subsequent sequencing and analysis. We report enrichment of DNA samples from human saliva, human blood, a mock malaria-infected blood sample and a black molly fish. When reads were mapped to reference genomes, sequence reads aligning to host genomes decreased 50-fold, while bacterial and Plasmodium DNA sequences reads increased 8–11.5-fold. The Shannon-Wiener diversity index was calculated for 149 bacterial species in saliva before and after enrichment. Unenriched saliva had an index of 4.72, while the enriched sample had an index of 4.80. The similarity of these indices demonstrates that bacterial species diversity and relative phylotype abundance remain conserved in enriched samples. Enrichment using the MBD-Fc method holds promise for targeted microbiome sequence analysis across a broad range of sample types.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号