首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 187 毫秒
1.

Background

Single-cell genome sequencing has the potential to allow the in-depth exploration of the vast genetic diversity found in uncultured microbes. We used the marine cyanobacterium Prochlorococcus as a model system for addressing important challenges facing high-throughput whole genome amplification (WGA) and complete genome sequencing of individual cells.

Methodology/Principal Findings

We describe a pipeline that enables single-cell WGA on hundreds of cells at a time while virtually eliminating non-target DNA from the reactions. We further developed a post-amplification normalization procedure that mitigates extreme variations in sequencing coverage associated with multiple displacement amplification (MDA), and demonstrated that the procedure increased sequencing efficiency and facilitated genome assembly. We report genome recovery as high as 99.6% with reference-guided assembly, and 95% with de novo assembly starting from a single cell. We also analyzed the impact of chimera formation during MDA on de novo assembly, and discuss strategies to minimize the presence of incorrectly joined regions in contigs.

Conclusions/Significance

The methods describe in this paper will be useful for sequencing genomes of individual cells from a variety of samples.  相似文献   

2.
The nature and pace of genome mutation is largely unknown. Because standard methods sequence DNA from populations of cells, the genetic composition of individual cells is lost, de novo mutations in cells are concealed within the bulk signal and per cell cycle mutation rates and mechanisms remain elusive. Although single-cell genome analyses could resolve these problems, such analyses are error-prone because of whole-genome amplification (WGA) artefacts and are limited in the types of DNA mutation that can be discerned. We developed methods for paired-end sequence analysis of single-cell WGA products that enable (i) detecting multiple classes of DNA mutation, (ii) distinguishing DNA copy number changes from allelic WGA-amplification artefacts by the discovery of matching aberrantly mapping read pairs among the surfeit of paired-end WGA and mapping artefacts and (iii) delineating the break points and architecture of structural variants. By applying the methods, we capture DNA copy number changes acquired over one cell cycle in breast cancer cells and in blastomeres derived from a human zygote after in vitro fertilization. Furthermore, we were able to discover and fine-map a heritable inter-chromosomal rearrangement t(1;16)(p36;p12) by sequencing a single blastomere. The methods will expedite applications in basic genome research and provide a stepping stone to novel approaches for clinical genetic diagnosis.  相似文献   

3.
Whole genome amplification (WGA) is essential for obtaining genome sequences from single bacterial cells because the quantity of template DNA contained in a single cell is very low. Multiple displacement amplification (MDA), using Phi29 DNA polymerase and random primers, is the most widely used method for single-cell WGA. However, single-cell MDA usually results in uneven genome coverage because of amplification bias, background amplification of contaminating DNA, and formation of chimeras by linking of non-contiguous chromosomal regions. Here, we present a novel MDA method, termed droplet MDA, that minimizes amplification bias and amplification of contaminants by using picoliter-sized droplets for compartmentalized WGA reactions. Extracted DNA fragments from a lysed cell in MDA mixture are divided into 105 droplets (67 pL) within minutes via flow through simple microfluidic channels. Compartmentalized genome fragments can be individually amplified in these droplets without the risk of encounter with reagent-borne or environmental contaminants. Following quality assessment of WGA products from single Escherichia coli cells, we showed that droplet MDA minimized unexpected amplification and improved the percentage of genome recovery from 59% to 89%. Our results demonstrate that microfluidic-generated droplets show potential as an efficient tool for effective amplification of low-input DNA for single-cell genomics and greatly reduce the cost and labor investment required for determination of nearly complete genome sequences of uncultured bacteria from environmental samples.  相似文献   

4.
Since only a small fraction of environmental bacteria are amenable to laboratory culture, there is great interest in genomic sequencing directly from single cells. Sufficient DNA for sequencing can be obtained from one cell by the Multiple Displacement Amplification (MDA) method, thereby eliminating the need to develop culture methods. Here we used a microfluidic device to isolate individual Escherichia coli and amplify genomic DNA by MDA in 60-nl reactions. Our results confirm a report that reduced MDA reaction volume lowers nonspecific synthesis that can result from contaminant DNA templates and unfavourable interaction between primers. The quality of the genome amplification was assessed by qPCR and compared favourably to single-cell amplifications performed in standard 50-μl volumes. Amplification bias was greatly reduced in nanoliter volumes, thereby providing a more even representation of all sequences. Single-cell amplicons from both microliter and nanoliter volumes provided high-quality sequence data by high-throughput pyrosequencing, thereby demonstrating a straightforward route to sequencing genomes from single cells.  相似文献   

5.
We present a protocol for reliably detecting DNA copy number aberrations in a single human cell. Multiple displacement-amplified DNAs of a cell are hybridized to a 3,000-bacterial artificial chromosome (BAC) array and to an Affymetrix 250,000 (250K)-SNP array. Subsequent copy number calling is based on the integration of BAC probe-specific copy number probabilities that are estimated by comparing probe intensities with a single-cell whole-genome amplification (WGA) reference model for diploid chromosomes, as well as SNP copy number and loss-of-heterozygosity states estimated by hidden Markov models (HMM). All methods for detecting DNA copy number aberrations in single human cells have difficulty in confidently discriminating WGA artifacts from true genetic variants. Furthermore, some methods lack thorough validation for segmental DNA imbalance detection. Our protocol minimizes false-positive variant calling and enables uniparental isodisomy detection in single cells. Additionally, it provides quality assessment, allowing the exclusion of uninterpretable single-cell WGA samples. The protocol takes 5-7 d.  相似文献   

6.
Somatic mosaicism occurs throughout normal development and contributes to numerous disease etiologies, including tumorigenesis and neurological disorders. Intratumor genetic heterogeneity is inherent to many cancers, creating challenges for effective treatments. Unfortunately, analysis of bulk DNA masks subclonal phylogenetic architectures created by the acquisition and distribution of somatic mutations amongst cells. As a result, single-cell genetic analysis is becoming recognized as vital for accurately characterizing cancers. Despite this, methods for single-cell genetics are lacking. Here we present an automated microfluidic workflow enabling efficient cell capture, lysis, and whole genome amplification (WGA). We find that ~90% of the genome is accessible in single cells with improved uniformity relative to current single-cell WGA methods. Allelic dropout (ADO) rates were limited to 13.75% and variant false discovery rates (SNV FDR) were 4.11x10-6, on average. Application to ER-/PR-/HER2+ breast cancer cells and matched normal controls identified novel mutations that arose in a subpopulation of cells and effectively resolved the segregation of known cancer-related mutations with single-cell resolution. Finally, we demonstrate effective cell classification using mutation profiles with 10X average exome coverage depth per cell. Our data demonstrate an efficient automated microfluidic platform for single-cell WGA that enables the resolution of somatic mutation patterns in single cells.  相似文献   

7.
8.
Metagenomics and single-cell genomics have enabled the discovery of relevant uncultured microbes. Recently, single-virus genomics (SVG), although still in an incipient stage, has opened new avenues in viral ecology by allowing the sequencing of one single virus at a time. The investigation of methodological alternatives and optimization of existing procedures for SVG is paramount to deliver high-quality genomic data. We report a sequencing dataset of viral single-amplified genomes (vSAGs) from cultured and uncultured viruses obtained by applying different conditions in each SVG step, from viral preservation and novel whole-genome amplification (WGA) to sequencing platforms and genome assembly. Sequencing data showed that cryopreservation and mild fixation were compatible with WGA, although fresh samples delivered better genome quality data. The novel TruPrime WGA, based on primase-polymerase features, and WGA-X employing a thermostable phi29 polymerase, were proven to be with sufficient sensitivity in SVG. The Oxford Nanopore (ON) sequencing platform did not provide a significant improvement of vSAG assembly compared to Illumina alone. Finally, the SPAdes assembler performed the best. Overall, our results represent a valuable genomic dataset that will help to standardized and advance new tools in viral ecology.  相似文献   

9.
Multiple displacement amplification (MDA) is a recently described method of whole-genome amplification (WGA) that has proven efficient in the amplification of small amounts of DNA, including DNA from single cells. Compared with PCR-based WGA methods, MDA generates DNA with a higher molecular weight and shows better genome coverage. This protocol was developed for preimplantation genetic diagnosis, and details a method for performing single-cell MDA using the phi29 DNA polymerase. It can also be useful for the amplification of other minute quantities of DNA, such as from forensic material or microdissected tissue. The protocol includes the collection and lysis of single cells, and all materials and steps involved in the MDA reaction. The whole procedure takes 3 h and generates 1-2 microg of DNA from a single cell, which is suitable for multiple downstream applications, such as sequencing, short tandem repeat analysis or array comparative genomic hybridization.  相似文献   

10.
Multiple Displacement Amplification (MDA) of DNA using φ29 (phi29) DNA polymerase amplifies DNA several billion-fold, which has proved to be potentially very useful for evaluating genome information in a culture-independent manner. Whole genome sequencing using DNA from a single prokaryotic genome copy amplified by MDA has not yet been achieved due to the formation of chimeras and skewed amplification of genomic regions during the MDA step, which then precludes genome assembly. We have hereby addressed the issue by using 10 ng of genomic Vibrio cholerae DNA extracted within an agarose plug to ensure circularity as a starting point for MDA and then sequencing the amplified yield using the SOLiD platform. We successfully managed to assemble the entire genome of V. cholerae strain LMA3984-4 (environmental O1 strain isolated in urban Amazonia) using a hybrid de novo assembly strategy. Using our method, only 178 out of 16,713 (1%) of contigs were not able to be inserted into either chromosome scaffold, and out of these 178, only 3 appeared to be chimeras. The other contigs seem to be the result of template-independent non-specific amplification during MDA, yielding spurious reads. Extraction of genomic DNA within an agarose plug in order to ensure circularity of the extracted genome might be key to minimizing amplification bias by MDA for WGS.  相似文献   

11.
Methods for haplotyping and DNA copy-number typing of single cells are paramount for studying genomic heterogeneity and enabling genetic diagnosis. Before analyzing the DNA of a single cell by microarray or next-generation sequencing, a whole-genome amplification (WGA) process is required, but it substantially distorts the frequency and composition of the cell’s alleles. As a consequence, haplotyping methods suffer from error-prone discrete SNP genotypes (AA, AB, BB) and DNA copy-number profiling remains difficult because true DNA copy-number aberrations have to be discriminated from WGA artifacts. Here, we developed a single-cell genome analysis method that reconstructs genome-wide haplotype architectures as well as the copy-number and segregational origin of those haplotypes by employing phased parental genotypes and deciphering WGA-distorted SNP B-allele fractions via a process we coin haplarithmisis. We demonstrate that the method can be applied as a generic method for preimplantation genetic diagnosis on single cells biopsied from human embryos, enabling diagnosis of disease alleles genome wide as well as numerical and structural chromosomal anomalies. Moreover, meiotic segregation errors can be distinguished from mitotic ones.  相似文献   

12.
Advances in both high-throughput sequencing and whole-genome amplification (WGA) protocols have allowed genomes to be sequenced from femtograms of DNA, for example from individual cells or from precious clinical and archived samples. Using the highly curated Caenorhabditis elegans genome as a reference, we have sequenced and identified errors and biases associated with Illumina library construction, library insert size, different WGA methods and genome features such as GC bias and simple repeat content. Detailed analysis of the reads from amplified libraries revealed characteristics suggesting that majority of amplified fragment ends are identical but inverted versions of each other. Read coverage in amplified libraries is correlated with both tandem and inverted repeat content, while GC content only influences sequencing in long-insert libraries. Nevertheless, single nucleotide polymorphism (SNP) calls and assembly metrics from reads in amplified libraries show comparable results with unamplified libraries. To utilize the full potential of WGA to reveal the real biological interest, this article highlights the importance of recognizing additional sources of errors from amplified sequence reads and discusses the potential implications in downstream analyses.  相似文献   

13.
We developed and optimized a method using Chelex DNA extraction followed by whole genome amplification (WGA) to overcome problems conducting molecular genetic studies due to the limited amount of DNA obtainable from individual small organisms such as predatory mites. The DNA from a single mite, Phytoseiulus persimilis Athias-Henrot (Acari: Phytoseiidae), isolated in Chelex suspension was subjected to WGA. More than 1000-fold amplification of the DNA was achieved using as little as 0.03 ng genomic DNA template. The DNA obtained by the WGA was used for polymerase chain reaction followed by direct sequencing. From WGA DNA, nuclear DNA intergenic spacers ITS1 and ITS2 and a mitochondrial DNA 12S marker were tested in three different geographical populations of the predatory mite: California, the Netherlands, and Sicily. We found a total of four different alleles of the 12S in the Sicilian population, but no polymorphism was identified in the ITS marker. The combination of Chelex DNA extraction and WGA is thus shown to be a simple and robust technique for examining molecular markers for multiple loci by using individual mites. We conclude that the methods, Chelex extraction of DNA followed by WGA, provide a large quantity of DNA template that can be used for multiple PCR reactions useful for genetic studies requiring the genotypes of individual mites.  相似文献   

14.
Copy number variations (CNVs), a common genomic mutation associated with various diseases, are important in research and clinical applications. Whole genome amplification (WGA) and massively parallel sequencing have been applied to single cell CNVs analysis, which provides new insight for the fields of biology and medicine. However, the WGA-induced bias significantly limits sensitivity and specificity for CNVs detection. Addressing these limitations, we developed a practical bioinformatic methodology for CNVs detection at the single cell level using low coverage massively parallel sequencing. This method consists of GC correction for WGA-induced bias removal, binary segmentation algorithm for locating CNVs breakpoints, and dynamic threshold determination for final signals filtering. Afterwards, we evaluated our method with seven test samples using low coverage sequencing (4∼9.5%). Four single-cell samples from peripheral blood, whose karyotypes were confirmed by whole genome sequencing analysis, were acquired. Three other test samples derived from blastocysts whose karyotypes were confirmed by SNP-array analysis were also recruited. The detection results for CNVs of larger than 1 Mb were highly consistent with confirmed results reaching 99.63% sensitivity and 97.71% specificity at base-pair level. Our study demonstrates the potential to overcome WGA-bias and to detect CNVs (>1 Mb) at the single cell level through low coverage massively parallel sequencing. It highlights the potential for CNVs research on single cells or limited DNA samples and may prove as a promising tool for research and clinical applications, such as pre-implantation genetic diagnosis/screening, fetal nucleated red blood cells research and cancer heterogeneity analysis.  相似文献   

15.
Population genomics is a useful tool to support integrated pest management as it can elucidate population dynamics, demography, and histories of invasion. Here, we use a restriction site‐associated DNA sequencing approach combined with whole‐genome amplification (WGA) to assess genomic population structure of a newly described pest of canola, the diminutive canola flower midge, Contarinia brassicola. Clustering analyses recovered little geographic structure across the main canola production region but differentiated several geographically disparate populations at edges of the agricultural zone. Given a lack of alternative hypotheses for this pattern, we suggest these data support alternative hosts for this species and thus our canola‐centric view of this midge as a pest has limited our understanding of its biology. These results speak to the need for increased surveying efforts across multiple habitats and other potential hosts within Brassicaceae to improve both our ecological and evolutionary knowledge of this species and contribute to effective management strategies. We additionally found that use of WGA prior to library preparation was an effective method for increasing DNA quantity of these small insects prior to restriction site‐associated DNA sequencing and had no discernible impact on genotyping consistency for population genetic analysis; WGA is therefore likely to be tractable for other similar studies that seek to randomly sample markers across the genome in small organisms.  相似文献   

16.
The objective of the present study was to develop an approach that could assess the chromosomal status and the mitochondrial DNA (mtDNA) content of oocytes and their corresponding polar bodies (PBs) with the goal of obtaining a comparative picture of the segregation process both for nuclear and mtDNA. After Whole Genome Amplification (WGA), sequencing of the whole mitochondrial genome was attempted to analyze the segregation of mutant and wild-type mtDNA during human meiosis. Three triads, composed of oocyte and corresponding PBs, were analyzed and their chromosome status was successfully assessed. The complete mitochondrial genome (mitogenome) was almost entirely sequenced in the oocytes (95.99% compared to 98.43% in blood), while the percentage of sequences obtained in the corresponding PB1 and PB2 was lower (69.70% and 69.04% respectively). The comparison with the mtDNA sequence in blood revealed no changes in the D-loop region for any of the cells of each triad. In the coding region of blood mtDNA and oocyte mtDNA sequences showed full correspondence, whereas all PBs had at least one change with respect to the blood-oocyte pairs. In all, 9 changes were found, either in PB1 or PB2: 4 in MT-ND5, 2 in MT-RNR2, and 1 each in MT-ATP8, MT-ND4, MT-CYTB. The full concordance between oocyte and blood in the 3 triads, and the relegation of changes to PBs, revealed the unexpected coexistence of different variants, giving a refined estimation of mitochondrial heteroplasmy. Should these findings be confirmed by additional data, an active mechanism could be postulated in the oocyte to preserve a condition of ‘normality’.  相似文献   

17.
《Genomics》2020,112(1):207-211
Viral sequence integrations in the human genome have been implicated in various human diseases. Viral integrations remain among the most challenging-to-detect structural changes of the human genome. No studies have systematically analyzed how molecular and bioinformatics factors affect the power (sensitivity) to detect viral integrations using high-throughput sequencing (HTS). We selected a wide-range of molecular and bioinformatics factors covering genome sequence characteristics, HTS features, and viral integration detection. We designed a fast simulation-based framework to model the process of detecting variable viral integration events in the human genome. We then examined the associations of selected factors with viral integration detection power. We identified six factors that significantly affected viral integration detection power (P < 2 × 10−16). The strongest factors associated with detection power included proportion of sample cells with clonal viral integrations (Pearson's ρ = 0.64), sequencing depth (ρ = 0.37), length of viral integration (ρ = 0.37), paired-end read insert size (ρ = 0.23), user-defined threshold (number of supporting reads) to claim successful identification of integrations (ρ = −0.19), and read length (when sequence volume was fixed) (ρ = −0.09). As the first tool of its kind, VIpower incorporates all these factors, which can be manipulated in concert with each other to optimize the detection power. This tool may be used to estimate viral integration detection power for various combinations of sequencing or analytic parameters. It may also be used to estimate the parameters required to achieve a specific power when designing new sequencing experiments.  相似文献   

18.
Single-cell RNA sequencing is a powerful technique that continues to expand across various biological applications. However, incomplete 3′-UTR annotations can impede single-cell analysis resulting in genes that are partially or completely uncounted. Performing single-cell RNA sequencing with incomplete 3′-UTR annotations can hinder the identification of cell identities and gene expression patterns and lead to erroneous biological inferences. We demonstrate that performing single-cell isoform sequencing in tandem with single-cell RNA sequencing can rapidly improve 3′-UTR annotations. Using threespine stickleback fish (Gasterosteus aculeatus), we show that gene models resulting from a minimal embryonic single-cell isoform sequencing dataset retained 26.1% greater single-cell RNA sequencing reads than gene models from Ensembl alone. Furthermore, pooling our single-cell sequencing isoforms with a previously published adult bulk Iso-Seq dataset from stickleback, and merging the annotation with the Ensembl gene models, resulted in a marginal improvement (+0.8%) over the single-cell isoform sequencing only dataset. In addition, isoforms identified by single-cell isoform sequencing included thousands of new splicing variants. The improved gene models obtained using single-cell isoform sequencing led to successful identification of cell types and increased the reads identified of many genes in our single-cell RNA sequencing stickleback dataset. Our work illuminates single-cell isoform sequencing as a cost-effective and efficient mechanism to rapidly annotate genomes for single-cell RNA sequencing.  相似文献   

19.
In the last few years, dozens of studies have documented the detection of loci influenced by selection from genome scans in a wide range of non-model species. Many of those studies used amplified fragment length polymorphism (AFLP) markers, which became popular for being easily applicable to any organism. However, because they are anonymous markers, AFLPs impose many challenges for their isolation and identification. Most recent AFLP genome scans used capillary electrophoresis (CE), which adds even more obstacles to the isolation of bands with a specific size for sequencing. These caveats might explain the extremely low number of studies that moved from the detection of outlier AFLP markers to their actual isolation and characterization. We document our efforts to characterize a set of outlier AFLP markers from a previous genome scan with CE in ocellated lizards (Lacerta lepida). Seven outliers were successfully isolated, cloned and sequenced. Their sequences are noncoding and show internal indels or polymorphic repetitive elements (microsatellites). Three outliers were converted into codominant markers by using specific internal primers to sequence and screen population variability from undigested DNA. Amplification in closely related lizard species was also achieved, revealing remarkable interspecific conservation in outlier loci sequences. We stress the importance of following up AFLP genome scans to validate selection signatures of outlier loci, but also report the main challenges and pitfalls that may be faced during the process.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号