首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Two-step PCR procedures are an efficient and well established way to generate amplicon libraries for NGS sequencing. However, there is a high risk of cross-contamination by carry-over of amplicons from first to second amplification rounds, potentially leading to severe misinterpretation of results. Here we describe a new method able to prevent and/or to identify carry-over contaminations by introducing the K-box, a series of three synergistically acting short sequence elements. Our K-boxes are composed of (i) K1 sequences for suppression of contaminations, (ii) K2 sequences for detection of possible residual contaminations and (iii) S sequences acting as separators to avoid amplification bias. In order to demonstrate the effectiveness of our method we analyzed two-step PCR NGS libraries derived from a multiplex PCR system for detection of T-cell receptor beta gene rearrangements. We used this system since it is of high clinical relevance and may be affected by very low amounts of contaminations. Spike-in contaminations are effectively blocked by the K-box even at high rates as demonstrated by ultra-deep sequencing of the amplicons. Thus, we recommend implementation of the K-box in two-step PCR-based NGS systems for research and diagnostic applications demanding high sensitivity and accuracy.  相似文献   

2.
3.
Massively parallel sequencing of 16S rRNA genes enables the comparison of terrestrial, aquatic, and host-associated microbial communities with sufficient sequencing depth for robust assessments of both alpha and beta diversity. Establishing standardized protocols for the analysis of microbial communities is dependent on increasing the reproducibility of PCR-based molecular surveys by minimizing sources of methodological bias. In this study, we tested the effects of template concentration, pooling of PCR amplicons, and sample preparation/interlane sequencing on the reproducibility associated with paired-end Illumina sequencing of bacterial 16S rRNA genes. Using DNA extracts from soil and fecal samples as templates, we sequenced pooled amplicons and individual reactions for both high (5- to 10-ng) and low (0.1-ng) template concentrations. In addition, all experimental manipulations were repeated on two separate days and sequenced on two different Illumina MiSeq lanes. Although within-sample sequence profiles were highly consistent, template concentration had a significant impact on sample profile variability for most samples. Pooling of multiple PCR amplicons, sample preparation, and interlane variability did not influence sample sequence data significantly. This systematic analysis underlines the importance of optimizing template concentration in order to minimize variability in microbial-community surveys and indicates that the practice of pooling multiple PCR amplicons prior to sequencing contributes proportionally less to reducing bias in 16S rRNA gene surveys with next-generation sequencing.  相似文献   

4.
Detection and enumeration of Cryptosporidium parvum in both treated and untreated waters are important to facilitate prevention of future cryptosporidiosis incidents. Immunomagnetic separation (IMS)-fluorescent antibody (FA) detection and IMS-PCR detection efficiencies were evaluated in two natural waters seeded with nominal seed doses of 5, 10, and 15 oocysts. IMS-FA detected oocysts at concentrations at or below the three nominal oocyst seed doses, illustrating that IMS-FA is sensitive enough to detect low oocyst numbers. However, the species of the oocysts could not be determined with this technique. IMS-PCR, targeting the 18S rRNA gene in this study, yielded positive amplification for 17 of the 18 seeded water samples, and the amplicons were subjected to restriction fragment length polymorphism digestion and DNA sequencing for species identification. Interestingly, the two unseeded, natural water samples were also PCR positive; one amplicon was the same base pair size as the C. parvum amplicon, and the other amplicon was larger. These two amplified products were determined to be derived from DNA of Cryptosporidium muris and a dinoflagellate. These IMS-PCR results illustrate that (i) IMS-PCR is able to detect low oocyst numbers in natural waters, (ii) PCR amplification alone is not confirmatory for detection of target DNA when environmental samples are used, (ii) PCR primers, especially those designed against the rRNA gene region, need to be evaluated for specificity with organisms closely related to the target organism, and (iv) environmental amplicons should be subjected to appropriate species-specific confirmatory techniques.  相似文献   

5.
Detection and enumeration of Cryptosporidium parvum in both treated and untreated waters are important to facilitate prevention of future cryptosporidiosis incidents. Immunomagnetic separation (IMS)-fluorescent antibody (FA) detection and IMS-PCR detection efficiencies were evaluated in two natural waters seeded with nominal seed doses of 5, 10, and 15 oocysts. IMS-FA detected oocysts at concentrations at or below the three nominal oocyst seed doses, illustrating that IMS-FA is sensitive enough to detect low oocyst numbers. However, the species of the oocysts could not be determined with this technique. IMS-PCR, targeting the 18S rRNA gene in this study, yielded positive amplification for 17 of the 18 seeded water samples, and the amplicons were subjected to restriction fragment length polymorphism digestion and DNA sequencing for species identification. Interestingly, the two unseeded, natural water samples were also PCR positive; one amplicon was the same base pair size as the C. parvum amplicon, and the other amplicon was larger. These two amplified products were determined to be derived from DNA of Cryptosporidium muris and a dinoflagellate. These IMS-PCR results illustrate that (i) IMS-PCR is able to detect low oocyst numbers in natural waters, (ii) PCR amplification alone is not confirmatory for detection of target DNA when environmental samples are used, (ii) PCR primers, especially those designed against the rRNA gene region, need to be evaluated for specificity with organisms closely related to the target organism, and (iv) environmental amplicons should be subjected to appropriate species-specific confirmatory techniques.  相似文献   

6.
Efficient methods for constructing 16S tag amplicon libraries for pyrosequencing are needed for the rapid and thorough screening of infectious bacterial diversity from host tissue samples. Here we have developed a double‐nested PCR methodology that generates 16S tag amplicon libraries from very small amounts of bacteria/host samples. This methodology was tested for 133 kidney samples from the lake whitefish Coregonus clupeaformis (Salmonidae) sampled in five different lake populations. The double‐nested PCR efficiency was compared with two other PCR strategies: single primer pair amplification and simple nested PCR. The double‐nested PCR was the only amplification strategy to provide highly specific amplification of bacterial DNA. The resulting 16S amplicon libraries were synthesized and pyrosequenced using 454 FLX technology to analyse the variation of pathogenic bacteria abundance. The proportion of the community sequenced was very high (Good’s coverage estimator; mean = 95.4%). Furthermore, there were no significant differences of sequence coverage among samples. Finally, the occurrence of chimeric amplicons was very low. Therefore, the double‐nested PCR approach provides a rapid, informative and cost‐effective method for screening fish immunobiomes and most likely applicable to other low‐density microbiomes as well.  相似文献   

7.
Although DNA metabarcoding is an attractive approach for monitoring biodiversity, it is often difficult to detect all the species present in a bulk sample. In particular, sequence recovery for a given species depends on its biomass and mitome copy number as well as the primer set employed for PCR. To examine these variables, we constructed a mock community of terrestrial arthropods comprised of 374 species. We used this community to examine how species recovery was impacted when amplicon pools were constructed in four ways. The first two protocols involved the construction of bulk DNA extracts from different body segments (Bulk Abdomen, Bulk Leg). The other protocols involved the production of DNA extracts from single legs which were then merged prior to PCR (Composite Leg) or PCR‐amplified separately (Single Leg) and then pooled. The amplicons generated by these four treatments were then sequenced on three platforms (Illumina MiSeq, Ion Torrent PGM and Ion Torrent S5). The choice of sequencing platform did not substantially influence species recovery, although the Miseq delivered the highest sequence quality. As expected, species recovery was most efficient from the Single Leg treatment because amplicon abundance varied little among taxa. Among the three treatments where PCR occurred after pooling, the Bulk Abdomen treatment produced a more uniform read abundance than the Bulk Leg or Composite Leg treatment. Primer choice also influenced species recovery and evenness. Our results reveal how variation in protocols can have substantial impacts on perceived diversity unless sequencing coverage is sufficient to reach an asymptote.  相似文献   

8.
Different protocols based on Illumina high-throughput DNA sequencing and denaturing gradient gel electrophoresis (DGGE)-cloning were developed and applied for investigating hot spring related samples. The study was focused on three target genes: archaeal and bacterial 16S rRNA and mcrA of methanogenic microflora. Shorter read lengths of the currently most popular technology of sequencing by Illumina do not allow analysis of the complete 16S rRNA region, or of longer gene fragments, as was the case of Sanger sequencing. Here, we demonstrate that there is no need for special indexed or tailed primer sets dedicated to short variable regions of 16S rRNA since the presented approach allows the analysis of complete bacterial 16S rRNA amplicons (V1–V9) and longer archaeal 16S rRNA and mcrA sequences. Sample augmented with transposon is represented by a set of approximately 300 bp long fragments that can be easily sequenced by Illumina MiSeq. Furthermore, a low proportion of chimeric sequences was observed. DGGE-cloning based strategies were performed combining semi-nested PCR, DGGE and clone library construction. Comparing both investigation methods, a certain degree of complementarity was observed confirming that the DGGE-cloning approach is not obsolete. Novel protocols were created for several types of laboratories, utilizing the traditional DGGE technique or using the most modern Illumina sequencing.  相似文献   

9.
Sampling the sequence of a relatively small fraction of the genome in large numbers of individuals is an important objective for population genetics and association genetics approaches. However, currently available ‘sequence capture’ methods either require expensive instrumentation or have problems dealing with high sample numbers and relatively small target sizes. We have developed Genome-Tagged Amplification (GTA) as a flexible PCR-based method for preparing pools of hundreds of amplicons from hundreds of samples for next generation sequencing. The method involves tagging of genomic DNA with barcode adapters at restriction sites, followed by PCR amplification from flanking DNA. It is freely scalable for both sample number and amplicon number and has no specialized equipment requirement. An optimized protocol is presented which provides a matrix of 96 × 192 combinations of samples x amplicons, corresponding to a complete 454 Titanium run. Initially, we used 454 sequencing; however, GTA could easily be adapted to Illumina sequencing platforms as read lengths have significantly increased in this system.  相似文献   

10.
PCR permits the exponential and sequence-specific amplification of DNA, even from minute starting quantities. PCR is a fundamental step in preparing DNA samples for high-throughput sequencing. However, there are errors associated with PCR-mediated amplification. Here we examine the effects of four important sources of error—bias, stochasticity, template switches and polymerase errors—on sequence representation in low-input next-generation sequencing libraries. We designed a pool of diverse PCR amplicons with a defined structure, and then used Illumina sequencing to search for signatures of each process. We further developed quantitative models for each process, and compared predictions of these models to our experimental data. We find that PCR stochasticity is the major force skewing sequence representation after amplification of a pool of unique DNA amplicons. Polymerase errors become very common in later cycles of PCR but have little impact on the overall sequence distribution as they are confined to small copy numbers. PCR template switches are rare and confined to low copy numbers. Our results provide a theoretical basis for removing distortions from high-throughput sequencing data. In addition, our findings on PCR stochasticity will have particular relevance to quantification of results from single cell sequencing, in which sequences are represented by only one or a few molecules.  相似文献   

11.
Analysis of microbial communities by high-throughput pyrosequencing of SSU rRNA gene PCR amplicons has transformed microbial ecology research and led to the observation that many communities contain a diverse assortment of rare taxa-a phenomenon termed the Rare Biosphere. Multiple studies have investigated the effect of pyrosequencing read quality on operational taxonomic unit (OTU) richness for contrived communities, yet there is limited information on the fidelity of community structure estimates obtained through this approach. Given that PCR biases are widely recognized, and further unknown biases may arise from the sequencing process itself, a priori assumptions about the neutrality of the data generation process are at best unvalidated. Furthermore, post-sequencing quality control algorithms have not been explicitly evaluated for the accuracy of recovered representative sequences and its impact on downstream analyses, reducing useful discussion on pyrosequencing reads to their diversity and abundances. Here we report on community structures and sequences recovered for in vitro-simulated communities consisting of twenty 16S rRNA gene clones tiered at known proportions. PCR amplicon libraries of the V3-V4 and V6 hypervariable regions from the in vitro-simulated communities were sequenced using the Roche 454 GS FLX Titanium platform. Commonly used quality control protocols resulted in the formation of OTUs with >1% abundance composed entirely of erroneous sequences, while over-aggressive clustering approaches obfuscated real, expected OTUs. The pyrosequencing process itself did not appear to impose significant biases on overall community structure estimates, although the detection limit for rare taxa may be affected by PCR amplicon size and quality control approach employed. Meanwhile, PCR biases associated with the initial amplicon generation may impose greater distortions in the observed community structure.  相似文献   

12.
13.
Aims:  The focus of this study was to identify a bacterial 16S rRNA gene sequence, unique to microbiota in the human gut, for use in development of a dependable PCR assay to detect human faecal pollution in water.
Methods and Results:  Suppression subtractive hybridization (SSH) and bioinformatics were used to identify a genetic marker, within the 16S rRNA gene of Faecalibacterium , for the detection of human faeces. DNA sequencing analysis demonstrated that a majority (16) of 74 clones of the SSH library contained insertion sequences identified as Faecalibacterium 16S rRNA genes . Human faeces-specific sequences were derived and six PCR primer sets designed and tested against faecal DNA samples from human and nonhuman sources. One PCR primer set, HFB-F3 and HFB-R5, was exclusively associated with human faeces. These primers generated a human faeces-specific amplicon of 399 bp from 60·2% of human faecal samples and 100% of sewage samples.
Conclusions:  The subject Faecalibacterium marker is specific for sewage.
Significance and Impact of the Study:  This study represents the initial report of a Faecalibacterium marker for human faeces, which may prove useful for microbial source tracking.  相似文献   

14.
Invertebrate biodiversity measured at mostly family level is widely used in biological monitoring programmes to assess anthropogenic impacts on ecosystems. However, next‐generation sequencing (NGS) could allow development of new more sensitive biomonitoring tools by allowing rapid species identification. This could be accelerated if archived invertebrate collections and environmental information from past programmes are used to understand species distributions and their environmental responses. In this study, we take archived macroinvertebrate samples from two sites collected on multiple occasions and test whether NGS can successfully detect species. Samples had been stored in 70% ethanol at room temperature for up to 12 years. Three amplicons ranging from 197 to 274 bps within the DNA barcode region were amplified from samples and compared to DNA barcoding libraries to identify species. We were able to amplify partial DNA barcodes from most samples, and species were often detected with multiple amplicons. However, some singletons and taxa poorly covered by DNA barcoding were missed. This suggests additional DNA barcodes will be required to fill ‘gaps’ in current DNA barcode libraries for aquatic macroinvertebrates and/or that it may not be possible to detect all taxa in a sample. Furthermore, older samples often detected fewer taxa and were less reliable for amplification, suggesting NGS is best used on samples within 8 years of collection. Nevertheless, many common taxa with existing DNA barcodes were reliably identified with NGS and were often present at sites across multiple years, showing the potential of NGS for detecting common and abundant species in archived material.  相似文献   

15.
Most studies involving next-generation amplicon sequencing of microbial communities from environmental studies lack replicates. DNA extraction and PCR effects on the variation of read abundances of operational taxonomic units generated from deep amplicon 454 pyrosequencing was investigated using soil samples from an agricultural field with diseased pea. One sample was extracted four times, and one of these samples was PCR amplified four times to obtain eight replicates in total. Results showed that species richness was consistent among replicates. Variation among dominant taxa was low across replicates, whereas rare operational taxonomic units showed higher variation among replicates. The results indicate that pooling of several extractions and PCR amplicons will decrease variation among samples.  相似文献   

16.
Rhodococcus coprophilus, a natural inhabitant of herbivore faeces, has been suggested as a good indicator of animal (as opposed to human) faecal contamination of aquatic environments. However, conventional detection methods limit its use for this as they require up to 21 days to obtain a result. In this paper an optimised method for extracting R. coprophilus DNA from faecal samples is described. PCR and 5'-nuclease (TaqMan) PCR methods were developed to allow the detection and enumeration of R. coprophilus in faecal samples within 2-3 days. Both PCR methods targeted the 16S rRNA gene, producing an amplicon of 443 bp which was specific for R. coprophilus. Sixty cells were required to produce an amplification product by conventional PCR, while as little as one cell was required for the TaqMan PCR method. The latter approach gave a linear quantitative response over at least four log units with both bacterial cells and DNA. Successful amplification by PCR was achieved using DNA extracted from cow, sheep, horse and deer faeces but was negative for samples from humans, pig, possum, duck and rabbit. These PCR methods enhance the feasibility of using R. coprophilus to distinguish faecal pollution of farmed herbivores from human pollution.  相似文献   

17.

Background

Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences.

Results

We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates.

Conclusion

We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material.  相似文献   

18.
Rapid advancements in sequencing technologies along with falling costs present widespread opportunities for microbiome studies across a vast and diverse array of environments. These impressive technological developments have been accompanied by a considerable growth in the number of methodological variables, including sampling, storage, DNA extraction, primer pairs, sequencing technology, chemistry version, read length, insert size, and analysis pipelines, amongst others. This increase in variability threatens to compromise both the reproducibility and the comparability of studies conducted. Here we perform the first reported study comparing both amplicon and shotgun sequencing for the three leading next-generation sequencing technologies. These were applied to six human stool samples using Illumina HiSeq, MiSeq and Ion PGM shotgun sequencing, as well as amplicon sequencing across two variable 16S rRNA gene regions. Notably, we found that the factor responsible for the greatest variance in microbiota composition was the chosen methodology rather than the natural inter-individual variance, which is commonly one of the most significant drivers in microbiome studies. Amplicon sequencing suffered from this to a large extent, and this issue was particularly apparent when the 16S rRNA V1-V2 region amplicons were sequenced with MiSeq. Somewhat surprisingly, the choice of taxonomic binning software for shotgun sequences proved to be of crucial importance with even greater discriminatory power than sequencing technology and choice of amplicon. Optimal N50 assembly values for the HiSeq was obtained for 10 million reads per sample, whereas the applied MiSeq and PGM sequencing depths proved less sufficient for shotgun sequencing of stool samples. The latter technologies, on the other hand, provide a better basis for functional gene categorisation, possibly due to their longer read lengths. Hence, in addition to highlighting methodological biases, this study demonstrates the risks associated with comparing data generated using different strategies. We also recommend that laboratories with particular interests in certain microbes should optimise their protocols to accurately detect these taxa using different techniques.  相似文献   

19.
The methanogenic community in hydrothermally active sediments of Guaymas Basin (Gulf of California, Mexico) was analyzed by PCR amplification, cloning, and sequencing of methyl coenzyme M reductase (mcrA) and 16S rRNA genes. Members of the Methanomicrobiales and Methanosarcinales dominated the mcrA and 16S rRNA clone libraries from the upper 15 cm of the sediments. Within the H2/CO2- and formate-utilizing family Methanomicrobiales, two mcrA and 16S rRNA lineages were closely affiliated with cultured species of the genera Methanoculleus and Methanocorpusculum. The most frequently recovered mcrA PCR amplicons within the Methanomicrobiales did not branch with any cultured genera. Within the nutritionally versatile family Methanosarcinales, one 16S rRNA amplicon and most of the mcrA PCR amplicons were affiliated with the obligately acetate utilizing species Methanosaeta concilii. The mcrA clone libraries also included phylotypes related to the methyl-disproportionating genus Methanococcoides. However, two mcrA and two 16S rRNA lineages within the Methanosarcinales were unrelated to any cultured genus. Overall, the clone libraries indicate a diversified methanogen community that uses H2/CO2, formate, acetate, and methylated substrates. Phylogenetic affiliations of mcrA and 16S rRNA clones with thermophilic and nonthermophilic cultured isolates indicate a mixed mesophilic and thermophilic methanogen community in the surficial Guaymas sediments.  相似文献   

20.
Metabarcoding of environmental samples on second‐generation sequencing platforms has rapidly become a valuable tool for ecological studies. A fundamental assumption of this approach is the reliance on being able to track tagged amplicons back to the samples from which they originated. In this study, we address the problem of sequences in metabarcoding sequencing outputs with false combinations of used tags (tag jumps). Unless these sequences can be identified and excluded from downstream analyses, tag jumps creating sequences with false, but already used tag combinations, can cause incorrect assignment of sequences to samples and artificially inflate diversity. In this study, we document and investigate tag jumping in metabarcoding studies on Illumina sequencing platforms by amplifying mixed‐template extracts obtained from bat droppings and leech gut contents with tagged generic arthropod and mammal primers, respectively. We found that an average of 2.6% and 2.1% of sequences had tag combinations, which could be explained by tag jumping in the leech and bat diet study, respectively. We suggest that tag jumping can happen during blunt‐ending of pools of tagged amplicons during library build and as a consequence of chimera formation during bulk amplification of tagged amplicons during library index PCR. We argue that tag jumping and contamination between libraries represents a considerable challenge for Illumina‐based metabarcoding studies, and suggest measures to avoid false assignment of tag jumping‐derived sequences to samples.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号