首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
The potential for comparing microbial community population structures has been greatly enhanced by developments in next generation sequencing methods that can generate hundreds of thousands to millions of reads in a single run. Conversely, many microbial community comparisons have been published with no more than 1,000 sequences per sample. These studies have presented data on levels of shared operational taxonomic units (OTUs) between communities. Due to lack of coverage, that approach might compromise the conclusions about microbial diversity and the degree of difference between environments. In this study, we present data from recent studies that highlight this problem. Also, we analyzed datasets of 16 rRNA sequences with small and high sequence coverage from different environments to demonstrate that the level of sequencing effort used for analyzing microbial communities biases the results. We randomly sampled pyrosequencing-generated 16S rRNA gene libraries with increasing sequence effort. Sequences were used to calculate Good's coverage, the percentage of shared OTUs, and phylogenetic distance measures. Our data showed that simple counts of presence/absence of taxonomic unities do not reflect the real similarity in membership and structure of the bacterial communities and that community comparisons based on phylogenetic tests provide a way to test statistically significant differences between two or more environments without need an exhaustive sampling effort.  相似文献   

2.
Massively parallel high throughput sequencing technologies allow us to interrogate the microbial composition of biological samples at unprecedented resolution. The typical approach is to perform high-throughout sequencing of 16S rRNA genes, which are then taxonomically classified based on similarity to known sequences in existing databases. Current technologies cause a predicament though, because although they enable deep coverage of samples, they are limited in the length of sequence they can produce. As a result, high-throughout studies of microbial communities often do not sequence the entire 16S rRNA gene. The challenge is to obtain reliable representation of bacterial communities through taxonomic classification of short 16S rRNA gene sequences. In this study we explored properties of different study designs and developed specific recommendations for effective use of short-read sequencing technologies for the purpose of interrogating bacterial communities, with a focus on classification using naïve Bayesian classifiers. To assess precision and coverage of each design, we used a collection of ∼8,500 manually curated 16S rRNA gene sequences from cultured bacteria and a set of over one million bacterial 16S rRNA gene sequences retrieved from environmental samples, respectively. We also tested different configurations of taxonomic classification approaches using short read sequencing data, and provide recommendations for optimal choice of the relevant parameters. We conclude that with a judicious selection of the sequenced region and the corresponding choice of a suitable training set for taxonomic classification, it is possible to explore bacterial communities at great depth using current technologies, with only a minimal loss of taxonomic resolution.  相似文献   

3.
We constructed error-correcting DNA barcodes that allow one run of a massively parallel pyrosequencer to process up to 1,544 samples simultaneously. Using these barcodes we processed bacterial 16S rRNA gene sequences representing microbial communities in 286 environmental samples, corrected 92% of sample assignment errors, and thus characterized nearly as many 16S rRNA genes as have been sequenced to date by Sanger sequencing.  相似文献   

4.
Massively parallel sequencing of 16S rRNA genes enables the comparison of terrestrial, aquatic, and host-associated microbial communities with sufficient sequencing depth for robust assessments of both alpha and beta diversity. Establishing standardized protocols for the analysis of microbial communities is dependent on increasing the reproducibility of PCR-based molecular surveys by minimizing sources of methodological bias. In this study, we tested the effects of template concentration, pooling of PCR amplicons, and sample preparation/interlane sequencing on the reproducibility associated with paired-end Illumina sequencing of bacterial 16S rRNA genes. Using DNA extracts from soil and fecal samples as templates, we sequenced pooled amplicons and individual reactions for both high (5- to 10-ng) and low (0.1-ng) template concentrations. In addition, all experimental manipulations were repeated on two separate days and sequenced on two different Illumina MiSeq lanes. Although within-sample sequence profiles were highly consistent, template concentration had a significant impact on sample profile variability for most samples. Pooling of multiple PCR amplicons, sample preparation, and interlane variability did not influence sample sequence data significantly. This systematic analysis underlines the importance of optimizing template concentration in order to minimize variability in microbial-community surveys and indicates that the practice of pooling multiple PCR amplicons prior to sequencing contributes proportionally less to reducing bias in 16S rRNA gene surveys with next-generation sequencing.  相似文献   

5.
Illumina-based analysis of microbial community diversity   总被引:4,自引:0,他引:4  
Microbes commonly exist in milieus of varying complexity and diversity. Although cultivation-based techniques have been unable to accurately capture the true diversity within microbial communities, these deficiencies have been overcome by applying molecular approaches that target the universally conserved 16S ribosomal RNA gene. The recent application of 454 pyrosequencing to simultaneously sequence thousands of 16S rDNA sequences (pyrotags) has revolutionized the characterization of complex microbial communities. To date, studies based on 454 pyrotags have dominated the field, but sequencing platforms that generate many more sequence reads at much lower costs have been developed. Here, we use the Illumina sequencing platform to design a strategy for 16S amplicon analysis (iTags), and assess its generality, practicality and potential complications. We fabricated and sequenced paired-end libraries of amplified hyper-variable 16S rDNA fragments from sets of samples that varied in their contents, ranging from a single bacterium to highly complex communities. We adopted an approach that allowed us to evaluate several potential sources of errors, including sequencing artifacts, amplification biases, non-corresponding paired-end reads and mistakes in taxonomic classification. By considering each source of error, we delineate ways to make biologically relevant and robust conclusions from the millions of sequencing reads that can be readily generated by this technology.  相似文献   

6.
Environmental DNA sequencing is the gold standard to reveal microbial community structures. In most applications, a one-fragment PCR approach is applied to amplify a taxonomic marker gene, usually a hypervariable region of the 16S rRNA gene. We used a new reverse complement (RC)-PCR-based assay that amplifies seven out of the nine hypervariable regions of the 16S rRNA gene, to interrogate bacterial communities in sediment samples collected from different coastal marine sites with an impact gradient. In parallel, we employed a traditional one-fragment analysis of the hypervariable V3–V4 region to investigate whether the RC-PCR reveals more of the ‘unseen’ diversity obtained by the one-fragment approach. As a benchmark for the full deck of diversity, we subjected the samples to PCR-free metagenomic sequencing. None of the two PCR-based approaches recorded the full taxonomic repertoire obtained from the metagenomics datasets. However, the RC-PCR approach detected 2.8 times more bacterial genera compared to the near-saturation sequenced V3–V4 samples. RC-PCR is an ideal compromise between the standard one-fragment approach and metagenomics sequencing and may guide future environmental sequencing studies, in which bacterial diversity is a central subject.  相似文献   

7.
Microbial communities host unparalleled taxonomic diversity. Adequate characterization of environmental and host-associated samples remains a challenge for microbiologists, despite the advent of 16S rRNA gene sequencing. In order to increase the depth of sampling for diverse bacterial communities, we developed a method for sequencing and assembling millions of paired-end reads from the 16S rRNA gene (spanning the V3 region; ~200 nucleotides) by using an Illumina genome analyzer. To confirm reproducibility and to identify a suitable computational pipeline for data analysis, sequence libraries were prepared in duplicate for both a defined mixture of DNAs from known cultured bacterial isolates (>1 million postassembly sequences) and an Arctic tundra soil sample (>6 million postassembly sequences). The Illumina 16S rRNA gene libraries represent a substantial increase in number of sequences over all extant next-generation sequencing approaches (e.g., 454 pyrosequencing), while the assembly of paired-end 125-base reads offers a methodological advantage by incorporating an initial quality control step for each 16S rRNA gene sequence. This method incorporates indexed primers to enable the characterization of multiple microbial communities in a single flow cell lane, may be modified readily to target other variable regions or genes, and demonstrates unprecedented and economical access to DNAs from organisms that exist at low relative abundances.  相似文献   

8.
The deep sequencing of 16S rRNA genes amplified by universal primers has revolutionized our understanding of microbial communities by allowing the characterization of the diversity of the uncultured majority. However, some universal primers also amplify eukaryotic rRNA genes, leading to a decrease in the efficiency of sequencing of prokaryotic 16S rRNA genes with possible mischaracterization of the diversity in the microbial community. In this study, we compared 16S rRNA gene sequences from genome-sequenced strains and identified candidates for non-degenerate universal primers that could be used for the amplification of prokaryotic 16S rRNA genes. The 50 identified candidates were investigated to calculate their coverage for prokaryotic and eukaryotic rRNA genes, including those from uncultured taxa and eukaryotic organelles, and a novel universal primer set, 342F-806R, covering many prokaryotic, but not eukaryotic, rRNA genes was identified. This primer set was validated by the amplification of 16S rRNA genes from a soil metagenomic sample and subsequent pyrosequencing using the Roche 454 platform. The same sample was also used for pyrosequencing of the amplicons by employing a commonly used primer set, 338F-533R, and for shotgun metagenomic sequencing using the Illumina platform. Our comparison of the taxonomic compositions inferred by the three sequencing experiments indicated that the non-degenerate 342F-806R primer set can characterize the taxonomic composition of the microbial community without substantial bias, and is highly expected to be applicable to the analysis of a wide variety of microbial communities.  相似文献   

9.
【背景】在过去的十几年里,基于核糖体RNA基因的扩增子测序技术被广泛用于各种生态系统中微生物群落的多样性检测。扩增子测序的使用极大地促进了土壤、水体、空气等环境中微生物生态的相关研究。【目的】随着高通量测序技术的不断发展和参考数据库的不断更新,针对不同的环境样本的引物选择和改进仍然需要更深入的校验。【方法】本文收集了目前在微生物群落研究中被广泛采用的标记基因扩增通用引物,包括16S rRNA基因扩增常用的8对通用引物和2对古菌引物、9对真菌转录间隔区(internal transcribed spacer,ITS)基因扩增引物,以及18S rRNA基因扩增的4对真核微生物通用引物和1对真菌特异性引物。这些引物中包括了地球微生物组计划(Earth Microbiome Project,EMP)推荐的2对16S rRNA基因扩增引物、1对ITS1基因扩增引物和1对18S rRNA基因扩增引物。采用最近更新的标准数据库对这些引物进行了覆盖度和特异性评价。【结果】EMP推荐的引物依然具有较高的覆盖度,而其他引物在覆盖度及对特定环境或类群的特异性上也各有特点。此外,最近有研究对这些通用引物进行了一些改进,而我们也发现,一个碱基的变化都可能会导致评价结果或扩增产物发生明显变化,简并碱基的引入既可以覆盖更多的物种,但同时也会在一定程度上降低关注物种的特异性。【结论】研究结果为微生态研究中标记基因的引物选择提供了一个广泛的指导,但在关注具体科学问题时,引物的选择仍需数据指导与实验尝试。  相似文献   

10.
16S rRNA amplicon analysis and shotgun metagenome sequencing are two main culture-independent strategies to explore the genetic landscape of various microbial communities. Recently, numerous studies have employed these two approaches together, but downstream data analyses were performed separately, which always generated incongruent or conflict signals on both taxonomic and functional classifications. Here we propose a novel approach, RiboFR-Seq (Ribosomal RNA gene flanking region sequencing), for capturing both ribosomal RNA variable regions and their flanking protein-coding genes simultaneously. Through extensive testing on clonal bacterial strain, salivary microbiome and bacterial epibionts of marine kelp, we demonstrated that RiboFR-Seq could detect the vast majority of bacteria not only in well-studied microbiomes but also in novel communities with limited reference genomes. Combined with classical amplicon sequencing and shotgun metagenome sequencing, RiboFR-Seq can link the annotations of 16S rRNA and metagenomic contigs to make a consensus classification. By recognizing almost all 16S rRNA copies, the RiboFR-seq approach can effectively reduce the taxonomic abundance bias resulted from 16S rRNA copy number variation. We believe that RiboFR-Seq, which provides an integrated view of 16S rRNA profiles and metagenomes, will help us better understand diverse microbial communities.  相似文献   

11.
The high throughput and cost-effectiveness afforded by short-read sequencing technologies, in principle, enable researchers to perform 16S rRNA profiling of complex microbial communities at unprecedented depth and resolution. Existing Illumina sequencing protocols are, however, limited by the fraction of the 16S rRNA gene that is interrogated and therefore limit the resolution and quality of the profiling. To address this, we present the design of a novel protocol for shotgun Illumina sequencing of the bacterial 16S rRNA gene, optimized to amplify more than 90% of sequences in the Greengenes database and with the ability to distinguish nearly twice as many species-level OTUs compared to existing protocols. Using several in silico and experimental datasets, we demonstrate that despite the presence of multiple variable and conserved regions, the resulting shotgun sequences can be used to accurately quantify the constituents of complex microbial communities. The reconstruction of a significant fraction of the 16S rRNA gene also enabled high precision (>90%) in species-level identification thereby opening up potential application of this approach for clinical microbial characterization.  相似文献   

12.
Taxonomic marker gene studies, such as the 16S rRNA gene, have been used to successfully explore microbial diversity in a variety of marine, terrestrial, and host environments. For some of these environments long term sampling programs are beginning to build a historical record of microbial community structure. Although these 16S rRNA gene datasets do not intrinsically provide information on microbial metabolism or ecosystem function, this information can be developed by identifying metabolisms associated with related, phenotyped strains. Here we introduce the concept of metabolic inference; the systematic prediction of metabolism from phylogeny, and describe a complete pipeline for predicting the metabolic pathways likely to be found in a collection of 16S rRNA gene phylotypes. This framework includes a mechanism for assigning confidence to each metabolic inference that is based on a novel method for evaluating genomic plasticity. We applied this framework to 16S rRNA gene libraries from the West Antarctic Peninsula marine environment, including surface and deep summer samples and surface winter samples. Using statistical methods commonly applied to community ecology data we found that metabolic structure differed between summer surface and winter and deep samples, comparable to an analysis of community structure by 16S rRNA gene phylotypes. While taxonomic variance between samples was primarily driven by low abundance taxa, metabolic variance was attributable to both high and low abundance pathways. This suggests that clades with a high degree of functional redundancy can occupy distinct adjacent niches. Overall our findings demonstrate that inferred metabolism can be used in place of taxonomy to describe the structure of microbial communities. Coupling metabolic inference with targeted metagenomics and an improved collection of completed genomes could be a powerful way to analyze microbial communities in a high-throughput manner that provides direct access to metabolic and ecosystem function.  相似文献   

13.
Fan L  McElroy K  Thomas T 《PloS one》2012,7(6):e39948
Direct sequencing of environmental DNA (metagenomics) has a great potential for describing the 16S rRNA gene diversity of microbial communities. However current approaches using this 16S rRNA gene information to describe community diversity suffer from low taxonomic resolution or chimera problems. Here we describe a new strategy that involves stringent assembly and data filtering to reconstruct full-length 16S rRNA genes from metagenomicpyrosequencing data. Simulations showed that reconstructed 16S rRNA genes provided a true picture of the community diversity, had minimal rates of chimera formation and gave taxonomic resolution down to genus level. The strategy was furthermore compared to PCR-based methods to determine the microbial diversity in two marine sponges. This showed that about 30% of the abundant phylotypes reconstructed from metagenomic data failed to be amplified by PCR. Our approach is readily applicable to existing metagenomic datasets and is expected to lead to the discovery of new microbial phylotypes.  相似文献   

14.
15.
Toward a census of bacteria in soil   总被引:2,自引:0,他引:2  
For more than a century, microbiologists have sought to determine the species richness of bacteria in soil, but the extreme complexity and unknown structure of soil microbial communities have obscured the answer. We developed a statistical model that makes the problem of estimating richness statistically accessible by evaluating the characteristics of samples drawn from simulated communities with parametric community distributions. We identified simulated communities with rank-abundance distributions that followed a truncated lognormal distribution whose samples resembled the structure of 16S rRNA gene sequence collections made using Alaskan and Minnesotan soils. The simulated communities constructed based on the distribution of 16S rRNA gene sequences sampled from the Alaskan and Minnesotan soils had a richness of 5,000 and 2,000 operational taxonomic units (OTUs), respectively, where an OTU represents a collection of sequences not more than 3% distant from each other. To sample each of these OTUs in the Alaskan 16S rRNA gene library at least twice, 480,000 sequences would be required; however, to estimate the richness of the simulated communities using nonparametric richness estimators would require only 18,000 sequences. Quantifying the richness of complex environments such as soil is an important step in building an ecological framework. We have shown that generating sufficient sequence data to do so requires less sequencing effort than completely sequencing a bacterial genome.  相似文献   

16.
The recent introduction of massively parallel pyrosequencers allows rapid, inexpensive analysis of microbial community composition using 16S ribosomal RNA (rRNA) sequences. However, a major challenge is to design a workflow so that taxonomic information can be accurately and rapidly assigned to each read, so that the composition of each community can be linked back to likely ecological roles played by members of each species, genus, family or phylum. Here, we use three large 16S rRNA datasets to test whether taxonomic information based on the full-length sequences can be recaptured by short reads that simulate the pyrosequencer outputs. We find that different taxonomic assignment methods vary radically in their ability to recapture the taxonomic information in full-length 16S rRNA sequences: most methods are sensitive to the region of the 16S rRNA gene that is targeted for sequencing, but many combinations of methods and rRNA regions produce consistent and accurate results. To process large datasets of partial 16S rRNA sequences obtained from surveys of various microbial communities, including those from human body habitats, we recommend the use of Greengenes or RDP classifier with fragments of at least 250 bases, starting from one of the primers R357, R534, R798, F343 or F517.  相似文献   

17.
MetaMetaDB (http://mmdb.aori.u-tokyo.ac.jp/) is a database and analytic system for investigating microbial habitability, i.e., how a prokaryotic group can inhabit different environments. The interaction between prokaryotes and the environment is a key issue in microbiology because distinct prokaryotic communities maintain distinct ecosystems. Because 16S ribosomal RNA (rRNA) sequences play pivotal roles in identifying prokaryotic species, a system that comprehensively links diverse environments to 16S rRNA sequences of the inhabitant prokaryotes is necessary for the systematic understanding of the microbial habitability. However, existing databases are biased to culturable prokaryotes and exhibit limitations in the comprehensiveness of the data because most prokaryotes are unculturable. Recently, metagenomic and 16S rRNA amplicon sequencing approaches have generated abundant 16S rRNA sequence data that encompass unculturable prokaryotes across diverse environments; however, these data are usually buried in large databases and are difficult to access. In this study, we developed MetaMetaDB (Meta-Metagenomic DataBase), which comprehensively and compactly covers 16S rRNA sequences retrieved from public datasets. Using MetaMetaDB, users can quickly generate hypotheses regarding the types of environments a prokaryotic group may be adapted to. We anticipate that MetaMetaDB will improve our understanding of the diversity and evolution of prokaryotes.  相似文献   

18.
基于16S rRNA基因测序分析微生物群落多样性   总被引:6,自引:1,他引:5       下载免费PDF全文
微生物群落多样性的研究对于挖掘微生物资源,探索微生物群落功能,阐明微生物群落与生境间的关系具有重要意义。随着宏基因组概念的提出以及测序技术的快速发展,16S rRNA基因测序在微生物群落多样性的研究中已被广泛应用。文中系统地介绍了16S rRNA基因测序分析流程中的四个重要环节,包括测序平台与扩增区的选择、测序数据预处理以及多样性分析方法,就其面临的问题与挑战进行了探讨并对未来的研究方向进行了展望,以期为微生物群落多样性相关研究提供参考。  相似文献   

19.
Pyrosequencing technology allows us to characterize microbial communities using 16S ribosomal RNA (rRNA) sequences orders of magnitude faster and more cheaply than has previously been possible. However, results from different studies using pyrosequencing and traditional sequencing are often difficult to compare, because amplicons covering different regions of the rRNA might yield different conclusions. We used sequences from over 200 globally dispersed environments to test whether studies that used similar primers clustered together mistakenly, without regard to environment. We then tested whether primer choice affects sequence-based community analyses using UniFrac, our recently-developed method for comparing microbial communities. We performed three tests of primer effects. We tested whether different simulated amplicons generated the same UniFrac clustering results as near-full-length sequences for three recent large-scale studies of microbial communities in the mouse and human gut, and the Guerrero Negro microbial mat. We then repeated this analysis for short sequences (100-, 150-, 200- and 250-base reads) resembling those produced by pyrosequencing. The results show that sequencing effort is best focused on gathering more short sequences rather than fewer longer ones, provided that the primers are chosen wisely, and that community comparison methods such as UniFrac are surprisingly robust to variation in the region sequenced.  相似文献   

20.
We describe a rapid oligonucleotide probe design strategy based on subtractive hybridization which yields probes for 16S rRNA or rRNA genes of individual members of microbial communities that are specific within the context of those communities. This strategy circumvents the need to sequence many similar or identical clones of dominant members of a community. Radioactively labeled subfragments of a cloned 16S rRNA gene sequence for which a probe is required (target) were hybridized with biotinylated total 16S ribosomal DNA (rDNA) amplified from the microbial community, and the hybrids formed were subsequently discarded. The remaining enriched fragments were used to screen a library consisting of cloned subfragments of the target sequence by colony hybridization in order to identify the variable regions of the 16S rRNA gene with the required specificity. The sequencing of random clones in one 16S rDNA library demonstrated that only those clones with 100% sequence identity with the probe fragment were detected by it. Moreover, sequencing of other, randomly selected, probe-positive clones revealed 100% sequence identity with the probe. Probes developed in this way tended to correspond to more variable regions of the 16S rRNA if the target sequences were similar to the sequences of other clones in the library and to less variable regions if the target sequences were phylogenetically isolated within the clone library. Although the absolute specificity of the latter probes, as assessed by comparison with available database sequences, was lower than the absolute specificity of the probes from the more variable regions, they were specific within the context of the environmental samples from which they were derived.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号