首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 500 毫秒
1.
Metagenomics holds the promise of greatly advancing the study of diversity in natural communities, but novel theoretical and methodological approaches must first be developed and adjusted for these data sets. We evaluated widely used macroecological metrics of taxonomic diversity on a simulated set of metagenomic samples, using phylogenetically meaningful protein-coding genes as ecological proxies. To our knowledge, this is the first approach of this kind to evaluate taxonomic diversity metrics derived from metagenomic data sets. We demonstrate that abundance matrices derived from protein-coding marker genes reproduce more faithfully the structure of the original community than those derived from SSU-rRNA gene. We also found that the most commonly used diversity metrics are biased estimators of community structure and differ significantly from their corresponding real parameters and that these biases are most likely caused by insufficient sampling and differences in community phylogenetic composition. Our results suggest that the ranking of samples using multidimensional metrics makes a good qualitative alternative for contrasting community structure and that these comparisons can be greatly improved with the incorporation of metrics for both community structure and phylogenetic diversity. These findings will help to achieve a standardized framework for community diversity comparisons derived from metagenomic data sets.  相似文献   

2.
The microbial taxonomic composition of the three domains of life in two coastal plankton samples was assessed by random total community metagenomic sequencing and PCR-based rDNA amplicon deep-sequencing in order to compare the resulting diversity and investigate possible limitations and complementarities of each method. The various universal primer sets, used to amplify different hypervariable rDNA regions, revealed the same major high-level taxonomic groups in Bacteria and unicellular Eukaryota, and showed a scarce Archaea apparent richness. However, significant differences were found between the different primer sets (p-value < 0.05, with the Kolmogorov–Smirnov test), regarding both operational taxonomic unit (OTU) richness and relative abundance of the major high-level taxonomic groups detected. Based on the metagenomic approach, the phylum Bacteroidetes dominated the prokaryotic community, followed by Proteobacteria, while the detected eukaryotic unicellular taxa belonged to the groups of Alveolata, Fungi, Chlorophyta, Stramenopiles and Phaeophyceae. These groups were found to carry genes typically found in microbial communities, which are linked to DNA, RNA and protein metabolism and the synthesis of nucleotides, amino acids, carbohydrates and vitamins. Although our findings suggest that the total community metagenomic approach can provide a more comprehensive picture of the planktonic microbial community structure, a number of issues associated with this approach emerged. These issues include the still relatively high cost compared to amplicon sequencing, the possible low coverage of the full marine diversity, the insufficiency of databases for other gene markers than the small subunit gene, and the bias towards bacterial sequences because of their higher abundance relative to eukaryotes in marine environments.  相似文献   

3.
With the continual improvement in high‐throughput sequencing technology and constant updates to fungal reference databases, the use of amplicon‐based DNA markers as a tool to reveal fungal diversity and composition in various ecosystems has become feasible. However, both primer selection and the experimental procedure require meticulous verification. Here, we computationally and experimentally evaluated the accuracy and specificity of three widely used or newly designed internal transcribed spacer (ITS) primer sets (ITS1F/ITS2, gITS7/ITS4 and 5.8S‐Fun/ITS4‐Fun). In silico evaluation revealed that primer coverage varied at different taxonomic levels due to differences in degeneracy and the location of primer sets. Using even and staggered mock community standards, we identified different proportions of chimeric and mismatch reads generated by different primer sets, as well as great variation in species abundances, suggesting that primer selection would affect the results of amplicon‐based metabarcoding studies. Choosing proofreading and high‐fidelity polymerase (KAPA HiFi) could significantly reduce the percentage of chimeric and mismatch sequences, further reducing inflation of operational taxonomic units. Moreover, for two types of environmental fungal communities, plant endophytic and soil fungi, it was demonstrated that the three primer sets could not reach a consensus on fungal community composition or diversity, and that primer selection, not experimental treatment, determines observed soil fungal community diversity and composition. Future DNA marker surveys should pay greater attention to potential primer effects and improve the experimental scheme to increase credibility and accuracy.  相似文献   

4.

Background

The 16S rRNA gene-based amplicon sequencing analysis is widely used to determine the taxonomic composition of microbial communities. Once the taxonomic composition of each community is obtained, evolutionary relationships among taxa are inferred by a phylogenetic tree. Thus, the combined representation of taxonomic composition and phylogenetic relationships among taxa is a powerful method for understanding microbial community structure; however, applying phylogenetic tree-based representation with information on the abundance of thousands or more taxa in each community is a difficult task. For this purpose, we previously developed the tool VITCOMIC (VIsualization tool for Taxonomic COmpositions of MIcrobial Community), which is based on the genome-sequenced microbes’ phylogenetic information. Here, we introduce VITCOMIC2, which incorporates substantive improvements over VITCOMIC that were necessary to address several issues associated with 16S rRNA gene-based analysis of microbial communities.

Results

We developed VITCOMIC2 to provide (i) sequence identity searches against broad reference taxa including uncultured taxa; (ii) normalization of 16S rRNA gene copy number differences among taxa; (iii) rapid sequence identity searches by applying the graphics processing unit-based sequence identity search tool CLAST; (iv) accurate taxonomic composition inference and nearly full-length 16S rRNA gene sequence reconstructions for metagenomic shotgun sequencing; and (v) an interactive user interface for simultaneous representation of the taxonomic composition of microbial communities and phylogenetic relationships among taxa. We validated the accuracy of processes (ii) and (iv) by using metagenomic shotgun sequencing data from a mock microbial community.

Conclusions

The improvements incorporated into VITCOMIC2 enable users to acquire an intuitive understanding of microbial community composition based on the 16S rRNA gene sequence data obtained from both metagenomic shotgun and amplicon sequencing.
  相似文献   

5.
Phylogenetic diversity--patterns of phylogenetic relatedness among organisms in ecological communities--provides important insights into the mechanisms underlying community assembly. Studies that measure phylogenetic diversity in microbial communities have primarily been limited to a single marker gene approach, using the small subunit of the rRNA gene (SSU-rRNA) to quantify phylogenetic relationships among microbial taxa. In this study, we present an approach for inferring phylogenetic relationships among microorganisms based on the random metagenomic sequencing of DNA fragments. To overcome challenges caused by the fragmentary nature of metagenomic data, we leveraged fully sequenced bacterial genomes as a scaffold to enable inference of phylogenetic relationships among metagenomic sequences from multiple phylogenetic marker gene families. The resulting metagenomic phylogeny can be used to quantify the phylogenetic diversity of microbial communities based on metagenomic data sets. We applied this method to understand patterns of microbial phylogenetic diversity and community assembly along an oceanic depth gradient, and compared our findings to previous studies of this gradient using SSU-rRNA gene and metagenomic analyses. Bacterial phylogenetic diversity was highest at intermediate depths beneath the ocean surface, whereas taxonomic diversity (diversity measured by binning sequences into taxonomically similar groups) showed no relationship with depth. Phylogenetic diversity estimates based on the SSU-rRNA gene and the multi-gene metagenomic phylogeny were broadly concordant, suggesting that our approach will be applicable to other metagenomic data sets for which corresponding SSU-rRNA gene sequences are unavailable. Our approach opens up the possibility of using metagenomic data to study microbial diversity in a phylogenetic context.  相似文献   

6.
Fan L  McElroy K  Thomas T 《PloS one》2012,7(6):e39948
Direct sequencing of environmental DNA (metagenomics) has a great potential for describing the 16S rRNA gene diversity of microbial communities. However current approaches using this 16S rRNA gene information to describe community diversity suffer from low taxonomic resolution or chimera problems. Here we describe a new strategy that involves stringent assembly and data filtering to reconstruct full-length 16S rRNA genes from metagenomicpyrosequencing data. Simulations showed that reconstructed 16S rRNA genes provided a true picture of the community diversity, had minimal rates of chimera formation and gave taxonomic resolution down to genus level. The strategy was furthermore compared to PCR-based methods to determine the microbial diversity in two marine sponges. This showed that about 30% of the abundant phylotypes reconstructed from metagenomic data failed to be amplified by PCR. Our approach is readily applicable to existing metagenomic datasets and is expected to lead to the discovery of new microbial phylotypes.  相似文献   

7.
Next‐generation sequencing technologies give access to large sets of data, which are extremely useful in the study of microbial diversity based on 16S rRNA gene. However, the production of such large data sets is not only marred by technical biases and sequencing noise but also increases computation time and disc space use. To improve the accuracy of OTU predictions and overcome both computations, storage and noise issues, recent studies and tools suggested removing all single reads and low abundant OTUs, considering them as noise. Although the effect of applying an OTU abundance threshold on α‐ and β‐diversity has been well documented, the consequences of removing single reads have been poorly studied. Here, we test the effect of singleton read filtering (SRF) on microbial community composition using in silico simulated data sets as well as sequencing data from synthetic and real communities displaying different levels of diversity and abundance profiles. Scalability to large data sets is also assessed using a complete MiSeq run. We show that SRF drastically reduces the chimera content and computational time, enabling the analysis of a complete MiSeq run in just a few minutes. Moreover, SRF accurately determines the actual community diversity: the differences in α‐ and β‐community diversity obtained with SRF and standard procedures are much smaller than the intrinsic variability of technical and biological replicates.  相似文献   

8.
9.
Next-generation DNA sequencing (NGS) approaches are rapidly surpassing Sanger sequencing for characterizing the diversity of natural microbial communities. Despite this rapid transition, few comparisons exist between Sanger sequences and the generally much shorter reads of NGS. Operational taxonomic units (OTUs) derived from full-length (Sanger sequencing) and pyrotag (454 sequencing of the V9 hypervariable region) sequences of 18S rRNA genes from 10 global samples were analyzed in order to compare the resulting protistan community structures and species richness. Pyrotag OTUs called at 98% sequence similarity yielded numbers of OTUs that were similar overall to those for full-length sequences when the latter were called at 97% similarity. Singleton OTUs strongly influenced estimates of species richness but not the higher-level taxonomic composition of the community. The pyrotag and full-length sequence data sets had slightly different taxonomic compositions of rhizarians, stramenopiles, cryptophytes, and haptophytes, but the two data sets had similarly high compositions of alveolates. Pyrotag-based OTUs were often derived from sequences that mapped to multiple full-length OTUs at 100% similarity. Thus, pyrotags sequenced from a single hypervariable region might not be appropriate for establishing protistan species-level OTUs. However, nonmetric multidimensional scaling plots constructed with the two data sets yielded similar clusters, indicating that beta diversity analysis results were similar for the Sanger and NGS sequences. Short pyrotag sequences can provide holistic assessments of protistan communities, although care must be taken in interpreting the results. The longer reads (>500 bp) that are now becoming available through NGS should provide powerful tools for assessing the diversity of microbial eukaryotic assemblages.  相似文献   

10.
16S rRNA amplicon analysis and shotgun metagenome sequencing are two main culture-independent strategies to explore the genetic landscape of various microbial communities. Recently, numerous studies have employed these two approaches together, but downstream data analyses were performed separately, which always generated incongruent or conflict signals on both taxonomic and functional classifications. Here we propose a novel approach, RiboFR-Seq (Ribosomal RNA gene flanking region sequencing), for capturing both ribosomal RNA variable regions and their flanking protein-coding genes simultaneously. Through extensive testing on clonal bacterial strain, salivary microbiome and bacterial epibionts of marine kelp, we demonstrated that RiboFR-Seq could detect the vast majority of bacteria not only in well-studied microbiomes but also in novel communities with limited reference genomes. Combined with classical amplicon sequencing and shotgun metagenome sequencing, RiboFR-Seq can link the annotations of 16S rRNA and metagenomic contigs to make a consensus classification. By recognizing almost all 16S rRNA copies, the RiboFR-seq approach can effectively reduce the taxonomic abundance bias resulted from 16S rRNA copy number variation. We believe that RiboFR-Seq, which provides an integrated view of 16S rRNA profiles and metagenomes, will help us better understand diverse microbial communities.  相似文献   

11.
Prevailing 16S rRNA gene-amplicon methods for characterizing the bacterial microbiome of wildlife are economical, but result in coarse taxonomic classifications, are subject to primer and 16S copy number biases, and do not allow for direct estimation of microbiome functional potential. While deep shotgun metagenomic sequencing can overcome many of these limitations, it is prohibitively expensive for large sample sets. Here we evaluated the ability of shallow shotgun metagenomic sequencing to characterize taxonomic and functional patterns in the faecal microbiome of a model population of feral horses (Sable Island, Canada). Since 2007, this unmanaged population has been the subject of an individual-based, long-term ecological study. Using deep shotgun metagenomic sequencing, we determined the sequencing depth required to accurately characterize the horse microbiome. In comparing conventional vs. high-throughput shotgun metagenomic library preparation techniques, we validate the use of more cost-effective laboratory methods. Finally, we characterize similarities between 16S amplicon and shallow shotgun characterization of the microbiome, and demonstrate that the latter recapitulates biological patterns first described in a published amplicon data set. Unlike for amplicon data, we further demonstrate how shallow shotgun metagenomic data provide useful insights regarding microbiome functional potential which support previously hypothesized diet effects in this study system.  相似文献   

12.
Next‐generation sequencing technologies have provided unprecedented insights into fungal diversity and ecology. However, intrinsic biases and insufficient quality control in next‐generation methods can lead to difficult‐to‐detect errors in estimating fungal community richness, distributions and composition. The aim of this study was to examine how tissue storage prior to DNA extraction, primer design and various quality‐control approaches commonly used in 454 amplicon pyrosequencing might influence ecological inferences in studies of endophytic and endolichenic fungi. We first contrast 454 data sets generated contemporaneously from subsets of the same plant and lichen tissues that were stored in CTAB buffer, dried in silica gel or freshly frozen prior to DNA extraction. We show that storage in silica gel markedly limits the recovery of sequence data and yields a small fraction of the diversity observed by the other two methods. Using lichen mycobiont sequences as internal positive controls, we next show that despite careful filtering of raw reads and utilization of current best‐practice OTU clustering methods, homopolymer errors in sequences representing rare taxa artificially increased estimates of richness c. 15‐fold in a model data set. Third, we show that inferences regarding endolichenic diversity can be improved using a novel primer that reduces amplification of the mycobiont. Together, our results provide a rationale for selecting tissue treatment regimes prior to DNA extraction, demonstrate the efficacy of reducing mycobiont amplification in studies of the fungal microbiomes of lichen thalli and highlight the difficulties in differentiating true information about fungal biodiversity from methodological artefacts.  相似文献   

13.
The deep sequencing of 16S rRNA genes amplified by universal primers has revolutionized our understanding of microbial communities by allowing the characterization of the diversity of the uncultured majority. However, some universal primers also amplify eukaryotic rRNA genes, leading to a decrease in the efficiency of sequencing of prokaryotic 16S rRNA genes with possible mischaracterization of the diversity in the microbial community. In this study, we compared 16S rRNA gene sequences from genome-sequenced strains and identified candidates for non-degenerate universal primers that could be used for the amplification of prokaryotic 16S rRNA genes. The 50 identified candidates were investigated to calculate their coverage for prokaryotic and eukaryotic rRNA genes, including those from uncultured taxa and eukaryotic organelles, and a novel universal primer set, 342F-806R, covering many prokaryotic, but not eukaryotic, rRNA genes was identified. This primer set was validated by the amplification of 16S rRNA genes from a soil metagenomic sample and subsequent pyrosequencing using the Roche 454 platform. The same sample was also used for pyrosequencing of the amplicons by employing a commonly used primer set, 338F-533R, and for shotgun metagenomic sequencing using the Illumina platform. Our comparison of the taxonomic compositions inferred by the three sequencing experiments indicated that the non-degenerate 342F-806R primer set can characterize the taxonomic composition of the microbial community without substantial bias, and is highly expected to be applicable to the analysis of a wide variety of microbial communities.  相似文献   

14.
Metagenomic shotgun sequencing data can identify microbes populating a microbial community and their proportions, but existing taxonomic profiling methods are inefficient for increasingly large data sets. We present an approach that uses clade-specific marker genes to unambiguously assign reads to microbial clades more accurately and >50× faster than current approaches. We validated our metagenomic phylogenetic analysis tool, MetaPhlAn, on terabases of short reads and provide the largest metagenomic profiling to date of the human gut. It can be accessed at http://huttenhower.sph.harvard.edu/metaphlan/.  相似文献   

15.
High‐throughput sequencing approaches have enabled characterizations of the community composition of numerous gut microbial communities, which in turn has enhanced interest in their diversity and functional relationships in different groups of vertebrates. Although fishes represent the greatest taxonomic and ecological diversity of vertebrates, our understanding of their gut microbiota and its functional significance has lagged well behind that of terrestrial vertebrates. In order to highlight emerging issues, we provide an overview of research on fish gut microbiotas and the biology of their hosts. We conclude that microbial community composition must be viewed within an informed context of host ecology and physiology, and that this is of particular importance with respect to research planning and sampling design.  相似文献   

16.
Microbial community samples can be efficiently surveyed in high throughput by sequencing markers such as the 16S ribosomal RNA gene. Often, a collection of samples is then selected for subsequent metagenomic, metabolomic or other follow-up. Two-stage study design has long been used in ecology but has not yet been studied in-depth for high-throughput microbial community investigations. To avoid ad hoc sample selection, we developed and validated several purposive sample selection methods for two-stage studies (that is, biological criteria) targeting differing types of microbial communities. These methods select follow-up samples from large community surveys, with criteria including samples typical of the initially surveyed population, targeting specific microbial clades or rare species, maximizing diversity, representing extreme or deviant communities, or identifying communities distinct or discriminating among environment or host phenotypes. The accuracies of each sampling technique and their influences on the characteristics of the resulting selected microbial community were evaluated using both simulated and experimental data. Specifically, all criteria were able to identify samples whose properties were accurately retained in 318 paired 16S amplicon and whole-community metagenomic (follow-up) samples from the Human Microbiome Project. Some selection criteria resulted in follow-up samples that were strongly non-representative of the original survey population; diversity maximization particularly undersampled community configurations. Only selection of intentionally representative samples minimized differences in the selected sample set from the original microbial survey. An implementation is provided as the microPITA (Microbiomes: Picking Interesting Taxa for Analysis) software for two-stage study design of microbial communities.  相似文献   

17.
Deep sequencing of PCR amplicon libraries facilitates the detection of low‐abundance populations in environmental DNA surveys of complex microbial communities. At the same time, deep sequencing can lead to overestimates of microbial diversity through the generation of low‐frequency, error‐prone reads. Even with sequencing error rates below 0.005 per nucleotide position, the common method of generating operational taxonomic units (OTUs) by multiple sequence alignment and complete‐linkage clustering significantly increases the number of predicted OTUs and inflates richness estimates. We show that a 2% single‐linkage preclustering methodology followed by an average‐linkage clustering based on pairwise alignments more accurately predicts expected OTUs in both single and pooled template preparations of known taxonomic composition. This new clustering method can reduce the OTU richness in environmental samples by as much as 30–60% but does not reduce the fraction of OTUs in long‐tailed rank abundance curves that defines the rare biosphere.  相似文献   

18.
Endolithic (rock-dwelling) microbial communities are ubiquitous in hyper-arid deserts around the world and the last resort for life under extreme aridity. These communities are excellent models to explore biotic and abiotic drivers of diversity because they are of low complexity. Using high-throughput amplicon and metagenome sequencing, combined with X-ray computed tomography, we investigated how water availability and substrate architecture modulated the taxonomic and functional composition of gypsum endolithic communities in the Atacama Desert, Chile. We found that communities inhabiting gypsum rocks with a more fragmented substrate architecture had higher taxonomic and functional diversity, despite having less water available. This effect was tightly linked with community connectedness and likely the result of niche differentiation. Gypsum communities were functionally similar, yet adapted to their unique micro-habitats by modulating their carbon and energy acquisition strategies and their growth modalities. Reconstructed population genomes showed that these endolithic microbial populations encoded potential pathways for anoxygenic phototrophy and atmospheric hydrogen oxidation as supplemental energy sources.  相似文献   

19.
DNA sequencing has become an integrated part of microbial ecology, and taxonomic marker genes such as the SSU and LSU rRNA are frequently used to assess community structure. One solution for taxonomic community analysis based on shotgun metagenomic data is the Metaxa2 software, which can extract and classify sequence fragments belonging to the rRNA genes. This paper describes the Metaxa2 Diversity Tools, a set of new open-source software programs that extends the capabilities of the Metaxa2 software. These tools allow for better handling of data from multiple samples, improved species classifications, rarefaction analysis accounting for unclassified entries, and determination of significant differences in community composition of different samples. We demonstrate the performance of the software tools on rRNA data extracted from different shotgun metagenomes, and find the tools to streamline and improve the assessments of community diversity, particularly for samples from environments for which few reference genomes are available. Finally, we establish that our resampling algorithm for determining community dissimilarity is robust to differences in coverage depth, suggesting that it forms a complement to multidimensional visualization approaches for finding differences between communities. The Metaxa2 Diversity Tools are included in recent versions (2.1 and later) of Metaxa2 (http://microbiology.se/software/metaxa2/) and facilitate implementation of Metaxa2 within software pipelines for taxonomic analysis of environmental communities.  相似文献   

20.
Human associated microbial communities exert tremendous influence over human health and disease. With modern metagenomic sequencing methods it is now possible to follow the relative abundance of microbes in a community over time. These microbial communities exhibit rich ecological dynamics and an important goal of microbial ecology is to infer the ecological interactions between species directly from sequence data. Any algorithm for inferring ecological interactions must overcome three major obstacles: 1) a correlation between the abundances of two species does not imply that those species are interacting, 2) the sum constraint on the relative abundances obtained from metagenomic studies makes it difficult to infer the parameters in timeseries models, and 3) errors due to experimental uncertainty, or mis-assignment of sequencing reads into operational taxonomic units, bias inferences of species interactions due to a statistical problem called “errors-in-variables”. Here we introduce an approach, Learning Interactions from MIcrobial Time Series (LIMITS), that overcomes these obstacles. LIMITS uses sparse linear regression with boostrap aggregation to infer a discrete-time Lotka-Volterra model for microbial dynamics. We tested LIMITS on synthetic data and showed that it could reliably infer the topology of the inter-species ecological interactions. We then used LIMITS to characterize the species interactions in the gut microbiomes of two individuals and found that the interaction networks varied significantly between individuals. Furthermore, we found that the interaction networks of the two individuals are dominated by distinct “keystone species”, Bacteroides fragilis and Bacteroided stercosis, that have a disproportionate influence on the structure of the gut microbiome even though they are only found in moderate abundance. Based on our results, we hypothesize that the abundances of certain keystone species may be responsible for individuality in the human gut microbiome.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号