首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Massively parallel high throughput sequencing technologies allow us to interrogate the microbial composition of biological samples at unprecedented resolution. The typical approach is to perform high-throughout sequencing of 16S rRNA genes, which are then taxonomically classified based on similarity to known sequences in existing databases. Current technologies cause a predicament though, because although they enable deep coverage of samples, they are limited in the length of sequence they can produce. As a result, high-throughout studies of microbial communities often do not sequence the entire 16S rRNA gene. The challenge is to obtain reliable representation of bacterial communities through taxonomic classification of short 16S rRNA gene sequences. In this study we explored properties of different study designs and developed specific recommendations for effective use of short-read sequencing technologies for the purpose of interrogating bacterial communities, with a focus on classification using naïve Bayesian classifiers. To assess precision and coverage of each design, we used a collection of ∼8,500 manually curated 16S rRNA gene sequences from cultured bacteria and a set of over one million bacterial 16S rRNA gene sequences retrieved from environmental samples, respectively. We also tested different configurations of taxonomic classification approaches using short read sequencing data, and provide recommendations for optimal choice of the relevant parameters. We conclude that with a judicious selection of the sequenced region and the corresponding choice of a suitable training set for taxonomic classification, it is possible to explore bacterial communities at great depth using current technologies, with only a minimal loss of taxonomic resolution.  相似文献   

2.
Exploitation of microbial wealth, of which almost 95% or more is still unexplored, is a growing need. The taxonomic placements of a new isolate based on phenotypic characteristics are now being supported by information preserved in the 16S rRNA gene. However, the analysis of 16S rDNA sequences retrieved from metagenome, by the available bioinformatics tools, is subject to limitations. In this study, the occurrences of nucleotide features in 16S rDNA sequences have been used to ascertain the taxonomic placement of organisms. The tetra- and penta-nucleotide features were extracted from the training data set of the 16S rDNA sequence, and was subjected to an artificial neural network (ANN) based tool known as self-organizing map (SOM), which helped in visualization of unsupervised classification. For selection of significant features, principal component analysis (PCA) or curvilinear component analysis (CCA) was applied. The SOM along with these techniques could discriminate the sample sequences with more than 90% accuracy, highlighting the relevance of features. To ascertain the confidence level in the developed classification approach, the test data set was specifically evaluated for Thiobacillus, with Acidiphilium, Paracocus and Starkeya, which are taxonomically reassigned. The evaluation proved the excellent generalization capability of the developed tool. The topology of genera in SOM supported the conventional chemo-biochemical classification reported in the Bergey manual.  相似文献   

3.
AIMS: To identify Bacillus species and related genera by fingerprinting based on ribosomal RNA gene restriction patterns; to compare ribosomal RNA gene restriction patterns-based phylogenetic trees with trees based on 16S rRNA gene sequences; to evaluate the usefulness of ribosomal RNA gene restriction patterns as a taxonomic tool for the classification of Bacillus species and related genera. METHODS AND RESULTS: Seventy-eight bacterial species which include 42 Bacillus species, 31 species from five newly created Bacillus-related genera, and five species from five phenotypically related genera were tested. A total of 77 distinct 16S rRNA gene hybridization banding patterns were obtained. The dendrogram resulting from UPGMA analysis showed three distinct main genetic clusters at the 75% banding pattern similarity. A total of 77 distinct 23S and 5S rRNA genes hybridization banding patterns were obtained, and the dendrogram showed four distinct genetic clusters at the 75% banding pattern similarity. A third dendrogram was constructed using a combination of the data from the 16S rRNA gene fingerprinting and the 23S and 5S rRNA genes fingerprinting. It revealed three distinct main phylogenetic clusters at the 75% banding pattern similarity. CONCLUSIONS: The Bacillus species along with the species from related genera were identified successfully and differentiated by ribosomal RNA gene restriction patterns, and most were distributed with no apparent order in various clusters on each of the three dendrograms. SIGNIFICANCE AND IMPACT OF THE STUDY: Our data indicate that ribosomal RNA gene restriction patterns can be used to reconstruct the phylogeny of the Bacillus species and derived-genera that approximates, but does not duplicate, phylogenies based on 16S rRNA gene sequences.  相似文献   

4.
cpcHID操纵子序列用于钝顶节旋藻品系分类与鉴定的研究   总被引:4,自引:0,他引:4  
克隆并测定7株钝顶节旋藻品系的cpcHID操纵子序列,以及16SrRNA和16S-23SrRNA转录单元内间隔区(ITS)序列,进一步通过生物信息学和分子系统学等研究发现:(1)7株品系的cpcHID序列,以及16SrRNA和ITS序列具有很高的相似性。(2)基于7株品系cpcHID序列的GC含量绝对偏差平均值、碱基变异率和遗传距离系数普遍比基于16SrRNA和ITS序列的大。(3)基于cpcHID序列的分类结果与基于16SrRNA和ITS序列的十分相近。因此,cpcHID可作为节旋藻等蓝细菌分类与鉴定的一种新的分子标记,特别是以其丰富的信息量而在品系水平的分类鉴定中占有优势。  相似文献   

5.
The Ribosomal Database Project (RDP) Classifier, a na?ve Bayesian classifier, can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes (2nd ed., release 5.0, Springer-Verlag, New York, NY, 2004). It provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. The majority of classifications (98%) were of high estimated confidence (> or = 95%) and high accuracy (98%). In addition to being tested with the corpus of 5,014 type strain sequences from Bergey's outline, the RDP Classifier was tested with a corpus of 23,095 rRNA sequences as assigned by the NCBI into their alternative higher-order taxonomy. The results from leave-one-out testing on both corpora show that the overall accuracies at all levels of confidence for near-full-length and 400-base segments were 89% or above down to the genus level, and the majority of the classification errors appear to be due to anomalies in the current taxonomies. For shorter rRNA segments, such as those that might be generated by pyrosequencing, the error rate varied greatly over the length of the 16S rRNA gene, with segments around the V2 and V4 variable regions giving the lowest error rates. The RDP Classifier is suitable both for the analysis of single rRNA sequences and for the analysis of libraries of thousands of sequences. Another related tool, RDP Library Compare, was developed to facilitate microbial-community comparison based on 16S rRNA gene sequence libraries. It combines the RDP Classifier with a statistical test to flag taxa differentially represented between samples. The RDP Classifier and RDP Library Compare are available online at http://rdp.cme.msu.edu/.  相似文献   

6.
AIMS: In a bioterrorism event a rapid tool is needed to identify relevant dangerous bacteria. The aim of the study was to assess the usefulness of partial 16S rRNA gene sequence analysis and the suitability of diverse databases for identifying dangerous bacterial pathogens. METHODS AND RESULTS: For rapid identification purposes a 500-bp fragment of the 16S rRNA gene of 28 isolates comprising Bacillus anthracis, Brucella melitensis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, Yersinia pestis, and eight genus-related and unrelated control strains was amplified and sequenced. The obtained sequence data were submitted to three public and two commercial sequence databases for species identification. The most frequent reason for incorrect identification was the lack of the respective 16S rRNA gene sequences in the database. CONCLUSIONS: Sequence analysis of a 500-bp 16S rDNA fragment allows the rapid identification of dangerous bacterial species. However, for discrimination of closely related species sequencing of the entire 16S rRNA gene, additional sequencing of the 23S rRNA gene or sequencing of the 16S-23S rRNA intergenic spacer is essential. SIGNIFICANCE AND IMPACT OF THE STUDY: This work provides comprehensive information on the suitability of partial 16S rDNA analysis and diverse databases for rapid and accurate identification of dangerous bacterial pathogens.  相似文献   

7.
Discordant phylogenies within the rrn loci of Rhizobia   总被引:9,自引:0,他引:9       下载免费PDF全文
It is evident from complete genome sequencing results that lateral gene transfer and recombination are essential components in the evolutionary process of bacterial genomes. Since this has important implications for bacterial systematics, the primary objective of this study was to compare estimated evolutionary relationships among a representative set of alpha-Proteobacteria by sequencing analysis of three loci within their rrn operons. Tree topologies generated with 16S rRNA gene sequences were significantly different from corresponding trees assembled with 23S rRNA gene and internally transcribed space region sequences. Besides the incongruence in tree topologies, evidence that distinct segments along the 16S rRNA gene sequences of bacteria currently classified within the genera Bradyrhizobium, Mesorhizobium and Sinorhizobium have a reticulate evolutionary history was also obtained. Our data have important implications for bacterial taxonomy, because currently most taxonomic decisions are based on comparative 16S rRNA gene sequence analysis. Since phylogenetic placement based on 16S rRNA gene sequence divergence perhaps is questionable, we suggest that the proposals of bacterial nomenclature or changes in their taxonomy that have been made may not necessarily be warranted. Accordingly, a more conservative approach should be taken in the future, in which taxonomic decisions are based on the analysis of a wider variety of loci and comparative analytical methods are used to estimate phylogenetic relationships among the genomes under consideration.  相似文献   

8.
Taxonomic classification of the thousands–millions of 16S rRNA gene sequences generated in microbiome studies is often achieved using a naïve Bayesian classifier (for example, the Ribosomal Database Project II (RDP) classifier), due to favorable trade-offs among automation, speed and accuracy. The resulting classification depends on the reference sequences and taxonomic hierarchy used to train the model; although the influence of primer sets and classification algorithms have been explored in detail, the influence of training set has not been characterized. We compared classification results obtained using three different publicly available databases as training sets, applied to five different bacterial 16S rRNA gene pyrosequencing data sets generated (from human body, mouse gut, python gut, soil and anaerobic digester samples). We observed numerous advantages to using the largest, most diverse training set available, that we constructed from the Greengenes (GG) bacterial/archaeal 16S rRNA gene sequence database and the latest GG taxonomy. Phylogenetic clusters of previously unclassified experimental sequences were identified with notable improvements (for example, 50% reduction in reads unclassified at the phylum level in mouse gut, soil and anaerobic digester samples), especially for phylotypes belonging to specific phyla (Tenericutes, Chloroflexi, Synergistetes and Candidate phyla TM6, TM7). Trimming the reference sequences to the primer region resulted in systematic improvements in classification depth, and greatest gains at higher confidence thresholds. Phylotypes unclassified at the genus level represented a greater proportion of the total community variation than classified operational taxonomic units in mouse gut and anaerobic digester samples, underscoring the need for greater diversity in existing reference databases.  相似文献   

9.
10.
The ribosomal rRNA genes are widely used as genetic markers for taxonomic identification of microbes. Particularly the small subunit (SSU; 16S/18S) rRNA gene is frequently used for species‐ or genus‐level identification, but also the large subunit (LSU; 23S/28S) rRNA gene is employed in taxonomic assignment. The metaxa software tool is a popular utility for extracting partial rRNA sequences from large sequencing data sets and assigning them to an archaeal, bacterial, nuclear eukaryote, mitochondrial or chloroplast origin. This study describes a comprehensive update to metaxa – metaxa 2 – that extends the capabilities of the tool, introducing support for the LSU rRNA gene, a greatly improved classifier allowing classification down to genus or species level, as well as enhanced support for short‐read (100 bp) and paired‐end sequences, among other changes. The performance of metaxa 2 was compared to other commonly used taxonomic classifiers, showing that metaxa 2 often outperforms previous methods in terms of making correct predictions while maintaining a low misclassification rate. metaxa 2 is freely available from http://microbiology.se/software/metaxa2/ .  相似文献   

11.
We studied the effect of ectomycorrhizal fungi on bacterial communities colonizing roots of Douglas fir (Pseudotsuga menziesii). Mycorrhizal tips were cleaned of soil and separated based on gross morphological characteristics. Sequencing of the internal transcribed spacers of the nuclear rRNA gene cluster indicated that the majority of the tips were colonized by fungi in the Russulaceae, with the genera Russula and Lactarius comprising 70% of the tips. Because coamplification of organellar 16S rRNA genes can interfere with bacterial community analysis of root tips, we developed and tested a new primer pair that permits amplification of bacterial 16S rRNA genes but discriminates more effectively against organellar sequences than commonly used bacterial primer sets. We then used terminal restriction fragment length polymorphism (T-RFLP) and sequence analysis of the 16S rRNA gene to examine differences in bacterial communities associated with the mycorrhizal tips. Cluster analysis of T-RFLP profiles indicated that there were different bacterial communities among the root tips; however, the communities did not seem to be affected by the taxonomic identity of the ectomycorrhizal fungi. Terminal restriction fragment profiling and sequencing of cloned partial 16S rRNA genes indicated that most bacteria on the ectomycorrhizal tips were related to the Alphaproteobacteria and the Bacteroidetes group.  相似文献   

12.
13.
Rickettsiella grylli is an intracellular bacterial pathogen of aquatic and terrestrial arthropods. Previous determination of its 16S rRNA-encoding sequence has led to the taxonomic classification of the genus Rickettsiella in the class Gammaproteobacteria, order Legionellales, family Coxiellaceae, i.e. in close vicinity to vertebrate pathogenic bacteria of the genera Coxiella and Legionella. Here we use the additional information available from the recently published first whole genome sequence from this genus to evaluate critically the taxonomic classification of R. grylli beyond the 16S rRNA gene level. Using phylogenetic reconstruction, together with significance testing on a data basis defined by a core set of 211 previously identified families of protein-encoding genes, together with a reanalysis of 16S rRNA gene data, the present study firmly corroborates the assignment of this species to both the class Gammaproteobacteria and the order Legionellales. However, the results obtained from concatenated and single protein, single protein-encoding gene, and 16S rRNA gene data demonstrate a similar phylogenetic distance of R. grylli to both the Coxiellaceae and the Legionellaceae and are, therefore, inconsistent with its current family-level classification. Consequently, a respective reorganization of the order Legionellales is proposed.  相似文献   

14.
Conventional classification of the species in the family Mycoplasmataceae is mainly based on phenotypic criteria, which are complicated, can be difficult to measure, and have the potential to be hampered by phenotypic deviations among the isolates. The number of biochemical reactions suitable for phenotypic characterization of the Mycoplasmataceae is also very limited and therefore the strategy for the final identification of the Mycoplasmataceae species is based on comparative serological results. However, serological testing of the Mycoplasmataceae species requires a performance panel of hyperimmune sera which contains anti-serum to each known species of the family, a high level of technical expertise, and can only be properly performed by mycoplasma-reference laboratories. In addition, the existence of uncultivated and fastidious Mycoplasmataceae species/isolates in clinical materials significantly complicates, or even makes impossible, the application of conventional bacteriological tests. The analysis of available genetic markers is an additional approach for the primary identification and phylogenetic classification of cultivable species and uncultivable or fastidious organisms in standard microbiological laboratories. The partial nucleotide sequences of the RNA polymerase β-subunit gene (rpoB) and the 16S-23S rRNA intergenic transcribed spacer (ITS) were determined for all known type strains and the available non-type strains of the Mycoplasmataceae species. In addition to the available 16S rRNA gene data, the ITS and rpoB sequences were used to infer phylogenetic relationships among these species and to enable identification of the Mycoplasmataceae isolates to the species level. The comparison of the ITS and rpoB phylogenetic trees with the 16S rRNA reference phylogenetic tree revealed a similar clustering patterns for the Mycoplasmataceae species, with minor discrepancies for a few species that demonstrated higher divergence of their ITS and rpoB in comparison to their neighbor species. Overall, our results demonstrated that the ITS and rpoB gene could be useful complementary phylogenetic markers to infer phylogenetic relationships among the Mycoplasmataceae species and provide useful background information for the choice of appropriate metabolic and serological tests for the final classification of isolates. In summary, three-target sequence analysis, which includes the ITS, rpoB, and 16S rRNA genes, was demonstrated to be a reliable and useful taxonomic tool for the species differentiation within the family Mycoplasmataceae based on their phylogenetic relatedness and pairwise sequence similarities. We believe that this approach might also become a valuable tool for routine analysis and primary identification of new isolates in medical and veterinary microbiological laboratories.  相似文献   

15.
The phylogenetic diversity of the bacterial and archaeal community in the water and sediments of three large lakes of the Wadi An Natrun was investigated using 16S rRNA clone libraries. The bacterial community was diverse: 769 clones formed 345 operational taxonomic units (OTUs) defined at 99% 16S rRNA sequence identity. The bacterial community in both the water and sediments of the lakes was dominated by clones affiliated with the low G + C Gram-type-positive group, alpha-proteobacteria, and Bacteroidetes, (11-39, 11-30, and 10-37% of OTUs observed, respectively), patterns that have been observed in previously described alkaline, athalassohaline systems. However, a relatively high proportion of Firmicutess-related clones in the water of the lakes and alpha-proteobacteria in the sediments was observed. The bacterial community composition of the water and sediment of the same lake and of different lakes was significantly different (p < 0.05). Operational taxonomic units related to the gamma-proteobacteria were more abundant in the sediment of Lake Fazda, whereas the sediment of Lake UmRisha was dominated by members of the delta-proteobacteria. The proportion of gamma-proteobacterial and Bacteroidetes-affiliated OTUs were predominant in the water of Lake UmRisha and differed significantly from other lake waters (chi-squared analysis, p < or = 0.01). The more oxygenated and dilute nature of Lake Hamra was reflected in its microbial community composition, with the abundance of Bacillales sequences in the water, the absence of Halanaerobiales, Clostridiales, and Archaea in the water, and the presence of representatives of more phyla such as the Actinobacteria, Spirochaetes, and Verrucomicrobia. The archaeal community composition appeared less diverse: 589 clones resulted in 198 OTUs defined at 99% 16S rRNA sequence identity, and all sequences fell into the phylum Euryarchaeota. Phylogenetic analysis showed that many of the sequences were distantly related (83-90% 16S rRNA sequence identity) to cultured and uncultured archaea, with many clones forming clusters that branched deeply within the Euryarchaeota. Forty-two and 53% of the bacterial and archaeal clones had less than 90% 16S rRNA sequence identity to previously described sequences. This indicates that the water and sediments of the Wadi An Natrun harbor a unique and novel prokaryotic diversity that is different from what has been described among other alkaline, athalassohaline lakes.  相似文献   

16.
Obtaining full-length 16S rRNA gene sequences is important for generating accurate taxonomy assignments of bacteria, which normally is realized via clone library construction. However, the application of clone library has been hindered due to its limitations in sample throughput and in capturing minor populations (<1?% of total microorganisms). To overcome these limitations, a new strategy, two-step denaturing gradient gel electrophoresis (2S-DGGE), is developed to obtain full-length 16S rRNA gene sequences. 2S-DGGE can compare microbial communities based on its first-round DGGE profiles and generate partial 16S rRNA gene sequences (8-534?bp, Escherichia coli numbering). Then, strain-specific primers can be designed based on sequence information of bacteria of interest to PCR amplify their remaining 16S rRNA gene sequences (515-1541?bps, E. coli numbering). The second-round DGGE can confirm DNA sequence purity of these PCR products. Finally, the full-length 16S rRNA gene sequences can be obtained through combining the two partial DNA sequences. By employing 2S-DGGE, taxonomies of a group of dehalogenating bacteria have been assigned based on their full-length 16S rRNA gene sequences, several of which existed in dehalogenating microcosms as minor populations. In all, 2S-DGGE can be utilized as a medium throughput method for taxonomic identification of interested/minor populations from single or multiple microbial consortia.  相似文献   

17.
Operational taxonomic units (OTUs) are conventionally defined at a phylogenetic distance (0.03—species, 0.05—genus, 0.10—family) based on full-length 16S rRNA gene sequences. However, partial sequences (700 bp or shorter) have been used in most studies. This discord may affect analysis of diversity and species richness because sequence divergence is not distributed evenly along the 16S rRNA gene. In this study, we compared a set each of bacterial and archaeal 16S rRNA gene sequences of nearly full length with multiple sets of different partial 16S rRNA gene sequences derived therefrom (approximately 440-700 bp), at conventional and alternative distance levels. Our objective was to identify partial sequence region(s) and distance level(s) that allow more accurate phylogenetic analysis of partial 16S rRNA genes. Our results showed that no partial sequence region could estimate OTU richness or define OTUs as reliably as nearly full-length genes. However, the V1-V4 regions can provide more accurate estimates than others. For analysis of archaea, we recommend the V1-V3 and the V4-V7 regions and clustering of species-level OTUs at 0.03 and 0.02 distances, respectively. For analysis of bacteria, the V1-V3 and the V1-V4 regions should be targeted, with species-level OTUs being clustered at 0.04 distance in both cases.  相似文献   

18.
The new release of the All-Species Living Tree Project (LTP) represents an important step forward in the reconstruction of 16S rRNA gene phylogenies, since we not only provide an updated set of type strain sequences until December 2020, but also a series of improvements that increase the quality of the database. An improved universal alignment has been introduced that is implemented in the ARB format. In addition, all low-quality sequences present in the previous releases have been substituted by new entries with higher quality, many of them as a result of whole genome sequencing. Altogether, the improvements in the dataset and 16S rRNA sequence alignment allowed us to reconstruct robust phylogenies. The trees made available through this current LTP release feature the best topologies currently achievable. The given nomenclature and taxonomic hierarchy reflect all the changes available up to December 2020. The aim is to regularly update the validly published nomenclatural classification changes and new taxa proposals. The new release can be found at the following URL: https://imedea.uib-csic.es/mmg/ltp/.  相似文献   

19.
rpoB sequence analysis as a novel basis for bacterial identification   总被引:12,自引:0,他引:12  
Comparison of the sequences of conserved genes, most commonly those encoding 16S rRNA, is used for bacterial genotypic identification. Among some taxa, such as the Enterobacteriaceae, variation within this gene does not allow confident species identification. We investigated the usefulness of RNA polymerase beta-subunit encoding gene ( rpoB  ) sequences as an alternative tool for universal bacterial genotypic identification. We generated a database of partial rpoB for 14 Enterobacteriaceae species and then assessed the intra- and interspecies divergence between the rpoB and the 16S rRNA genes by pairwise comparisons. We found that levels of divergence between the rpoB sequences of different strains were markedly higher than those between their 16S rRNA genes. This higher discriminatory power was further confirmed by assigning 20 blindly selected clinical isolates to the correct enteric species on the basis of rpoB sequence comparison. Comparison of rpoB sequences from Enterobacteriaceae was also used as the basis for their phylogenetic analysis and demonstrated the genus Klebsiella to be polyphyletic. The trees obtained with rpoB were more compatible with the currently accepted classification of Enterobacteriaceae than those obtained with 16S rRNA. These data indicate that rpoB is a powerful identification tool, which may be useful for universal bacterial identification.  相似文献   

20.
As an evolutionary marker, 23S ribosomal RNA (rRNA) offers more diagnostic sequence stretches and greater sequence variation than 16S rRNA. However, 23S rRNA is still not as widely used. Based on 80 metagenome samples from the Global Ocean Sampling (GOS) Expedition, the usefulness and taxonomic resolution of 23S rRNA were compared to those of 16S rRNA. Since 23S rRNA is approximately twice as large as 16S rRNA, twice as many 23S rRNA gene fragments were retrieved from the GOS reads than 16S rRNA gene fragments, with 23S rRNA gene fragments being generally about 100 bp longer. Datasets for 16S and 23S rRNA sequences revealed similar relative abundances for major marine bacterial and archaeal taxa. However, 16S rRNA sequences had a better taxonomic resolution due to their significantly larger reference database.Reevaluation of the specificity of previously published PCR amplification primers and group specific fluorescence in situ hybridization probes on this metagenomic set of non-amplified 23S rRNA sequences revealed that out of 16 primers investigated, only two had more than 90% target group coverage. Evaluations of two probes, BET42a and GAM42a, were in accordance with previous evaluations, with a discrepancy in the target group coverage of the GAM42a probe when evaluated against the GOS metagenomic dataset.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号