首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A new method for detecting chimeras and other anomalies within 16S rRNA sequence records is presented. Using this method, we screened 1,399 sequences from 19 phyla, as defined by the Ribosomal Database Project, release 9, update 22, and found 5.0% to harbor substantial errors. Of these, 64.3% were obvious chimeras, 14.3% were unidentified sequencing errors, and 21.4% were highly degenerate. In all, 11 phyla contained obvious chimeras, accounting for 0.8 to 11% of the records for these phyla. Many chimeras (43.1%) were formed from parental sequences belonging to different phyla. While most comprised two fragments, 13.7% were composed of at least three fragments, often from three different sources. A separate analysis of the Bacteroidetes phylum (2,739 sequences) also revealed 5.8% records to be anomalous, of which 65.4% were apparently chimeric. Overall, we conclude that, as a conservative estimate, 1 in every 20 public database records is likely to be corrupt. Our results support concerns recently expressed over the quality of the public repositories. With 16S rRNA sequence data increasingly playing a dominant role in bacterial systematics and environmental biodiversity studies, it is vital that steps be taken to improve screening of sequences prior to submission. To this end, we have implemented our method as a program with a simple-to-use graphic user interface that is capable of running on a range of computer platforms. The program is called Pintail, is released under the terms of the GNU General Public License open source license, and is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.  相似文献   

2.
Libraries of 16S rRNA genes provide insight into the membership of microbial communities. Statistical methods help to determine whether differences in library composition are artifacts of sampling or are due to underlying differences in the communities from which they are derived. To contribute to a growing statistical framework for comparing 16S rRNA libraries, we present a computer program, integral -LIBSHUFF, which calculates the integral form of the Cramér-von Mises statistic. This implementation builds upon the LIBSHUFF program, which uses an approximation of the statistic and makes a number of modifications that improve precision and accuracy. Once integral -LIBSHUFF calculates the P values, when pairwise comparisons are tested at the 0.05 level, the probability of falsely identifying a significant P value is 0.098 for a study with two libraries, 0.265 for three libraries, and 0.460 for four libraries. The potential negative effects of making the multiple pairwise comparisons necessitate correcting for the increased likelihood that differences between treatments are due to chance and do not reflect biological differences. Using integral -LIBSHUFF, we found that previously published 16S rRNA gene libraries constructed from Scottish and Wisconsin soils contained different bacterial lineages. We also analyzed the published libraries constructed for the zebrafish gut microflora and found statistically significant changes in the community during development of the host. These analyses illustrate the power of integral -LIBSHUFF to detect differences between communities, providing the basis for ecological inference about the association of soil productivity or host gene expression and microbial community composition.  相似文献   

3.
The contribution of PCR artifacts to 16S rRNA gene sequence diversity from a complex bacterioplankton sample was estimated. Taq DNA polymerase errors were found to be the dominant sequence artifact but could be constrained by clustering the sequences into 99% sequence similarity groups. Other artifacts (chimeras and heteroduplex molecules) were significantly reduced by employing modified amplification protocols. Surprisingly, no skew in sequence types was detected in the two libraries constructed from PCR products amplified for different numbers of cycles. Recommendations for modification of amplification protocols and for reporting diversity estimates at 99% sequence similarity as a standard are given.  相似文献   

4.
Libraries of 16S rRNA genes provide insight into the membership of microbial communities. Statistical methods help to determine whether differences in library composition are artifacts of sampling or are due to underlying differences in the communities from which they are derived. To contribute to a growing statistical framework for comparing 16S rRNA libraries, we present a computer program, ∫-LIBSHUFF, which calculates the integral form of the Cramér-von Mises statistic. This implementation builds upon the LIBSHUFF program, which uses an approximation of the statistic and makes a number of modifications that improve precision and accuracy. Once ∫-LIBSHUFF calculates the P values, when pairwise comparisons are tested at the 0.05 level, the probability of falsely identifying a significant P value is 0.098 for a study with two libraries, 0.265 for three libraries, and 0.460 for four libraries. The potential negative effects of making the multiple pairwise comparisons necessitate correcting for the increased likelihood that differences between treatments are due to chance and do not reflect biological differences. Using ∫-LIBSHUFF, we found that previously published 16S rRNA gene libraries constructed from Scottish and Wisconsin soils contained different bacterial lineages. We also analyzed the published libraries constructed for the zebrafish gut microflora and found statistically significant changes in the community during development of the host. These analyses illustrate the power of ∫-LIBSHUFF to detect differences between communities, providing the basis for ecological inference about the association of soil productivity or host gene expression and microbial community composition.  相似文献   

5.
The contribution of PCR artifacts to 16S rRNA gene sequence diversity from a complex bacterioplankton sample was estimated. Taq DNA polymerase errors were found to be the dominant sequence artifact but could be constrained by clustering the sequences into 99% sequence similarity groups. Other artifacts (chimeras and heteroduplex molecules) were significantly reduced by employing modified amplification protocols. Surprisingly, no skew in sequence types was detected in the two libraries constructed from PCR products amplified for different numbers of cycles. Recommendations for modification of amplification protocols and for reporting diversity estimates at 99% sequence similarity as a standard are given.  相似文献   

6.
High-throughput culturing (HTC) methods that rely on dilution to extinction in very-low-nutrient media were used to obtain bacterial isolates from Crater Lake, Oregon. 16S rRNA sequence determination and phylogenetic reconstruction were used to determine the potential ecological significance of isolated bacteria, both in Crater Lake and globally. Fifty-five Crater Lake isolates yielded 16 different 16S rRNA gene sequences. Thirty of 55 (55%) Crater Lake isolates had 16S rRNA gene sequences with 97% or greater similarity to sequences recovered previously from Crater Lake 16S rRNA gene clone libraries. Furthermore, 36 of 55 (65%) Crater Lake isolates were found to be members of widely distributed freshwater groups. These results confirm that HTC is a significant improvement over traditional isolation techniques that tend to enrich for microorganisms that do not predominate in their environment and rarely correlate with 16S rRNA gene clone library sequences. Although all isolates were obtained under dark, heterotrophic growth conditions, 2 of the 16 different groups showed evidence of photosynthetic capability as assessed by the presence of puf operon sequences, suggesting that photoheterotrophy may be a significant process in this oligotrophic, freshwater habitat.  相似文献   

7.
High-throughput culturing (HTC) methods that rely on dilution to extinction in very-low-nutrient media were used to obtain bacterial isolates from Crater Lake, Oregon. 16S rRNA sequence determination and phylogenetic reconstruction were used to determine the potential ecological significance of isolated bacteria, both in Crater Lake and globally. Fifty-five Crater Lake isolates yielded 16 different 16S rRNA gene sequences. Thirty of 55 (55%) Crater Lake isolates had 16S rRNA gene sequences with 97% or greater similarity to sequences recovered previously from Crater Lake 16S rRNA gene clone libraries. Furthermore, 36 of 55 (65%) Crater Lake isolates were found to be members of widely distributed freshwater groups. These results confirm that HTC is a significant improvement over traditional isolation techniques that tend to enrich for microorganisms that do not predominate in their environment and rarely correlate with 16S rRNA gene clone library sequences. Although all isolates were obtained under dark, heterotrophic growth conditions, 2 of the 16 different groups showed evidence of photosynthetic capability as assessed by the presence of puf operon sequences, suggesting that photoheterotrophy may be a significant process in this oligotrophic, freshwater habitat.  相似文献   

8.
SUMMARY: Bellerophon is a program for detecting chimeric sequences in multiple sequence datasets by an adaption of partial treeing analysis. Bellerophon was specifically developed to detect 16S rRNA gene chimeras in PCR-clone libraries of environmental samples but can be applied to other nucleotide sequence alignments. AVAILABILITY: Bellerophon is available as an interactive web server at http://foo.maths.uq.edu.au/~huber/bellerophon.pl  相似文献   

9.
The bacterioneuston is defined as the community of bacteria present within the neuston or sea surface microlayer. Bacteria within this layer were sampled using a membrane filter technique and bacterial diversity was compared with that in the underlying pelagic coastal seawater using molecular ecological techniques. 16S rRNA gene libraries of approximately 500 clones were constructed from both bacterioneuston and the pelagic water samples and representative clones from each library were sequenced for comparison of bacterial diversity. The bacterioneuston was found to have a significantly lower bacterial diversity than the pelagic seawater, with only nine clone types (ecotaxa) as opposed to 46 ecotaxa in the pelagic seawater library. Surprisingly, the bacterioneuston clone library was dominated by 16S rRNA gene sequences affiliated to two groups of organisms, Vibrio spp. which accounted for over 68% of clones and Pseudoalteromonas spp. accounting for 21% of the library. The dominance of these two 16S rRNA gene sequence types within the bacterioneuston clone library was confirmed in a subsequent gene probing experiment. 16S rRNA gene probes specific for these groups of bacteria were designed and used to probe new libraries of 1000 clones from both the bacterioneuston and pelagic seawater DNA samples. This revealed that 57% of clones from the bacterioneuston library hybridized to a Vibrio sp.-specific 16S rRNA gene probe and 32% hybridized to a Pseudoalteromonas sp.-specific 16S rRNA gene probe. In contrast, the pelagic seawater library resulted in only 13% and 8% of 16S rRNA gene clones hybridizing to the Vibrio sp. and Pseudoalteromonas sp. probes respectively. Results from this study suggest that the bacterioneuston contains a distinct population of bacteria and warrants further detailed study at the molecular level.  相似文献   

10.
晋南牛瘤胃中古菌分子多样性的研究   总被引:2,自引:0,他引:2  
采用3对古菌特异性引物扩增瘤胃古菌16S rRNA基因分别建立克隆库来研究晋南牛瘤胃古菌的多样性.每个克隆库随机挑选100个克隆.引物Arch f364/1386建立的克隆库中,克隆分为四类,分别与四种甲烷短杆菌1Y(61%)、SM9(23%)、NT7(14%)和AK-87(2%)相似.引物1Af/1100Ar建立的克隆库中,克隆分为两类,分别与Methanobacterium aarhusense(72%)和Methanosphaera stadtmanae DSM 3091(28%)相似.引物Met86F/Met1340R建立的克隆库反映的古菌种类较为全面,除以上4种甲烷短杆菌(所占比例分别为47%、26%、11%和3%)外,还有Methanomicrobium mobile(2%)、以及类似Methanobacterium aarhusense(1%)和Methanosphaera stadtmanae(3%)的序列,还有7%的未匹配序列.系统进化分析表明,这些克隆属于Methanobrevibacter、Methanobacterium、Methanosphaera、Methanomicrobium,和未知广域古菌等5个分支.有25类属于广域古菌的未知序列,提示瘤胃中存在大量的未知产甲烷菌.  相似文献   

11.
Samples of the sponge Haliclona simulans were collected from Irish waters and subjected to a culture-independent analysis to determine the microbial, polyketide synthase (PKS) and non-ribosomal peptide synthase (NRPS) diversity. 16S rRNA gene libraries were prepared from total sponge, bacterial enriched sponge and seawater samples. Eight phyla from the Bacteria were detected in the sponge by phylogenetic analyses of the 16S rRNA gene libraries. The most abundant phylum in the total sponge library was the Proteobacteria (86%), with the majority of these clones being from the γ- Proteobacteria (77%); two groups of clones were dominant and together made up 69% of the total. Both of these groups were related to other sponge-derived microbes and comprised novel genera. Within the other bacterial phyla groups of clones representing novel candidate genera within the phyla Verrucomicrobia and Lentisphaerae were also found. Selective enrichment of the bacterial component of the sponge prior to 16S rRNA gene analysis resulted in a 16S rRNA gene library dominated by a novel genus of δ- Proteobacteria , most closely related to the Bdellovibrio . The potential for the sponge microbiota to produce secondary metabolites was also analysed by polymerase chain reaction amplification of PKS and NRPS genes. While no NRPS sequences were isolated seven ketosynthase (KS) sequences were obtained from the sponge metagenome. Analyses of these clones revealed a diverse collection of PKS sequences which were most closely affiliated with PKS from members of the Cyanobacteria , Myxobacteria and Dinoflagellata .  相似文献   

12.
Using 16S rRNA gene sequence analyses we investigated the bacterial diversity of winter bacterioplankton of two eutrophic Siberian reservoirs. These reservoirs show similarity in phytoplankton community composition in spring and autumn but tend to differ in summer in exhibiting cyanobacterial bloom. Forty-eight unique partial 16S RNA gene sequences retrieved from two libraries were mostly affiliated with the class Actinobacteria, b subdivision of the class Proteobacteria, and the phylum Cytophaga-Flavobacterium-Bacteroides. The clone library of the pond exhibiting summer cyanobacterial bloom showed more diversity in sequence composition. A significant number of bacterial 16S rRNA gene clones were closely related to freshwater bacteria previously found in different aquatic ecosystems. This finding confirms the assumption that some bacterial clades are globally distributed.  相似文献   

13.
Haloarchaea are the dominant microbial flora in hypersaline waters with near-saturating salt levels. The haloarchaeal diversity of an Australian saltern crystallizer pond was examined by use of a library of PCR-amplified 16S rRNA genes and by cultivation. High viable counts (10(6) CFU/ml) were obtained on solid media. Long incubation times (> or =8 weeks) appeared to be more important than the medium composition for maximizing viable counts and diversity. Of 66 isolates examined, all belonged to the family Halobacteriaceae, including members related to species of the genera Haloferax, Halorubrum, and Natronomonas. In addition, isolates belonging to a novel group (the ADL group), previously detected only as 16S rRNA genes in an Antarctic hypersaline lake (Deep Lake), were cultivated for the first time. The 16S rRNA gene library identified the following five main groups: Halorubrum groups 1 and 2 (49%), the SHOW (square haloarchaea of Walsby) group (33%), the ADL group (16%), and the Natronomonas group (2%). There were two significant differences between the organisms detected in cultivation and 16S rRNA sequence results. Firstly, Haloferax spp. were frequently isolated on plates (15% of all isolates) but were not detected in the 16S rRNA sequences. Control experiments indicated that a bias against Haloferax sequences in the generation of the 16S rRNA gene library was unlikely, suggesting that Haloferax spp. readily form colonies, even though they were not a dominant group. Secondly, while the 16S rRNA gene library identified the SHOW group as a major component of the microbial community, no isolates of this group were obtained. This inability to culture members of the SHOW group remains an outstanding problem in studying the ecology of hypersaline environments.  相似文献   

14.

Background

Ribosomal 16S DNA sequences are an essential tool for identifying and classifying microbes. High-throughput DNA sequencing now makes it economically possible to produce very large datasets of 16S rDNA sequences in short time periods, necessitating new computer tools for analyses. Here we describe FastGroup, a Java program designed to dereplicate libraries of 16S rDNA sequences. By dereplication we mean to: 1) compare all the sequences in a data set to each other, 2) group similar sequences together, and 3) output a representative sequence from each group. In this way, duplicate sequences are removed from a library.

Results

FastGroup was tested using a library of single-pass, bacterial 16S rDNA sequences cloned from coral-associated bacteria. We found that the optimal strategy for dereplicating these sequences was to: 1) trim ambiguous bases from the 5' end of the sequences and all sequence 3' of the conserved Bact517 site, 2) match the sequences from the 3' end, and 3) group sequences >=97% identical to each other.

Conclusions

The FastGroup program simplifies the dereplication of 16S rDNA sequence libraries and prepares the raw sequences for subsequent analyses.  相似文献   

15.
Tuz Lake is an inland thalassohaline water body located in central Anatolia that contributes to 60% of the total salt production in Turkey per year. The microbiota inhabiting this lake has been studied by FISH, denaturing gradient gel electrophoresis of PCR-amplified fragments of 16S rRNA genes, and 16S rRNA gene clone library analysis. Total cell counts per milliliter (1.38 × 107) were in the range of the values normally found for hypersaline environments. The proportion of Bacteria to Archaea in the community detectable by FISH was one to three. 16S rRNA gene clone libraries indicated that the archaeal assemblage was dominated by members of the Square Haloarchaea of the Walsby group, although some other groups were also found. Bacteria were dominated by members of the Bacteroidetes , including Salinibacter ruber -related phylotypes. Because members of Bacteroidetes are widely present in different hypersaline environments, a phylogenetic analysis of 16S rRNA gene sequences from Bacteroidetes retrieved from these environments was carried out in order to ascertain whether they formed a unique cluster. Sequences retrieved from Tuz Lake and a group of sequences from other hypersaline environments clustered together in a branch that could be considered as the 'halophilic branch' within the Bacteroidetes phylum.  相似文献   

16.
Molecular diversity of rumen archaeal populations from bovine rumen fluid incubated with or without condensed tannins was investigated using 16S rRNA gene libraries. The predominant order of rumen archaea in the 16S rRNA gene libraries of the control and condensed tannins treatment was found to belong to a novel group of rumen archaea that is distantly related to the order Thermoplasmatales, with 59.5% (15 phylotypes) and 81.43% (21 phylotypes) of the total clones from the control and treatment clone libraries, respectively. The 16S rRNA gene library of the control was found to have higher proportions of methanogens from the orders Methanomicrobiales (32%) and Methanobacteriales (8.5%) as compared to those found in the condensed tannins treatment clone library in both orders (16.88% and 1.68% respectively). The phylotype distributed in the order Methanosarcinales was only found in the control clone library. The study indicated that condensed tannins could alter the diversity of bovine rumen methanogens.  相似文献   

17.
Agricultural activities have produced well-documented changes in the Florida Everglades, including establishment of a gradient in phosphorus concentrations in Water Conservation Area 2A (WCA-2A) of the northern Everglades. An effect of increased phosphorus concentrations is increased methanogenesis in the eutrophic regions compared to the oligotrophic regions of WCA-2A. The goal of this study was to identify relationships between eutrophication and composition and activity of methanogenic assemblages in WCA-2A soils. Distributions of two genes associated with methanogens were characterized in soils taken from WCA-2A: the archaeal 16S rRNA gene and the methyl coenzyme M reductase gene. The richness of methanogen phylotypes was greater in eutrophic than in oligotrophic sites, and sequences related to previously cultivated and uncultivated methanogens were found. A preferential selection for the order Methanomicrobiales was observed in mcrA clone libraries, suggesting primer bias for this group. A greater diversity within the Methanomicrobiales was observed in mcrA clone libraries than in 16S rRNA gene libraries. 16S rRNA phylogenetic analyses revealed a dominance of clones related to Methanosaeta spp., an acetoclastic methanogen dominant in environments with low acetate concentrations. A significant number of clones were related to Methanomicrobiales, an order characterized by species utilizing hydrogen and formate as methanogenic substrates. No representatives of the orders Methanobacteriales and Methanococcales were found in any 16S rRNA clone library, although some Methanobacteriales were found in mcrA libraries. Hydrogenotrophs are the dominant methanogens in WCA-2A, and acetoclastic methanogen genotypes that proliferate in low acetate concentrations outnumber those that typically dominate in higher acetate concentrations.  相似文献   

18.
Reverse complementary DNA sequences - sequences that are inadvertently given backwards with all purines and pyrimidines transposed - can affect sequence analysis detrimentally unless taken into account. We present an open-source, high-throughput software tool -v-revcomp (http://www.cmde.science.ubc.ca/mohn/software.html) - to detect and reorient reverse complementary entries of the small-subunit rRNA (16S) gene from sequencing datasets, particularly from environmental sources. The software supports sequence lengths ranging from full length down to the short reads that are characteristic of next-generation sequencing technologies. We evaluated the reliability of v-revcomp by screening all 406 781 16S sequences deposited in release 102 of the curated SILVA database and demonstrated that the tool has a detection accuracy of virtually 100%. We subsequently used v-revcomp to analyse 1 171 646 16S sequences deposited in the International Nucleotide Sequence Databases and found that about 1% of these user-submitted sequences were reverse complementary. In addition, a nontrivial proportion of the entries were otherwise anomalous, including reverse complementary chimeras, sequences associated with wrong taxa, nonribosomal genes, sequences of poor quality or otherwise erroneous sequences without a reasonable match to any other entry in the database. Thus, v-revcomp is highly efficient in detecting and reorienting reverse complementary 16S sequences of almost any length and can be used to detect various sequence anomalies.  相似文献   

19.
A 16S rRNA gene database (http://greengenes.lbl.gov) addresses limitations of public repositories by providing chimera screening, standard alignment, and taxonomic classification using multiple published taxonomies. It was found that there is incongruent taxonomic nomenclature among curators even at the phylum level. Putative chimeras were identified in 3% of environmental sequences and in 0.2% of records derived from isolates. Environmental sequences were classified into 100 phylum-level lineages in the Archaea and Bacteria.  相似文献   

20.
A precise phylogenetic identity of the Defluviicoccus-related glycogen-accumulating organisms (GAO) observed after FISH probing in a novel activated sludge process removing phosphorus was sought with the aim of exploring the phylogenetic diversity of this important group. These organisms, whose sequences were not revealed in previously generated community wide 16S rRNA gene clone libraries, were identified using flow cytometry cell sorting of FISH-positive cells. Sequencing of a 16S rRNA gene clone library created from this sorted population identified the Defluviicoccus-related GAO as being highly related to previous identified GAO from enhanced biological phosphorus removal systems, despite a marked environmental difference between the two systems.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号