首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Taxonomy of Cyanobacteria, the oldest phototrophic prokaryotes, is problematic for many years due to their simple morphology, high variability and adaptability to diverse ecological niches. After introduction of the polyphasic approach, which is based on the combination of several criteria (molecular sequencing, morphological and ecological), the whole classification system of these organisms is subject to reorganization. The aim of this study was to evaluate whether the outer membrane efflux protein (OMEP) sequences can be used as a molecular marker for resolving the phylogeny and taxonomic status of closely related cyanobacteria. We have performed phylogenetic analyses based on the amino acid sequences of the OMEP and the DNA sequences of the 16S rRNA gene from 86 cyanobacterial species/strains with completely sequenced genomes. Phylogenetic trees based on the OMEP showed that most of the cyanobacterial species/strains belonging to different genera are clustered in separate clades supported by high bootstrap values. Comparing the OMEP trees with the 16S rDNA tree clearly showed that the OMEP is more suitable marker in resolving phylogenetic relationships within Cyanobacteria at generic and species level.  相似文献   

2.
3.
16S rRNA gene analysis is the most convenient and robust method for microbiome studies. Inaccurate taxonomic assignment of bacterial strains could have deleterious effects as all downstream analyses rely heavily on the accurate assessment of microbial taxonomy. The use of mock communities to check the reliability of the results has been suggested. However, often the mock communities used in most of the studies represent only a small fraction of taxa and are used mostly as validation of sequencing run to estimate sequencing artifacts. Moreover, a large number of databases and tools available for classification and taxonomic assignment of the 16S rRNA gene make it challenging to select the best-suited method for a particular dataset. In the present study, we used authentic and validly published 16S rRNA gene type strain sequences (full length, V3-V4 region) and analyzed them using a widely used QIIME pipeline along with different parameters of OTU clustering and QIIME compatible databases. Data Analysis Measures (DAM) revealed a high discrepancy in ratifying the taxonomy at different taxonomic hierarchies. Beta diversity analysis showed clear segregation of different DAMs. Limited differences were observed in reference data set analysis using partial (V3-V4) and full-length 16S rRNA gene sequences, which signify the reliability of partial 16S rRNA gene sequences in microbiome studies. Our analysis also highlights common discrepancies observed at various taxonomic levels using various methods and databases.  相似文献   

4.
Discordant phylogenies within the rrn loci of Rhizobia   总被引:9,自引:0,他引:9       下载免费PDF全文
It is evident from complete genome sequencing results that lateral gene transfer and recombination are essential components in the evolutionary process of bacterial genomes. Since this has important implications for bacterial systematics, the primary objective of this study was to compare estimated evolutionary relationships among a representative set of alpha-Proteobacteria by sequencing analysis of three loci within their rrn operons. Tree topologies generated with 16S rRNA gene sequences were significantly different from corresponding trees assembled with 23S rRNA gene and internally transcribed space region sequences. Besides the incongruence in tree topologies, evidence that distinct segments along the 16S rRNA gene sequences of bacteria currently classified within the genera Bradyrhizobium, Mesorhizobium and Sinorhizobium have a reticulate evolutionary history was also obtained. Our data have important implications for bacterial taxonomy, because currently most taxonomic decisions are based on comparative 16S rRNA gene sequence analysis. Since phylogenetic placement based on 16S rRNA gene sequence divergence perhaps is questionable, we suggest that the proposals of bacterial nomenclature or changes in their taxonomy that have been made may not necessarily be warranted. Accordingly, a more conservative approach should be taken in the future, in which taxonomic decisions are based on the analysis of a wider variety of loci and comparative analytical methods are used to estimate phylogenetic relationships among the genomes under consideration.  相似文献   

5.
Several characteristics of the 16S rRNA gene, such as its essential function, ubiquity, and evolutionary properties, have allowed it to become the most commonly used molecular marker in microbial ecology. However, one fact that has been overlooked is that multiple copies of this gene are often present in a given bacterium. These intragenomic copies can differ in sequence, leading to identification of multiple ribotypes for a single organism. To evaluate the impact of such intragenomic heterogeneity on the performance of the 16S rRNA gene as a molecular marker, we compared its phylogenetic and evolutionary characteristics to those of the single-copy gene rpoB. Full-length gene sequences and gene fragments commonly used for denaturing gradient gel electrophoresis were compared at various taxonomic levels. Heterogeneity found between intragenomic 16S rRNA gene copies was concentrated in specific regions of rRNA secondary structure. Such "heterogeneity hot spots" occurred within all gene fragments commonly used in molecular microbial ecology. This intragenomic heterogeneity influenced 16S rRNA gene tree topology, phylogenetic resolution, and operational taxonomic unit estimates at the species level or below. rpoB provided comparable phylogenetic resolution to that of the 16S rRNA gene at all taxonomic levels, except between closely related organisms (species and subspecies levels), for which it provided better resolution. This is particularly relevant in the context of a growing number of studies focusing on subspecies diversity, in which single-copy protein-encoding genes such as rpoB could complement the information provided by the 16S rRNA gene.  相似文献   

6.
Several characteristics of the 16S rRNA gene, such as its essential function, ubiquity, and evolutionary properties, have allowed it to become the most commonly used molecular marker in microbial ecology. However, one fact that has been overlooked is that multiple copies of this gene are often present in a given bacterium. These intragenomic copies can differ in sequence, leading to identification of multiple ribotypes for a single organism. To evaluate the impact of such intragenomic heterogeneity on the performance of the 16S rRNA gene as a molecular marker, we compared its phylogenetic and evolutionary characteristics to those of the single-copy gene rpoB. Full-length gene sequences and gene fragments commonly used for denaturing gradient gel electrophoresis were compared at various taxonomic levels. Heterogeneity found between intragenomic 16S rRNA gene copies was concentrated in specific regions of rRNA secondary structure. Such “heterogeneity hot spots” occurred within all gene fragments commonly used in molecular microbial ecology. This intragenomic heterogeneity influenced 16S rRNA gene tree topology, phylogenetic resolution, and operational taxonomic unit estimates at the species level or below. rpoB provided comparable phylogenetic resolution to that of the 16S rRNA gene at all taxonomic levels, except between closely related organisms (species and subspecies levels), for which it provided better resolution. This is particularly relevant in the context of a growing number of studies focusing on subspecies diversity, in which single-copy protein-encoding genes such as rpoB could complement the information provided by the 16S rRNA gene.  相似文献   

7.
Culture-independent DNA fingerprints are commonly used to assess the diversity of a microbial community. However, relating species composition to community profiles produced by community fingerprint methods is not straightforward. Terminal restriction fragment length polymorphism (T-RFLP) is a community fingerprint method in which phylogenetic assignments may be inferred from the terminal restriction fragment (T-RF) sizes through the use of web-based resources that predict T-RF sizes for known bacteria. The process quickly becomes computationally intensive due to the need to analyze profiles produced by multiple restriction digests and the complexity of profiles generated by natural microbial communities. A web-based tool is described here that rapidly generates phylogenetic assignments from submitted community T-RFLP profiles based on a database of fragments produced by known 16S rRNA gene sequences. Users have the option of submitting a customized database generated from unpublished sequences or from a gene other than the 16S rRNA gene. This phylogenetic assignment tool allows users to employ T-RFLP to simultaneously analyze microbial community diversity and species composition. An analysis of the variability of bacterial species composition throughout the water column in a humic lake was carried out to demonstrate the functionality of the phylogenetic assignment tool. This method was validated by comparing the results generated by this program with results from a 16S rRNA gene clone library.  相似文献   

8.
Culture-independent DNA fingerprints are commonly used to assess the diversity of a microbial community. However, relating species composition to community profiles produced by community fingerprint methods is not straightforward. Terminal restriction fragment length polymorphism (T-RFLP) is a community fingerprint method in which phylogenetic assignments may be inferred from the terminal restriction fragment (T-RF) sizes through the use of web-based resources that predict T-RF sizes for known bacteria. The process quickly becomes computationally intensive due to the need to analyze profiles produced by multiple restriction digests and the complexity of profiles generated by natural microbial communities. A web-based tool is described here that rapidly generates phylogenetic assignments from submitted community T-RFLP profiles based on a database of fragments produced by known 16S rRNA gene sequences. Users have the option of submitting a customized database generated from unpublished sequences or from a gene other than the 16S rRNA gene. This phylogenetic assignment tool allows users to employ T-RFLP to simultaneously analyze microbial community diversity and species composition. An analysis of the variability of bacterial species composition throughout the water column in a humic lake was carried out to demonstrate the functionality of the phylogenetic assignment tool. This method was validated by comparing the results generated by this program with results from a 16S rRNA gene clone library.  相似文献   

9.

Background

An important task in a metagenomic analysis is the assignment of taxonomic labels to sequences in a sample. Most widely used methods for taxonomy assignment compare a sequence in the sample to a database of known sequences. Many approaches use the best BLAST hit(s) to assign the taxonomic label. However, it is known that the best BLAST hit may not always correspond to the best taxonomic match. An alternative approach involves phylogenetic methods, which take into account alignments and a model of evolution in order to more accurately define the taxonomic origin of sequences. Similarity-search based methods typically run faster than phylogenetic methods and work well when the organisms in the sample are well represented in the database. In contrast, phylogenetic methods have the capability to identify new organisms in a sample but are computationally quite expensive.

Results

We propose a two-step approach for metagenomic taxon identification; i.e., use a rapid method that accurately classifies sequences using a reference database (this is a filtering step) and then use a more complex phylogenetic method for the sequences that were unclassified in the previous step. In this work, we explore whether and when using top BLAST hit(s) yields a correct taxonomic label. We develop a method to detect outliers among BLAST hits in order to separate the phylogenetically most closely related matches from matches to sequences from more distantly related organisms. We used modified BILD (Bayesian Integral Log-Odds) scores, a multiple-alignment scoring function, to define the outliers within a subset of top BLAST hits and assign taxonomic labels. We compared the accuracy of our method to the RDP classifier and show that our method yields fewer misclassifications while properly classifying organisms that are not present in the database. Finally, we evaluated the use of our method as a pre-processing step before more expensive phylogenetic analyses (in our case TIPP) in the context of real 16S rRNA datasets.

Conclusion

Our experiments make a good case for using a two-step approach for accurate taxonomic assignment. We show that our method can be used as a filtering step before using phylogenetic methods and provides a way to interpret BLAST results using more information than provided by E-values and bit-scores alone.
  相似文献   

10.
For the first time, the cyanobacterial diversity from microbial mats in lakes of Eastern Antarctica was investigated using microscopic and molecular approaches. The present study assessed the biogeographical distribution of cyanobacteria in Antarctica. Five samples were taken from four lakes spanning a range of different ecological environments in Larsemann Hills, Vestfold Hills and Rauer Islands to evaluate the influence of lake characteristics on the cyanobacterial diversity. Seventeen morphospecies and 28 16S rRNA gene-based operational taxonomic units belonging to the Oscillatoriales, Nostocales and Chroococcales were identified. The internal transcribed spacer was evaluated to complement the 16S rRNA gene data and showed similar but more clear-cut tendencies. The molecular approach suggested that potential Antarctic endemic species, including a previously undiscovered diversity, are more abundant than has been estimated by morphological methods. Moreover, operational taxonomic units, also found outside Antarctica, were more widespread over the continent than potential endemics. The cyanobacterial diversity of the most saline lakes was found to differ from the others, and correlations between the sampling depth and the cyanobacterial communities can also be drawn. Comparison with database sequences illustrated the ubiquity of several cyanobacterial operational taxonomic units and their remarkable range of tolerance to harsh environmental conditions.  相似文献   

11.
Cyanobacteria are important primary producers, and many are able to fix atmospheric nitrogen playing a key role in the marine environment. However, not much is known about the diversity of cyanobacteria in Portuguese marine waters. This paper describes the diversity of 60 strains isolated from benthic habitats in 9 sites (intertidal zones) on the Portuguese South and West coasts. The strains were characterized by a morphological study (light and electron microscopy) and by a molecular characterization (partial 16S rRNA, nifH, nifK, mcyA, mcyE/ndaF, sxtI genes). The morphological analyses revealed 35 morphotypes (15 genera and 16 species) belonging to 4 cyanobacterial Orders/Subsections. The dominant groups among the isolates were the Oscillatoriales. There is a broad congruence between morphological and molecular assignments. The 16S rRNA gene sequences of 9 strains have less than 97% similarity compared to the sequences in the databases, revealing novel cyanobacterial diversity. Phylogenetic analysis, based on partial 16S rRNA gene sequences showed at least 12 clusters. One-third of the isolates are potential N(2)-fixers, as they exhibit heterocysts or the presence of nif genes was demonstrated by PCR. Additionally, no conventional freshwater toxins genes were detected by PCR screening.  相似文献   

12.
Next-generation sequencing technologies have led to recognition of a so-called ‘rare biosphere''. These microbial operational taxonomic units (OTUs) are defined by low relative abundance and may be specifically adapted to maintaining low population sizes. We hypothesized that mining of low-abundance next-generation 16S ribosomal RNA (rRNA) gene data would lead to the discovery of novel phylogenetic diversity, reflecting microorganisms not yet discovered by previous sampling efforts. Here, we test this hypothesis by combining molecular and bioinformatic approaches for targeted retrieval of phylogenetic novelty within rare biosphere OTUs. We combined BLASTN network analysis, phylogenetics and targeted primer design to amplify 16S rRNA gene sequences from unique potential bacterial lineages, comprising part of the rare biosphere from a multi-million sequence data set from an Arctic tundra soil sample. Demonstrating the feasibility of the protocol developed here, three of seven recovered phylogenetic lineages represented extremely divergent taxonomic entities. These divergent target sequences correspond to (a) a previously unknown lineage within the BRC1 candidate phylum, (b) a sister group to the early diverging and currently recognized monospecific Cyanobacteria Gloeobacter, a genus containing multiple plesiomorphic traits and (c) a highly divergent lineage phylogenetically resolved within mitochondria. A comparison to twelve next-generation data sets from additional soils suggested persistent low-abundance distributions of these novel 16S rRNA genes. The results demonstrate this sequence analysis and retrieval pipeline as applicable for exploring underrepresented phylogenetic novelty and recovering taxa that may represent significant steps in bacterial evolution.  相似文献   

13.
【背景】对于环境样品中氨氧化古菌(Ammonia-oxidizing archaea,AOA)多样性的研究,利用amoA功能基因作为分子标记会比16SrRNA基因有更强的特异性和更高的分辨率,能更准确地反映环境样品中氨氧化古菌的种群结构和分布特征。然而,目前对amoA基因扩增子高通量测序的分析存在两大限制因素:一是缺乏相应的amoA基因参考数据库;二是AOA amoA基因在种水平上的相似性阈值未知,分析过程中没有明确的划分种水平操作分类单元(Operational taxonomic unit,OTU)的阈值。【目的】构建基于amoA功能基因序列分析氨氧化古菌多样性的方法,为基于高通量测序的功能微生物多样性分析提供参考。【方法】基于目前已通过分离纯化或富集培养获得的34株氨氧化古菌及功能基因数据库中收录的环境样品amoA基因序列,构建氨氧化古菌amoA基因参考数据库。通过菌株间两两比对获得的amoA基因相似度与16SrRNA基因相似度的相关性分析,确定amoA基因在种水平上的相似性阈值。基于MOTHUR软件平台,利用建立的参考数据库和确定的阈值对南海一个垂直水体剖面样品的amoA基因序列进行多样性分析。【结果】构建了含有26 091条序列信息的古菌amoA基因参考数据库,确定了89%作为分析过程中古菌amoA基因划分种水平OTU的阈值,对南海水体样品氨氧化古菌的多样性分析结果很好地显示了南海不同深度水层水体中氨氧化古菌的种群结构和系统发育关系,有效揭示了南海氨氧化古菌的垂直分布差异。【结论】建立了基于amoA基因高通量测序的氨氧化古菌多样性分析方法,此方法可以有效分析环境样品中氨氧化古菌的多样性。  相似文献   

14.
Reference phylogenies are crucial for providing a taxonomic framework for interpretation of marker gene and metagenomic surveys, which continue to reveal novel species at a remarkable rate. Greengenes is a dedicated full-length 16S rRNA gene database that provides users with a curated taxonomy based on de novo tree inference. We developed a ‘taxonomy to tree'' approach for transferring group names from an existing taxonomy to a tree topology, and used it to apply the Greengenes, National Center for Biotechnology Information (NCBI) and cyanoDB (Cyanobacteria only) taxonomies to a de novo tree comprising 408 315 sequences. We also incorporated explicit rank information provided by the NCBI taxonomy to group names (by prefixing rank designations) for better user orientation and classification consistency. The resulting merged taxonomy improved the classification of 75% of the sequences by one or more ranks relative to the original NCBI taxonomy with the most pronounced improvements occurring in under-classified environmental sequences. We also assessed candidate phyla (divisions) currently defined by NCBI and present recommendations for consolidation of 34 redundantly named groups. All intermediate results from the pipeline, which includes tree inference, jackknifing and transfer of a donor taxonomy to a recipient tree (tax2tree) are available for download. The improved Greengenes taxonomy should provide important infrastructure for a wide range of megasequencing projects studying ecosystems on scales ranging from our own bodies (the Human Microbiome Project) to the entire planet (the Earth Microbiome Project). The implementation of the software can be obtained from http://sourceforge.net/projects/tax2tree/.  相似文献   

15.
Kang YJ  Cheng J  Mei LJ  Hu J  Piao Z  Yin SX 《Mikrobiologiia》2010,79(5):664-671
The use of 16S rRNA gene has been a "golden" method to determine the diversity of microbial communities in environmental samples, phylogenetic relationships of prokaryotes and taxonomic position of newly isolated organisms. However due to the presence of multiple heterogeneous 16S rRNA gene copies in many strains, the interpretation of microbial ecology via 16S rRNA sequences is complicated. Purpose of present paper is to demonstrate the extent to which the multiple heterogeneous 16S rRNA gene copies affect RFLP patterns and DGG E profiles by using the genome database. In present genome database, there are 782 bacterial strains in total whose genomes have been completely sequenced and annotated. Among the total strains, 639 strains (82%) possess multiple 16S rRNA gene copies, 415 strains (53%) whose multiple copies are heterogeneous in sequences as revealed by alignment, 236 strains (30%) whose multiple copies show different restrict patterns by CSP61 + Hinfl, MspI + Rsal or HhaI as analyzed in silico. Polymorphisms of the multiple copies in certain strains were further characterized by G + C% and phylogentic distances based on the sequences of V3 region, which are linked to DGGE patters. Polymorphisms of a few strains were shown as examples. Using artificial communities, it is demonstrated that the presence of multiple heterogeneous 16S rRNA gene copies potentially leads to over-estimation of the diversity of a community. It is suggested that care must be taken when interpreting 16S rRNA-based RFLP and DGGE data and profiling an environmental community.  相似文献   

16.
为了更好地了解拟诺卡氏菌属(Nocardiopsis)各物种间的系统发育关系,该属现有有效描述种的gyrB,sodrpoB基因的部分序列被测定,结合16S rRNA基因,对拟诺卡氏菌属进行了系统发育重建。研究发现拟诺卡氏菌属gyrB,sodrpoB基因的平均相似性分别为87.7%、87.3%和94.1%,而16S rRNA基因的平均相似性则达到96.65%,3个看家基因均比16S rRNA具有更高的分歧度。比较基于不同基因的系统树发现,由gyrB基因得到的系统树拓扑结构与16S rRNA得到的结构在亚群上基本一致。因此,gyrB基因在拟诺卡氏菌属的系统分类上比16S rRNA基因更具优越性。  相似文献   

17.
It is generally accepted that the plastids arose from a cyanobacterial ancestor, but the exact phylogenetic relationships between cyanobacteria and plastids are still controversial. Most studies based on partial 16S rRNA sequences suggested a relatively late origin of plastids within the cyanobacterial divergence. In order to clarify the exact relationship and divergence order of cyanobacteria and plastids, we studied their phylogeny on the basis of nearly complete 16S rRNA gene sequences. The data set comprised 15 strains of cyanobacteria from different morphological groups, 1 prochlorophyte, and plastids belonging to 8 species of plants and 12 species of diverse algae. This set included three cyanobacterial sequences determined in this study. This is the most comprehensive set of complete cyanobacterial and plastidial 16S rRNA sequences used so far. Phylogenetic trees were constructed using neighbor joining and maximum parsimony, and the reliability of the tree topologies was tested by different methods. Our results suggest an early origin of plastids within the cyanobacterial divergence, preceded only by the divergence of two cyanobacterial genera, Gloeobacter and Pseudanabaena.   相似文献   

18.
The recent introduction of massively parallel pyrosequencers allows rapid, inexpensive analysis of microbial community composition using 16S ribosomal RNA (rRNA) sequences. However, a major challenge is to design a workflow so that taxonomic information can be accurately and rapidly assigned to each read, so that the composition of each community can be linked back to likely ecological roles played by members of each species, genus, family or phylum. Here, we use three large 16S rRNA datasets to test whether taxonomic information based on the full-length sequences can be recaptured by short reads that simulate the pyrosequencer outputs. We find that different taxonomic assignment methods vary radically in their ability to recapture the taxonomic information in full-length 16S rRNA sequences: most methods are sensitive to the region of the 16S rRNA gene that is targeted for sequencing, but many combinations of methods and rRNA regions produce consistent and accurate results. To process large datasets of partial 16S rRNA sequences obtained from surveys of various microbial communities, including those from human body habitats, we recommend the use of Greengenes or RDP classifier with fragments of at least 250 bases, starting from one of the primers R357, R534, R798, F343 or F517.  相似文献   

19.
Li  Renhui  Carmichael  Wayne W.  Liu  Yongding  Watanabe  Makoto M. 《Hydrobiologia》2000,438(1-3):99-105
The taxonomy of Aphanizomenon flos-aquae strain NH-5, a producer of cyanotoxins, was re-evaluated by comparison with six other Aphanizomenon strains using morphological characteristics and 16S rRNA gene sequences. Strain NH-5 was concluded to be improperly identified as Aph. flos-aquae based upon (1) lack of bundle formation in the trichomes, (2) location of akinetes next to heterocytes, (3) lower similarities (less than 97.5%) in the 16S rRNA gene sequences relative to Aph. flos-aquae strains, and (4) comparison within a phylogenetic tree constructed from 16S rRNA gene sequences. The Aphanizomenon strains investigated in this study are classified to four morphological groups as described by the classical taxonomy of Komárek & Kovácik (1989). This classification was supported from the phylogenetic results of 16S rRNA gene sequences. This study also discusses the generic boundaries between Aphanizomenon and Anabaena.  相似文献   

20.

Background

In environmental sequencing studies, fungi can be identified based on nucleic acid sequences, using either highly variable sequences as species barcodes or conserved sequences containing a high-quality phylogenetic signal. For the latter, identification relies on phylogenetic analyses and the adoption of the phylogenetic species concept.Such analysis requires that the reference sequences are well identified and deposited in public-access databases. However, many entries in the public sequence databases are problematic in terms of quality and reliability and these data require screening to ensure correct phylogenetic interpretation.

Methods and Principal Findings

To facilitate phylogenetic inferences and phylogenetic assignment, we introduce a fungal sequence database. The database PHYMYCO-DB comprises fungal sequences from GenBank that have been filtered to satisfy stringent sequence quality criteria. For the first release, two widely used molecular taxonomic markers were chosen: the nuclear SSU rRNA and EF1-α gene sequences. Following the automatic extraction and filtration, a manual curation is performed to remove problematic sequences while preserving relevant sequences useful for phylogenetic studies. As a result of curation, ∼20% of the automatically filtered sequences have been removed from the database. To demonstrate how PHYMYCO-DB can be employed, we test a set of environmental Chytridiomycota sequences obtained from deep sea samples.

Conclusion

PHYMYCO-DB offers the tools necessary to: (i) extract high quality fungal sequences for each of the 5 fungal phyla, at all taxonomic levels, (ii) extract already performed alignments, to act as ‘reference alignments’, (iii) launch alignments of personal sequences along with stored data. A total of 9120 SSU rRNA and 672 EF1-α high-quality fungal sequences are now available.The PHYMYCO-DB is accessible through the URL http://phymycodb.genouest.org/.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号