期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A frequent step in metagenomic data analysis comprises the assembly of the sequenced reads. Many assembly tools have been published in the last years targeting data coming from next-generation sequencing (NGS) technologies but these assemblers have not been designed for or tested in multi-genome scenarios that characterize metagenomic studies. Here we provide a critical assessment of current de novo short reads assembly tools in multi-genome scenarios using complex simulated metagenomic data. With this approach we tested the fidelity of different assemblers in metagenomic studies demonstrating that even under the simplest compositions the number of chimeric contigs involving different species is noticeable. We further showed that the assembly process reduces the accuracy of the functional classification of the metagenomic data and that these errors can be overcome raising the coverage of the studied metagenome. The results presented here highlight the particular difficulties that de novo genome assemblers face in multi-genome scenarios demonstrating that these difficulties, that often compromise the functional classification of the analyzed data, can be overcome with a high sequencing effort. 相似文献

6.

Effects of short read quality and quantity on a de novo vertebrate transcriptome assembly

Garcia TI Shen Y Catchen J Amores A Schartl M Postlethwait J Walter RB 《Comparative biochemistry and physiology. Toxicology & pharmacology : CBP》2012,155(1):95-101

For many researchers, next generation sequencing data holds the key to answering a category of questions previously unassailable. One of the important and challenging steps in achieving these goals is accurately assembling the massive quantity of short sequencing reads into full nucleic acid sequences. For research groups working with non-model or wild systems, short read assembly can pose a significant challenge due to the lack of pre-existing EST or genome reference libraries. While many publications describe the overall process of sequencing and assembly, few address the topic of how many and what types of reads are best for assembly. The goal of this project was use real world data to explore the effects of read quantity and short read quality scores on the resulting de novo assemblies. Using several samples of short reads of various sizes and qualities we produced many assemblies in an automated manner. We observe how the properties of read length, read quality, and read quantity affect the resulting assemblies and provide some general recommendations based on our real-world data set. 相似文献

7.

Evaluation of de novo transcriptome assemblies from RNA-Seq data

Bo Li Nathanael Fillmore Yongsheng Bai Mike Collins James A Thomson Ron Stewart Colin N Dewey 《Genome biology》2014,15(12)

相似文献

8.

Comparative study of de novo assembly and genome-guided assembly strategies for transcriptome reconstruction based on RNA-Seq

BingXin Lu ZhenBing Zeng TieLiu Shi 《中国科学：生命科学英文版》2013,56(2):143-155

相似文献

9.

A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach

Dacotah Melicher Alex S Torson Ian Dworkin Julia H Bowsher 《BMC genomics》2014,15(1)

相似文献

10.

Letting the data speak for themselves: a fully Bayesian approach to transcriptome assembly

Marcel H Schulz 《Genome biology》2014,15(10)

相似文献

11.

Characterization of common carp transcriptome: sequencing, de novo assembly, annotation and comparative genomics

Ji P Liu G Xu J Wang X Li J Zhao Z Zhang X Zhang Y Xu P Sun X 《PloS one》2012,7(4):e35152

相似文献

12.

Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks

Shawn J. Cokus Paul F. Gugger Victoria L. Sork 《BMC genomics》2015,16(1)

相似文献

13.

Evaluating de Bruijn Graph Assemblers on 454 Transcriptomic Data

Xianwen Ren Tao Liu Jie Dong Lilian Sun Jian Yang Yafang Zhu Qi Jin 《PloS one》2012,7(12)

相似文献

14.

Drosophila parthenogenesis: a model for de novo centrosome assembly

Riparbelli MG Callaini G 《Developmental biology》2003,260(2):298-313

The Drosophila egg contains all the components required to properly execute the early mitotic divisions but is unable to assemble a functional centrosome without a sperm-provided basal body. We show that 65% of unfertilized eggs obtained from a laboratory strain of Drosophila mercatorum can spontaneously assemble a number of cytoplasmic asters after activation, most of them duplicating in a cell cycle-dependent manner. Such asters are formed by a polarized array of microtubules that have their Asp-associated minus-ends converging at a main focus, where centrioles and typical centrosomal antigens are found. Aster assembly is spatially restricted to the anterior region of the oocyte. When fertilized, the parthenogenetic egg forms the poles of the gonomeric spindle by using the sperm-provided basal body, despite the presence within the same cytoplasm of maternal centrosomes. Thirty-five percent of parthenogenetic eggs and all unfertilized and fertilized eggs from the sibling bisexually reproducing D. mercatorum strain do not contain cytoplasmic asters. Thus, the Drosophila eggs have the potential for de novo formation of functional centrosomes independent of preexisting centrioles, but some control mechanisms preventing their spontaneous assembly must exist. We speculate that the release of the block preventing centrosome self-assembly could be a landmark for ensuring parthenogenetic reproduction. 相似文献

15.

Efficient de novo assembly of single-cell bacterial genomes from short-read data sets

Chitsaz H Yee-Greenbaum JL Tesler G Lombardo MJ Dupont CL Badger JH Novotny M Rusch DB Fraser LJ Gormley NA Schulz-Trieglaff O Smith GP Evers DJ Pevzner PA Lasken RS 《Nature biotechnology》2011,29(10):915-921

Whole genome amplification by the multiple displacement amplification (MDA) method allows sequencing of DNA from single cells of bacteria that cannot be cultured. Assembling a genome is challenging, however, because MDA generates highly nonuniform coverage of the genome. Here we describe an algorithm tailored for short-read data from single cells that improves assembly through the use of a progressively increasing coverage cutoff. Assembly of reads from single Escherichia coli and Staphylococcus aureus cells captures >91% of genes within contigs, approaching the 95% captured from an assembly based on many E. coli cells. We apply this method to assemble a genome from a single cell of an uncultivated SAR324 clade of Deltaproteobacteria, a cosmopolitan bacterial lineage in the global ocean. Metabolic reconstruction suggests that SAR324 is aerobic, motile and chemotaxic. Our approach enables acquisition of genome assemblies for individual uncultivated bacteria using only short reads, providing cell-specific genetic information absent from metagenomic studies. 相似文献

16.

A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies

Zhang W Chen J Yang Y Tang Y Shang J Shen B 《PloS one》2011,6(3):e17915

The advent of next-generation sequencing technologies is accompanied with the development of many whole-genome sequence assembly methods and software, especially for de novo fragment assembly. Due to the poor knowledge about the applicability and performance of these software tools, choosing a befitting assembler becomes a tough task. Here, we provide the information of adaptivity for each program, then above all, compare the performance of eight distinct tools against eight groups of simulated datasets from Solexa sequencing platform. Considering the computational time, maximum random access memory (RAM) occupancy, assembly accuracy and integrity, our study indicate that string-based assemblers, overlap-layout-consensus (OLC) assemblers are well-suited for very short reads and longer reads of small genomes respectively. For large datasets of more than hundred millions of short reads, De Bruijn graph-based assemblers would be more appropriate. In terms of software implementation, string-based assemblers are superior to graph-based ones, of which SOAPdenovo is complex for the creation of configuration file. Our comparison study will assist researchers in selecting a well-suited assembler and offer essential information for the improvement of existing assemblers or the developing of novel assemblers. 相似文献

17.

IsoLasso: a LASSO regression approach to RNA-Seq based transcriptome assembly

Li W Feng J Jiang T 《Journal of computational biology》2011,18(11):1693-1707

相似文献

18.

Analysis of de novo sequencing and transcriptome assembly and lignocellulolytic enzymes gene expression of Coriolopsis gallica HTC

Yuehong Chen Qinghua Cao Xiang Tao Huanhuan Shao Kun Zhang Yizheng Zhang 《Bioscience, biotechnology, and biochemistry》2017,81(3):460-468

相似文献

19.

Assisted assembly: how to improve a de novo genome assembly by using related species

Sante Gnerre Eric S Lander Kerstin Lindblad-Toh David B Jaffe 《Genome biology》2009,10(8):R88-9

We describe a new assembly algorithm, where a genome assembly with low sequence coverage, either throughout the genome or locally, due to cloning bias, is considerably improved through an assisting process via a related genome. We show that the information provided by aligning the whole-genome shotgun reads of the target against a reference genome can be used to substantially improve the quality of the resulting assembly. 相似文献

20.

RNA-seq analysis of Rubus idaeus cv. Nova: transcriptome sequencing and de novo assembly for subsequent functional genomics approaches

Tae Kyung Hyun Sarah Lee Dhinesh Kumar Yeonggil Rim Ritesh Kumar Sang Yeol Lee Choong Hwan Lee Jae-Yean Kim 《Plant cell reports》2014,33(10):1617-1628

相似文献