期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems

Minoche AE Dohm JC Himmelbauer H 《Genome biology》2011,12(11):R112-15

Background

The generation and analysis of high-throughput sequencing data are becoming a major component of many studies in molecular biology and medical research. Illumina's Genome Analyzer (GA) and HiSeq instruments are currently the most widely used sequencing devices. Here, we comprehensively evaluate properties of genomic HiSeq and GAIIx data derived from two plant genomes and one virus, with read lengths of 95 to 150 bases.

Results

We provide quantifications and evidence for GC bias, error rates, error sequence context, effects of quality filtering, and the reliability of quality values. By combining different filtering criteria we reduced error rates 7-fold at the expense of discarding 12.5% of alignable bases. While overall error rates are low in HiSeq data we observed regions of accumulated wrong base calls. Only 3% of all error positions accounted for 24.7% of all substitution errors. Analyzing the forward and reverse strands separately revealed error rates of up to 18.7%. Insertions and deletions occurred at very low rates on average but increased to up to 2% in homopolymers. A positive correlation between read coverage and GC content was found depending on the GC content range.

Conclusions

The errors and biases we report have implications for the use and the interpretation of Illumina sequencing data. GAIIx and HiSeq data sets show slightly different error profiles. Quality filtering is essential to minimize downstream analysis artifacts. Supporting previous recommendations, the strand-specificity provides a criterion to distinguish sequencing errors from low abundance polymorphisms. 相似文献

2.

Multiplex sequencing of bacterial artificial chromosomes for assembling complex plant genomes

下载免费PDF全文

Sebastian Beier Axel Himmelbach Thomas Schmutzer Marius Felder Stefan Taudien Klaus F. X. Mayer Matthias Platzer Nils Stein Uwe Scholz Martin Mascher 《Plant biotechnology journal》2016,14(7):1511-1522

Hierarchical shotgun sequencing remains the method of choice for assembling high‐quality reference sequences of complex plant genomes. The efficient exploitation of current high‐throughput technologies and powerful computational facilities for large‐insert clone sequencing necessitates the sequencing and assembly of a large number of clones in parallel. We developed a multiplexed pipeline for shotgun sequencing and assembling individual bacterial artificial chromosomes (BACs) using the Illumina sequencing platform. We illustrate our approach by sequencing 668 barley BACs (Hordeum vulgare L.) in a single Illumina HiSeq 2000 lane. Using a newly designed parallelized computational pipeline, we obtained sequence assemblies of individual BACs that consist, on average, of eight sequence scaffolds and represent >98% of the genomic inserts. Our BAC assemblies are clearly superior to a whole‐genome shotgun assembly regarding contiguity, completeness and the representation of the gene space. Our methods may be employed to rapidly obtain high‐quality assemblies of a large number of clones to assemble map‐based reference sequences of plant and animal species with complex genomes by sequencing along a minimum tiling path. 相似文献

3.

Elucidating and mining the Tulipa and Lilium transcriptomes

Natalia M. Moreno-Pachon Hendrika A. C. F. Leeggangers Harm Nijveen Edouard Severing Henk Hilhorst Richard G. H. Immink 《Plant molecular biology》2016,90(3):249-265

相似文献

4.

Comparative transcriptomics uncovers differences in photoautotrophic versus photoheterotrophic modes of nutrition in relation to secondary metabolites biosynthesis in <Emphasis Type="Italic">Swertia chirayita</Emphasis>

Tarun Pal Jibesh Kumar Padhan Pawan Kumar Hemant Sood Rajinder S. Chauhan 《Molecular biology reports》2018,45(2):77-98

相似文献

5.

Towards standardization of RNA quality assessment using user-independent classifiers of microcapillary electrophoresis traces

Imbeaud S Graudens E Boulanger V Barlet X Zaborski P Eveno E Mueller O Schroeder A Auffray C 《Nucleic acids research》2005,33(6):e56

相似文献

6.

A low-cost library construction protocol and data analysis pipeline for Illumina-based strand-specific multiplex RNA-seq 总被引：1，自引：0，他引：1

Wang L Si Y Dedow LK Shao Y Liu P Brutnell TP 《PloS one》2011,6(10):e26426

相似文献

7.

SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

Yang Yu Jiankai Wei Xiaojun Zhang Jingwen Liu Chengzhang Liu Fuhua Li Jianhai Xiang 《PloS one》2014,9(1)

相似文献

8.

Assessment of microRNA differential expression and detection in multiplexed small RNA sequencing data

Joshua D. Campbell Gang Liu Lingqi Luo Ji Xiao Joseph Gerrein Brenda Juan-Guardela John Tedrow Yuriy O. Alekseyev Ivana V. Yang Mick Correll Mark Geraci John Quackenbush Frank Sciurba David A. Schwartz Naftali Kaminski W. Evan Johnson Stefano Monti Avrum Spira Jennifer Beane Marc E. Lenburg 《RNA (New York, N.Y.)》2015,21(2):164-171

相似文献

9.

De novo transcriptome characterization of Vitis vinifera cv. Corvina unveils varietal diversity

Luca Venturini Alberto Ferrarini Sara Zenoni Giovanni Battista Tornielli Marianna Fasoli Silvia Dal Santo Andrea Minio Genny Buson Paola Tononi Elisa Debora Zago Gianpiero Zamperin Diana Bellin Mario Pezzotti Massimo Delledonne 《BMC genomics》2013,14(1):1-13

相似文献

10.

Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems

André E Minoche Juliane C Dohm Heinz Himmelbauer 《Genome biology》2012,12(11):R112

Background

The generation and analysis of high-throughput sequencing data are becoming a major component of many studies in molecular biology and medical research. Illumina's Genome Analyzer (GA) and HiSeq instruments are currently the most widely used sequencing devices. Here, we comprehensively evaluate properties of genomic HiSeq and GAIIx data derived from two plant genomes and one virus, with read lengths of 95 to 150 bases. 相似文献

11.

A Modified RNA-Seq Approach for Whole Genome Sequencing of RNA Viruses from Faecal and Blood Samples

Elizabeth M. Batty T. H. Nicholas Wong Amy Trebes Karène Argoud Moustafa Attar David Buck Camilla L. C. Ip Tanya Golubchik Madeleine Cule Rory Bowden Charis Manganis Paul Klenerman Eleanor Barnes A. Sarah Walker David H. Wyllie Daniel J. Wilson Kate E. Dingle Tim E. A. Peto Derrick W. Crook Paolo Piazza 《PloS one》2013,8(6)

To date, very large scale sequencing of many clinically important RNA viruses has been complicated by their high population molecular variation, which creates challenges for polymerase chain reaction and sequencing primer design. Many RNA viruses are also difficult or currently not possible to culture, severely limiting the amount and purity of available starting material. Here, we describe a simple, novel, high-throughput approach to Norovirus and Hepatitis C virus whole genome sequence determination based on RNA shotgun sequencing (also known as RNA-Seq). We demonstrate the effectiveness of this method by sequencing three Norovirus samples from faeces and two Hepatitis C virus samples from blood, on an Illumina MiSeq benchtop sequencer. More than 97% of reference genomes were recovered. Compared with Sanger sequencing, our method had no nucleotide differences in 14,019 nucleotides (nt) for Noroviruses (from a total of 2 Norovirus genomes obtained with Sanger sequencing), and 8 variants in 9,542 nt for Hepatitis C virus (1 variant per 1,193 nt). The three Norovirus samples had 2, 3, and 2 distinct positions called as heterozygous, while the two Hepatitis C virus samples had 117 and 131 positions called as heterozygous. To confirm that our sample and library preparation could be scaled to true high-throughput, we prepared and sequenced an additional 77 Norovirus samples in a single batch on an Illumina HiSeq 2000 sequencer, recovering >90% of the reference genome in all but one sample. No discrepancies were observed across 118,757 nt compared between Sanger and our custom RNA-Seq method in 16 samples. By generating viral genomic sequences that are not biased by primer-specific amplification or enrichment, this method offers the prospect of large-scale, affordable studies of RNA viruses which could be adapted to routine diagnostic laboratory workflows in the near future, with the potential to directly characterize within-host viral diversity. 相似文献

12.

Prenatal detection of aneuploidy and imbalanced chromosomal arrangements by massively parallel sequencing

Dan S Chen F Choy KW Jiang F Lin J Xuan Z Wang W Chen S Li X Jiang H Leung TY Lau TK Su Y Zhang W Zhang X 《PloS one》2012,7(2):e27835

Fetal chromosomal abnormalities are the most common reasons for invasive prenatal testing. Currently, G-band karyotyping and several molecular genetic methods have been established for diagnosis of chromosomal abnormalities. Although these testing methods are highly reliable, the major limitation remains restricted resolutions or can only achieve limited coverage on the human genome at one time. The massively parallel sequencing (MPS) technologies which can reach single base pair resolution allows detection of genome-wide intragenic deletions and duplication challenging karyotyping and microarrays as the tool for prenatal diagnosis. Here we reported a novel and robust MPS-based method to detect aneuploidy and imbalanced chromosomal arrangements in amniotic fluid (AF) samples. We sequenced 62 AF samples on Illumina GAIIx platform and with averagely 0.01× whole genome sequencing data we detected 13 samples with numerical chromosomal abnormalities by z-test. With up to 2× whole genome sequencing data we were able to detect microdeletion/microduplication (ranged from 1.4 Mb to 37.3 Mb of 5 samples from chorionic villus sampling (CVS) using SeqSeq algorithm. Our work demonstrated MPS is a robust and accurate approach to detect aneuploidy and imbalanced chromosomal arrangements in prenatal samples. 相似文献

13.

Inexpensive Multiplexed Library Preparation for Megabase-Sized Genomes

Michael Baym Sergey Kryazhimskiy Tami D. Lieberman Hattie Chung Michael M. Desai Roy Kishony 《PloS one》2015,10(5)

Whole-genome sequencing has become an indispensible tool of modern biology. However, the cost of sample preparation relative to the cost of sequencing remains high, especially for small genomes where the former is dominant. Here we present a protocol for rapid and inexpensive preparation of hundreds of multiplexed genomic libraries for Illumina sequencing. By carrying out the Nextera tagmentation reaction in small volumes, replacing costly reagents with cheaper equivalents, and omitting unnecessary steps, we achieve a cost of library preparation of $8 per sample, approximately 6 times cheaper than the standard Nextera XT protocol. Furthermore, our procedure takes less than 5 hours for 96 samples. Several hundred samples can then be pooled on the same HiSeq lane via custom barcodes. Our method will be useful for re-sequencing of microbial or viral genomes, including those from evolution experiments, genetic screens, and environmental samples, as well as for other sequencing applications including large amplicon, open chromosome, artificial chromosomes, and RNA sequencing. 相似文献

14.

The Fast Changing Landscape of Sequencing Technologies and Their Impact on Microbial Genome Assemblies and Annotation 总被引：1，自引：0，他引：1

Konstantinos Mavromatis Miriam L. Land Thomas S. Brettin Daniel J. Quest Alex Copeland Alicia Clum Lynne Goodwin Tanja Woyke Alla Lapidus Hans Peter Klenk Robert W. Cottingham Nikos C. Kyrpides 《PloS one》2012,7(12)

Background

The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation.

Methodology/Principal Findings

In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis.

Conclusion

These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio). 相似文献

15.

Genotyping‐in‐Thousands by sequencing (GT‐seq): A cost effective SNP genotyping method based on custom amplicon sequencing

下载免费PDF全文

Nathan R. Campbell Stephanie A. Harmon Shawn R. Narum 《Molecular ecology resources》2015,15(4):855-867

相似文献

16.

A simple strand-specific RNA-Seq library preparation protocol combining the Illumina TruSeq RNA and the dUTP methods

Sultan M Dökel S Amstislavskiy V Wuttig D Sültmann H Lehrach H Yaspo ML 《Biochemical and biophysical research communications》2012,422(4):643-646

相似文献

17.

七星瓢虫滞育关联基因的转录组学分析

下载免费PDF全文

齐晓阳任小云安涛陈红印黄建张礼生《环境昆虫学报》2016,(2):238-248

对七星瓢虫正常发育、滞育以及滞育解除的雌成虫进行RNA测序,并对筛选出来的滞育相关基因进行KEGG通路富集分析,从分子水平解析七星瓢虫滞育发机理。本研究以正常发育产卵、滞育30 d以及滞育贮存30 d后解除产卵的七星瓢虫雌成虫为研究对象,分别抽提RNA,合成c DNA,构建c DNA文库,文库检测合格后在Illummina Hiseq 2500测序仪上进行双向测序。根据测序结果,共获取unigene 82820个。采用两两比较法对正常发育组和滞育组、滞育组和滞育解除组进行差异表达分析,分别获得差异表达基因3501个和1427个。深入分析两组比对结果,将在滞育组上调且滞育解除组下调的unigene定义为滞育关联基因,共有443个基因为滞育关联基因。应用KEGG KAAS在线pathway比对分析工具对滞育关联基因进行通路富集分析,结果发现这些基因主要集中在碳水化合物代谢、脂质代谢以及信号转导等途径中。相似文献

18.

Intra-genomic rRNA gene variability of Nassellaria and Spumellaria (Rhizaria,Radiolaria) assessed by Sanger,MinION and Illumina sequencing

Miguel M. Sandin Sarah Romac Fabrice Not 《Environmental microbiology》2022,24(7):2979-2993

Ribosomal RNA (rRNA) genes are known to be valuable markers for the barcoding of eukaryotic life and its phylogenetic classification at various taxonomic levels. The large-scale exploration of environmental microbial diversity through metabarcoding approaches has been focused mainly on the V4 and V9 regions of the 18S rRNA gene. The accurate interpretation of such environmental surveys is hampered by technical (e.g. PCR and sequencing errors) and biological biases (e.g. intra-genomic variability). Here we explored the intra-genomic diversity of Nassellaria and Spumellaria specimens (Radiolaria) by comparing Sanger sequencing with Illumina and Oxford Nanopore Technologies (MinION). Our analysis determined that intra-genomic variability of Nassellaria and Spumellaria is generally low, yet some Spumellaria specimens showed two different copies of the V4 with <97% similarity. Of the different sequencing methods, Illumina showed the highest number of contaminations (i.e. environmental DNA, cross-contamination, tag-jumping), revealed by its high sequencing depth; and MinION showed the highest sequencing rate error (~14%). Yet the long reads produced by MinION (~2900 bp) allowed accurate phylogenetic reconstruction studies. These results highlight the requirement for a careful interpretation of Illumina-based metabarcoding studies, in particular regarding low abundant amplicons, and open future perspectives towards full-length rDNA environmental metabarcoding surveys. 相似文献

19.

Scalable transcriptome preparation for massive parallel sequencing

Stranneheim H Werne B Sherwood E Lundeberg J 《PloS one》2011,6(7):e21910

Background

The tremendous output of massive parallel sequencing technologies requires automated robust and scalable sample preparation methods to fully exploit the new sequence capacity.

Methodology

In this study, a method for automated library preparation of RNA prior to massively parallel sequencing is presented. The automated protocol uses precipitation onto carboxylic acid paramagnetic beads for purification and size selection of both RNA and DNA. The automated sample preparation was compared to the standard manual sample preparation.

Conclusion/Significance

The automated procedure was used to generate libraries for gene expression profiling on the Illumina HiSeq 2000 platform with the capacity of 12 samples per preparation with a significantly improved throughput compared to the standard manual preparation. The data analysis shows consistent gene expression profiles in terms of sensitivity and quantification of gene expression between the two library preparation methods. 相似文献

20.

A novel post hoc method for detecting index switching finds no evidence for increased switching on the Illumina HiSeq X

下载免费PDF全文

Gregory L. Owens Marco Todesco Emily B. M. Drummond Sam Yeaman Loren H. Rieseberg 《Molecular ecology resources》2018,18(1):169-175

相似文献