首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
4.
Next generation pyrosequencing of high G + C content genomes still poses problems to automated sequencing and assembly processes which necessitates cost and time intensive manual work in order to finish such genomes completely. The sequencing of the high G + C actinomycete Actinoplanes sp. SE50/110 was performed with standard pyrosequencing technology (454 Life Sciences) and revealed a high number of gaps. The reasons for the introduction of gaps were analyzed on a previously known 41 kb long DNA reference sequence from Actinoplanes sp. SE50/110, hosting the acarbose biosynthesis gene cluster. Mapping of the sequencing results on the reference gene cluster sequence revealed a fragmentation into 30 contiguous sequences of different lengths. The gaps between these sequences were characterized by extremely low read coverage which strongly correlated with the G + C content in the gap regions in a negative manner. Furthermore, the gap-sequences contained strong stem-loop structures which hindered the amplification of these sequences during the emulsion PCR. Being significantly underrepresented or absent in the subsequent sequencing process, these sequences lead to weakly or uncovered genomic regions which forces the assembly algorithm to output multiple contiguous sequences instead of one finished genome. However, by applying a different pyrosequencing protocol, it was possible to sequence the complete acarbose biosynthesis gene cluster. The changes to the protocol include longer read length and addition of chemicals to the amplification chemistry, which reduces the self-annealing of DNA fragments during the amplification process and enables the complete reconstruction of high G + C content genomes without manual intervention.  相似文献   

5.
6.
Second generation sequencing has been widely used to sequence whole genomes. Though various paired-end sequencing methods have been developed to construct the long scaffold from contigs derived from shotgun sequencing, the classical paired-end sequencing of the Bacteria Artificial Chromosome (BAC) or fosmid libraries by the Sanger method still plays an important role in genome assembly. However, sequencing libraries with the Sanger method is expensive and time-consuming. Here we report a new strategy to sequence the paired-ends of genomic libraries with parallel pyrosequencing, using a Chinese amphioxus (Branchiostoma belcheri) BAC library as an example. In total, approximately 12,670 non-redundant paired-end sequences were generated. Mapping them to the primary scaffolds of Chinese amphioxus, we obtained 413 ultra-scaffolds from 1,182 primary scaffolds, and the N50 scaffold length was increased approximately 55 kb, which is about a 10% improvement. We provide a universal and cost-effective method for sequencing the ultra-long paired-ends of genomic libraries. This method can be very easily implemented in other second generation sequencing platforms.  相似文献   

7.
Next-generation sequencing (NGS) opens up exciting possibilities for improving our knowledge of environmental microbial diversity, allowing rapid and cost-effective identification of both cultivated and uncultivated microorganisms. However, library preparation, sequencing, and analysis of the results can provide inaccurate representations of the studied community compositions. Therefore, all these steps need to be taken into account carefully. Here we evaluated the effects of DNA extraction methods, targeted 16S rRNA hypervariable regions, and sample origins on the diverse microbes detected by 454 pyrosequencing in marine cold seep and hydrothermal vent sediments. To assign the reads with enough taxonomic precision, we built a database with about 2,500 sequences from Archaea and Bacteria from deep-sea marine sediments, affiliated according to reference publications in the field. Thanks to statistical and diversity analyses as well as inference of operational taxonomic unit (OTU) networks, we show that (i) while DNA extraction methods do not seem to affect the results for some samples, they can lead to dramatic changes for others; and (ii) the choice of amplification and sequencing primers also considerably affects the microbial community detected in the samples. Thereby, very different proportions of pyrosequencing reads were obtained for some microbial lineages, such as the archaeal ANME-1, ANME-2c, and MBG-D and deltaproteobacterial subgroups. This work clearly indicates that the results from sequencing-based analyses, such as pyrosequencing, should be interpreted very carefully. Therefore, the combination of NGS with complementary approaches, such as fluorescence in situ hybridization (FISH)/catalyzed reporter deposition (CARD)-FISH or quantitative PCR (Q-PCR), would be desirable to gain a more comprehensive picture of environmental microbial communities.  相似文献   

8.

Background  

Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae).  相似文献   

9.

Background

16S rRNA gene pyrosequencing approach has revolutionized studies in microbial ecology. While primer selection and short read length can affect the resulting microbial community profile, little is known about the influence of pyrosequencing methods on the sequencing throughput and the outcome of microbial community analyses. The aim of this study is to compare differences in output, ease, and cost among three different amplicon pyrosequencing methods for the Roche/454 Titanium platform

Methodology/Principal Findings

The following three pyrosequencing methods for 16S rRNA genes were selected in this study: Method-1 (standard method) is the recommended method for bi-directional sequencing using the LIB-A kit; Method-2 is a new option designed in this study for unidirectional sequencing with the LIB-A kit; and Method-3 uses the LIB-L kit for unidirectional sequencing. In our comparison among these three methods using 10 different environmental samples, Method-2 and Method-3 produced 1.5–1.6 times more useable reads than the standard method (Method-1), after quality-based trimming, and did not compromise the outcome of microbial community analyses. Specifically, Method-3 is the most cost-effective unidirectional amplicon sequencing method as it provided the most reads and required the least effort in consumables management.

Conclusions

Our findings clearly demonstrated that alternative pyrosequencing methods for 16S rRNA genes could drastically affect sequencing output (e.g. number of reads before and after trimming) but have little effect on the outcomes of microbial community analysis. This finding is important for both researchers and sequencing facilities utilizing 16S rRNA gene pyrosequencing for microbial ecological studies.  相似文献   

10.
合成生物学和基因组测序技术的快速发展使挖掘和高效合成天然产物进入了一个全新的时代。由于多数原始菌株生长缓慢、难以培养及遗传改造困难等问题,导致天然产物生物合成基因簇的激活和高效表达受到严重制约。基于此,将原始菌株来源的基因簇转移到操作简便、遗传背景清晰的模式宿主中进行异源表达成为天然产物发现和产量提高的一种有效手段。其中,基因簇的克隆与编辑是实现天然产物异源表达的一个主要限速步骤。CRISPR/Cas技术的应用极大地提高了大型基因簇克隆和编辑的效率,有效促进了微生物来源新药的发现。本文针对基于CRISPR/Cas开发的基因簇克隆和编辑技术进行了系统梳理和全面总结,探讨相关技术在天然产物挖掘和高效合成中的应用及其重要意义。  相似文献   

11.
Zhao F  Zhao F  Li T  Bryant DA 《Nucleic acids research》2008,36(10):3455-3462
Gap closing is considered one of the most challenging and time-consuming tasks in bacterial genome sequencing projects, especially with the emergence of new sequencing technologies, such as pyrosequencing, which may result in large amounts of data without the benefit of large insert libraries for contig scaffolding. We propose a novel algorithm to align contigs with more than one reference genome at a time. This approach can successfully overcome the limitations of low degrees of conserved gene order for the reference and target genomes. A pheromone trail-based genetic algorithm (PGA) was used to search globally for the optimal placement for each contig. Extensive testing on simulated and real data sets shows that PGA significantly outperforms previous methods, especially when assembling genomes that are only moderately related. An extended version of PGA can predict additional candidate connections for each contig and can thus increase the likelihood of identifying the correct arrangement of each contig. The software and test data sets can be accessed at http://sourceforge.net/projects/pga4genomics/.  相似文献   

12.
Accuracy and quality of massively parallel DNA pyrosequencing   总被引:13,自引:0,他引:13  

Background  

Massively parallel pyrosequencing systems have increased the efficiency of DNA sequencing, although the published per-base accuracy of a Roche GS20 is only 96%. In genome projects, highly redundant consensus assemblies can compensate for sequencing errors. In contrast, studies of microbial diversity that catalogue differences between PCR amplicons of ribosomal RNA genes (rDNA) or other conserved gene families cannot take advantage of consensus assemblies to detect and minimize incorrect base calls.  相似文献   

13.
The Human Genome Project stimulated the development of efficient strategies and relevant hardware for complete genome sequencing. The comparative genomic approach extends the possibilities of using the sequencing data to identify new genes or conserved regulatory regions by means of nucleotide sequence alignment of the particular regions of the mouse and human genomes, or to trace the evolutionary events resulting in the genome structure of modern mammals. The review focuses on the use of new molecular cytogenetic methods along with computer-aided analysis of the genomes in vertebrates. Several factors hindering data analysis are considered. The currently available information on gene evolution rate inferred from comparative genomic data is presented. The origin and evolution of the genomes of several species are discussed.  相似文献   

14.
The Human Genome Project stimulated the development of efficient strategies and relevant hardware for complete genome sequencing. The comparative genomic approach extends the possibilities of using the sequencing data to identify new genes or conserved regulatory regions by means of nucleotide sequence alignment of the particular regions of the mouse and human genomes, or to trace the evolutionary events resulting in the genome structure of modern mammals. The review focuses on the use of new molecular cytogenetic methods along with computer-aided analysis of the genomes in vertebrates. Several factors hindering data analysis are considered. The currently available information on gene evolution rate inferred from comparative genomic data is presented. The origin and evolution of the genomes of several species are discussed.  相似文献   

15.

Background

Bacterial viruses (phages) play a critical role in shaping microbial populations as they influence both host mortality and horizontal gene transfer. As such, they have a significant impact on local and global ecosystem function and human health. Despite their importance, little is known about the genomic diversity harbored in phages, as methods to capture complete phage genomes have been hampered by the lack of knowledge about the target genomes, and difficulties in generating sufficient quantities of genomic DNA for sequencing. Of the approximately 550 phage genomes currently available in the public domain, fewer than 5% are marine phage.

Methodology/Principal Findings

To advance the study of phage biology through comparative genomic approaches we used marine cyanophage as a model system. We compared DNA preparation methodologies (DNA extraction directly from either phage lysates or CsCl purified phage particles), and sequencing strategies that utilize either Sanger sequencing of a linker amplification shotgun library (LASL) or of a whole genome shotgun library (WGSL), or 454 pyrosequencing methods. We demonstrate that genomic DNA sample preparation directly from a phage lysate, combined with 454 pyrosequencing, is best suited for phage genome sequencing at scale, as this method is capable of capturing complete continuous genomes with high accuracy. In addition, we describe an automated annotation informatics pipeline that delivers high-quality annotation and yields few false positives and negatives in ORF calling.

Conclusions/Significance

These DNA preparation, sequencing and annotation strategies enable a high-throughput approach to the burgeoning field of phage genomics.  相似文献   

16.
454 pyrosequencing, a massively parallel sequencing (MPS) technology, is often used to study HIV genetic variation. However, the substantial mismatch error rate of the PCR required to prepare HIV-containing samples for pyrosequencing has limited the detection of rare variants within viral populations to those present above ~1%. To improve detection of rare variants, we varied PCR enzymes and conditions to identify those that combined high sensitivity with a low error rate. Substitution errors were found to vary up to 3-fold between the different enzymes tested. The sensitivity of each enzyme, which impacts the number of templates amplified for pyrosequencing, was shown to vary, although not consistently across genes and different samples. We also describe an amplicon-based method to improve the consistency of read coverage over stretches of the HIV-1 genome. Twenty-two primers were designed to amplify 11 overlapping amplicons in the HIV-1 clade B gag-pol and env gp120 coding regions to encompass 4.7 kb of the viral genome per sample at sensitivities as low as 0.01-0.2%.  相似文献   

17.
18.
Oscypek is a traditional Polish scalded-smoked cheese, with a protected-designation-of-origin (PDO) status, manufactured from raw sheep's milk without starter cultures in the Tatra Mountains region of Poland. This study was undertaken in order to gain insight into the microbiota that develops and evolves during the manufacture and ripening stages of Oscypek. To this end, we made use of both culturing and the culture-independent methods of PCR followed by denaturing gradient gel electrophoresis (PCR-DGGE) and pyrosequencing of 16S rRNA gene amplicons. The culture-dependent technique and PCR-DGGE fingerprinting detected the predominant microorganisms in traditional Oscypek, whereas the next-generation sequencing technique (454 pyrosequencing) revealed greater bacterial diversity. Besides members of the most abundant bacterial genera in dairy products, e.g., Lactococcus, Lactobacillus, Leuconostoc, Streptococcus, and Enterococcus, identified by all three methods, other, subdominant bacteria belonging to the families Bifidobacteriaceae and Moraxellaceae (mostly Enhydrobacter), as well as various minor bacteria, were identified by pyrosequencing. The presence of bifidobacterial sequences in a cheese system is reported for the first time. In addition to bacteria, a great diversity of yeast species was demonstrated in Oscypek by the PCR-DGGE method. Culturing methods enabled the determination of a number of viable microorganisms from different microbial groups and their isolation for potential future applications in specific cheese starter cultures.  相似文献   

19.
结核分枝杆菌基因组学与基因组进化   总被引:1,自引:0,他引:1  
在后基因组时代,特别是在新的测序理论和设备大发展的背景下,一些重大传染性致病微生物基因组序列正在被逐一测定,并且随后的基因功能注释,蛋白质三维结构重建等工作也正在开展,以期对致病微生物的生物学特性、诊断策略和治疗方法等有突破性的认识.作为对人类健康一直存在严重威胁的结核分枝杆菌,其基因组在进化中所发生的各种遗传事件对其生物学性质、致病能力和抗药性等各方面有重要作用.本文旨在阐述结核分枝杆菌的起源及其基因组特征,论述其基因组进化的研究进展.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号