首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
This is a time of unprecedented transition in DNA sequencing technologies. Next-generation sequencing (NGS) clearly holds promise for fast and cost-effective generation of multilocus sequence data for phylogeography and phylogenetics. However, the focus on non-model organisms, in addition to uncertainty about which sample preparation methods and analyses are appropriate for different research questions and evolutionary timescales, have contributed to a lag in the application of NGS to these fields. Here, we outline some of the major obstacles specific to the application of NGS to phylogeography and phylogenetics, including the focus on non-model organisms, the necessity of obtaining orthologous loci in a cost-effective manner, and the predominate use of gene trees in these fields. We describe the most promising methods of sample preparation that address these challenges. Methods that reduce the genome by restriction digest and manual size selection are most appropriate for studies at the intraspecific level, whereas methods that target specific genomic regions (i.e., target enrichment or sequence capture) have wider applicability from the population level to deep-level phylogenomics. Additionally, we give an overview of how to analyze NGS data to arrive at data sets applicable to the standard toolkit of phylogeography and phylogenetics, including initial data processing to alignment and genotype calling (both SNPs and loci involving many SNPs). Even though whole-genome sequencing is likely to become affordable rather soon, because phylogeography and phylogenetics rely on analysis of hundreds of individuals in many cases, methods that reduce the genome to a subset of loci should remain more cost-effective for some time to come.  相似文献   

2.
3.
Next-generation sequencing (NGS) has caused a revolution in biology. NGS requires the preparation of libraries in which (fragments of) DNA or RNA molecules are fused with adapters followed by PCR amplification and sequencing. It is evident that robust library preparation methods that produce a representative, non-biased source of nucleic acid material from the genome under investigation are of crucial importance. Nevertheless, it has become clear that NGS libraries for all types of applications contain biases that compromise the quality of NGS datasets and can lead to their erroneous interpretation. A detailed knowledge of the nature of these biases will be essential for a careful interpretation of NGS data on the one hand and will help to find ways to improve library quality or to develop bioinformatics tools to compensate for the bias on the other hand. In this review we discuss the literature on bias in the most common NGS library preparation protocols, both for DNA sequencing (DNA-seq) as well as for RNA sequencing (RNA-seq). Strikingly, almost all steps of the various protocols have been reported to introduce bias, especially in the case of RNA-seq, which is technically more challenging than DNA-seq. For each type of bias we discuss methods for improvement with a view to providing some useful advice to the researcher who wishes to convert any kind of raw nucleic acid into an NGS library.  相似文献   

4.
Next-generation sequencing (NGS) is emerging as a powerful tool for elucidating genetic information for a wide range of applications. Unfortunately, the surging popularity of NGS has not yet been accompanied by an improvement in automated techniques for preparing formatted sequencing libraries. To address this challenge, we have developed a prototype microfluidic system for preparing sequencer-ready DNA libraries for analysis by Illumina sequencing. Our system combines droplet-based digital microfluidic (DMF) sample handling with peripheral modules to create a fully-integrated, sample-in library-out platform. In this report, we use our automated system to prepare NGS libraries from samples of human and bacterial genomic DNA. E. coli libraries prepared on-device from 5 ng of total DNA yielded excellent sequence coverage over the entire bacterial genome, with >99% alignment to the reference genome, even genome coverage, and good quality scores. Furthermore, we produced a de novo assembly on a previously unsequenced multi-drug resistant Klebsiella pneumoniae strain BAA-2146 (KpnNDM). The new method described here is fast, robust, scalable, and automated. Our device for library preparation will assist in the integration of NGS technology into a wide variety of laboratories, including small research laboratories and clinical laboratories.  相似文献   

5.
二代测序技术的涌现推动了基因组学研究,特别是在疾病相关的遗传变异研究中发挥了重要作用.虽然大多数遗传变异类型都可以借助于各种二代测序分析工具进行检测,但是仍然存在局限性,比如短串联重复序列的长度变异.许多遗传疾病是由短串联重复序列的长度扩张导致的,尤其是亨廷顿病等多种神经系统疾病.然而,现在几乎没有工具能够利用二代测序检测长度大于测序读长的短串联重复序列变异.为了突破这一限制,我们开发了一个全新的方法,该方法基于双末端二代测序辨识短串联重复序列长度变异,并可估计其扩张长度,将其应用于一项基于全外显子组测序的运动神经元疾病临床研究中,成功地鉴定出致病的短串联重复序列长度扩张.该方法首次原创性地利用测序读长覆盖深度特征来解决短串联重复序列变异检测问题,在人类遗传疾病研究中具有广泛的应用价值,并且对于其他二代测序分析方法的开发具有启发性意义.  相似文献   

6.
Next-generation sequencing technologies (NGS) have revolutionized biological research by significantly increasing data generation while simultaneously decreasing the time to data output. For many ecologists and evolutionary biologists, the research opportunities afforded by NGS are substantial; even for taxa lacking genomic resources, large-scale genome-level questions can now be addressed, opening up many new avenues of research. While rapid and massive sequencing afforded by NGS increases the scope and scale of many research objectives, whole genome sequencing is often unwarranted and unnecessarily complex for specific research questions. Recently developed targeted sequence enrichment, coupled with NGS, represents a beneficial strategy for enhancing data generation to answer questions in ecology and evolutionary biology. This marriage of technologies offers researchers a simple method to isolate and analyze a few to hundreds, or even thousands, of genes or genomic regions from few to many samples in a relatively efficient and effective manner. These strategies can be applied to questions at both the infra- and interspecific levels, including those involving parentage, gene flow, divergence, phylogenetics, reticulate evolution, and many more. Here we provide a brief overview of targeted sequence enrichment, and emphasize the power of this technology to increase our ability to address a wide range of questions of interest to ecologists and evolutionary biologists, particularly for those working with taxa for which few genomic resources are available.  相似文献   

7.
Knowledge and better understanding of functions of the microbial community are pivotal for crop management. This study was conducted to study bacterial structures including Acidovorax species community structures and diversity from the watermelon cultivated soils in different regions of South Korea. In this study, soil samples were collected from watermelon cultivation areas from various places of South Korea and microbiome analysis was performed to analyze bacterial communities including Acidovorax species community. Next generation sequencing (NGS) was performed by extracting genomic DNA from 92 soil samples from 8 different provinces using a fast genomic DNA extraction kit. NGS data analysis results revealed that, total, 39,367 operational taxonomic unit (OTU), were obtained. NGS data results revealed that, most dominant phylum in all the soil samples was Proteobacteria (37.3%). In addition, most abundant genus was Acidobacterium (1.8%) in all the samples. In order to analyze species diversity among the collected soil samples, OTUs, community diversity, and Shannon index were measured. Shannon (9.297) and inverse Simpson (0.996) were found to have the highest diversity scores in the greenhouse soil sample of Gyeonggi-do province (GG4). Results from NGS sequencing suggest that, most of the soil samples consists of similar trend of bacterial community and diversity. Environmental factors play a key role in shaping the bacterial community and diversity. In order to address this statement, further correlation analysis between soil physical and chemical parameters with dominant bacterial community will be carried out to observe their interactions.  相似文献   

8.
9.
Next-generation sequencing(NGS) technologies have made high-throughput sequencing available to medium- and small-size laboratories, culminating in a tidal wave of genomic information. The quantity of sequenced bacterial genomes has not only brought excitement to the field of genomics but also heightened expectations that NGS would boost antibacterial discovery and vaccine development. Although many possible drug and vaccine targets have been discovered, the success rate of genome-based analysis has remained below expectations. Furthermore, NGS has had consequences for genome quality, resulting in an exponential increase in draft(partial data) genome deposits in public databases. If no further interests are expressed for a particular bacterial genome, it is more likely that the sequencing of its genome will be limited to a draft stage, and the painstaking tasks of completing the sequencing of its genome and annotation will not be undertaken. It is important to know what is lost when we settle for a draft genome and to determine the "scientific value" of a newly sequenced genome. This review addresses the expected impact of newly sequenced genomes on antibacterial discovery and vaccinology. Also, it discusses the factors that could be leading to the increase in the number of draft deposits and the consequent loss of relevant biological information.  相似文献   

10.
11.
12.

Background  

Next-generation sequencing (NGS) offers a unique opportunity for high-throughput genomics and has potential to replace Sanger sequencing in many fields, including de-novo sequencing, re-sequencing, meta-genomics, and characterisation of infectious pathogens, such as viral quasispecies. Although methodologies and software for whole genome assembly and genome variation analysis have been developed and refined for NGS data, reconstructing a viral quasispecies using NGS data remains a challenge. This application would be useful for analysing intra-host evolutionary pathways in relation to immune responses and antiretroviral therapy exposures. Here we introduce a set of formulae for the combinatorial analysis of a quasispecies, given a NGS re-sequencing experiment and an algorithm for quasispecies reconstruction. We require that sequenced fragments are aligned against a reference genome, and that the reference genome is partitioned into a set of sliding windows (amplicons). The reconstruction algorithm is based on combinations of multinomial distributions and is designed to minimise the reconstruction of false variants, called in-silico recombinants.  相似文献   

13.
The genotyping of highly polymorphic multigene families across many individuals used to be a particularly challenging task because of methodological limitations associated with traditional approaches. Next‐generation sequencing (NGS) can overcome most of these limitations, and it is increasingly being applied in population genetic studies of multigene families. Here, we critically review NGS bioinformatic approaches that have been used to genotype the major histocompatibility complex (MHC) immune genes, and we discuss how the significant advances made in this field are applicable to population genetic studies of gene families. Increasingly, approaches are introduced that apply thresholds of sequencing depth and sequence similarity to separate alleles from methodological artefacts. We explain why these approaches are particularly sensitive to methodological biases by violating fundamental genotyping assumptions. An alternative strategy that utilizes ultra‐deep sequencing (hundreds to thousands of sequences per amplicon) to reconstruct genotypes and applies statistical methods on the sequencing depth to separate alleles from artefacts appears to be more robust. Importantly, the ‘degree of change’ (DOC) method avoids using arbitrary cut‐off thresholds by looking for statistical boundaries between the sequencing depth for alleles and artefacts, and hence, it is entirely repeatable across studies. Although the advances made in generating NGS data are still far ahead of our ability to perform reliable processing, analysis and interpretation, the community is developing statistically rigorous protocols that will allow us to address novel questions in evolution, ecology and genetics of multigene families. Future developments in third‐generation single molecule sequencing may potentially help overcome problems that still persist in de novo multigene amplicon genotyping when using current second‐generation sequencing approaches.  相似文献   

14.
The recent increase in accessibility and scale of genetic data available through next-generation sequencing (NGS) technology has transformed biological inquiry. As a direct result, the application and analysis of NGS data has quickly become an important skill for future scientists. However, the steep learning curve for applying NGS technology to biological questions, including the complexity of sample preparation for sequencing and the analysis of large data sets, are deterrents to the integration of NGS into undergraduate education. Here, we present a course-based undergraduate research experience (CURE) designed to aid in overcoming these limitations through NGS investigations of prokaryotic diversity. Specifically, we use 16S rRNA sequencing to explore patterns of diversity stemming from student-directed hypothesis development. This CURE addresses three learning objectives: (1) it provides a forum for experimental design hypothesis generation, (2) it introduces modern genomic tools through a hands-on experience generating an NGS data-set, and (3) it provides students with an introductory experience in bioinformatics.  相似文献   

15.
16.
Even though next-generation sequencing (NGS) has now become the predominant state-of-the-art technique for genotyping populations, amplified fragment length polymorphism (AFLP) DNA fingerprinting is still a relevant method, thanks to its versatility, cost-effectiveness, independence of prior sequence information and broad applicability. Even though the number of AFLP studies reached its peak in 2012, it is still applied extensively for phylogenetic analysis, genotyping or identifying non-model species, which often feature complex and large genomes. For these purposes, tools continue to be developed for designing AFLP studies, scoring AFLPs or handling AFLP data. Moreover, AFLP studies embrace the NGS technology; for example, the whole-genome sequence of model species is used to design more efficient AFLP studies for non-model species. Conversely, in complexity reduction of polymorphic sequences and restriction site-associated DNA sequencing studies, polymorphisms are often found to be present in many restriction sites, which can still be studied as AFLPs. We discuss the latest advances in AFLP-based studies in the era of NGS and anticipate that AFLP will remain a relevant method in the near future, even for species with a known genome, owing to its many promising new features such as methylation-sensitive-AFLP. Here, we also present an optimized pipeline for converting AFLP markers into single-locus markers, which can be applied in both traditional AFLP and NGS studies.  相似文献   

17.
《Biotechnology advances》2019,37(8):107450
Conventional Sanger Sequencing for authentication of herbal products is difficult since they are mixture of herbs with fragmented DNA. Next-generation sequencing (NGS) techniques give massive parallelization of sequencing reaction to generate multiple reads with various read length, thus different components in herbal products with fragmented DNA can be identified. NGS is especially suitable for animal derived products with the lack of effective markers for chemical analysis. Currently, second generation sequencing such as Illumina Sequencing and Ion Torrent Sequencing, and third generation sequencing such as PacBio Sequencing and Nanopore Sequencing are representative NGS platforms. The constructed library is first sequenced to obtain a pool of genomic data, followed by bioinformatics analysis and comparison with DNA database. NGS also facilitates the determination of contaminant which is essential for quality control regulation in Good Manufacturing Practice (GMP) factory. In this article, we provide an overview on NGS, summarize the cases on the use of NGS to identify herbal products, discuss the key technological challenges and provide perspectives on future directions for authentication and quality control of herbal products.  相似文献   

18.
19.
Next-generation sequencing (NGS) has revolutionized genetics and enabled the accurate identification of many genetic variants across many genomes. However, detection of biologically important low-frequency variants within genetically heterogeneous populations remains challenging, because they are difficult to distinguish from intrinsic NGS sequencing error rates. Approaches to overcome these limitations are essential to detect rare mutations in large cohorts, virus or microbial populations, mitochondria heteroplasmy, and other heterogeneous mixtures such as tumors. Modifications in library preparation can overcome some of these limitations, but are experimentally challenging and restricted to skilled biologists. This paper describes a novel quality filtering and base pruning pipeline, called Complex Heterogeneous Overlapped Paired-End Reads (CHOPER), designed to detect sequence variants in a complex population with high sequence similarity derived from All-Codon-Scanning (ACS) mutagenesis. A novel fast alignment algorithm, designed for the specified application, has O(n) time complexity. CHOPER was applied to a p53 cancer mutant reactivation study derived from ACS mutagenesis. Relative to error filtering based on Phred quality scores, CHOPER improved accuracy by about 13% while discarding only half as many bases. These results are a step toward extending the power of NGS to the analysis of genetically heterogeneous populations.  相似文献   

20.
KS Lee  RN Kim  BH Yoon  DS Kim  SH Choi  DW Kim  SH Nam  A Kim  A Kang  KH Park  JE Jung  SH Chae  HS Park 《Bioinformation》2012,8(11):532-534
Recently, next generation sequencing (NGS) technologies have led to a revolutionary increase in sequencing speed and costefficacy. Consequently, a vast number of contigs from many recently sequenced bacterial genomes remain to be accurately mapped and annotated, requiring the development of more convenient bioinformatics programs. In this paper, we present a newly developed web-based bioinformatics program, Bacterial Genome Mapper, which is suitable for mapping and annotating contigs that have been assembled from bacterial genome sequence raw data. By constructing a multiple alignment map between target contig sequences and two reference bacterial genome sequences, this program also provides very useful comparative genomics analysis of draft bacterial genomes. AVAILABILITY: The database is available for free at http://mbgm.kribb.re.kr.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号