首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
High‐throughput sequencing (HTS) technologies generate millions of sequence reads from DNA/RNA molecules rapidly and cost‐effectively, enabling single investigator laboratories to address a variety of ‘omics’ questions in nonmodel organisms, fundamentally changing the way genomic approaches are used to advance biological research. One major challenge posed by HTS is the complexity and difficulty of data quality control (QC). While QC issues associated with sample isolation, library preparation and sequencing are well known and protocols for their handling are widely available, the QC of the actual sequence reads generated by HTS is often overlooked. HTS‐generated sequence reads can contain various errors, biases and artefacts whose identification and amelioration can greatly impact subsequent data analysis. However, a systematic survey on QC procedures for HTS data is still lacking. In this review, we begin by presenting standard ‘health check‐up’ QC procedures recommended for HTS data sets and establishing what ‘healthy’ HTS data look like. We next proceed by classifying errors, biases and artefacts present in HTS data into three major types of ‘pathologies’, discussing their causes and symptoms and illustrating with examples their diagnosis and impact on downstream analyses. We conclude this review by offering examples of successful ‘treatment’ protocols and recommendations on standard practices and treatment options. Notwithstanding the speed with which HTS technologies – and consequently their pathologies – change, we argue that careful QC of HTS data is an important – yet often neglected – aspect of their application in molecular ecology, and lay the groundwork for developing a HTS data QC ‘best practices’ guide.  相似文献   

2.
To enable rapid selection of traits in marker‐assisted breeding, markers must be technically simple, low‐cost, high‐throughput and randomly distributed in a genome. We developed such a technology, designated as Multiplex Restriction Amplicon Sequencing (MRASeq), which reduces genome complexity by polymerase chain reaction (PCR) amplification of amplicons flanked by restriction sites. The first PCR primers contain restriction site sequences at 3’‐ends, preceded by 6‐10 bases of specific or degenerate nucleotide sequences and then by a unique M13‐tail sequence which serves as a binding site for a second PCR that adds sequencing primers and barcodes to allow sample multiplexing for sequencing. The sequences of restriction sites and adjacent nucleotides can be altered to suit different species. Physical mapping of MRASeq SNPs from a biparental population of allohexaploid wheat (Triticum aestivum L.) showed a random distribution of SNPs across the genome. MRASeq generated thousands of SNPs from a wheat biparental population and natural populations of wheat and barley (Hordeum vulgare L.). This novel, next‐generation sequencing‐based genotyping platform can be used for linkage mapping to screen quantitative trait loci (QTL), background selection in breeding and many other genetics and breeding applications of various species.  相似文献   

3.
High‐throughput sequencing (HTS) of PCR amplicons is becoming the method of choice to sequence one or several targeted loci for phylogenetic and DNA barcoding studies. Although the development of HTS has allowed rapid generation of massive amounts of DNA sequence data, preparing amplicons for HTS remains a rate‐limiting step. For example, HTS platforms require platform‐specific adapter sequences to be present at the 5′ and 3′ end of the DNA fragment to be sequenced. In addition, short multiplex identifier (MID) tags are typically added to allow multiple samples to be pooled in a single HTS run. Existing methods to incorporate HTS adapters and MID tags into PCR amplicons are either inefficient, requiring multiple enzymatic reactions and clean‐up steps, or costly when applied to multiple samples or loci (fusion primers). We describe a method to amplify a target locus and add HTS adapters and MID tags via a linker sequence using a single PCR. We demonstrate our approach by generating reference sequence data for two mitochondrial loci (COI and 16S) for a diverse suite of insect taxa. Our approach provides a flexible, cost‐effective and efficient method to prepare amplicons for HTS.  相似文献   

4.
Dietary changes linked to the availability of anthropogenic food resources can have complex implications for species and ecosystems, especially when species are in decline. Here, we use recently developed primers targeting the ITS2 region of plants to characterize diet from faecal samples of four UK columbids, with particular focus on the European turtle dove (Streptopelia turtur), a rapidly declining obligate granivore. We examine dietary overlap between species (potential competition), associations with body condition in turtle doves and spatiotemporal variation in diet. We identified 143 taxonomic units, of which we classified 55% to species, another 34% to genus and the remaining 11% to family. We found significant dietary overlap between all columbid species, with the highest between turtle doves and stock doves (Columba oenas), then between turtle doves and woodpigeons (Columba palumbus). The lowest overlap was between woodpigeons and collared doves (Streptopelia decaocto). We show considerable change in columbid diets compared to previous studies, probably reflecting opportunistic foraging behaviour by columbids within a highly anthropogenically modified landscape, although our data for nonturtle doves should be considered preliminary. Nestling turtle doves in better condition had a higher dietary proportion of taxonomic units from natural arable plant species and a lower proportion of taxonomic units from anthropogenic food resources such as garden bird seed mixes and brassicas. This suggests that breeding ground conservation strategies for turtle doves should include provision of anthropogenic seeds for adults early in the breeding season, coupled with habitat rich in accessible seeds from arable plants once chicks have hatched.  相似文献   

5.
Recent advances in high‐throughput sequencing library preparation and subgenomic enrichment methods have opened new avenues for population genetics and phylogenetics of nonmodel organisms. To multiplex large numbers of indexed samples while sequencing predominantly orthologous, targeted regions of the genome, we propose modifications to an existing, in‐solution capture that utilizes PCR products as target probes to enrich library pools for the genomic subset of interest. The sequence capture using PCR‐generated probes (SCPP) protocol requires no specialized equipment, is highly flexible and significantly reduces experimental costs for projects where a modest scale of genetic data is optimal (25–100 genomic loci). Our alterations enable application of this method across a wider phylogenetic range of taxa and result in higher capture efficiencies and coverage at each locus. Efficient and consistent capture over multiple SCPP experiments and at various phylogenetic distances is demonstrated, extending the utility of this method to both phylogeographic and phylogenomic studies.  相似文献   

6.
The microbiome associated with brown planthopper (BPH) plays an important role in mediating host health and fitness. Characterization of the microbial community and its structure is prerequisite for understanding the intricate symbiotic relationships between microbes and host insect. Here, we investigated the bacterial and fungal communities of BPH at different developmental stages using high‐throughput amplicon sequencing. Our results revealed that both the bacterial and fungal communities were diverse and dynamic during BPH development. The bacterial communities were generally richer than fungi in each developmental stage. At 97% similarly, 19 phyla and 278 genera of bacteria were annotated, while five fungal phyla comprising 80 genera were assigned. The highest species richness for the bacterial communities was detected in the nymphal stage. The taxonomic diversity of the fungal communities in female adults was generally at a relatively higher level when compared to other developmental stages. The most dominant phylum of bacteria and fungi at each developmental stage all belonged to Proteobacteria and Ascomycota, respectively. A significantly lower abundance of bacterial genus Acinetobacter was recorded in the egg stage when compared to other developmental stages, while the dominant fungal genus Wallemia was more abundant in the nymph and adult stages than in the egg stage. Additionally, the microbial composition differed between male and female adults, suggesting that the microbial communities in BPH were gender‐dependent. Overall, our study enriches our knowledge on the microbial communities associated with BPH and will provide clues to develop potential biocontrol techniques against this rice pest.  相似文献   

7.
8.
Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high‐throughput sequencing. From outer Oslofjorden, S Norway, nano‐ and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta‐specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA‐sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September–October (autumn) and lowest in April–May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP‐3–5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in‐depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters.  相似文献   

9.
DNA analysis of predator faeces using high‐throughput amplicon sequencing (HTS) enhances our understanding of predator–prey interactions. However, conclusions drawn from this technique are constrained by biases that occur in multiple steps of the HTS workflow. To better characterize insectivorous animal diets, we used DNA from a diverse set of arthropods to assess PCR biases of commonly used and novel primer pairs for the mitochondrial gene, cytochrome oxidase C subunit 1 (COI). We compared diversity recovered from HTS of bat guano samples using a commonly used primer pair “ZBJ” to results using the novel primer pair “ANML.” To parameterize our bioinformatics pipeline, we created an arthropod mock community consisting of single‐copy (cloned) COI sequences. To examine biases associated with both PCR and HTS, mock community members were combined in equimolar amounts both pre‐ and post‐PCR. We validated our system using guano from bats fed known diets and using composite samples of morphologically identified insects collected in pitfall traps. In PCR tests, the ANML primer pair amplified 58 of 59 arthropod taxa (98%), whereas ZBJ amplified 24–40 of 59 taxa (41%–68%). Furthermore, in an HTS comparison of field‐collected samples, the ANML primers detected nearly fourfold more arthropod taxa than the ZBJ primers. The additional arthropods detected include medically and economically relevant insect groups such as mosquitoes. Results revealed biases at both the PCR and sequencing levels, demonstrating the pitfalls associated with using HTS read numbers as proxies for abundance. The use of an arthropod mock community allowed for improved bioinformatics pipeline parameterization.  相似文献   

10.
High‐throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user‐friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24 hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.  相似文献   

11.
Next‐generation sequencing (NGS) technologies are revolutionizing the fields of biology and medicine as powerful tools for amplicon sequencing (AS). Using combinations of primers and barcodes, it is possible to sequence targeted genomic regions with deep coverage for hundreds, even thousands, of individuals in a single experiment. This is extremely valuable for the genotyping of gene families in which locus‐specific primers are often difficult to design, such as the major histocompatibility complex (MHC). The utility of AS is, however, limited by the high intrinsic sequencing error rates of NGS technologies and other sources of error such as polymerase amplification or chimera formation. Correcting these errors requires extensive bioinformatic post‐processing of NGS data. Amplicon Sequence Assignment (amplisas ) is a tool that performs analysis of AS results in a simple and efficient way, while offering customization options for advanced users. amplisas is designed as a three‐step pipeline consisting of (i) read demultiplexing, (ii) unique sequence clustering and (iii) erroneous sequence filtering. Allele sequences and frequencies are retrieved in excel spreadsheet format, making them easy to interpret. amplisas performance has been successfully benchmarked against previously published genotyped MHC data sets obtained with various NGS technologies.  相似文献   

12.
Characterization of highly duplicated genes, such as genes of the major histocompatibility complex (MHC), where multiple loci often co‐amplify, has until recently been hindered by insufficient read depths per amplicon. Here, we used ultra‐deep Illumina sequencing to resolve genotypes at exon 3 of MHC class I genes in the sedge warbler (Acrocephalus schoenobaenus). We sequenced 24 individuals in two replicates and used this data, as well as a simulated data set, to test the effect of amplicon coverage (range: 500–20 000 reads per amplicon) on the repeatability of genotyping using four different genotyping approaches. A third replicate employed unique barcoding to assess the extent of tag jumping, that is swapping of individual tag identifiers, which may confound genotyping. The reliability of MHC genotyping increased with coverage and approached or exceeded 90% within‐method repeatability of allele calling at coverages of >5000 reads per amplicon. We found generally high agreement between genotyping methods, especially at high coverages. High reliability of the tested genotyping approaches was further supported by our analysis of the simulated data set, although the genotyping approach relying primarily on replication of variants in independent amplicons proved sensitive to repeatable errors. According to the most repeatable genotyping method, the number of co‐amplifying variants per individual ranged from 19 to 42. Tag jumping was detectable, but at such low frequencies that it did not affect the reliability of genotyping. We thus demonstrate that gene families with many co‐amplifying genes can be reliably genotyped using HTS, provided that there is sufficient per amplicon coverage.  相似文献   

13.
Metabarcoding has been used in a range of ecological applications such as taxonomic assignment, dietary analysis and the analysis of environmental DNA. However, after a decade of use in these applications there is little consensus on the extent to which proportions of reads generated corresponds to the original proportions of species in a community. To quantify our current understanding, we conducted a structured review and meta‐analysis. The analysis suggests that a weak quantitative relationship may exist between the biomass and sequences produced (slope = 0.52 ± 0.34, p < 0.01), albeit with a large degree of uncertainty. None of the tested moderators, sequencing platform type, the number of species used in a trial or the source of DNA, were able to explain the variance. Our current understanding of the factors affecting the quantitative performance of metabarcoding is still limited: additional research is required before metabarcoding can be confidently utilized for quantitative applications. Until then, we advocate the inclusion of mock communities when metabarcoding as this facilitates direct assessment of the quantitative ability of any given study.  相似文献   

14.
15.
16.
基于高通量测序的鄱阳湖典型湿地土壤细菌群落特征分析   总被引:15,自引:0,他引:15  
王鹏  陈波  张华 《生态学报》2017,37(5):1650-1658
采用高通量测序技术分析了鄱阳湖典型湿地土壤细菌群落特征。测序结果表明,不同植被土壤细菌群落丰度与多样性的排序相同:苔草带苔草-虉草带芦苇带泥滩带藜蒿带。沿湖面至坡地,空间位置相近的土壤细菌群落结构具有更大的相似性,苔草-虉草带、苔草带和芦苇带的细菌群落结构相近,泥滩带和藜蒿带的细菌群落结构差异较大。变形菌门(30.0%)是湿地土壤平均相对丰度最高的门,其次为酸杆菌门(16.7%)和绿弯菌门(16.5%);多数门分类细菌相对丰度沿湖面至坡地存在一定变化趋势。硝化螺菌属是第一大属分类水平细菌群落。在土壤化学指标中,与鄱阳湖湿地细菌群落相关性较大的是总磷、铵态氮和有机质含量。以上研究结果表明,鄱阳湖湿地不同植被土壤细菌群落具有结构性差异,但沿湖面至坡地存在规律性变化。  相似文献   

17.
Establishing the sex of individuals in wild systems can be challenging and often requires genetic testing. Genotyping‐by‐sequencing (GBS) and other reduced‐representation DNA sequencing (RRS) protocols (e.g., RADseq, ddRAD) have enabled the analysis of genetic data on an unprecedented scale. Here, we present a novel approach for the discovery and statistical validation of sex‐specific loci in GBS data sets. We used GBS to genotype 166 New Zealand fur seals (NZFS, Arctocephalus forsteri) of known sex. We retained monomorphic loci as potential sex‐specific markers in the locus discovery phase. We then used (i) a sex‐specific locus threshold (SSLT) to identify significantly male‐specific loci within our data set; and (ii) a significant sex‐assignment threshold (SSAT) to confidently assign sex in silico the presence or absence of significantly male‐specific loci to individuals in our data set treated as unknowns (98.9% accuracy for females; 95.8% for males, estimated via cross‐validation). Furthermore, we assigned sex to 86 individuals of true unknown sex using our SSAT and assessed the effect of SSLT adjustments on these assignments. From 90 verified sex‐specific loci, we developed a panel of three sex‐specific PCR primers that we used to ascertain sex independently of our GBS data, which we show amplify reliably in at least two other pinniped species. Using monomorphic loci normally discarded from large SNP data sets is an effective way to identify robust sex‐linked markers for nonmodel species. Our novel pipeline can be used to identify and statistically validate monomorphic and polymorphic sex‐specific markers across a range of species and RRS data sets.  相似文献   

18.
Effective clone selection is a crucial step toward developing a robust mammalian cell culture production platform. Currently, clone selection is done by culturing cells in well plates and picking the highest producers. Ideally, clone selection should be done in a stirred tank bioreactor as this would best replicate the eventual production environment. The actual number of clones selected for future evaluation in bioreactors at bench‐scale is limited by the scale‐up and operational costs involved. This study describes the application of miniaturized stirred high‐throughput bioreactors (35 mL working volume; HTBRs) with noninvasive optical sensors for clone screening and selection. We investigated a method for testing several subclones simultaneously in a stirred environment using our high throughput bioreactors (up to 12 clones per HTBR run) and compared it with a traditional well plate selection approach. Importantly, it was found that selecting clones solely based on results from stationary well plate cultures could result in the chance of missing higher producing clones. Our approach suggests that choosing a clone after analyzing its performance in a stirred bioreactor environment is an improved method for clone selection. © 2010 American Institute of Chemical Engineers Biotechnol. Prog., 2010  相似文献   

19.
The present study aimed to estimate the clinical performance of non‐invasive prenatal testing (NIPT) based on high‐throughput sequencing method for the detection of foetal chromosomal deletions and duplications. A total of 6348 pregnant women receiving NIPT using high‐throughput sequencing method were included in our study. They all conceived naturally, without twins, triplets or multiple births. Individuals showing abnormalities in NIPT received invasive ultrasound‐guided amniocentesis for chromosomal karyotype and microarray analysis at 18‐24 weeks of pregnancy. Detection results of foetal chromosomal deletions and duplications were compared between high‐throughput sequencing method and chromosomal karyotype and microarray analysis. Thirty‐eight individuals were identified to show 51 chromosomal deletions/duplications via high‐throughput sequencing method. In subsequent chromosomal karyotype and microarray analysis, 34 subchromosomal deletions/duplications were identified in 26 pregnant women. The observed deletions and duplications ranged from 1.05 to 17.98 Mb. Detection accuracy for these deletions and duplications was 66.7%. Twenty‐one deletions and duplications were found to be correlated with the known abnormalities. NIPT based on high‐throughput sequencing technique is able to identify foetal chromosomal deletions and duplications, but its sensitivity and specificity were not explored. Further progress should be made to reduce false‐positive results.  相似文献   

20.
Wild crop relatives represent a source of novel alleles for crop genetic improvement. Screening biodiversity for useful or diverse gene homologues has often been based upon the amplification of targeted genes using available sequence information to design primers that amplify the target gene region across species. The crucial requirement of this approach is the presence of sequences with sufficient conservation across species to allow for the design of universal primers. This approach is often not successful with diverse organisms or highly variable genes. Massively parallel sequencing (MPS) can quickly produce large amounts of sequence data and provides a viable option for characterizing homologues of known genes in poorly described genomes. MPS of genomic DNA was used to obtain species‐specific sequence information for 18 rice genes related to domestication characteristics in a wild relative of rice, Microlaena stipoides. Species‐specific primers were available for 16 genes compared with 12 genes using the universal primer method. The use of species‐specific primers had the potential to cover 92% of the sequence of these genes, while traditional universal primers could only be designed to cover 80%. A total of 24 species‐specific primer pairs were used to amplify gene homologues, and 11 primer pairs were successful in capturing six gene homologues. The 23 million, 36‐base pair (bp) paired end reads, equated to an average of 2X genome coverage, facilitated the successful amplification and sequencing of six target gene homologues, illustrating an important approach to the discovery of useful genes in wild crop relatives.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号