首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A goal of many environmental DNA barcoding studies is to infer quantitative information about relative abundances of different taxa based on sequence read proportions generated by high‐throughput sequencing. However, potential biases associated with this approach are only beginning to be examined. We sequenced DNA amplified from faeces (scats) of captive harbour seals (Phoca vitulina) to investigate whether sequence counts could be used to quantify the seals’ diet. Seals were fed fish in fixed proportions, a chordate‐specific mitochondrial 16S marker was amplified from scat DNA and amplicons sequenced using an Ion Torrent PGM?. For a given set of bioinformatic parameters, there was generally low variability between scat samples in proportions of prey species sequences recovered. However, proportions varied substantially depending on sequencing direction, level of quality filtering (due to differences in sequence quality between species) and minimum read length considered. Short primer tags used to identify individual samples also influenced species proportions. In addition, there were complex interactions between factors; for example, the effect of quality filtering was influenced by the primer tag and sequencing direction. Resequencing of a subset of samples revealed some, but not all, biases were consistent between runs. Less stringent data filtering (based on quality scores or read length) generally produced more consistent proportional data, but overall proportions of sequences were very different than dietary mass proportions, indicating additional technical or biological biases are present. Our findings highlight that quantitative interpretations of sequence proportions generated via high‐throughput sequencing will require careful experimental design and thoughtful data analysis.  相似文献   

2.
Different regions of the bacterial 16S rRNA gene evolve at different evolutionary rates. The scientific outcome of short read sequencing studies therefore alters with the gene region sequenced. We wanted to gain insight in the impact of primer choice on the outcome of short read sequencing efforts. All the unknowns associated with sequencing data, i.e. primer coverage rate, phylogeny, OTU-richness and taxonomic assignment, were therefore implemented in one study for ten well established universal primers (338f/r, 518f/r, 799f/r, 926f/r and 1062f/r) targeting dispersed regions of the bacterial 16S rRNA gene. All analyses were performed on nearly full length and in silico generated short read sequence libraries containing 1175 sequences that were carefully chosen as to present a representative substitute of the SILVA SSU database. The 518f and 799r primers, targeting the V4 region of the 16S rRNA gene, were found to be particularly suited for short read sequencing studies, while the primer 1062r, targeting V6, seemed to be least reliable. Our results will assist scientists in considering whether the best option for their study is to select the most informative primer, or the primer that excludes interferences by host-organelle DNA. The methodology followed can be extrapolated to other primers, allowing their evaluation prior to the experiment.  相似文献   

3.
Microbial community profiling using 16S rRNA gene sequences requires accurate taxonomy assignments. ‘Universal'' primers target conserved sequences and amplify sequences from many taxa, but they provide variable coverage of different environments, and regions of the rRNA gene differ in taxonomic informativeness—especially when high-throughput short-read sequencing technologies (for example, 454 and Illumina) are used. We introduce a new evaluation procedure that provides an improved measure of expected taxonomic precision when classifying environmental sequence reads from a given primer. Applying this measure to thousands of combinations of primers and read lengths, simulating single-ended and paired-end sequencing, reveals that these choices greatly affect taxonomic informativeness. The most informative sequence region may differ by environment, partly due to variable coverage of different environments in reference databases. Using our Rtax method of classifying paired-end reads, we found that paired-end sequencing provides substantial benefit in some environments including human gut, but not in others. Optimal primer choice for short reads totaling 96 nt provides 82–100% of the confident genus classifications available from longer reads.  相似文献   

4.
5.
The centromeric alpha-satellite DNA subfamilies from chromosomes 13 and 21 are almost identical in sequence. So far it has proven difficult to discriminate between sequence variations in the chromosome 13 and 21 alpha-satellite regions using in situ techniques. To analyze whether the method of modified single-color and double-color PRINS could be used to detect single nucleotide polymorphisms within this region, we used previously published primers D13Z and D21Z that differ in the terminal 3'-nucleotide and an additionally constructed primer "D13/21-test" lacking the final nucleotide at the 3' end. The results show that a one-base pair mismatch at the 3' end is sufficient to be detected by PRINS. Surprisingly, only about 35% of our samples exhibited the expected combination of two chromosomes 13 specifically labeled with only primer D13Z and two chromosomes 21 specifically labeled with only primer D21Z. The rest of the samples showed a polymorphic distribution of the target sequence for the primers, therefore these primers are not suited for routine detection of chromosomes 13 and 21 during interphase. Our data indicate that an interchromosomal exchange of alpha-satellite DNA takes place between chromosomes 13 and 21, possibly due to a concerted evolution process.  相似文献   

6.
We have designed a computer program which rapidly scans nucleic acid sequences to select all possible pairs of oligonucleotides suitable for use as primers to direct efficient DNA amplification by the polymerase chain reaction. This program is based on a set of rules which define in generic terms both the sequence composition of the primers and the amplified region of DNA. These rules (1) enhance primer-to-target sequence hybridization avidity at critical 3'-end extension initiation sites, (2) facilitate attainment of full length extension during the 72 degrees C phase, by minimizing generation of incomplete or nonspecific product and (3) limit primer losses occurring from primer-self or primer-primer homologies. Three examples of primer sets chosen by the program that correctly amplified the target regions starting from RNA are shown. This program should facilitate the rapid selection of effective and specific primers from long gene sequences while providing a flexible choice of various primers to focus study on particular regions of interest.  相似文献   

7.
比较了一些影响荧光终止法PCR循环测序反应的因素。实验结果显示在Beckman CEQ2000自动测序仪上,可读序列长度随着pUC18模板量增加而逐渐增多,当模板量达到125ng时DNA可读序列最长,以后随着pUC18量增加测序长度逐渐下降。当引物量是1μl时,其测序结果比用0.5μl引物时好。在同样模板量情况下,10μl反应体积比5μl反应体积可读序列长。  相似文献   

8.
PRIMO is a computer program that designs walking primers for large-scale DNA sequencing projects. Oligonucleotide primers are predicted automatically, using quality information associated with each base call, eliminating the need for manually viewing the sequence traces or inspecting contig assemblies to determine appropriate locations for primer design. This allows PRIMO to run in batch mode on an arbitrarily large number of templates. For shotgun sequencing, PRIMO reads assembled sequence contigs with corresponding base quality statistics and automatically designs walking primers as needed to extend and join contigs, or improve their overall quality. In the opposite extreme of single-pass or completely directed sequencing, PRIMO reads the unassembled sequence for each template and designs walking primers for extending each read. If the base-calling software does not provide base quality statistics, PRIMO assigns its own measure of base quality determined by the shapes of individual peaks in the trace data for each template. In this way, PRIMO can be used in the finishing stages of a shotgun sequencing project, in sequencing by directed primer walking, or in some intermediate strategy. The code is written in ANSI C and maintained in two versions: one for the Macintosh and the other for UNIX.  相似文献   

9.
Wild crop relatives represent a source of novel alleles for crop genetic improvement. Screening biodiversity for useful or diverse gene homologues has often been based upon the amplification of targeted genes using available sequence information to design primers that amplify the target gene region across species. The crucial requirement of this approach is the presence of sequences with sufficient conservation across species to allow for the design of universal primers. This approach is often not successful with diverse organisms or highly variable genes. Massively parallel sequencing (MPS) can quickly produce large amounts of sequence data and provides a viable option for characterizing homologues of known genes in poorly described genomes. MPS of genomic DNA was used to obtain species‐specific sequence information for 18 rice genes related to domestication characteristics in a wild relative of rice, Microlaena stipoides. Species‐specific primers were available for 16 genes compared with 12 genes using the universal primer method. The use of species‐specific primers had the potential to cover 92% of the sequence of these genes, while traditional universal primers could only be designed to cover 80%. A total of 24 species‐specific primer pairs were used to amplify gene homologues, and 11 primer pairs were successful in capturing six gene homologues. The 23 million, 36‐base pair (bp) paired end reads, equated to an average of 2X genome coverage, facilitated the successful amplification and sequencing of six target gene homologues, illustrating an important approach to the discovery of useful genes in wild crop relatives.  相似文献   

10.
Using a synthetic oligonucleotide primer complementary to human prostate-specific antigen mRNA, we found that an additional sequence possibly similar to human glandular kallikrein-1 could be read by a primer-extension sequencing technique. We were able to confirm the identity of that additional sequence with another oligonucleotide primer complementary to a specific region of the human glandular kallikrein-1 mRNA sequence. Northern blot analysis with 2 oligonucleotide probes respectively specific for prostate-specific antigen and human glandular kallikrein-1 mRNAs showed that the length of both mRNAs was similar at 1.5 kb. The level of human glandular kallikrein-1 mRNA relative to that of prostate-specific antigen could be estimated as approx. 10-20%. This study constitutes the first evidence that the human glandular kallikrein-1 gene is expressed at a high level in a human tissue.  相似文献   

11.
Li J  He S  Zhang L  Hu Y  Yang F  Ma L  Huang J  Li L 《Protoplasma》2012,249(1):207-215
Some reports have shown that nucleolar organizer regions are located at the telomeric region and have a structural connection with telomeres at the cellular level in many organisms. In this study, we found that all 45S ribosomal DNA (rDNA) signals were located at telomeric regions on the chromosomes in Chrysanthemum segetum L., and the 45S rDNA showed distinct signal patterns on different metaphase chromosome spreads. The bicolor fluorescence in situ hybridization experiment on the extended fibers revealed that telomere repeats were structurally connected with or interspersed into rDNA sequences. The close cytological structure relation between rDNA and telomere sequences led us to use PCR with combinations of the telomere primer and the rDNA primer to obtain some fragments, which were flanked by different rDNA and telomere primer sequences. One representative clone CHS2 contains closely connected rDNA and telomere sequences, suggesting that the telomere sequence invaded into the conserved rDNA sequence. In addition, the sequences of some PCR clones were flanked by the single telomeric primer sequence or the rDNA primer sequence. These results suggested that homologous recombination occurred between tandem repeat units of rDNA sequences or telomere repeats at the chromosome terminus.  相似文献   

12.
鸻形目(Charadriiformes),全世界约有384个物种,分属于19科94属,种类繁多、分布广泛,是研究迁徙和觅食行为的良好材料。近年来,线粒体基因组的研究快速发展,由于样品难以收集,缺乏系统的测序策略,鸻形目鸟类线粒体基因组的研究相对滞后。引物设计对聚合酶链式反应(PCR)至关重要,一个成功的PCR实验依赖于高质量的特异性引物,本研究拟设计一套用于鸻形目鸟类线粒体基因组扩增的通用引物。对GenBank中现有的鸻形目物种线粒体基因组进行多重比对,发现若干个保守性区域,本研究在该区域设计13对扩增鸻形目鸟类线粒体基因组的通用引物,扩增的目的片段长度均在1.5 kb左右。我们选取4个物种,即灰头麦鸡(Vanellus cinereus)、丘鹬(Scolopax rusticola)、白腰草鹬(Tringa ochropus)和针尾沙锥(Gallinago stenura),进行PCR扩增验证,设计的引物在4个物种中均能顺利扩增、测序,该引物在鸻形目鸟类线粒体基因组扩增中的具有普遍适用性。本研究设计的13对通用引物在扩增鸻形目物种线粒体全基因组中具有较强的应用价值,将为鸻形目的系统发育关系、种群遗传学和生物地理学研究提供珍贵的资源。  相似文献   

13.
14.
Porter TM  Golding GB 《PloS one》2012,7(4):e35749
Nuclear large subunit ribosomal DNA is widely used in fungal phylogenetics and to an increasing extent also amplicon-based environmental sequencing. The relatively short reads produced by next-generation sequencing, however, makes primer choice and sequence error important variables for obtaining accurate taxonomic classifications. In this simulation study we tested the performance of three classification methods: 1) a similarity-based method (BLAST + Metagenomic Analyzer, MEGAN); 2) a composition-based method (Ribosomal Database Project na?ve bayesian classifier, NBC); and, 3) a phylogeny-based method (Statistical Assignment Package, SAP). We also tested the effects of sequence length, primer choice, and sequence error on classification accuracy and perceived community composition. Using a leave-one-out cross validation approach, results for classifications to the genus rank were as follows: BLAST + MEGAN had the lowest error rate and was particularly robust to sequence error; SAP accuracy was highest when long LSU query sequences were classified; and, NBC runs significantly faster than the other tested methods. All methods performed poorly with the shortest 50-100 bp sequences. Increasing simulated sequence error reduced classification accuracy. Community shifts were detected due to sequence error and primer selection even though there was no change in the underlying community composition. Short read datasets from individual primers, as well as pooled datasets, appear to only approximate the true community composition. We hope this work informs investigators of some of the factors that affect the quality and interpretation of their environmental gene surveys.  相似文献   

15.
Glutathione synthetase (gshB) has previously been reported to confer tolerance to acidic soil condition in Rhizobium species. Cloning the gene coding for this enzyme necessitates the designing of proper primer sets which in turn depends on the identification of high quality sequence similarity in multiple global alignments. In this experiment, a group of homologous gene sequences related to gshB gene (accession no: gi-86355669:327589-328536) of Rhizobium etli CFN 42, were extracted from NCBI nucleotide sequence databases using BLASTN and were analyzed for designing degenerate primers. However, the T-coffee multiple global alignment results did not show any block of conserved region for the above sequence set to design the primers. Therefore, we attempted to identify the location of common motif region based on multiple local alignments employing the MEME algorithm supported with MAST and Primer3. The results revealed some common motif regions that enabled us to design the primer sets for related gshB gene sequences. The result will be validated in wet lab.  相似文献   

16.
This study surveyed strategies of sequencing primer selection and evaluated primer performance in automated DNA sequencing. We asked participants to relate their preferred primer design strategies to identify primer characteristics that are considered most important in sequencing primer design. The participants preferred primers of 18-24 nucleotides (nt), 39%-58% G + C, a melting temperature (Tm) of 53 degrees-65 degrees C with a 1-2 nt 3' GC clamp, hairpin stems of less than 2-3 bp, homopolymeric runs of less than 4-5 nt, primer dimers of less than 3-4 bp and secondary priming sites of less than 3-4 bp. We provided a 300-bp test sequence and asked participants to submit sequences of 1-3 optimal sequencing primers. Submitted primers ranged from 17-24 nt and largely conformed to the preferred parameters. Submitted primers were distributed across the test sequence, although some sites were disfavored. Surprisingly, approximately 45% of the primers were selected "manually", more than by any software package. Each of 69 submitted and 95 control primers, distributed at 3-bp intervals across the test sequence, were synthesized, purified and tested using a Model 377 PRISM DNA Sequencer with dichlororhodamine dye terminator reagents (dRhodamine dye terminators). Approximately half of the control primers were also tested using rhodamine dye terminator reagents ("old" rhodamine dye terminators). The results indicated that primer physico-chemical characteristics thought to have a strong impact on sequencing performance had surprisingly little effect. Thus, primers with high or low percent G + C or Tm, strong secondary priming scores or long 3' homopolymeric stretches yielded excellent sequences with the dRhodamine dye terminator reagents, although these characteristics had a stronger effect when the old rhodamine reagents were used. The old rhodamine reagents gave sequences with a similar average read length, but the number of errors and ambiguities or "N's" was consistently higher. Moreover, the effects of the primer physico-chemical characteristics were also more evident with the old rhodamine dyes. We conclude that under optimal sequencing conditions with highly pure template and primer, many of the commonly applied primer design parameters are dispensable, particularly when using one of the new generation of sequencing reagents such as the dichlororhodamine dye terminators.  相似文献   

17.
Tamarillo (Solanum betaceum Bosh) is a plant from the highland tropics of great importance in the fruit agroindustry of Colombia. In recent years, there has been a dramatic decrease in the local tamarillo fruit production as a result of an emerging disease caused by the potyvirus Tamarillo leaf malformation virus (TaLMV). Symptoms of TaLMV infection include rugose mosaics, severe leaf deformation, defoliation and reduction in plant longevity. In this work, we present a second genome sequence of TaLMV obtained from a tamarillo-growing region in northern Antioquia (Colombia). This genome was compared with a previously sequenced isolate from La Union in eastern Antioquia; this information was used to design specific primers targeting the CP and CI regions of the TaLMV. These primer sets will be useful in future epidemiological studies, seed certification schemes and genetic improvement programmes using real-time RT-PCR (RT-qPCR) and/or RT-PCR.  相似文献   

18.
The development and screening of microsatellite markers have been accelerated by next‐generation sequencing (NGS) technology and in particular GS‐FLX pyro‐sequencing (454). More recent platforms such as the PGM semiconductor sequencer (Ion Torrent) offer potential benefits such as dramatic reductions in cost, but to date have not been well utilized. Here, we critically compare the advantages and disadvantages of microsatellite development using PGM semiconductor sequencing and GS‐FLX pyro‐sequencing for two gymnosperm (a conifer and a cycad) and one angiosperm species. We show that these NGS platforms differ in the quantity of returned sequence data, unique microsatellite data and primer design opportunities, mostly consistent with the differences in read length. The strength of the PGM lies in the large amount of data generated at a comparatively lower cost and time. The strength of GS‐FLX lies in the return of longer average length sequences and therefore greater flexibility in producing markers with variable product length, due to longer flanking regions, which is ideal for capillary multiplexing. These differences need to be considered when choosing a NGS method for microsatellite discovery. However, the ongoing improvement in read lengths of the NGS platforms will reduce the disadvantage of the current short read lengths, particularly for the PGM platform, allowing greater flexibility in primer design coupled with the power of a larger number of sequences.  相似文献   

19.
Summary DNA amplification fingerprinting (DAF) is the enzymatic amplification of arbitrary stretches of DNA which is directed by very short oligonucleotide primers of arbitrary sequence to generate complex but characteristic DNA fingerprints. To determine the contribution of primer sequence and length to the fingerprint pattern and the effect of primer-template mismatches, DNA was amplified from several sources using sequence-related primers. Primers of varying length, constructed by removing nucleotides from the 5 terminus, produced unique patterns only when primers were 8 nucleotides or fewer in length. Larger primers produced either identical or related fingerprints, depending on the sequence. Single base changes within this first 8-nucleotide region of the primer significantly altered the spectrum of amplification products, especially at the 3 terminus. Increasing annealing temperatures from 15° to 70° C during amplification did not shift the boundary of the 8-nucleotide region, but reduced the amplification ability of shorter primers. Our observations define a 3-terminal oligonucleotide domain that is at least 8 bases in length and largely conditions amplification, but that is modulated by sequences beyond it. Our results indicate that only a fraction of template annealing sites are efficiently amplified during DAF. A model is proposed in which a single primer preferentially amplifies certain products due to competition for annealing sites between primer and terminal hairpin loop structures of the template.  相似文献   

20.
Highly conserved regions are attractive targets for detection and quantitation by PCR, but designing species-specific primer sets can be difficult. Ultimately, almost all primer sets are designed based upon literature searches in public domain databases, such as the National Center for Biotechnology Information (NCBI). Prudence suggests that the researcher needs to evaluate as many sequences as available for designing species-specific PCR primers. In this report, we aligned 11, 9, and 16 DNA sequences entered for Stachybotrys spp. rRNA, tri5, and β-tubulin regions, respectively. Although we were able to align and determine consensus primer sets for the 9 tri5 and the 16 β-tubulin sequences, there was no consensus sequence that could be derived from alignment of the 11 rRNA sequences. However, by judicious clustering of the sequences that aligned well, we were able to design three sets of primers for the rRNA region of S. chartarum. The two primer sets for tri5 and β-tubulin produced satisfactory PCR results for all four strains of S. chartarum used in this study whereas only one rRNA primer set of three produced similar satisfactory results. Ultimately, we were able to show that rRNA copy number is approximately 2-log greater than for tri5 and β-tubulin in the four strains of S. chartarum tested.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号