首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Many genetic epidemiology resources have collected dried blood spots (predominantly as Guthrie Cards) as an economical and efficient means of archiving sources of DNA, conferring great value to genetic screening methods that are compatible with this medium. We applied Hi-Plex to screen the breast cancer predisposition gene PALB2 in 93 Guthrie Card-derived DNA specimens previously characterized for PALB2 genetic variants via DNA derived from lymphoblastoid cell lines, whole blood, and buffy coat. Of the 93 archival Guthrie Card-derived DNAs, 92 (99%) were processed successfully and sequenced using approximately half of a MiSeq run. From these 92 DNAs, all 59 known variants were detected and no false-positive variant calls were yielded. Fully 98.13% of amplicons (5417/5520) were represented within 15-fold of the median coverage (2786 reads), and 99.98% of amplicons (5519/5520) were represented at a depth of 10 read-pairs or greater. With Hi-Plex, we show for the first time that a High-Plex amplicon-based massively parallel sequencing (MPS) system can be applied effectively to DNA prepared from dried blood spot archival specimens and, as such, can dramatically increase the scopes of both method and resource.  相似文献   

2.
Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the comparatively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availability of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356,958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as 'threatened' and 'endangered' in the United States for under $0.5 M (USD).  相似文献   

3.
Although pioneering sequencing projects have shed light on the boxer and poodle genomes, a number of challenges need to be met before the sequencing and annotation of the dog genome can be considered complete. Here, we present the DNA sequence of the Jindo dog genome, sequenced to 45-fold average coverage using Illumina massively parallel sequencing technology. A comparison of the sequence to the reference boxer genome led to the identification of 4 675 437 single nucleotide polymorphisms (SNPs, including 3 346 058 novel SNPs), 71 642 indels and 8131 structural variations. Of these, 339 non-synonymous SNPs and 3 indels are located within coding sequences (CDS). In particular, 3 non-synonymous SNPs and a 26-bp deletion occur in the TCOF1 locus, implying that the difference observed in cranial facial morphology between Jindo and boxer dogs might be influenced by those variations. Through the annotation of the Jindo olfactory receptor gene family, we found 2 unique olfactory receptor genes and 236 olfactory receptor genes harbouring non-synonymous homozygous SNPs that are likely to affect smelling capability. In addition, we determined the DNA sequence of the Jindo dog mitochondrial genome and identified Jindo dog-specific mtDNA genotypes. This Jindo genome data upgrade our understanding of dog genomic architecture and will be a very valuable resource for investigating not only dog genetics and genomics but also human and dog disease genetics and comparative genomics.  相似文献   

4.

Background

Assembling genes from next-generation sequencing data is not only time consuming but computationally difficult, particularly for taxa without a closely related reference genome. Assembling even a draft genome using de novo approaches can take days, even on a powerful computer, and these assemblies typically require data from a variety of genomic libraries. Here we describe software that will alleviate these issues by rapidly assembling genes from distantly related taxa using a single library of paired-end reads: aTRAM, automated Target Restricted Assembly Method. The aTRAM pipeline uses a reference sequence, BLAST, and an iterative approach to target and locally assemble the genes of interest.

Results

Our results demonstrate that aTRAM rapidly assembles genes across distantly related taxa. In comparative tests with a closely related taxon, aTRAM assembled the same sequence as reference-based and de novo approaches taking on average < 1 min per gene. As a test case with divergent sequences, we assembled >1,000 genes from six taxa ranging from 25 – 110 million years divergent from the reference taxon. The gene recovery was between 97 – 99% from each taxon.

Conclusions

aTRAM can quickly assemble genes across distantly-related taxa, obviating the need for draft genome assembly of all taxa of interest. Because aTRAM uses a targeted approach, loci can be assembled in minutes depending on the size of the target. Our results suggest that this software will be useful in rapidly assembling genes for phylogenomic projects covering a wide taxonomic range, as well as other applications. The software is freely available http://www.github.com/juliema/aTRAM.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0515-2) contains supplementary material, which is available to authorized users.  相似文献   

5.
Bisulfite sequencing is widely used for analysis of DNA methylation status (i.e., 5-methylcytosine [5mC] vs. cytosine [C]) in CpG-rich or other loci in genomic DNA (gDNA). Such methods typically involve reaction of gDNA with bisulfite followed by polymerase chain reaction (PCR) amplification of specific regions of interest that, overall, converts C→T (thymine) and 5mC→C and then capillary sequencing to measure C versus T composition at CpG sites. Massively parallel sequencing by oligonucleotide ligation and detection (SOLiD) has recently enabled relatively low-cost whole genome sequencing, and it would be highly desirable to apply such massively parallel sequencing to bisulfite-converted whole genomes to determine DNA methylation status of an entire genome, which has heretofore not been reported. As an initial step toward achieving this goal, we have extended our ongoing interest in improving bisulfite conversion sample preparation to include a human genome-wide fragment library for SOliD. The current article features novel use of formamide denaturant during bisulfite conversion of a suitably constructed library directly in a band slice from polyacryamide gel electrophoresis (PAGE). To validate this new protocol for 5mC-protected fragment library conversion, which we refer to as Bis-PAGE, capillary-based size analysis and Sanger sequencing were carried out for individual amplicons derived from single-molecule PCR (smPCR) of randomly selected library fragments. smPCR/Capillary Sanger sequencing of approximately 200 amplicons unambiguously demonstrated greater than 99% C→T conversion. All of these approximately 200 Sanger sequences were analyzed with a previously published web-accessible bioinformatics tool (methBLAST) for mapping to human chromosomes, the results of which indicated random distribution of analyzed fragments across all chromosomes. Although these particular Bis-PAGE conversion and quality control methods were exemplified in the context of a fragment library for SOLiD, the concepts can be generalized to include other genome-wide library constructions intended for DNA methylation analysis by alternative high-throughput or massively parallelized methods that are currently available.  相似文献   

6.

Background

Generation of long (>5 Kb) DNA sequencing reads provides an approach for interrogation of complex regions in the human genome. Currently, large-insert whole genome sequencing (WGS) technologies from Pacific Biosciences (PacBio) enable analysis of chromosomal structural variations (SVs), but the cost to achieve the required sequence coverage across the entire human genome is high.

Results

We developed a method (termed PacBio-LITS) that combines oligonucleotide-based DNA target-capture enrichment technologies with PacBio large-insert library preparation to facilitate SV studies at specific chromosomal regions. PacBio-LITS provides deep sequence coverage at the specified sites at substantially reduced cost compared with PacBio WGS. The efficacy of PacBio-LITS is illustrated by delineating the breakpoint junctions of low copy repeat (LCR)-associated complex structural rearrangements on chr17p11.2 in patients diagnosed with Potocki–Lupski syndrome (PTLS; MIM#610883). We successfully identified previously determined breakpoint junctions in three PTLS cases, and also were able to discover novel junctions in repetitive sequences, including LCR-mediated breakpoints. The new information has enabled us to propose mechanisms for formation of these structural variants.

Conclusions

The new method leverages the cost efficiency of targeted capture-sequencing as well as the mappability and scaffolding capabilities of long sequencing reads generated by the PacBio platform. It is therefore suitable for studying complex SVs, especially those involving LCRs, inversions, and the generation of chimeric Alu elements at the breakpoints. Other genomic research applications, such as haplotype phasing and small insertion and deletion validation could also benefit from this technology.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1370-2) contains supplementary material, which is available to authorized users.  相似文献   

7.
Bacterial assemblages from subsurface (100 m depth), meso- (200-1000 m depth) and bathy-pelagic (below 1000 m depth) zones at 10 stations along a North Atlantic Ocean transect from 60°N to 5°S were characterized using massively parallel pyrotag sequencing of the V6 region of the 16S rRNA gene (V6 pyrotags). In a dataset of more than 830,000 pyrotags, we identified 10,780 OTUs of which 52% were singletons. The singletons accounted for less than 2% of the OTU abundance, whereas the 100 and 1000 most abundant OTUs represented 80% and 96% respectively of all recovered OTUs. Non-metric Multi-Dimensional Scaling and Canonical Correspondence Analysis of all the OTUs excluding the singletons revealed a clear clustering of the bacterial communities according to the water masses. More than 80% of the 1000 most abundant OTUs corresponded to Proteobacteria of which 55% were Alphaproteobacteria, mostly composed of the SAR11 cluster. Gammaproteobacteria increased with depth and included a relatively large number of OTUs belonging to Alteromonadales and Oceanospirillales. The bathypelagic zone showed higher taxonomic evenness than the overlying waters, albeit bacterial diversity was remarkably variable. Both abundant and low-abundance OTUs were responsible for the distinct bacterial communities characterizing the major deep-water masses. Taken together, our results reveal that deep-water masses act as bio-oceanographic islands for bacterioplankton leading to water mass-specific bacterial communities in the deep waters of the Atlantic.  相似文献   

8.
Chan ER  Hester J  Kalady M  Xiao H  Li X  Serre D 《Genomics》2011,98(4):253-259
Deep sequencing of the 16S rRNA gene provides a comprehensive view of bacterial communities in a particular environment and has expanded our ability to study the impact of the microflora on human health and disease. Current analysis methods rely on comparisons of the sequences generated with an expanding but limited set of annotated 16S rRNA sequences or phylogenic clustering of sequences based on arbitrary similarity cutoffs. We describe a novel approach to characterize bacterial composition using deep sequencing of 16S rRNA gene. Our method defines operational taxonomic units based on phylogenetic tree reconstruction and dynamic clustering of sequences using solely sequencing data. These OTUs can be used to identify differences in bacteria abundance between environments. This approach can perform better than previous phylogenetic methods and will significantly improve our understanding of the microfloral role on human diseases by providing a comprehensive analysis of the microbial composition from various bacterial communities.  相似文献   

9.
10.
Massively parallel signature sequencing (MPSS) is one of the newest tools available for conducting in-depth expression profiling. MPSS is an open-ended platform that analyses the level of expression of virtually all genes in a sample by counting the number of individual mRNA molecules produced from each gene. There is no requirement that genes be identified and characterised prior to conducting an experiment. MPSS has a routine sensitivity at a level of a few molecules of mRNA per cell, and the datasets are in a digital format that simplifies the management and analysis of the data. Therefore, of the various microarray and non-microarray technologies currently available, MPSS provides many advantages for generating the type of complete datasets that will help to facilitate hypothesis-driven experiments in the era of digital biology.  相似文献   

11.
We present a systematic assessment of RainDrop BS-seq, a novel method for large-scale, targeted bisulfite sequencing using microdroplet-based PCR amplification coupled with next-generation sequencing. We compared DNA methylation levels at 498 target loci (1001 PCR amplicons) in human whole blood, osteosarcoma cells and an archived tumor tissue sample. We assessed the ability of RainDrop BS-seq to accurately measure DNA methylation over a range of DNA quantities (from 10 to 1500 ng), both with and without whole-genome amplification (WGA) following bisulfite conversion. DNA methylation profiles generated using at least 100 ng correlated well (median R = 0.92) with those generated on Illumina Infinium HumanMethylation450 BeadChips, currently the platform of choice for epigenome-wide association studies (EWAS). WGA allowed for testing of samples with a starting DNA amount of 10 and 50 ng, although a reduced correlation was observed (median R = 0.79). We conclude that RainDrop BS-seq is suitable for measuring DNA methylation levels using nanogram quantities of DNA, and can be used to study candidate epigenetic biomarker loci in an accurate and high-throughput manner, paving the way for its application to routine clinical diagnostics.  相似文献   

12.
《Epigenetics》2013,8(5):678-684
We present a systematic assessment of RainDrop BS-seq, a novel method for large-scale, targeted bisulfite sequencing using microdroplet-based PCR amplification coupled with next-generation sequencing. We compared DNA methylation levels at 498 target loci (1001 PCR amplicons) in human whole blood, osteosarcoma cells and an archived tumor tissue sample. We assessed the ability of RainDrop BS-seq to accurately measure DNA methylation over a range of DNA quantities (from 10 to 1500 ng), both with and without whole-genome amplification (WGA) following bisulfite conversion. DNA methylation profiles generated using at least 100 ng correlated well (median R = 0.92) with those generated on Illumina Infinium HumanMethylation450 BeadChips, currently the platform of choice for epigenome-wide association studies (EWAS). WGA allowed for testing of samples with a starting DNA amount of 10 and 50 ng, although a reduced correlation was observed (median R = 0.79). We conclude that RainDrop BS-seq is suitable for measuring DNA methylation levels using nanogram quantities of DNA, and can be used to study candidate epigenetic biomarker loci in an accurate and high-throughput manner, paving the way for its application to routine clinical diagnostics.  相似文献   

13.
Mondal K  Shetty AC  Patel V  Cutler DJ  Zwick ME 《Genomics》2011,98(4):260-265
We used a RainDance Technologies (RDT) expanded content library to enrich the human X chromosome exome (2.5 Mb) from 26 male samples followed by Illumina sequencing. Our multiplex primer library covered 98.05% of the human X chromosome exome in a single tube with 11,845 different PCR amplicons. Illumina sequencing of 24 male samples showed coverage for 97% of the targeted sequences. Sequence from 2 HapMap samples confirmed missing data rates of 2–3% at sites successfully typed by the HapMap project, with an accuracy of at least ~ 99.5% as compared to reported HapMap genotypes. Our demonstration that a RDT expanded content library can efficiently enrich and enable the routine sequencing of the human X chromosome exome suggests a wide variety of potential research and clinical applications for this platform.  相似文献   

14.
建立环介导间接PCR检测体系,为分子诊断提供一种新的检测工具。以质粒pUC18的核苷酸序列为模板,设计两条特异性探针,采用常规PCR技术将特异性探针标记于大豆Lectin基因的左右两端充当报告基因,此标记的报告基因与待检的pUC18质粒经杂交和缺口补平后形成一环状DNA分子,然后采用反向PCR技术扩增报告基因,建立针对pUC18质粒的环介导间接PCR检测方法。结果表明,该检测方法的检测底限为0.32 pg/μL,与常规PCR相当,并且与其他质粒和动物DNA检测无交叉反应,是一种简单、快速、灵敏、特异的PCR检测方法。  相似文献   

15.

Background

Usually, next generation sequencing (NGS) technology has the property of ultra-high throughput but the read length is remarkably short compared to conventional Sanger sequencing. Paired-end NGS could computationally extend the read length but with a lot of practical inconvenience because of the inherent gaps. Now that Illumina paired-end sequencing has the ability of read both ends from 600 bp or even 800 bp DNA fragments, how to fill in the gaps between paired ends to produce accurate long reads is intriguing but challenging.

Results

We have developed a new technology, referred to as pseudo-Sanger (PS) sequencing. It tries to fill in the gaps between paired ends and could generate near error-free sequences equivalent to the conventional Sanger reads in length but with the high throughput of the Next Generation Sequencing. The major novelty of PS method lies on that the gap filling is based on local assembly of paired-end reads which have overlaps with at either end. Thus, we are able to fill in the gaps in repetitive genomic region correctly. The PS sequencing starts with short reads from NGS platforms, using a series of paired-end libraries of stepwise decreasing insert sizes. A computational method is introduced to transform these special paired-end reads into long and near error-free PS sequences, which correspond in length to those with the largest insert sizes. The PS construction has 3 advantages over untransformed reads: gap filling, error correction and heterozygote tolerance. Among the many applications of the PS construction is de novo genome assembly, which we tested in this study. Assembly of PS reads from a non-isogenic strain of Drosophila melanogaster yields an N50 contig of 190 kb, a 5 fold improvement over the existing de novo assembly methods and a 3 fold advantage over the assembly of long reads from 454 sequencing.

Conclusions

Our method generated near error-free long reads from NGS paired-end sequencing. We demonstrated that de novo assembly could benefit a lot from these Sanger-like reads. Besides, the characteristic of the long reads could be applied to such applications as structural variations detection and metagenomics.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-14-711) contains supplementary material, which is available to authorized users.  相似文献   

16.
Pseudogenes are frequently encountered noncoding sequences with a high sequence similarity to their protein-coding paralogue. For this reason, their presence is often considered troublesome in molecular diagnostics. In pseudoxanthoma elasticum(PXE), a disease predominantly caused by mutations in ATPbinding cassette family C member 6(ABCC6), the presence of two pseudogenes complicates the analysis of sequence data. With whole-exome sequencing(WES) becoming the standard of care in molecular diagnostics, we wanted to evaluate whether this technique is as reliable as gene-specific targeted enrichment analysis for the analysis of ABCC6. We established a PCR-based targeted enrichment and next-generation sequencing testing approach and demonstrated that the ABCC6-specific enrichment combined with the applied mapping algorithm overcomes the complication of ABCC6 pseudogene aspecificities, contrary to WES. We propose a time-and cost-efficient diagnostic strategy for comprehensive and accurate molecular genetic testing of PXE, which is highly automatable.  相似文献   

17.
Massively parallel sequencing of cell-free, maternal plasma DNA was recently demonstrated to be a safe and effective screening method for fetal chromosomal aneuploidies. Here, we report an improved sequencing method achieving significantly increased throughput and decreased cost by replacing laborious sequencing library preparation steps with PCR employing a single primer pair designed to amplify a discrete subset of repeated regions. Using this approach, samples containing as little as 4% trisomy 21 DNA could be readily distinguished from euploid samples.  相似文献   

18.
A new approach for optically sequencing ensembles of single DNA molecules using DNA polymerase to mediate the consecutive incorporation of fluorochrome-labeled nucleotides into an array of large single DNA molecules is presented. The approach utilizes cycles of labeled fluorochrome addition, detection to count incorporations, and bleaching to reset the counter. These additions are imaged and analyzed to estimate the number of labeled additions and to correlate them on a per-locus basis along DNA backbones. Initial studies used precisely labeled polymerase chain reaction products to aid the development and validation of simple models of fluorochrome point spread functions within the imaging system. In complementary studies, nucleotides labeled with the fluorochrome R110 were incorporated into surface-elongated lambda DNA, and fluorescent signals corresponding to the addition of R110-dUTP were counted and assigned precise loci along DNA backbones. The labeled DNAs were then subjected to photobleaching and to a second cycle of addition of R110-labeled nucleotides-a second round of additions was correlated with the first to establish strings of addition histories among the ensemble of largely double-stranded templates. These results confirm the basic operational validity of this approach and point the way to the development of a practical system for optical sequencing.  相似文献   

19.
The armoured dinoflagellate Peridinium is widely distributed in freshwater environments worldwide and contains a large number of species. Their identity, however, has remained elusive, since the small cells tend to be morphologically similar. To help resolve this, a sequence-based diagnostics for uncultured Peridinium cells from field samples was applied, using single-cell PCR and direct DNA sequencing of the PCR products. Single cells were isolated randomly from field samples, and PCR successfully amplified the target rDNA regions from the crude lysates. Phylogenetic trees showed that all the cells were strongly grouped into the same clade (> 99% bootstrap value), including the previously identified P. bipes f. occultatum, and apparently separated from relatives such as P. cinctum, P. volzii and P. willei. All 17 isolates were genotypically identified as P. bipes f. occultatum, based on over than 99% of sequence similarities, and the organism was responsible for water blooms at different seasons in Korean waters. The sequence-based typing could clearly resolve P. bipes f. occultatum from the various Peridinium cells, and that the method is accurate and more labor-saving than the conventional method to monitor Peridinium species. This protocol may be useful for the application of molecular tools to uncultured Peridinium cells.  相似文献   

20.
The field of phylogeography has long since realized the need and utility of incorporating nuclear DNA (nDNA) sequences into analyses. However, the use of nDNA sequence data, at the population level, has been hindered by technical laboratory difficulty, sequencing costs, and problematic analytical methods dealing with genotypic sequence data, especially in non-model organisms. Here, we present a method utilizing the 454 GS-FLX Titanium pyrosequencing platform with the capacity to simultaneously sequence two species of sea star (Meridiastra calcar and Parvulastra exigua) at five different nDNA loci across 16 different populations of 20 individuals each per species. We compare results from 3 populations with traditional Sanger sequencing based methods, and demonstrate that this next-generation sequencing platform is more time and cost effective and more sensitive to rare variants than Sanger based sequencing. A crucial advantage is that the high coverage of clonally amplified sequences simplifies haplotype determination, even in highly polymorphic species. This targeted next-generation approach can greatly increase the use of nDNA sequence loci in phylogeographic and population genetic studies by mitigating many of the time, cost, and analytical issues associated with highly polymorphic, diploid sequence markers.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号