共查询到20条相似文献,搜索用时 15 毫秒
1.
Meredith L. Carpenter Jason D. Buenrostro Cristina Valdiosera Hannes Schroeder Morten E. Allentoft Martin Sikora Morten Rasmussen Simon Gravel Sonia Guillén Georgi Nekhrizov Krasimir Leshtakov Diana Dimitrova Nikola Theodossiev Davide Pettener Donata Luiselli Karla Sandoval Andrés Moreno-Estrada Yingrui Li Jun Wang M. Thomas P. Gilbert Eske Willerslev William J. Greenleaf Carlos D. Bustamante 《American journal of human genetics》2013
2.
Ligation Bias in Illumina Next-Generation DNA Libraries: Implications for Sequencing Ancient Genomes
Andaine Seguin-Orlando Mikkel Schubert Joel Clary Julia Stagegaard Maria T. Alberdi José Luis Prado Alfredo Prieto Eske Willerslev Ludovic Orlando 《PloS one》2013,8(10)
Ancient DNA extracts consist of a mixture of endogenous molecules and contaminant DNA templates, often originating from environmental microbes. These two populations of templates exhibit different chemical characteristics, with the former showing depurination and cytosine deamination by-products, resulting from post-mortem DNA damage. Such chemical modifications can interfere with the molecular tools used for building second-generation DNA libraries, and limit our ability to fully characterize the true complexity of ancient DNA extracts. In this study, we first use fresh DNA extracts to demonstrate that library preparation based on adapter ligation at AT-overhangs are biased against DNA templates starting with thymine residues, contrarily to blunt-end adapter ligation. We observe the same bias on fresh DNA extracts sheared on Bioruptor, Covaris and nebulizers. This contradicts previous reports suggesting that this bias could originate from the methods used for shearing DNA. This also suggests that AT-overhang adapter ligation efficiency is affected in a sequence-dependent manner and results in an uneven representation of different genomic contexts. We then show how this bias could affect the base composition of ancient DNA libraries prepared following AT-overhang ligation, mainly by limiting the ability to ligate DNA templates starting with thymines and therefore deaminated cytosines. This results in particular nucleotide misincorporation damage patterns, deviating from the signature generally expected for authenticating ancient sequence data. Consequently, we show that models adequate for estimating post-mortem DNA damage levels must be robust to the molecular tools used for building ancient DNA libraries. 相似文献
3.
Targeted DNA enrichment coupled with next generation sequencing has been increasingly used for interrogation of select sub-genomic regions at high depth of coverage in a cost effective manner. Specificity measured by on-target efficiency is a key performance metric for target enrichment. Non-specific capture leads to off-target reads, resulting in waste of sequencing throughput on irrelevant regions. Microdroplet-PCR allows simultaneous amplification of up to thousands of regions in the genome and is among the most commonly used strategies for target enrichment. Here we show that carryover of single-stranded template genomic DNA from microdroplet-PCR constitutes a major contributing factor for off-target reads in the resultant libraries. Moreover, treatment of microdroplet-PCR enrichment products with a nuclease specific to single-stranded DNA alleviates off-target load and improves enrichment specificity. We propose that nuclease treatment of enrichment products should be incorporated in the workflow of targeted sequencing using microdroplet-PCR for target capture. These findings may have a broad impact on other PCR based applications for which removal of template DNA is beneficial. 相似文献
4.
Jesper Buchhave Poulsen Francesco Lescai Jakob Grove Marie B?kvad-Hansen Michael Christiansen Christian Munch Hagen Julian Maller Christine Stevens Shenting Li Qibin Li Jihua Sun Jun Wang Merete Nordentoft Thomas Mears Werge Preben Bo Mortensen Anders Dupont B?rglum Mark Daly David Michael Hougaard Jonas Bybjerg-Grauholm Mads Vilhelm Hollegaard 《PloS one》2016,11(4)
Stored neonatal dried blood spot (DBS) samples from neonatal screening programmes are a valuable diagnostic and research resource. Combined with information from national health registries they can be used in population-based studies of genetic diseases. DNA extracted from neonatal DBSs can be amplified to obtain micrograms of an otherwise limited resource, referred to as whole-genome amplified DNA (wgaDNA). Here we investigate the robustness of exome sequencing of wgaDNA of neonatal DBS samples. We conducted three pilot studies of seven, eight and seven subjects, respectively. For each subject we analysed a neonatal DBS sample and corresponding adult whole-blood (WB) reference sample. Different DNA sample types were prepared for each of the subjects. Pilot 1: wgaDNA of 2x3.2mm neonatal DBSs (DBS_2x3.2) and raw DNA extract of the WB reference sample (WB_ref). Pilot 2: DBS_2x3.2, WB_ref and a WB_ref replica sharing DNA extract with the WB_ref sample. Pilot 3: DBS_2x3.2, WB_ref, wgaDNA of 2x1.6 mm neonatal DBSs and wgaDNA of the WB reference sample. Following sequencing and data analysis, we compared pairwise variant calls to obtain a measure of similarity—the concordance rate. Concordance rates were slightly lower when comparing DBS vs WB sample types than for any two WB sample types of the same subject before filtering of the variant calls. The overall concordance rates were dependent on the variant type, with SNPs performing best. Post-filtering, the comparisons of DBS vs WB and WB vs WB sample types yielded similar concordance rates, with values close to 100%. WgaDNA of neonatal DBS samples performs with great accuracy and efficiency in exome sequencing. The wgaDNA performed similarly to matched high-quality reference—whole-blood DNA—based on concordance rates calculated from variant calls. No differences were observed substituting 2x3.2 with 2x1.6 mm discs, allowing for additional reduction of sample material in future projects. 相似文献
5.
6.
Kechin A. A. Dymova M. A. Tishkin A. A. Grushin S. P. Dashkovskiy P. K. Filipenko M. L. 《Russian Journal of Genetics》2019,55(12):1499-1505
Russian Journal of Genetics - Sheep were one of the first animals to be domesticated. The history of sheep domestication and their widespread distribution dates to about ten thousand years ago,... 相似文献
7.
8.
9.
Meetha P. Gould Colleen M. Bosworth Sarah McMahon Sneha Grandhi Brian T. Grimerg Thomas LaFramboise 《PloS one》2015,10(10)
Recent advances in sequencing technology allow for accurate detection of mitochondrial sequence variants, even those in low abundance at heteroplasmic sites. Considerable sequencing cost savings can be achieved by enriching samples for mitochondrial (relative to nuclear) DNA. Reduction in nuclear DNA (nDNA) content can also help to avoid false positive variants resulting from nuclear mitochondrial sequences (numts). We isolate intact mitochondrial organelles from both human cell lines and blood components using two separate methods: a magnetic bead binding protocol and differential centrifugation. DNA is extracted and further enriched for mitochondrial DNA (mtDNA) by an enzyme digest. Only 1 ng of the purified DNA is necessary for library preparation and next generation sequence (NGS) analysis. Enrichment methods are assessed and compared using mtDNA (versus nDNA) content as a metric, measured by using real-time quantitative PCR and NGS read analysis. Among the various strategies examined, the optimal is differential centrifugation isolation followed by exonuclease digest. This strategy yields >35% mtDNA reads in blood and cell lines, which corresponds to hundreds-fold enrichment over baseline. The strategy also avoids false variant calls that, as we show, can be induced by the long-range PCR approaches that are the current standard in enrichment procedures. This optimization procedure allows mtDNA enrichment for efficient and accurate massively parallel sequencing, enabling NGS from samples with small amounts of starting material. This will decrease costs by increasing the number of samples that may be multiplexed, ultimately facilitating efforts to better understand mitochondria-related diseases. 相似文献
10.
Zhenzhen Kong Peipei Zhao Haibing Liu Xiang Yu Yanyan Qin Zhaoliang Su Shengjun Wang Huaxi Xu Jianguo Chen 《PloS one》2016,11(3)
Staphylococcus aureus is a globally disseminated drug-resistant bacterial species. It remains a leading cause of hospital-acquired infection, primarily among immunocompromised patients. In 2012, the Affiliated People’s Hospital of Jiangsu University experienced a putative outbreak of methicillin-resistant S. aureus (MRSA) that affected 12 patients in the Neurosurgery Department. In this study, whole-genome sequencing (WGS) was used to gain insight into the epidemiology of the outbreak caused by MRSA, and traditional bacterial genotyping approaches were also applied to provide supportive evidence for WGS. We sequenced the DNA from 6 isolates associated with the outbreak. Phylogenetic analysis was constructed by comparing single-nucleotide polymorphisms (SNPs) in the core genome of 6 isolates in the present study and another 3 referenced isolates from GenBank. Of the 6 MRSA sequences in the current study, 5 belonged to the same group, clustering with T0131, while the other one clustered closely with TW20. All of the isolates were identified as ST239-SCCmecIII clones. Whole-genome analysis revealed that four of the outbreak isolates were more tightly clustered into a group and SA13002 together with SA13009 were distinct from the outbreak strains, which were considered non-outbreak strains. Based on the sequencing results, the antibiotic-resistance gene status (present or absent) was almost perfectly concordant with the results of phenotypic susceptibility testing. Various toxin genes were also analyzed successfully. Our analysis demonstrates that using traditional molecular methods and WGS can facilitate the identification of outbreaks and help to control nosocomial transmission. 相似文献
11.
12.
13.
Telomeres are the ends of eukaryotic chromosomes, consisting of consecutive short repeats that protect chromosome ends from degradation. Telomeres shorten with each cell division, leading to replicative cell senescence. Deregulation of telomere length homeostasis is associated with the development of various age-related diseases and cancers. A number of experimental techniques exist for telomere length measurement; however, until recently, the absence of tools for extracting telomere lengths from high-throughput sequencing data has significantly obscured the association of telomere length with molecular processes in normal and diseased conditions. We have developed Computel, a program in R for computing mean telomere length from whole-genome next-generation sequencing data. Computel is open source, and is freely available at https://github.com/lilit-nersisyan/computel. It utilizes a short-read alignment-based approach and integrates various popular tools for sequencing data analysis. We validated it with synthetic and experimental data, and compared its performance with the previously available software. The results have shown that Computel outperforms existing software in accuracy, independence of results from sequencing conditions, stability against inherent sequencing errors, and better ability to distinguish pure telomeric sequences from interstitial telomeric repeats. By providing a highly reliable methodology for determining telomere lengths from whole-genome sequencing data, Computel should help to elucidate the role of telomeres in cellular health and disease. 相似文献
14.
Mutant screens have proven powerful for genetic dissection of a myriad of biological processes, but subsequent identification and isolation of the causative mutations are usually complex and time consuming. We have made the process easier by establishing a novel strategy that employs whole-genome sequencing to simultaneously map and identify mutations without the need for any prior genetic mapping.THE challenges posed by the identification of a causal mutation in a mutant of interest have in effect restricted the use of forward genetics to those organisms benefiting from a solid genetic toolbox. Whole-genome sequencing (WGS) is promising to revolutionize the way phenotypic traits are assigned to genes. However, current strategies to identify causal mutations using WGS require first the identification of an approximate genomic location containing the mutation of interest (Sarin et al. 2008; Smith et al. 2008; Srivatsan et al. 2008; Blumenstiel et al. 2009; Irvine et al. 2009). This is because genomes contain many natural sequence variations (Denver et al. 2004; Hillier et al. 2008; Sarin et al. 2010), which, along with mutagen-induced ones, complicate the identification of the causal mutation when an approximate genomic location has not been previously identified. Mapping has previously been achieved with time-consuming and laborious techniques that, in addition, rely on an organism''s single-nucleotide polymorphism (SNP) map and established variant strains. For example, traditional SNP-based mapping (Wicks et al. 2001; Davis et al. 2005) has previously been used in Caenorhabditis elegans to narrow down the genomic region containing the mutation of interest, prior to conducting WGS (Sarin et al. 2008). In Arabidopsis, simultaneous SNP mapping and mutation identification has been achieved with WGS, but this requires the generation of a mapping population of up to 500 F2 progeny to identify only one allele (Schneeberger et al. 2009). This is a challenging prospect for many model systems. Indeed, if the mutant phenotype is subtle, the isolation of such numbers of recombinants is very tedious. Furthermore, it is not applicable in those organisms where a mapping population cannot be generated, simply because of a lack of intercrossable variants or because of life cycles (parasitic organisms, for example) that would make it extremely difficult to follow and isolate many recombinant individuals.Here, we describe a strategy to simultaneously and rapidly locate and identify multiple mutations from a mutagenesis screen with WGS that circumvents these limitations. This powerful and straightforward method directly uses mutagen-induced nucleotide changes that are linked to the causal mutation to identify its specific genomic location, thus negating the construction of genetic mapping populations and subsequent mapping.Treatment of organisms with a chemical mutagen induces nucleotide changes throughout the genome. Following mutagenesis, backcrossing or outcrossing of the mutagenized organism to unmutagenized counterparts is performed to eliminate mutagen-induced mutations (Figure 1A; supporting information, File S2). The phenotype-causing mutation remains as only backcrossed individuals showing the phenotype of interest are retained. In addition, mutagen-induced nucleotide changes that are genetically linked to the causal mutation and physically surround it on the chromosome will remain, in contrast to unlinked nucleotide changes (Figure 1A). As a result of this genetic linkage, a high-density cluster of typical mutagen-induced variants is visualized from sequence data obtained by WGS, which is positioned around the causal mutation. By locating such high-density regions, one maps the approximate genomic location of the causal mutation and subsequently identifies the affected gene within this region.Open in a separate windowFigure 1.—Mapping mutations on the basis of density of mutagen-induced DNA damage across the genome. (A) Visual representation of our WGS cloning strategy. Mutagen treatment induces point mutations throughout the genome (red asterisks). Backcrossing to the original unmutated parent strain removes much of the mutagen-induced nucleotide changes except for the causal mutation (green asterisk) and those genetically linked to it. WGS sequencing can be used to detect canonical mutagen-induced point mutations, thus revealing a physical position for the causal mutation. Shared background variants (yellow crosses) are filtered out from WGS data by comparing the sequences of mutants sequenced side-by-side, revealing a high-density variant cluster in only one genomic region. Importantly, genomic sequences of mutants derived from the same starting strain must be compared, to allow subtraction of nucleotide variants that are common to this particular strain, through sequence comparison. (B) Physical map of total nucleotide variations per megabase across the genome compared to the wild-type reference genome for each mutant (fp6, fp9, and fp12) after WGS. (C) After sequence quality filtering, subtraction of common variants between the 3 mutants, and filtering out noncanonical EMS nucleotide changes, high-density variant peaks are obtained in one genomic location for each mutant (red boxes). Steps 1 and 3 are essential for clear visualization of the high-density peaks whereas step 2 improves visualization. (D) Close-up of variants on chromosome III for fp6. Within this peak we identified only 6 candidate mutations that could potentially affect a protein sequence. We confirmed that the missense mutation in egl-5 was the causal mutation (Figure S2). For fp9 and fp12 we identified only 10 (9 missense and 1 3′-UTR) and 4 (2 premature stop and 2 missense) candidate mutations, respectively, within each mutant''s EMS-based mapped region. Thus, our method consistently allowed precise mapping in 3 different mutants to a region small enough to contain only a handful of candidate mutations.As a proof-of-principle, we simultaneously mapped and sequenced the causal mutations of multiple C. elegans mutants isolated from an EMS mutagenesis screen using this strategy. The mutagenesis screen itself was undertaken to identify genes that controlled the reprogramming of a single cell called Y into another cell called PDA during C. elegans development (Jarriault et al. 2008). After EMS treatment, three distinct mutant alleles (fp6, fp9, and fp12) were backcrossed to the original unmutagenized strain 4-6X. It is important to note that a backcrossing or outcrossing step is necessary for the analysis of mutants obtained from all mutagenesis screens, irrespective of the type of mutant identification strategy used or the type of mutagen or organism used (and, as such, does not represent an extra step introduced by our method). The mutants then underwent WGS side-by-side (Table S1, Table S2, Figure S1, and File S2). After alignment to the wild-type N2 reference genome using MAQgene software (Bigelow et al. 2009), the sequencing data obtained for each mutant were compared, and we subtracted common nucleotide variants that were shared between at least two of our three mutants (File S1). These shared variants, which are very unlikely to be either the causal mutation or EMS-induced mutations from the screen itself, represent strain differences between the N2 used to generate the reference genome and the PS3662 strain used here for mutagenesis. Note that this step eliminated ∼2000 point mutations as potential candidates for our causal mutation. This result strongly emphasizes the advantage of conducting WGS on two or more mutants side-by-side, as reference genomes may contain many nucleotide variations when compared to organisms sequenced from the laboratory (Denver et al. 2004; Hillier et al. 2008; Sarin et al. 2010; this study) and as such would confound mutation identification.To identify EMS-induced changes linked to the causal mutation and expose its location, we looked only at variants that matched the canonical EMS-induced G/C > A/T transitions (Drake and Baltz 1976), revealing localized peaks of high-density variation on a single chromosome for each mutant (Figure 1, B and C). These peaks correspond to regions of high mutagen-induced damage that were not removed during backcrossing and therefore are most likely genetically linked to the causal mutation. We therefore focused our attention on these physical regions to identify candidate mutations within them. We localized fp6 to a 4.29-Mb region on chromosome III, fp9 to a 7.11-Mb region on chromosome X, and fp12 to a 1.28-Mb region on a different part of chromosome X (Figure 1C).As a proof of principle, we further examined the nucleotide changes present in the interval to which fp6 was linked. Taking into consideration all variant types (point mutations and indels), we identified only six candidate mutations that potentially affected a gene''s function (Figure 1D and Table S3). One of these, affecting the egl-5/hox gene, lies almost perfectly in the middle of the predicted EMS-based mapped region. We confirmed the existence of the mutation in egl-5 by manual resequencing. Both egl-5 targeted RNAi and noncomplementation with the egl-5(n945) null allele confirmed that fp6 affected egl-5 and caused the Y-to-PDA reprogramming defect (Figure S2). fp9 and fp12 each map to distinct regions on chromosome X that also contain only a handful of candidate mutations (10 and 4, respectively) (Figure 1C). Thus, our method consistently allowed precise mapping in 3 different mutants to a region small enough to contain only a handful of candidate mutations and subsequent identification of the causal mutation.We calculated that comparison of WGS data for only two mutants of the same mutagenesis screen is sufficient to localize and sequence the causal mutation (Table S4). Thirteen times sequence coverage has been found to be sufficient to identify a mutation in a pre-SNP mapped C. elegans mutant (Shen et al. 2008). Here, we tested the sequence coverage necessary to perform simultaneous mapping and mutant identification using our strategy and found that 13× was more than enough (Table S4). In addition, by performing longer reads and/or paired-end sequencing, our method can be scaled up to bigger genomes or allow multiple mutant sequencing on each flow cell lane [for, e.g., using multiplex WGS (Cronn et al. 2008)]. Furthermore, because direct sequence comparison is ultimately made between two mutants sequenced side-by-side, the quality of an organism''s reference genome (which is used only for alignment purposes) does not have a bearing on the mapping or mutant identification outcome. Moreover, recent advances in de novo alignment of short reads generated from next generation sequencing platforms (Li et al. 2010; Nowrousian et al. 2010; Webb and Rosenthal 2010; Young et al. 2010) suggest that a reference genome may not even be required to perform mutagen-based mapping and mutant identification with WGS. We predict that technical advances in these areas will make it possible to perform mutagenesis screens on any nonsequenced and genetically uncharacterized organism and use our strategy to quickly identify the causal mutation of an interesting mutant.
Open in a separate windowWe found that all of the minimal requirements tested here were more than adequate to use our mapping strategy. Therefore, it is possible that fewer backcrosses and less sequencing coverage may suffice than is shown here. For example, for genomes with a similar size to C. elegans (∼100 Mb), this method can easily be scaled up by sequencing eight mutants per flow cell. As for any WGS experiments, total cost depends on genome size.By eliminating any prior work except for back/outcrossing, a necessary step for any mutant characterization, our simple and quick strategy provides a significant saving of time and labor as the time needed to map and identify a candidate causal mutation is trimmed down to the sequencing time (currently 7 days) and sequence analysis time (<1 day, see 相似文献
TABLE 1
Summary of WGS cloning strategyConditions used | Minimal requirements tested | |
---|---|---|
Backcrossing | 4–6× | 4× enough |
No. of mutants sequenced | 3 | 2 enough |
Sequencing of mutant | 2× flow cell lanes, paired-end reads (57mer) | 1× flow cell lane enough, single-end reads (57mer) enough |
Average sequence coverage | 52.2–55.3× | 13.6× enough |
Advantages | ||
Any SNP or genetic map information is not necessary | ||
No prior wet lab work necessary: generation of a recombinant mapping population is not necessary | ||
Multiple alleles identified at once | ||
Amenable to scaling up: can be equally used for bigger genomes | ||
Fast: 7 days sequencing, 12 hr MAQGene alignment, and 1 hr mapping | ||
Modest sequence coverage requirements limit cost | ||
Reference genome sequence quality is not important and may not even be necessary | ||
Very straightforward without any specialized software | ||
Requirement | ||
Species must be amenable to mutagenesis and backcrossing |
15.
Stephen J. Murphy John C. Cheville Shabnam Zarei Sarah H. Johnson Robert A. Sikkink Farhad Kosari Andrew L. Feldman Bruce W. Eckloff R. Jeffrey Karnes George Vasmatzis 《DNA research》2012,19(5):395-406
High-throughput next-generation sequencing provides a revolutionary platform to unravel the precise DNA aberrations concealed within subgroups of tumour cells. However, in many instances, the limited number of cells makes the application of this technology in tumour heterogeneity studies a challenge. In order to address these limitations, we present a novel methodology to partner laser capture microdissection (LCM) with sequencing platforms, through a whole-genome amplification (WGA) protocol performed in situ directly on LCM engrafted cells. We further adapted current Illumina mate pair (MP) sequencing protocols to the input of WGA DNA and used this technology to investigate large genomic rearrangements in adjacent Gleason Pattern 3 and 4 prostate tumours separately collected by LCM. Sequencing data predicted genome coverage and depths similar to unamplified genomic DNA, with limited repetition and bias predicted in WGA protocols. Mapping algorithms developed in our laboratory predicted high-confidence rearrangements and selected events each demonstrated the predicted fusion junctions upon validation. Rearrangements were additionally confirmed in unamplified tissue and evaluated in adjacent benign-appearing tissues. A detailed understanding of gene fusions that characterize cancer will be critical in the development of biomarkers to predict the clinical outcome. The described methodology provides a mechanism of efficiently defining these events in limited pure populations of tumour tissue, aiding in the derivation of genomic aberrations that initiate cancer and drive cancer progression. 相似文献
16.
Christopher M. Watson Mohammed El-Asrag David A. Parry Joanne E. Morgan Clare V. Logan Ian M. Carr Eamonn Sheridan Ruth Charlton Colin A. Johnson Graham Taylor Carmel Toomes Martin McKibbin Chris F. Inglehearn Manir Ali 《PloS one》2014,9(8)
Purpose
Retinal dystrophies are genetically heterogeneous, resulting from mutations in over 200 genes. Prior to the development of massively parallel sequencing, comprehensive genetic screening was unobtainable for most patients. Identifying the causative genetic mutation facilitates genetic counselling, carrier testing and prenatal/pre-implantation diagnosis, and often leads to a clearer prognosis. In addition, in a proportion of cases, when the mutation is known treatment can be optimised and patients are eligible for enrolment into clinical trials for gene-specific therapies.Methods
Patient genomic DNA was sheared, tagged and pooled in batches of four samples, prior to targeted capture and next generation sequencing. The enrichment reagent was designed against genes listed on the RetNet database (July 2010). Sequence data were aligned to the human genome and variants were filtered to identify potential pathogenic mutations. These were confirmed by Sanger sequencing.Results
Molecular analysis of 20 DNAs from retinal dystrophy patients identified likely pathogenic mutations in 12 cases, many of them known and/or confirmed by segregation. These included previously described mutations in ABCA4 (c.6088C>T,p.R2030*; c.5882G>A,p.G1961E), BBS2 (c.1895G>C,p.R632P), GUCY2D (c.2512C>T,p.R838C), PROM1 (c.1117C>T,p.R373C), RDH12 (c.601T>C,p.C201R; c.506G>A,p.R169Q), RPGRIP1 (c.3565C>T,p.R1189*) and SPATA7 (c.253C>T,p.R85*) and new mutations in ABCA4 (c.3328+1G>C), CRB1 (c.2832_2842+23del), RP2 (c.884-1G>T) and USH2A (c.12874A>G,p.N4292D).Conclusions
Tagging and pooling DNA prior to targeted capture of known retinal dystrophy genes identified mutations in 60% of cases. This relatively high success rate may reflect enrichment for consanguineous cases in the local Yorkshire population, and the use of multiplex families. Nevertheless this is a promising high throughput approach to retinal dystrophy diagnostics. 相似文献17.
In many crop species, DNA fingerprinting is required for the precise identification of cultivars to protect the rights of breeders. Many families of retrotransposons have multiple copies throughout the eukaryotic genome and their integrated copies are inherited genetically. Thus, their insertion polymorphisms among cultivars are useful for DNA fingerprinting. In this study, we conducted a DNA fingerprinting based on the insertion polymorphisms of active retrotransposon families (Rtsp-1 and LIb) in sweet potato. Using 38 cultivars, we identified 2,024 insertion sites in the two families with an Illumina MiSeq sequencing platform. Of these insertion sites, 91.4% appeared to be polymorphic among the cultivars and 376 cultivar-specific insertion sites were identified, which were converted directly into cultivar-specific sequence-characterized amplified region (SCAR) markers. A phylogenetic tree was constructed using these insertion sites, which corresponded well with known pedigree information, thereby indicating their suitability for genetic diversity studies. Thus, the genome-wide comparative analysis of active retrotransposon insertion sites using the bench-top MiSeq sequencing platform is highly effective for DNA fingerprinting without any requirement for whole genome sequence information. This approach may facilitate the development of practical polymerase chain reaction-based cultivar diagnostic system and could also be applied to the determination of genetic relationships. 相似文献
18.
Elena Castellanos-Rizaldos Coren A. Milbury Elli Karatza Clark C. Chen G. Mike Makrigiorgos Anne Merewood 《PloS one》2014,9(4)
Aberrant hypo-methylation of DNA is evident in a range of human diseases including cancer and diabetes. Development of sensitive assays capable of detecting traces of un-methylated DNA within methylated samples can be useful in several situations. Here we describe a new approach, fast-COLD-MS-PCR, which amplifies preferentially un-methylated DNA sequences. By employing an appropriate denaturation temperature during PCR of bi-sulfite converted DNA, fast-COLD-MS-PCR enriches un-methylated DNA and enables differential melting analysis or bisulfite sequencing. Using methylation on the MGMT gene promoter as a model, it is shown that serial dilutions of controlled methylation samples lead to the reliable sequencing of un-methylated sequences down to 0.05% un-methylated-to-methylated DNA. Screening of clinical glioma tumor and infant blood samples demonstrated that the degree of enrichment of un-methylated over methylated DNA can be modulated by the choice of denaturation temperature, providing a convenient method for analysis of partially methylated DNA or for revealing and sequencing traces of un-methylated DNA. Fast-COLD-MS-PCR can be useful for the detection of loss of methylation/imprinting in cancer, diabetes or diet-related methylation changes. 相似文献
19.