期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Sequence-based genotyping for marker discovery and co-dominant scoring in germplasm and populations 总被引：1，自引：0，他引：1

Truong HT Ramos AM Yalcin F de Ruiter M van der Poel HJ Huvenaars KH Hogers RC van Enckevort LJ Janssen A van Orsouw NJ van Eijk MJ 《PloS one》2012,7(5):e37565

Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n?=?222 samples) and lettuce (n?=?87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike. 相似文献

2.

Novel tools for conservation genomics: comparing two high-throughput approaches for SNP discovery in the transcriptome of the European hake

Milano I Babbucci M Panitz F Ogden R Nielsen RO Taylor MI Helyar SJ Carvalho GR Espiñeira M Atanassova M Tinti F Maes GE Patarnello T;FishPopTrace Consortium Bargelloni L 《PloS one》2011,6(11):e28008

相似文献

3.

GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies

Arnald Alonso Sara Marsal Raül Tortosa Oriol Canela-Xandri Antonio Julià 《PloS one》2013,8(7)

We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method. 相似文献

4.

SNiPer-HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays 总被引：2，自引：0，他引：2

Hua J Craig DW Brun M Webster J Zismann V Tembe W Joshipura K Huentelman MJ Dougherty ER Stephan DA 《Bioinformatics (Oxford, England)》2007,23(1):57-63

MOTIVATION: The technology to genotype single nucleotide polymorphisms (SNPs) at extremely high densities provides for hypothesis-free genome-wide scans for common polymorphisms associated with complex disease. However, we find that some errors introduced by commonly employed genotyping algorithms may lead to inflation of false associations between markers and phenotype. RESULTS: We have developed a novel SNP genotype calling program, SNiPer-High Density (SNiPer-HD), for highly accurate genotype calling across hundreds of thousands of SNPs. The program employs an expectation-maximization (EM) algorithm with parameters based on a training sample set. The algorithm choice allows for highly accurate genotyping for most SNPs. Also, we introduce a quality control metric for each assayed SNP, such that poor-behaving SNPs can be filtered using a metric correlating to genotype class separation in the calling algorithm. SNiPer-HD is superior to the standard dynamic modeling algorithm and is complementary and non-redundant to other algorithms, such as BRLMM. Implementing multiple algorithms together may provide highly accurate genotyping calls, without inflation of false positives due to systematically miss-called SNPs. A reliable and accurate set of SNP genotypes for increasingly dense panels will eliminate some false association signals and false negative signals, allowing for rapid identification of disease susceptibility loci for complex traits. AVAILABILITY: SNiPer-HD is available at TGen's website: http://www.tgen.org/neurogenomics/data. 相似文献

5.

A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays

Xiao Y Segal MR Yang YH Yeh RF 《Bioinformatics (Oxford, England)》2007,23(12):1459-1467

MOTIVATION: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. RESULTS: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. AVAILABILITY: R functions are available upon request from the authors. 相似文献

6.

High concordance of bovine single nucleotide polymorphism genotypes generated using two independent genotyping strategies

Magee DA Berkowicz EW Sikora KM Sweeney T Kenny DA Kelly AK Evans RD Wickham BW Bradley DG Spillane C MacHugh DE 《Animal biotechnology》2010,21(4):257-262

Single nucleotide polymorphisms (SNPs) represent the most common form of DNA sequence variation in mammalian livestock genomes. While the past decade has witnessed major advances in SNP genotyping technologies, genotyping errors caused, in part, by the biochemistry underlying the genotyping platform used, can occur. These errors can distort project results and conclusions and can result in incorrect decisions in animal management and breeding programs; hence, SNP genotype calls must be accurate and reliable. In this study, 263 Bos spp. samples were genotyped commercially for a total of 16 SNPs. Of the total possible 4,208 SNP genotypes, 4,179 SNP genotypes were generated, yielding a genotype call rate of 99.31% (standard deviation?±?0.93%). Between 110 and 263 samples were subsequently re-genotyped by us for all 16 markers using a custom-designed SNP genotyping platform, and of the possible 3,819 genotypes a total of 3,768 genotypes were generated (98.70% genotype call rate, SD?±?1.89%). A total of 3,744 duplicate genotypes were generated for both genotyping platforms, and comparison of the genotype calls for both methods revealed 3,741 concordant SNP genotype call rates (99.92% SNP genotype concordance rate). These data indicate that both genotyping methods used can provide livestock geneticists with reliable, reproducible SNP genotypic data for in-depth statistical analysis. 相似文献

7.

Optimization of the genotyping‐by‐sequencing strategy for population genomic analysis in conifers

下载免费PDF全文

Jin Pan Baosheng Wang Zhi‐Yong Pei Wei Zhao Jie Gao Jian‐Feng Mao Xiao‐Ru Wang 《Molecular ecology resources》2015,15(4):711-722

Flexibility and low cost make genotyping‐by‐sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI‐MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference‐free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000–11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking. 相似文献

8.

Application of high-resolution DNA melting for genotyping in lepidopteran non-model species: Ostrinia furnacalis (Crambidae)

Li F Niu B Huang Y Meng Z 《PloS one》2012,7(1):e29664

Development of an ideal marker system facilitates a better understanding of the genetic diversity in lepidopteran non-model organisms, which have abundant species, but relatively limited genomic resources. Single nucleotide polymorphisms (SNPs) discovered within single-copy genes have proved to be desired markers, but SNP genotyping by current techniques remain laborious and expensive. High resolution melting (HRM) curve analysis represents a simple, rapid and inexpensive genotyping method that is primarily confined to clinical and diagnostic studies. In this study, we evaluated the potential of HRM analysis for SNP genotyping in the lepidopteran non-model species Ostrinia furnacalis (Crambidae). Small amplicon and unlabeled probe assays were developed for the SNPs, which were identified in 30 females of O. furnacalis from 3 different populations by our direct sequencing. Both assays were then applied to genotype 90 unknown female DNA by prior mixing with known wild-type DNA. The genotyping results were compared with those that were obtained using bi-directional sequencing analysis. Our results demonstrated the efficiency and reliability of the HRM assays. HRM has the potential to provide simple, cost-effective genotyping assays and facilitates genotyping studies in any non-model lepidopteran species of interest. 相似文献

9.

High Concordance of Bovine Single Nucleotide Polymorphism Genotypes Generated Using Two Independent Genotyping Strategies

D. A. Magee E. W. Berkowicz K. M. Sikora T. Sweeney D. A. Kenny A. K. Kelly 《Animal biotechnology》2013,24(4):257-262

Single nucleotide polymorphisms (SNPs) represent the most common form of DNA sequence variation in mammalian livestock genomes. While the past decade has witnessed major advances in SNP genotyping technologies, genotyping errors caused, in part, by the biochemistry underlying the genotyping platform used, can occur. These errors can distort project results and conclusions and can result in incorrect decisions in animal management and breeding programs; hence, SNP genotype calls must be accurate and reliable. In this study, 263 Bos spp. samples were genotyped commercially for a total of 16 SNPs. Of the total possible 4,208 SNP genotypes, 4,179 SNP genotypes were generated, yielding a genotype call rate of 99.31% (standard deviation ± 0.93%). Between 110 and 263 samples were subsequently re-genotyped by us for all 16 markers using a custom-designed SNP genotyping platform, and of the possible 3,819 genotypes a total of 3,768 genotypes were generated (98.70% genotype call rate, SD ± 1.89%). A total of 3,744 duplicate genotypes were generated for both genotyping platforms, and comparison of the genotype calls for both methods revealed 3,741 concordant SNP genotype call rates (99.92% SNP genotype concordance rate). These data indicate that both genotyping methods used can provide livestock geneticists with reliable, reproducible SNP genotypic data for in-depth statistical analysis. 相似文献

10.

Comparison of whole‐genome (13X) and capture (87X) resequencing methods for SNP and genotype callings

下载免费PDF全文

P. F. Roux S. Marthey A. Djari M. Moroldo D. Esquerré J. Estellé C. Klopp O. Demeure 《Animal genetics》2015,46(1):82-86

The number of polymorphisms identified with next‐generation sequencing approaches depends directly on the sequencing depth and therefore on the experimental cost. Although higher levels of depth ensure more sensitive and more specific SNP calls, economic constraints limit the increase of depth for whole‐genome resequencing (WGS). For this reason, capture resequencing is used for studies focusing on only some specific regions of the genome. However, several biases in capture resequencing are known to have a negative impact on the sensitivity of SNP detection. Within this framework, the aim of this study was to compare the accuracy of WGS and capture resequencing on SNP detection and genotype calling, which differ in terms of both sequencing depth and biases. Indeed, we have evaluated the SNP calling and genotyping accuracy in a WGS dataset (13X) and in a capture resequencing dataset (87X) performed on 11 individuals. The percentage of SNPs not identified due to a sevenfold sequencing depth decrease was estimated at 7.8% using a down‐sampling procedure on the capture sequencing dataset. A comparison of the 87X capture sequencing dataset with the WGS dataset revealed that capture‐related biases were leading with the loss of 5.2% of SNPs detected with WGS. Nevertheless, when considering the SNPs detected by both approaches, capture sequencing appears to achieve far better SNP genotyping, with about 4.4% of the WGS genotypes that can be considered as erroneous and even 10% focusing on heterozygous genotypes. In conclusion, WGS and capture deep sequencing can be considered equivalent strategies for SNP detection, as the rate of SNPs not identified because of a low sequencing depth in the former is quite similar to SNPs missed because of method biases of the latter. On the other hand, capture deep sequencing clearly appears more adapted for studies requiring great accuracy in genotyping. 相似文献

11.

SNP Discovery in European Anchovy (Engraulis encrasicolus,L) by High-Throughput Transcriptome and Genome Sequencing

Iratxe Montes Darrell Conklin Aitor Albaina Simon Creer Gary R. Carvalho María Santos Andone Estonba 《PloS one》2013,8(8)

相似文献

12.

ALG: automated genotype calling of Luminex assays

Bourgey M Lariviere M Richer C Sinnett D 《PloS one》2011,6(5):e19368

Single nucleotide polymorphisms (SNPs) are the most commonly used polymorphic markers in genetics studies. Among the different platforms for SNP genotyping, Luminex is one of the less exploited mainly due to the lack of a robust (semi-automated and replicable) freely available genotype calling software. Here we describe a clustering algorithm that provides automated SNP calls for Luminex genotyping assays. We genotyped 3 SNPs in a cohort of 330 childhood leukemia patients, 200 parents of patient and 325 healthy individuals and used the Automated Luminex Genotyping (ALG) algorithm for SNP calling. ALG genotypes were called twice to test for reproducibility and were compared to sequencing data to test for accuracy. Globally, this analysis demonstrates the accuracy (99.6%) of the method, its reproducibility (99.8%) and the low level of no genotyping calls (3.4%). The high efficiency of the method proves that ALG is a suitable alternative to the current commercial software. ALG is semi-automated, and provides numerical measures of confidence for each SNP called, as well as an effective graphical plot. Moreover ALG can be used either through a graphical user interface, requiring no specific informatics knowledge, or through command line with access to the open source code. The ALG software has been implemented in R and is freely available for non-commercial use either at http://alg.sourceforge.net or by request to mathieu.bourgey@umontreal.ca. 相似文献

13.

SNPs by AFLP (SBA): a rapid SNP isolation strategy for non-model organisms 总被引：7，自引：0，他引：7

下载免费PDF全文

Nicod JC Largiadèr CR 《Nucleic acids research》2003,31(5):e19

Despite the great potential of single nucleotide polymorphism (SNP) markers in evolutionary studies, in particular for inferring population genetic parameters, SNP analysis has almost exclusively been limited to humans and ‘genomic model’ organisms, due to the lack of available sequence data in non-model organisms. Here, we describe a rapid and cost effective method to isolate candidate SNPs in non-model organisms. This SNP isolation strategy consists basically in the direct sequencing of amplified fragment length polymorphism bands. In a first application of this method, 10 unique DNA fragments that contained 24 SNPs were discovered in 11.11 kb of sequenced genomic DNA of a non-model species, the brown trout (Salmo trutta). 相似文献

14.

A comprehensive framework for detecting copy number variants from single nucleotide polymorphism data: ‘rCNV’, a versatile r package for paralogue and CNV detection

Piyal Karunarathne Qiujie Zhou Klaus Schliep Pascal Milesi 《Molecular ecology resources》2023,23(8):1772-1789

Recent studies have highlighted the significant role of copy number variants (CNVs) in phenotypic diversity, environmental adaptation and species divergence across eukaryotes. The presence of CNVs also has the potential to introduce genotyping biases, which can pose challenges to accurate population and quantitative genetic analyses. However, detecting CNVs in genomes, particularly in non-model organisms, presents a formidable challenge. To address this issue, we have developed a statistical framework and an accompanying r software package that leverage allelic-read depth from single nucleotide polymorphism (SNP) data for accurate CNV detection. Our framework capitalises on two key principles. First, it exploits the distribution of allelic-read depth ratios in heterozygotes for individual SNPs by comparing it against an expected distribution based on binomial sampling. Second, it identifies SNPs exhibiting an apparent excess of heterozygotes under Hardy–Weinberg equilibrium. By employing multiple statistical tests, our method not only enhances sensitivity to sampling effects but also effectively addresses reference biases, resulting in optimised SNP classification. Our framework is compatible with various NGS technologies (e.g. RADseq, Exome-capture). This versatility enables CNV calling from genomes of diverse complexities. To streamline the analysis process, we have implemented our framework in the user-friendly r package ‘rCNV’, which automates the entire workflow seamlessly. We trained our models using simulated data and validated their performance on four datasets derived from different sequencing technologies, including RADseq (Chinook salmon—Oncorhynchus tshawytscha), Rapture (American lobster—Homarus americanus), Exome-capture (Norway spruce—Picea abies) and WGS (Malaria mosquito—Anopheles gambiae). 相似文献

15.

SNP Discovery Using Next Generation Transcriptomic Sequencing in Atlantic Herring (Clupea harengus) 总被引：1，自引：0，他引：1

SJ Helyar MT Limborg D Bekkevold M Babbucci J van Houdt GE Maes L Bargelloni RO Nielsen MI Taylor R Ogden A Cariani GR Carvalho F Consortium F Panitz 《PloS one》2012,7(8):e42089

The introduction of Next Generation Sequencing (NGS) has revolutionised population genetics, providing studies of non-model species with unprecedented genomic coverage, allowing evolutionary biologists to address questions previously far beyond the reach of available resources. Furthermore, the simple mutation model of Single Nucleotide Polymorphisms (SNPs) permits cost-effective high-throughput genotyping in thousands of individuals simultaneously. Genomic resources are scarce for the Atlantic herring (Clupea harengus), a small pelagic species that sustains high revenue fisheries. This paper details the development of 578 SNPs using a combined NGS and high-throughput genotyping approach. Eight individuals covering the species distribution in the eastern Atlantic were bar-coded and multiplexed into a single cDNA library and sequenced using the 454 GS FLX platform. SNP discovery was performed by de novo sequence clustering and contig assembly, followed by the mapping of reads against consensus contig sequences. Selection of candidate SNPs for genotyping was conducted using an in silico approach. SNP validation and genotyping were performed simultaneously using an Illumina 1,536 GoldenGate assay. Although the conversion rate of candidate SNPs in the genotyping assay cannot be predicted in advance, this approach has the potential to maximise cost and time efficiencies by avoiding expensive and time-consuming laboratory stages of SNP validation. Additionally, the in silico approach leads to lower ascertainment bias in the resulting SNP panel as marker selection is based only on the ability to design primers and the predicted presence of intron-exon boundaries. Consequently SNPs with a wider spectrum of minor allele frequencies (MAFs) will be genotyped in the final panel. The genomic resources presented here represent a valuable multi-purpose resource for developing informative marker panels for population discrimination, microarray development and for population genomic studies in the wild. 相似文献

16.

Resequencing studies of nonmodel organisms using closely related reference genomes: optimal experimental designs and bioinformatics approaches for population genomics

B. Nevado S. E. Ramos‐Onsins M. Perez‐Enciso 《Molecular ecology》2014,23(7):1764-1779

Decreasing costs of next‐generation sequencing (NGS) experiments have made a wide range of genomic questions open for study with nonmodel organisms. However, experimental designs and analysis of NGS data from less well‐known species are challenging because of the lack of genomic resources. In this work, we investigate the performance of alternative experimental designs and bioinformatics approaches in estimating variability and neutrality tests based on the site‐frequency‐spectrum (SFS) from individual resequencing data. We pay particular attention to challenges faced in the study of nonmodel organisms, in particular the absence of a species‐specific reference genome, although phylogenetically close genomes are assumed to be available. We compare the performance of three alternative bioinformatics approaches – genotype calling, genotype–haplotype calling and direct estimation without calling genotypes. We find that relying on genotype calls provides biased estimates of population genetic statistics at low to moderate read depth (2–8×). Genotype–haplotype calling returns more accurate estimates irrespective of the divergence to the reference genome, but requires moderate depth (8–20×). Direct estimation without calling genotypes returns the most accurate estimates of variability and of most SFS tests investigated, including at low read depth (2–4×). Studies without species‐specific reference genome should thus aim for low read depth and avoid genotype calling whenever individual genotypes are not essential. Otherwise, aiming for moderate to high depth at the expense of number of individuals, and using genotype–haplotype calling, is recommended. 相似文献

17.

Reference-free detection of isolated SNPs

Raluca Uricaru Guillaume Rizk Vincent Lacroix Elsa Quillery Olivier Plantard Rayan Chikhi Claire Lemaitre Pierre Peterlongo 《Nucleic acids research》2015,43(2):e11

Detecting single nucleotide polymorphisms (SNPs) between genomes is becoming a routine task with next-generation sequencing. Generally, SNP detection methods use a reference genome. As non-model organisms are increasingly investigated, the need for reference-free methods has been amplified. Most of the existing reference-free methods have fundamental limitations: they can only call SNPs between exactly two datasets, and/or they require a prohibitive amount of computational resources. The method we propose, discoSnp, detects both heterozygous and homozygous isolated SNPs from any number of read datasets, without a reference genome, and with very low memory and time footprints (billions of reads can be analyzed with a standard desktop computer). To facilitate downstream genotyping analyses, discoSnp ranks predictions and outputs quality and coverage per allele. Compared to finding isolated SNPs using a state-of-the-art assembly and mapping approach, discoSnp requires significantly less computational resources, shows similar precision/recall values, and highly ranked predictions are less likely to be false positives. An experimental validation was conducted on an arthropod species (the tick Ixodes ricinus) on which de novo sequencing was performed. Among the predicted SNPs that were tested, 96% were successfully genotyped and truly exhibited polymorphism. 相似文献

18.

A beginners guide to SNP calling from high-throughput DNA-sequencing data

A Altmann P Weber D Bader M Preuß EB Binder B Müller-Myhsok 《Human genetics》2012,131(10):1541-1554

High-throughput DNA sequencing (HTS) is of increasing importance in the life sciences. One of its most prominent applications is the sequencing of whole genomes or targeted regions of the genome such as all exonic regions (i.e., the exome). Here, the objective is the identification of genetic variants such as single nucleotide polymorphisms (SNPs). The extraction of SNPs from the raw genetic sequences involves many processing steps and the application of a diverse set of tools. We review the essential building blocks for a pipeline that calls SNPs from raw HTS data. The pipeline includes quality control, mapping of short reads to the reference genome, visualization and post-processing of the alignment including base quality recalibration. The final steps of the pipeline include the SNP calling procedure along with filtering of SNP candidates. The steps of this pipeline are accompanied by an analysis of a publicly available whole-exome sequencing dataset. To this end, we employ several alignment programs and SNP calling routines for highlighting the fact that the choice of the tools significantly affects the final results. 相似文献

19.

The use of genotyping by sequencing in blackcurrant (Ribes nigrum): developing high-resolution linkage maps in species without reference genome sequences

Joanne Russell Christine Hackett Pete Hedley Hui Liu Linda Milne Micha Bayer David Marshall Linzi Jorgensen Sandra Gordon Rex Brennan 《Molecular breeding : new strategies in plant improvement》2014,33(4):835-849

A genotyping by sequencing (GbS) approach is reported in blackcurrant (Ribes nigrum L.) using a de novo read assembly method developed because of the current absence of a reference genome sequence for this species. A new approach to single nucleotide polymorphism (SNP) genotype calling is described, where individual genotypes for a large number of SNPs were characterised from the GbS counts using a novel method based on a functional regression of major and minor allele read counts. The high-quality GbS SNPs were combined with SNPs and simple sequence repeats generated from other technologies to develop a linkage map with increased marker density and improved genome coverage, containing up to 204 SNPs on each linkage group. SNPs of lower quality were then located on the map using quantitative trait locus (QTL) interval mapping of the proportion of the major allele. Two QTL each for 100-berry weight and Brix scores, measured over three years, were identified using the map. The use of this approach to identify and map a significant number of novel SNPs in a woody species with hitherto limited genomic resources may have generic application to other under-resourced and minor species in the development of cost-effective and efficient high-density genetic maps. 相似文献

20.

Discovery of single-nucleotide polymorphisms (SNPs) in the uncharacterized genome of the ascomycete Ophiognomonia clavigignenti-juglandacearum from 454 sequence data

Broders KD Woeste KE San Miguel PJ Westerman RP Boland GJ 《Molecular ecology resources》2011,11(4):693-702

The benefits from recent improvement in sequencing technologies, such as the Roche GS FLX (454) pyrosequencing, may be even more valuable in non-model organisms, such as many plant pathogenic fungi of economic importance. One application of this new sequencing technology is the rapid generation of genomic information to identify putative single-nucleotide polymorphisms (SNPs) to be used for population genetic, evolutionary, and phylogeographic studies on non-model organisms. The focus of this research was to sequence, assemble, discover and validate SNPs in a fungal genome using 454 pyrosequencing when no reference sequence is available. Genomic DNA from eight isolates of Ophiognomonia clavigignenti-juglandacearum was pooled in one region of a four-region sequencing run on a Roche 454 GS FLX. This yielded 71 million total bases comprising 217,000 reads, 80% of which collapsed into 16,125,754 bases in 30,339 contigs upon assembly. By aligning reads from multiple isolates, we detected 298 SNPs using Roche's GS Mapper. With no reference sequence available, however, it was difficult to distinguish true polymorphisms from sequencing error. Eagleview software was used to manually examine each contig that contained one or more putative SNPs, enabling us to discard all but 45 of the original 298 putative SNPs. Of those 45 SNPs, 13 were validated using standard Sanger sequencing. This research provides a valuable genetic resource for research into the genus Ophiognomonia, demonstrates a framework for the rapid and cost-effective discovery of SNP markers in non-model organisms and should prove especially useful in the case of asexual or clonal fungi with limited genetic variability. 相似文献