首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Advances in sequencing technology have led to a rapid rise in the genomic data available for plants, driving new insights into the evolution, domestication and improvement of crops. Single nucleotide polymorphisms (SNPs) are a major component of crop genomic diversity, and are invaluable as genetic markers in research and breeding programs. High‐throughput SNP arrays, or ‘SNP chips’, can generate reproducible sets of informative SNP markers and have been broadly adopted. Although there are many public repositories for sequencing data, which are routinely uploaded, there are no formal repositories for crop SNP array data. To make SNP array data more easily accessible, we have developed CropSNPdb ( http://snpdb.appliedbioinformatics.com.au ), a database for SNP array data produced by the Illumina Infinium? hexaploid bread wheat (Triticum aestivum) 90K and Brassica 60K arrays. We currently host SNPs from datasets covering 526 Brassica lines and 309 bread wheat lines, and provide search, download and upload utilities for users. CropSNPdb provides a useful repository for these data, which can be applied for a range of genomics and molecular crop‐breeding activities.  相似文献   

2.
H. Zhou  D. Li  W. Liu  N. Yang 《Animal genetics》2013,44(3):276-284
Copy number variation (CNV) is considered an important genetic variation, contributing to many economically important traits in the chicken. Although CNVs can be detected using a comparative genomic hybridization array, the high‐density SNP array has provided an alternative way to identify CNVs in the chicken. In the current study, a chicken 60K SNP BeadChip was used to identify CNVs in two distinct chicken genetic lines (White Leghorn and dwarf) using the penncnv program. A total of 209 CNV regions were identified, distributing on chromosomes 1–22 and 24–28 and encompassing 13.55 Mb (1.42%) of chicken autosomal genome area. Three of seven selected CNVs (73.2% individuals) were completely validated by quantitative PCR. To our knowledge, this is the first report in the chicken identifying CNVs using a SNP array. Identification of 190 new identified CNVs illustrates the feasibility of the chicken 60K SNP BeadChip to detect CNVs in the chicken, which lays a solid foundation for future analyses of associations of CNVs with economically important phenotypes in chickens.  相似文献   

3.
Domestic dogs share a wide range of important disease conditions with humans, including cancers, diabetes and epilepsy. Many of these conditions have similar or identical underlying pathologies to their human counterparts and thus dogs represent physiologically relevant natural models of human disorders. Comparative genomic approaches whereby disease genes can be identified in dog diseases and then mapped onto the human genome are now recognized as a valid method and are increasing in popularity. The majority of dog breeds have been created over the past few hundred years and, as a consequence, the dog genome is characterized by extensive linkage disequilibrium (LD), extending usually from hundreds of kilobases to several megabases within a breed, rather than tens of kilobases observed in the human genome. Genome‐wide canine SNP arrays have been developed, and increasing success of using these arrays to map disease loci in dogs is emerging. No equivalent of the human HapMap currently exists for different canine breeds, and the LD structure for such breeds is far less understood than for humans. This study is a dedicated large‐scale assessment of the functionalities (LD and SNP tagging performance) of canine genome‐wide SNP arrays in multiple domestic dog breeds. We have used genotype data from 18 breeds as well as wolves and coyotes genotyped by the Illumina 22K canine SNP array and Affymetrix 50K canine SNP array. As expected, high tagging performance was observed with most of the breeds using both Illumina and Affymetrix arrays when multi‐marker tagging was applied. In contrast, however, large differences in population structure, LD coverage and pairwise tagging performance were found between breeds, suggesting that study designs should be carefully assessed for individual breeds before undertaking genome‐wide association studies (GWAS).  相似文献   

4.
Single nucleotide polymorphisms (SNPs) are essential for identifying the genetic mechanisms of complex traits. In the present study, we applied genotyping by genome reducing and sequencing (GGRS) method to construct a 252-plex sequencing library for SNP discovery and genotyping in chicken. The library was successfully sequenced on an Illumina HiSeq 2500 sequencer with a paired-end pattern; approximately 400 million raw reads were generated, and an average of approximately 1.4 million good reads per sample were generated. A total of 91,767 SNPs were identified after strict filtering, and all of the 252 samples and all of the chromosomes were well represented. Compared with the Illumina 60K chicken SNP chip data, approximately 34,131 more SNPs were identified using GGRS, and a higher SNP density was found using GGRS, which could be beneficial for downstream analysis. Using the GGRS method, more than 3528 samples can be sequenced simultaneously, and the cost is reduced to $18 per sample. To the best of our knowledge, this study describes the first report of such highly multiplexed sequencing in chicken, indicating potential applications for genome-wide association and genomic selection in chicken.  相似文献   

5.
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.  相似文献   

6.
7.
High-throughput SNP genotyping is widely used for plant genetic studies. Recently, a RICE6K SNP array has been developed based on the Illumina Bead Array platform and Infinium SNP assay technology for genome-wide evaluation of allelic variations and breeding applications. In this study, the RICE6K SNP array was used to genotype a recombinant inbred line (RIL) population derived from the cross between the indica variety, Zhenshan 97, and the japonica variety, Xizang 2. A total of 3324 SNP markers of high quality were identified and were grouped into 1495 recombination bins in the RIL population. A high-density linkage map, consisting of the 1495 bins, was developed, covering 1591.2 cM and with average length ofl.1 cM per bin. Segregation distortions were observed in 24 regions of the 11 chromosomes in the RILs. One half of the distorted regions contained fertility genes that had been previously reported. A total of 23 QTLs were identified for yield. Seven QTLs were firstly detected in this study. The positive alleles from about half of the identified QTLs came from Zhenshan 97 and they had lower phenotypic values than Xizang 2. This indicated that favorable alleles for breeding were dispersed in both parents and pyramiding favorable alleles could develop elite lines. The size of the mapping population for QTL analysis using high throughput SNP genotyping platform is also discussed.  相似文献   

8.
《Genomics》2022,114(4):110426
High-throughput single nucleotide polymorphism (SNP) genotyping assays are powerful tools for genetic studies and genomic breeding applications for many species. Though large numbers of SNPs have been identified in sea cucumber (Apostichopus japonicus), but, as yet, no high-throughput genotyping platform is available for this species. In this study, we designed and developed a high-throughput 24 K SNP genotyping array named HaishenSNP24K for A. japonicus, based on the multi-objective-local optimization (MOLO) algorithm and HD-Marker genotyping method. The SNP array exhibited a relatively high genotyping call rate (> 96%), genotyping accuracy (>95%) and exhibited highly polymorphic in sea cucumber populations. In addition, we also assessed its application in genomic selection (GS). Deep neural networks (DNN) that can capture the complicated interactions of genes have been proposed as a promising tool in GS for SNP-based genomic prediction of complex traits in animal breeding. To overcome the problem of over-fitting when using the HaishenSNP24K array as high-dimensional DNN input, we developed minmax concave penalty (MCP) regularization for sparse deep neural networks (DNN-MCP) that finds an optimal sparse structure of a DNN by minimizing the square error subject to the non-convex penalty MCP on the parameters (weights and biases). Compared to two linear models, namely RR-GBLUP and Bayes B, and the nonlinear model DNN, DNN-MCP has greatly improved the genomic prediction ability for three quantitative traits (e.g., wet weight, dry weight and survival time) in the sea cucumber population. To the best of our knowledge, this is the first work to develop a high-throughput SNP array for A. japonicus and a new model DNN-MCP for genomic prediction of complex traits in GS. The present results provide evidence that supports the HaishenSNP24K array with DNN-MCP will be valuable for genetic studies and molecular breeding in A. japonicus.  相似文献   

9.
《Genomics》2021,113(4):2096-2107
SNP arrays are powerful tools for high-resolution studies of the genetic basis of complex traits, facilitating both selective breeding and population genomic research. The European seabass (Dicentrarchus labrax) and the gilthead seabream (Sparus aurata) are the two most important fish species for Mediterranean aquaculture. While selective breeding programmes increasingly underpin stock supply for this industry, genomic selection is not yet widespread. Genomic selection has major potential to expedite genetic gain, particularly for traits practically impossible to measure on selection candidates, such as disease resistance and fillet characteristics. The aim of our study was to design a combined-species 60 K SNP array for European seabass and gilthead seabream, and to test its performance on farmed and wild populations from numerous locations throughout the species range. To achieve this, high coverage Illumina whole-genome sequencing of pooled samples was performed for 24 populations of European seabass and 27 populations of gilthead seabream. This resulted in a database of ~20 million SNPs per species, which were then filtered to identify high-quality variants and create the final set for the development of the ‘MedFish’ SNP array. The array was then tested by genotyping a subset of the discovery populations, highlighting a high conversion rate to functioning polymorphic assays on the array (92% in seabass; 89% in seabream) and repeatability (99.4–99.7%). The platform interrogates ~30 K markers in each species, includes features such as SNPs previously shown to be associated with performance traits, and is enriched for SNPs predicted to have high functional effects on proteins. The array was demonstrated to be effective at detecting population structure across a wide range of fish populations from diverse geographical origins, and to examine the extent of haplotype sharing among Mediterranean farmed fish populations. In conclusion, the new MedFish array enables efficient and accurate high-throughput genotyping for genome-wide distributed SNPs for each fish species, and will facilitate stock management, population genomics approaches, and acceleration of selective breeding through genomic selection.  相似文献   

10.
High‐density SNP genotyping arrays can be designed for any species given sufficient sequence information of high quality. Two high‐density SNP arrays relying on the Infinium iSelect technology (Illumina) were designed for use in the conifer white spruce (Picea glauca). One array contained 7338 segregating SNPs representative of 2814 genes of various molecular functional classes for main uses in genetic association and population genetics studies. The other one contained 9559 segregating SNPs representative of 9543 genes for main uses in population genetics, linkage mapping of the genome and genomic prediction. The SNPs assayed were discovered from various sources of gene resequencing data. SNPs predicted from high‐quality sequences derived from genomic DNA reached a genotyping success rate of 64.7%. Nonsingleton in silico SNPs (i.e. a sequence polymorphism present in at least two reads) predicted from expressed sequenced tags obtained with the Roche 454 technology and Illumina GAII analyser resulted in a similar genotyping success rate of 71.6% when the deepest alignment was used and the most favourable SNP probe per gene was selected. A variable proportion of these SNPs was shared by other nordic and subtropical spruce species from North America and Europe. The number of shared SNPs was inversely proportional to phylogenetic divergence and standing genetic variation in the recipient species, but positively related to allele frequency in P. glauca natural populations. These validated SNP resources should open up new avenues for population genetics and comparative genetic mapping at a genomic scale in spruce species.  相似文献   

11.
12.
Improvements in living standards have resulted in consumers having higher expectations for chicken meat quality. This is particularly true in Asia, where there is high consumer preference for local breeds. Nothing is presently known about the effectiveness of using genomic selection (GS) strategies in chickens to genetically improve meat quality traits that cannot be measured in living potential parents. In this study, 724 Beijing‐You chickens were used as a training population; all were genotyped using Illumina 60K SNP chips, and intramuscular fat content in breast muscle (IMFbr) was measured. Birds in the GS line were selected based on genomic estimated breeding values, IMFbr being the sole trait. Genetic progress in one generation was compared to that from conventional family‐based selection, and both were evaluated against random‐bred controls. Results showed that relative to the random‐bred controls, IMF percentage was improved 9.62% using GS, comparable to the 10.38% improvement using family‐based selection. We quantified the effectiveness of GS when applied to a meat quality trait with low heritability in chickens. We plan to introduce custom SNP chips, appropriate for native chicken breeds in China, to assist in applying GS in local breeding and accelerate genetic gain.  相似文献   

13.
Accurate and efficient genome-wide detection of copy number variants (CNVs) is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH), Single Nucleotide Polymorphism (SNP) genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications.  相似文献   

14.
Next-generation sequencing has prompted a surge of discovery of millions of genetic variants from vertebrate genomes. Besides applications in genetic association and linkage studies, a fraction of these variants will have functional consequences. This study describes detection and characterization of 15 million SNPs from chicken genome with the goal to predict variants with potential functional implications (pfVars) from both coding and non-coding regions. The study reports: 183K amino acid-altering SNPs of which 48% predicted as evolutionary intolerant, 13K splicing variants, 51K likely to alter RNA secondary structures, 500K within most conserved elements and 3K from non-coding RNAs. Regions of local fixation within commercial broiler and layer lines were investigated as potential selective sweeps using genome-wide SNP data. Relationships with phenotypes, if any, of the pfVars were explored by overlaying the sweep regions with known QTLs. Based on this, the candidate genes and/or causal mutations for a number of important traits are discussed. Although the fixed variants within sweep regions were enriched with non-coding SNPs, some non-synonymous-intolerant mutations reached fixation, suggesting their possible adaptive advantage. The results presented in this study are expected to have important implications for future genomic research to identify candidate causal mutations and in poultry breeding.  相似文献   

15.
Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs.The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species.  相似文献   

16.

Key message

Imputing genotypes from the 90K SNP chip to exome sequence in wheat was moderately accurate. We investigated the factors that affect imputation and propose several strategies to improve accuracy.

Abstract

Imputing genetic marker genotypes from low to high density has been proposed as a cost-effective strategy to increase the power of downstream analyses (e.g. genome-wide association studies and genomic prediction) for a given budget. However, imputation is often imperfect and its accuracy depends on several factors. Here, we investigate the effects of reference population selection algorithms, marker density and imputation algorithms (Beagle4 and FImpute) on the accuracy of imputation from low SNP density (9K array) to the Infinium 90K single-nucleotide polymorphism (SNP) array for a collection of 837 hexaploid wheat Watkins landrace accessions. Based on these results, we then used the best performing reference selection and imputation algorithms to investigate imputation from 90K to exome sequence for a collection of 246 globally diverse wheat accessions. Accession-to-nearest-entry and genomic relationship-based methods were the best performing selection algorithms, and FImpute resulted in higher accuracy and was more efficient than Beagle4. The accuracy of imputing exome capture SNPs was comparable to imputing from 9 to 90K at approximately 0.71. This relatively low imputation accuracy is in part due to inconsistency between 90K and exome sequence formats. We also found the accuracy of imputation could be substantially improved to 0.82 when choosing an equivalent number of exome SNP, instead of 90K SNPs on the existing array, as the lower density set. We present a number of recommendations to increase the accuracy of exome imputation.
  相似文献   

17.
Innovations in genomics have enabled the development of low-cost, high-resolution, single nucleotide polymorphism (SNP) genotyping arrays that accelerate breeding progress and support basic research in crop science. Here, we developed and validated the SoySNP618K array (618,888 SNPs) for the important crop soybean. The SNPs were selected from whole-genome resequencing data containing 2,214 diverse soybean accessions; 29.34% of the SNPs mapped to genic regions representing 86.85% of the 56,044 annotated high-confidence genes. Identity-by-state analyses of 318 soybeans revealed 17 redundant accessions, highlighting the potential of the SoySNP618K array in supporting gene bank management. The patterns of population stratification and genomic regions enriched through domestication were highly consistent with previous findings based on resequencing data, suggesting that the ascertainment bias in the SoySNP618K array was largely compensated for. Genome-wide association mapping in combination with reported quantitative trait loci enabled fine-mapping of genes known to influence flowering time, E2 and GmPRR3b, and of a new candidate gene, GmVIP5. Moreover, genomic prediction of flowering and maturity time in 502 recombinant inbred lines was highly accurate (>0.65). Thus, the SoySNP618K array is a valuable genomic tool that can be used to address many questions in applied breeding, germplasm management, and basic crop research.  相似文献   

18.
The advances in genotyping technology provide an opportunity to use genomic tools in crop breeding. As compared to field selections performed in conventional breeding programmes, genomics‐based genotype screen can potentially reduce number of breeding cycles and more precisely integrate target genes for particular traits into an ideal genetic background. We developed a whole‐genome single nucleotide polymorphism (SNP) array, RICE6K, based on Infinium technology, using representative SNPs selected from more than four million SNPs identified from resequencing data of more than 500 rice landraces. RICE6K contains 5102 SNP and insertion–deletion (InDel) markers, about 4500 of which were of high quality in the tested rice lines producing highly repeatable results. Forty‐five functional markers that are located inside 28 characterized genes of important traits can be detected using RICE6K. The SNP markers are evenly distributed on the 12 chromosomes of rice with the average density of 12 SNPs per 1 Mb and can provide information for polymorphisms between indica and japonica subspecies as well as varieties within indica and japonica groups. Application tests of RICE6K showed that the array is suitable for rice germplasm fingerprinting, genotyping bulked segregating pools, seed authenticity check and genetic background selection. These results suggest that RICE6K provides an efficient and reliable genotyping tool for rice genomic breeding.  相似文献   

19.
With the access to draft genome sequence assemblies and whole‐genome resequencing data from population samples, molecular ecology studies will be able to take truly genome‐wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single‐nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10‐fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later‐generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system.  相似文献   

20.
The Brassica napus 60K Illumina Infinium? SNP array has had huge international uptake in the rapeseed community due to the revolutionary speed of acquisition and ease of analysis of this high-throughput genotyping data, particularly when coupled with the newly available reference genome sequence. However, further utilization of this valuable resource can be optimized by better understanding the promises and pitfalls of SNP arrays. We outline how best to analyze Brassica SNP marker array data for diverse applications, including linkage and association mapping, genetic diversity and genomic introgression studies. We present data on which SNPs are locus-specific in winter, semi-winter and spring B. napus germplasm pools, rather than amplifying both an A-genome and a C-genome locus or multiple loci. Common issues that arise when analyzing array data will be discussed, particularly those unique to SNP markers and how to deal with these for practical applications in Brassica breeding applications.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号