首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Gender assignment errors are common in some animal species and lead to inaccuracies in downstream analyses. Procedures for detecting gender misassignment are available for array‐based SNP data but are still being developed for genotyping‐by‐sequencing (GBS) data. In this study, we describe a method for using GBS data to predict gender using X and Y chromosomal SNPs. From a set of 1286 X chromosomal and 23 Y chromosomal deer (Cervus sp.) SNPs discovered from GBS sequence reads, a prediction model was built using a training dataset of 422 Red deer and validated using a test dataset of 868 Red deer and Wapiti deer. Prediction was based on the proportion of heterozygous genotypes on the X chromosome and the proportion of non‐missing genotypes on the Y chromosome observed in each individual. The concordance between recorded gender and predicted gender was 98.6% in the training dataset and 99.3% in the test dataset. The model identified five individuals across both datasets with incorrect recorded gender and was unable to predict gender for another five individuals. Overall, our method predicted gender with a high degree of accuracy and could be used for quality control in gender assignment datasets or for assigning gender when unrecorded, provided a suitable reference genome is available.  相似文献   

2.
Blue catfish, Ictalurus furcatus, are valued in the United States as a trophy fishery for their capacity to reach large sizes, sometimes exceeding 45 kg. Additionally, blue catfish × channel catfish (I. punctatus) hybrid food fish production has recently increased the demand for blue catfish broodstock. However, there has been little study of the genetic impacts and interaction of farmed, introduced and stocked populations of blue catfish. We utilized genotyping‐by‐sequencing (GBS) to capture and genotype SNP markers on 190 individuals from five wild and domesticated populations (Mississippi River, Missouri, D&B, Rio Grande and Texas). Stringent filtering of SNP‐calling parameters resulted in 4275 SNP loci represented across all five populations. Population genetics and structure analyses revealed potential shared ancestry and admixture between populations. We utilized the Sequenom MassARRAY to validate two multiplex panels of SNPs selected from the GBS data. Selection criteria included SNPs shared between populations, SNPs specific to populations, number of reads per individual and number of individuals genotyped by GBS. Putative SNPs were validated in the discovery population and in two additional populations not used in the GBS analysis. A total of 64 SNPs were genotyped successfully in 191 individuals from nine populations. Our results should guide the development of highly informative, flexible genotyping multiplexes for blue catfish from the larger GBS SNP set as well as provide an example of a rapid, low‐cost approach to generate and genotype informative marker loci in aquatic species with minimal previous genetic information.  相似文献   

3.
In a de novo genotyping‐by‐sequencing (GBS) analysis of short, 64‐base tag‐level haplotypes in 4657 accessions of cultivated oat, we discovered 164741 tag‐level (TL) genetic variants containing 241224 SNPs. From this, the marker density of an oat consensus map was increased by the addition of more than 70000 loci. The mapped TL genotypes of a 635‐line diversity panel were used to infer chromosome‐level (CL) haplotype maps. These maps revealed differences in the number and size of haplotype blocks, as well as differences in haplotype diversity between chromosomes and subsets of the diversity panel. We then explored potential benefits of SNP vs. TL vs. CL GBS variants for mapping, high‐resolution genome analysis and genomic selection in oats. A combined genome‐wide association study (GWAS) of heading date from multiple locations using both TL haplotypes and individual SNP markers identified 184 significant associations. A comparative GWAS using TL haplotypes, CL haplotype blocks and their combinations demonstrated the superiority of using TL haplotype markers. Using a principal component‐based genome‐wide scan, genomic regions containing signatures of selection were identified. These regions may contain genes that are responsible for the local adaptation of oats to Northern American conditions. Genomic selection for heading date using TL haplotypes or SNP markers gave comparable and promising prediction accuracies of up to r = 0.74. Genomic selection carried out in an independent calibration and test population for heading date gave promising prediction accuracies that ranged between r = 0.42 and 0.67. In conclusion, TL haplotype GBS‐derived markers facilitate genome analysis and genomic selection in oat.  相似文献   

4.
Days open (DO), which is the interval from calving to conception, is an important trait related to reproductive performance in cattle. To identify quantitative trait loci for DO in Japanese Black cattle, we conducted a genome‐wide association study with 33 303 single nucleotide polymorphisms (SNPs) using 459 animals with extreme DO values selected from a larger group of 15 488 animals. We identified a SNP on bovine chromosome 2 (BTA2) that was associated with DO. After imputation using phased haplotype data inferred from 586 812 SNPs of 1041 Japanese Black cattle, six SNPs associated with DO were located in an 8.5‐kb region of high linkage disequilibrium on BTA2. These SNPs were located on the telomeric side at a distance of 177 kb from the parathyroid hormone 2 receptor (PTH2R) gene. The association was replicated in a sample of 1778 animals. In the replicated population, the frequency of the reduced‐DO allele (Q) was 0.63, and it accounted for 1.72% of the total genetic variance. The effect of a Q‐to‐q allele substitution on DO was a decrease of 3.74 days. The results suggest that the Q allele could serve as a marker in Japanese Black cattle to select animals with superior DO performance.  相似文献   

5.
6.
Single‐nucleotide polymorphisms (SNPs) are rapidly becoming the standard markers in population genomics studies; however, their use in nonmodel organisms is limited due to the lack of cost‐effective approaches to uncover genome‐wide variation, and the large number of individuals needed in the screening process to reduce ascertainment bias. To discover SNPs for population genomics studies in the fungal symbionts of the mountain pine beetle (MPB), we developed a road map to discover SNPs and to produce a genotyping platform. We undertook a whole‐genome sequencing approach of Leptographium longiclavatum in combination with available genomics resources of another MPB symbiont, Grosmannia clavigera. We sequenced 71 individuals pooled into four groups using the Illumina sequencing technology. We generated between 27 and 30 million reads of 75 bp that resulted in a total of 1, 181 contigs longer than 2 kb and an assembled genome size of 28.9 Mb (N50 = 48 kb, average depth = 125x). A total of 9052 proteins were annotated, and between 9531 and 17 266 SNPs were identified in the four pools. A subset of 206 genes (containing 574 SNPs, 11% false positives) was used to develop a genotyping platform for this species. Using this roadmap, we developed a genotyping assay with a total of 147 SNPs located in 121 genes using the Illumina® Sequenom iPLEX Gold. Our preliminary genotyping (success rate = 85%) of 304 individuals from 36 populations supports the utility of this approach for population genomics studies in other MPB fungal symbionts and other fungal nonmodel species.  相似文献   

7.
Research in evolutionary biology involving nonmodel organisms is rapidly shifting from using traditional molecular markers such as mtDNA and microsatellites to higher throughput SNP genotyping methodologies to address questions in population genetics, phylogenetics and genetic mapping. Restriction site associated DNA sequencing (RAD sequencing or RADseq) has become an established method for SNP genotyping on Illumina sequencing platforms. Here, we developed a protocol and adapters for double‐digest RAD sequencing for Ion Torrent (Life Technologies; Ion Proton, Ion PGM) semiconductor sequencing. We sequenced thirteen genomic libraries of three different nonmodel vertebrate species on Ion Proton with PI chips: Arctic charr Salvelinus alpinus, European whitefish Coregonus lavaretus and common lizard Zootoca vivipara. This resulted in ~962 million single‐end reads overall and a mean of ~74 million reads per library. We filtered the genomic data using Stacks, a bioinformatic tool to process RAD sequencing data. On average, we obtained ~11 000 polymorphic loci per library of 6–30 individuals. We validate our new method by technical and biological replication, by reconstructing phylogenetic relationships, and using a hybrid genetic cross to track genomic variants. Finally, we discuss the differences between using the different sequencing platforms in the context of RAD sequencing, assessing possible advantages and disadvantages. We show that our protocol can be used for Ion semiconductor sequencing platforms for the rapid and cost‐effective generation of variable and reproducible genetic markers.  相似文献   

8.
With the advent of next generation sequencing, new avenues have opened to study genomics in wild populations of non‐model species. Here, we describe a successful approach to a genome‐wide medium density Single Nucleotide Polymorphism (SNP) panel in a non‐model species, the house sparrow (Passer domesticus), through the development of a 10 K Illumina iSelect HD BeadChip. Genomic DNA and cDNA derived from six individuals were sequenced on a 454 GS FLX system and generated a total of 1.2 million sequences, in which SNPs were detected. As no reference genome exists for the house sparrow, we used the zebra finch (Taeniopygia guttata) reference genome to determine the most likely position of each SNP. The 10 000 SNPs on the SNP‐chip were selected to be distributed evenly across 31 chromosomes, giving on average one SNP per 100 000 bp. The SNP‐chip was screened across 1968 individual house sparrows from four island populations. Of the original 10 000 SNPs, 7413 were found to be variable, and 99% of these SNPs were successfully called in at least 93% of all individuals. We used the SNP‐chip to demonstrate the ability of such genome‐wide marker data to detect population sub‐division, and compared these results to similar analyses using microsatellites. The SNP‐chip will be used to map Quantitative Trait Loci (QTL) for fitness‐related phenotypic traits in natural populations.  相似文献   

9.
Reduced representation genome sequencing such as restriction‐site‐associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single‐nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the European eel using the RAD sequencing approach that was simultaneously identified and scored in a genome‐wide scan of 30 individuals. Whereas genomic resources are increasingly becoming available for this species, including the recent release of a draft genome, no genome‐wide set of SNP markers was available until now. The generated SNPs were widely distributed across the eel genome, aligning to 4779 different contigs and 19 703 different scaffolds. Significant variation was identified, with an average nucleotide diversity of 0.00529 across individuals. Results varied widely across the genome, ranging from 0.00048 to 0.00737 per locus. Based on the average nucleotide diversity across all loci, long‐term effective population size was estimated to range between 132 000 and 1 320 000, which is much higher than previous estimates based on microsatellite loci. The generated SNP resource consisting of 82 425 loci and 376 918 associated SNPs provides a valuable tool for future population genetics and genomics studies and allows for targeting specific genes and particularly interesting regions of the eel genome.  相似文献   

10.
A high-density single-nucleotide polymorphism (SNP) map was developed for Xq25–q28 using a targeted approach to SNP discovery. This high-density map includes 217 new SNP markers, and 117 are informative in the CEPH parent population with >20% minor allele frequency. The average distance between SNP markers is 100 kb in the targeted regions. This is the densest genetic map of Xq25–q28 to date. The SNP markers are presented in order by their distance in megabases along the X chromosome, and the markers from the current genetic map are placed using the same scale to produce an integrated map of the region.  相似文献   

11.
Copy number variations (CNVs) have recently been identified as promising sources of genetic variation, complementary to single nucleotide polymorphisms (SNPs). As a result, detection of CNVs has attracted a great deal of attention. In this study, we performed genome‐wide CNV detection using Illumina Bovine HD BeadChip (770k) data on 792 Simmental cattle. A total of 263 CNV regions (CNVRs) were identified, which included 137 losses, 102 gains and 24 regions classified as both loss and gain, covering 35.48 Mb (1.41%) of the bovine genome. The length of these CNVRs ranged from 10.18 kb to 1.76 Mb, with an average length of 134.78 kb and a median length of 61.95 kb. In 136 of these regions, a total of 313 genes were identified related to biological functions such as transmembrane activity and olfactory transduction activity. To validate the results, we performed quantitative PCR to detect nine randomly selected CNVRs and successfully confirmed seven (77.6%) of them. Our results present a map of cattle CNVs derived from high‐density SNP data, which expands the current CNV map of the cattle genome and provides useful information for investigation of genomic structural variation in cattle.  相似文献   

12.
Single nucleotide polymorphisms (SNPs) are essential to the understanding of population genetic variation and diversity. Here, we performed restriction‐site‐associated DNA sequencing (RAD‐seq) on 72 individuals from 13 Chinese indigenous and three introduced chicken breeds. A total of 620 million reads were obtained using an Illumina Hiseq2000 sequencer. An average of 75 587 SNPs were identified from each individual. Further filtering strictly validated 28 895 SNPs candidates for all populations. When compared with the NCBI dbSNP (chicken_9031), 15 404 SNPs were new discoveries. In this study, RAD‐seq was performed for the first time on chickens, implicating the remarkable effectiveness and potential applications on genetic analysis and breeding technique for whole‐genome selection in chicken and other agricultural animals.  相似文献   

13.
With its vast territory and complex natural environment, China boasts rich cattle genetic resources. To gain the further insight into the genetic diversity and paternal origins of Chinese cattle, we analyzed the polymorphism of Y‐SNPs (UTY19 and ZFY10) and Y‐STRs (INRA189 and BM861) in 34 Chinese cattle breeds/populations, including 606 males representative of 24 cattle breeds/populations collected in this study as well as previously published data for 302 bulls. Combined genotypic data identified 14 Y‐chromosome haplotypes that represented three haplogroups. Y2‐104‐158 and Y2‐102‐158 were the most common taurine haplotypes detected mainly in northern and central China, whereas the indicine haplotype Y3‐88‐156 predominates in southern China. Haplotypes Y2‐108‐158, Y2‐110‐158, Y2‐112‐158 and Y3‐92‐156 were private to Chinese cattle. The population structure revealed by multidimensional scaling analysis differentiated Tibetan cattle from the other three groups of cattle. Analysis of molecular variance showed that the majority of the genetic variation was explained by the genetic differences among groups. Overall, our study indicates that Chinese cattle retain high paternal diversity (= 0.607 ± 0.016) and probably much of the original lineages that derived from the domestication center in the Near East without strong admixture from commercial cattle carrying Y1 haplotypes.  相似文献   

14.
The genus Agapornis, or lovebirds, are popular pet parrots worldwide. Currently, breeders are dependent on pedigree records as a selection tool as no molecular parentage verification test is available for any of the nine species. The A. roseicollis reference genome was recently assembled. This was followed by the sequencing of the whole genomes of the parents of the reference genome individual at 30× coverage. The parents’ reads were mapped against the reference genome to identify SNPs. Over 1.6 million SNPs, shared between the parents, were discovered using the Genome Analysis Toolkit pipeline. SNPs were filtered to a panel of 480 SNPs based on Genome Analysis Toolkit parameters. The panel of 480 SNPs was genotyped in a population of 960 lovebirds across seven species. A panel of 262 SNPs was compiled that included SNPs successfully amplified across all species. The 262‐SNP panel was reduced based on the observed heterozygosity (HO) and minor allele frequency (MAF) values per SNP to include the lowest number of SNPs with the highest exclusion power for parentage verification. Two smaller panels consisting of 195 SNPs with MAF and HO values >0.1 and 40 SNPs with MAF and HO values >0.3, were constructed. The panels were verified using 43 families from different species with known relationships to evaluate the exclusion power of each panel. The 195 SNP panel with an average exclusion probability of 99.9% and MAF and HO values >0.1 was proposed as the routine Agapornis parentage verification panel.  相似文献   

15.
The European rabbit (Oryctolagus cuniculus) is a domesticated species with one of the broadest ranges of economic and scientific applications and fields of investigation. Rabbit genome information and assembly are available (oryCun2.0), but so far few studies have investigated its variability, and massive discovery of polymorphisms has not been published yet for this species. Here, we sequenced two reduced representation libraries (RRLs) to identify single nucleotide polymorphisms (SNPs) in the rabbit genome. Genomic DNA of 10 rabbits belonging to different breeds was pooled and digested with two restriction enzymes (HaeIII and RsaI) to create two RRLs which were sequenced using the Ion Torrent Personal Genome Machine. The two RRLs produced 2 917 879 and 4 046 871 reads, for a total of 280.51 Mb (248.49 Mb with quality >20) and 417.28 Mb (360.89 Mb with quality >20) respectively of sequenced DNA. About 90% and 91% respectively of the obtained reads were mapped on the rabbit genome, covering a total of 15.82% of the oryCun2.0 genome version. The mapping and ad hoc filtering procedures allowed to reliably call 62 491 SNPs. SNPs in a few genomic regions were validated by Sanger sequencing. The Variant Effect Predictor Web tool was used to map SNPs on the current version of the rabbit genome. The obtained results will be useful for many applied and basic research programs for this species and will contribute to the development of cost‐effective solutions for high‐throughput SNP genotyping in the rabbit.  相似文献   

16.
Cultivated peanut (Arachis hypogaea L.) is an important grain legume providing high‐quality cooking oil, rich proteins and other nutrients. Shelling percentage (SP) is the 2nd most important agronomic trait after pod yield and this trait significantly affects the economic value of peanut in the market. Deployment of diagnostic markers through genomics‐assisted breeding (GAB) can accelerate the process of developing improved varieties with enhanced SP. In this context, we deployed the QTL‐seq approach to identify genomic regions and candidate genes controlling SP in a recombinant inbred line population (Yuanza 9102 × Xuzhou 68‐4). Four libraries (two parents and two extreme bulks) were constructed and sequenced, generating 456.89–790.32 million reads and achieving 91.85%–93.18% genome coverage and 14.04–21.37 mean read depth. Comprehensive analysis of two sets of data (Yuanza 9102/two bulks and Xuzhou 68‐4/two bulks) using the QTL‐seq pipeline resulted in discovery of two overlapped genomic regions (2.75 Mb on A09 and 1.1 Mb on B02). Nine candidate genes affected by 10 SNPs with non‐synonymous effects or in UTRs were identified in these regions for SP. Cost‐effective KASP (Kompetitive Allele‐Specific PCR) markers were developed for one SNP from A09 and three SNPs from B02 chromosome. Genotyping of the mapping population with these newly developed KASP markers confirmed the major control and stable expressions of these genomic regions across five environments. The identified candidate genomic regions and genes for SP further provide opportunity for gene cloning and deployment of diagnostic markers in molecular breeding for achieving high SP in improved varieties.  相似文献   

17.
Body weight is a complex trait in cattle associated with commonly used commercial breeding measurements related to growth. Although many quantitative trait loci (QTL) for body weight have been identified in cattle so far, searching for genetic determinants in different breeds or environments is promising. Therefore, we carried out a genome‐wide association study (GWAS) in two cattle populations from the Russian Federation (Siberian region) using the GGP HD150K array containing 139 376 single nucleotide polymorphism (SNP) markers. Association tests for 107 550 SNPs left after filtering revealed five statistically significant SNPs on BTA5, considering a false discovery rate of less than 0.05. The chromosomal region containing these five SNPs contains the CCND2 gene, which was previously associated with average daily weight gain and body mass index in US beef cattle populations and in humans respectively. Our study is the first GWAS for body weight in beef cattle populations from the Russian Federation. The results provided here suggest that, despite the existence of breed‐ and species‐specific QTL, the genetic architecture of body weight could be evolutionarily conserved in mammals.  相似文献   

18.
With the access to draft genome sequence assemblies and whole‐genome resequencing data from population samples, molecular ecology studies will be able to take truly genome‐wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single‐nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10‐fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later‐generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system.  相似文献   

19.
SNP arrays are widely used in genetic research and agricultural genomics applications, and the quality of SNP genotyping data is of paramount importance. In the present study, SNP genotyping concordance and discordance were evaluated for commercial bovine SNP arrays based on two types of quality assurance (QA) samples provided by Neogen GeneSeek. The genotyping discordance rates (GDRs) between chips were on average between 0.06% and 0.37% based on the QA type I data and between 0.05% and 0.15% based on the QA type II data. The average genotyping error rate (GER) pertaining to single SNP chips, based on the QA type II data, varied between 0.02% and 0.08% per SNP and between 0.01% and 0.06% per sample. These results indicate that genotyping concordance rate was high (i.e. from 99.63% to 99.99%). Nevertheless, mitochondrial and Y chromosome SNPs had considerably elevated GDRs and GERs compared to the SNPs on the 29 autosomes and X chromosome. The majority of genotyping errors resulted from single allotyping errors, which also included the opposite instances for allele ‘dropout’ (i.e. from AB to AA or BB). Simultaneous allotyping errors on both alleles (e.g. mistaking AA for BB or vice versa) were relatively rare. Finally, a list of SNPs with a GER greater than 1% is provided. Interpretation of association effects of these SNPs, for example in genome‐wide association studies, needs to be taken with caution. The genotyping concordance information needs to be considered in the optimal design of future bovine SNP arrays.  相似文献   

20.
Detailed linkage and recombination rate maps are necessary to use the full potential of genome sequencing and population genomic analyses. We used a custom collared flycatcher 50 K SNP array to develop a high‐density linkage map with 37 262 markers assigned to 34 linkage groups in 33 autosomes and the Z chromosome. The best‐order map contained 4215 markers, with a total distance of 3132 cM and a mean genetic distance between markers of 0.12 cM . Facilitated by the array being designed to include markers from most scaffolds, we obtained a second‐generation assembly of the flycatcher genome that approaches full chromosome sequences (N50 super‐scaffold size 20.2 Mb and with 1.042 Gb (of 1.116 Gb) anchored to and mostly ordered and oriented along chromosomes). We found that flycatcher and zebra finch chromosomes are entirely syntenic but that inversions at mean rates of 1.5–2.0 event (6.6–7.5 Mb) per My have changed the organization within chromosomes, rates high enough for inversions to potentially have been involved with many speciation events during avian evolution. The mean recombination rate was 3.1 cM /Mb and correlated closely with chromosome size, from 2 cM /Mb for chromosomes >100 Mb to >10 cM /Mb for chromosomes <10 Mb. This size dependence seemed entirely due to an obligate recombination event per chromosome; if 50 cM was subtracted from the genetic lengths of chromosomes, the rate per physical unit DNA was constant across chromosomes. Flycatcher recombination rate showed similar variation along chromosomes as chicken but lacked the large interior recombination deserts characteristic of zebra finch chromosomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号