首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
    
Previously we extended the utility of mapping‐by‐sequencing by combining it with sequence capture and mapping sequence data to pseudo‐chromosomes that were organized using wheat–Brachypodium synteny. This, with a bespoke haplotyping algorithm, enabled us to map the flowering time locus in the diploid wheat Triticum monococcum L. identifying a set of deleted genes (Gardiner et al., 2014). Here, we develop this combination of gene enrichment and sliding window mapping‐by‐synteny analysis to map the Yr6 locus for yellow stripe rust resistance in hexaploid wheat. A 110 MB NimbleGen capture probe set was used to enrich and sequence a doubled haploid mapping population of hexaploid wheat derived from an Avalon and Cadenza cross. The Yr6 locus was identified by mapping to the POPSEQ chromosomal pseudomolecules using a bespoke pipeline and algorithm (Chapman et al., 2015). Furthermore the same locus was identified using newly developed pseudo‐chromosome sequences as a mapping reference that are based on the genic sequence used for sequence enrichment. The pseudo‐chromosomes allow us to demonstrate the application of mapping‐by‐sequencing to even poorly defined polyploidy genomes where chromosomes are incomplete and sub‐genome assemblies are collapsed. This analysis uniquely enabled us to: compare wheat genome annotations; identify the Yr6 locus – defining a smaller genic region than was previously possible; associate the interval with one wheat sub‐genome and increase the density of SNP markers associated. Finally, we built the pipeline in iPlant, making it a user‐friendly community resource for phenotype mapping.  相似文献   

2.
Pine cones that remain closed and retain seeds until fire causes the cones to open (cone serotiny) represent a key adaptive trait in a variety of pine species. In lodgepole pine, there is substantial geographical variation in serotiny across the Rocky Mountain region. This variation in serotiny has evolved as a result of geographically divergent selection, with consequences that extend to forest communities and ecosystems. An understanding of the genetic architecture of this trait is of interest owing to the wide-reaching ecological consequences of serotiny and also because of the repeated evolution of the trait across the genus. Here, we present and utilize an inexpensive and time-effective method for generating population genomic data. The method uses restriction enzymes and PCR amplification to generate a library of fragments that can be sequenced with a high level of multiplexing. We obtained data for more than 95,000 single nucleotide polymorphisms across 98 serotinous and nonserotinous lodgepole pines from three populations. We used a Bayesian generalized linear model (GLM) to test for an association between genotypic variation at these loci and serotiny. The probability of serotiny varied by genotype at 11 loci, and the association between genotype and serotiny at these loci was consistent in each of the three populations of pines. Genetic variation across these 11 loci explained 50% of the phenotypic variation in serotiny. Our results provide a first genome-wide association map of serotiny in pines and demonstrate an inexpensive and efficient method for generating population genomic data.  相似文献   

3.
    
Single nucleotide polymorphisms (SNPs) are replacing microsatellites for population genetic analyses, but it is not apparent how many SNPs are needed or how well SNPs correlate with microsatellites. We used data from the gopher tortoise, Gopherus polyphemus—a species with small populations, to compare SNPs and microsatellites to estimate population genetic parameters. Specifically, we compared one SNP data set (16 tortoises from four populations sequenced at 17 901 SNPs) to two microsatellite data sets, a full data set of 101 tortoises and a partial data set of 16 tortoises previously genotyped at 10 microsatellites. For the full microsatellite data set, observed heterozygosity, expected heterozygosity and FST were correlated between SNPs and microsatellites; however, allelic richness was not. The same was true for the partial microsatellite data set, except that allelic richness, but not observed heterozygosity, was correlated. The number of clusters estimated by structure differed for each data set (SNPs = 2; partial microsatellite = 3; full microsatellite = 4). Principle component analyses (PCA) showed four clusters for all data sets. More than 800 SNPs were needed to correlate with allelic richness, observed heterozygosity and expected heterozygosity, but only 100 were needed for FST. The number of SNPs typically obtained from next‐generation sequencing (NGS) far exceeds the number needed to correlate with microsatellite parameter estimates. Our study illustrates that diversity, FST and PCA results from microsatellites can mirror those obtained with SNPs. These results may be generally applicable to small populations, a defining feature of endangered and threatened species, because theory predicts that genetic drift will tend to outweigh selection in small populations.  相似文献   

4.
    
The computer program exonsampler automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next‐generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User‐adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of exonsampler to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon‐capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16 000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection.  相似文献   

5.
6.
    
Crop wild relatives (CWR) provide an important source of allelic diversity for any given crop plant species for counteracting the erosion of genetic diversity caused by domestication and elite breeding bottlenecks. Hordeum bulbosum L. is representing the secondary gene pool of the genus Hordeum. It has been used as a source of genetic introgressions for improving elite barley germplasm (Hordeum vulgare L.). However, genetic introgressions from Hbulbosum have yet not been broadly applied, due to a lack of suitable molecular tools for locating, characterizing, and decreasing by recombination and marker‐assisted backcrossing the size of introgressed segments. We applied next‐generation sequencing (NGS) based strategies for unlocking genetic diversity of three diploid introgression lines of cultivated barley containing chromosomal segments of its close relative H. bulbosum. Firstly, exome capture‐based (re)‐sequencing revealed large numbers of single nucleotide polymorphisms (SNPs) enabling the precise allocation of H. bulbosum introgressions. This SNP resource was further exploited by designing a custom multiplex SNP genotyping assay. Secondly, two‐enzyme‐based genotyping‐by‐sequencing (GBS) was employed to allocate the introgressed H. bulbosum segments and to genotype a mapping population. Both methods provided fast and reliable detection and mapping of the introgressed segments and enabled the identification of recombinant plants. Thus, the utilization of H. bulbosum as a resource of natural genetic diversity in barley crop improvement will be greatly facilitated by these tools in the future.  相似文献   

7.
8.
    
Infectious diseases are a type of disease caused by pathogenic microorganisms. Although the discovery of antibiotics changed the treatment of infectious diseases and reduced the mortality of bacterial infections, resistant bacterial strains have emerged. Anti‐infective therapy based on aetiological evidence is the gold standard for clinical treatment, but the time lag and low positive culture rate of traditional methods of pathogen diagnosis leads to relative difficulty in obtaining the evidence of pathogens. Compared with traditional methods of pathogenic diagnosis, next‐generation and third‐generation sequencing technologies have many advantages in the detection of pathogenic microorganisms. In this review, we mainly introduce recent progress in research on pathogenic diagnostic technology and the applications of sequencing technology in the diagnosis of pathogenic microorganisms. This review provides new insights into the application of sequencing technology in the clinical diagnosis of microorganisms.  相似文献   

9.
10.
    
Mapping‐by‐sequencing analyses have largely required a complete reference sequence and employed whole genome re‐sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re‐sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early‐flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene‐rich regions of hexaploid bread wheat to design a 110‐Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo‐chromosomes derived from the capture probe target sequence, with a long‐range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval.  相似文献   

11.
    
Next‐generation whole‐genome shotgun assemblies of complex genomes are highly useful, but fail to link nearby sequence contigs with each other or provide a linear order of contigs along individual chromosomes. Here, we introduce a strategy based on sequencing progeny of a segregating population that allows de novo production of a genetically anchored linear assembly of the gene space of an organism. We demonstrate the power of the approach by reconstructing the chromosomal organization of the gene space of barley, a large, complex and highly repetitive 5.1 Gb genome. We evaluate the robustness of the new assembly by comparison to a recently released physical and genetic framework of the barley genome, and to various genetically ordered sequence‐based genotypic datasets. The method is independent of the need for any prior sequence resources, and will enable rapid and cost‐efficient establishment of powerful genomic information for many species.  相似文献   

12.
    
The more demanding requirements of DNA preservation for genomic research can be difficult to meet when field conditions limit the methodological approaches that can be used or cause samples to be stored in suboptimal conditions. Such limitations may increase rates of DNA degradation, potentially rendering samples unusable for applications such as genome‐wide sequencing. Nonetheless, little is known about the impact of suboptimal sampling conditions. We evaluated the performance of two widely used preservation solutions (1. DESS: 20% DMSO, 0.25 M EDTA, NaCl saturated solution, and 2. Ethanol >99.5%) under a range of storage conditions over a three‐month period (sampling at 1 day, 1 week, 2 weeks, 1 month, and 3 months) to provide practical guidelines for DNA preservation. DNA degradation was quantified as the reduction in average DNA fragment size over time (DNA fragmentation) because the size distribution of DNA segments plays a key role in generating genomic datasets. Tissues were collected from a marine teleost species, the Australasian snapper, Chrysophrys auratus. We found that the storage solution has a strong effect on DNA preservation. In DESS, DNA was only moderately degraded after three months of storage while DNA stored in ethanol showed high levels of DNA degradation already within 24 hr, making samples unsuitable for next‐generation sequencing. Here, we conclude that DESS was the most promising solution when storing samples for genomic applications. We recognize that the best preservation protocol is highly dependent on the organism, tissue type, and study design. We highly recommend performing similar experiments before beginning a study. This study highlights the importance of testing sample preservation protocols and provides both practical and economical advice to improve DNA preservation when sampling for genome‐wide applications.  相似文献   

13.
14.
    
Target‐capture approach has improved over the past years, proving to be very efficient tool for selectively sequencing genetic regions of interest. These methods have also allowed the use of noninvasive samples such as faeces (characterized by their low quantity and quality of endogenous DNA) to be used in conservation genomic, evolution and population genetic studies. Here we aim to test different protocols and strategies for exome capture using the Roche SeqCap EZ Developer kit (57.5 Mb). First, we captured a complex pool of DNA libraries. Second, we assessed the influence of using more than one faecal sample, extract and/or library from the same individual, to evaluate its effect on the molecular complexity of the experiment. We validated our experiments with 18 chimpanzee faecal samples collected from two field sites as a part of the Pan African Programme: The Cultured Chimpanzee. Those two field sites are in Kibale National Park, Uganda (N = 9) and Loango National Park, Gabon (N = 9). We demonstrate that at least 16 libraries can be pooled, target enriched through hybridization, and sequenced allowing for the genotyping of 951,949 exome markers for population genetic analyses. Further, we observe that molecule richness, and thus, data acquisition, increase when using multiple libraries from the same extract or multiple extracts from the same sample. Finally, repeated captures significantly decrease the proportion of off‐target reads from 34.15% after one capture round to 7.83% after two capture rounds, supporting our conclusion that two rounds of target enrichment are advisable when using complex faecal samples.  相似文献   

15.
16.
17.
18.
    
By combining high‐throughput sequencing with target enrichment (‘hybridization capture’), researchers are able to obtain molecular data from genomic regions of interest for projects that are otherwise constrained by sample quality (e.g. degraded and contamination‐rich samples) or a lack of a priori sequence information (e.g. studies on nonmodel species). Despite the use of hybridization capture in various fields of research for many years, the impact of enrichment conditions on capture success is not yet thoroughly understood. We evaluated the impact of a key parameter – hybridization temperature – on the capture success of mitochondrial genomes across the carnivoran family Felidae. Capture was carried out for a range of sample types (fresh, archival, ancient) with varying levels of sequence divergence between bait and target (i.e. across a range of species) using pools of individually indexed libraries on Agilent SureSelect? arrays. Our results suggest that hybridization capture protocols require specific optimization for the sample type that is being investigated. Hybridization temperature affected the proportion of on‐target sequences following capture: for degraded samples, we obtained the best results with a hybridization temperature of 65 °C, while a touchdown approach (65 °C down to 50 °C) yielded the best results for fresh samples. Evaluation of capture performance at a regional scale (sliding window approach) revealed no significant improvement in the recovery of DNA fragments with high sequence divergence from the bait at any of the tested hybridization temperatures, suggesting that hybridization temperature may not be the critical parameter for the enrichment of divergent fragments.  相似文献   

19.
Next generation sequencing (NGS) has revolutionized genomics research, making it difficult to overstate its impact on studies of Biology. NGS will immediately allow researchers working in non‐mainstream species to obtain complete genomes together with a comprehensive catalogue of variants. In addition, RNA‐seq will be a decisive way to annotate genes that cannot be predicted purely by computational or comparative approaches. Future applications include whole genome sequence association studies, as opposed to classical SNP‐based association, and implementing this new source of information into breeding programmes. For these purposes, one of the main advantages of sequencing vs. genotyping is the possibility of identifying copy number variants. Currently, experimental design is a topic of utmost interest, and here we discuss some of the options available, including pools and reduced representation libraries. Although bioinformatics is still an important bottleneck, this limitation is only transient and should not deter animal geneticists from embracing these technologies.  相似文献   

20.
    
Museum specimens provide a wealth of information to biologists, but obtaining genetic data from formalin‐fixed and fluid‐preserved specimens remains challenging. While DNA sequences have been recovered from such specimens, most approaches are time‐consuming and produce low data quality and quantity. Here, we use a modified DNA extraction protocol combined with high‐throughput sequencing to recover DNA from formalin‐fixed and fluid‐preserved snakes that were collected over a century ago and for which little or no modern genetic materials exist in public collections. We successfully extracted DNA and sequenced ultraconserved elements ( = 2318 loci) from 10 fluid‐preserved snakes and included them in a phylogeny with modern samples. This phylogeny demonstrates the general use of such specimens in phylogenomic studies and provides evidence for the placement of enigmatic snakes, such as the rare and never‐before sequenced Indian Xylophis stenorhynchus. Our study emphasizes the relevance of museum collections in modern research and simultaneously provides a protocol that may prove useful for specimens that have been previously intractable for DNA sequencing.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号