共查询到20条相似文献,搜索用时 0 毫秒
1.
RAD‐tag is a powerful tool for high‐throughput genotyping. It relies on PCR amplification of the starting material, following enzymatic digestion and sequencing adaptor ligation. Amplification introduces duplicate reads into the data, which arise from the same template molecule and are statistically nonindependent, potentially introducing errors into genotype calling. In shotgun sequencing, data duplicates are removed by filtering reads starting at the same position in the alignment. However, restriction enzymes target specific locations within the genome, causing reads to start in the same place, and making it difficult to estimate the extent of PCR duplication. Here, we introduce a slight change to the Illumina sequencing adaptor chemistry, appending a unique four‐base tag to the first index read, which allows duplicate discrimination in aligned data. This approach was validated on the Illumina MiSeq platform, using double‐digest libraries of ants (Wasmannia auropunctata) and yeast (Saccharomyces cerevisiae) with known genotypes, producing modest though statistically significant gains in the odds of calling a genotype accurately. More importantly, removing duplicates also corrected for strong sample‐to‐sample variability of genotype calling accuracy seen in the ant samples. For libraries prepared from low‐input degraded museum bird samples (Mixornis gularis), which had low complexity, having been generated from relatively few starting molecules, adaptor tags show that virtually all of the genotypes were called with inflated confidence as a result of PCR duplicates. Quantification of library complexity by adaptor tagging does not significantly increase the difficulty of the overall workflow or its cost, but corrects for differences in quality between samples and permits analysis of low‐input material. 相似文献
2.
Knowledge of kin relationships between members of wild animal populations has broad application in ecology and evolution research by allowing the investigation of dispersal dynamics, mating systems, inbreeding avoidance, kin recognition, and kin selection as well as aiding the management of endangered populations. However, the assessment of kinship among members of wild animal populations is difficult in the absence of detailed multigenerational pedigrees. Here, we first review the distinction between genetic relatedness and kinship derived from pedigrees and how this makes the identification of kin using genetic data inherently challenging. We then describe useful approaches to kinship classification, such as parentage analysis and sibship reconstruction, and explain how the combined use of marker systems with biparental and uniparental inheritance, demographic information, likelihood analyses, relatedness coefficients, and estimation of misclassification rates can yield reliable classifications of kinship in groups with complex kin structures. We outline alternative approaches for cases in which explicit knowledge of dyadic kinship is not necessary, but indirect inferences about kinship on a group‐ or population‐wide scale suffice, such as whether more highly related dyads are in closer spatial proximity. Although analysis of highly variable microsatellite loci is still the dominant approach for studies on wild populations, we describe how the long‐awaited use of large‐scale single‐nucleotide polymorphism and sequencing data derived from noninvasive low‐quality samples may eventually lead to highly accurate assessments of varying degrees of kinship in wild populations. 相似文献
3.
The objectives of this study were to develop breed-specific single nucleotide polymorphisms (SNPs) in five pig breeds sequenced with Illumina's Genome Analyzer and to investigate their usefulness for breed assignment purposes. DNA pools were prepared for Duroc, Landrace, Large White, Pietrain and Wild Boar. The total number of animals used for sequencing was 153. SNP discovery was performed by aligning the filtered reads against Build 7 of the pig genome. A total of 313,964 high confidence SNPs were identified and analysed for the presence of breed-specific SNPs (defined in this context as SNPs for which one of the alleles was detected in only one breed). There were 29,146 putative breed-specific SNPs identified, of which 4441 were included in the PorcineSNP60 beadchip. Upon re-examining the genotypes obtained using the beadchip, 193 SNPs were confirmed as being breed specific. These 193 SNPs were subsequently used to assign an additional 490 individuals from the same breeds, using the sequenced individuals as reference populations. In total, four breed assignment tests were performed. Results showed that for all methods tested 99% of the animals were correctly assigned, with an average probability of assignment of at least 99.2%, indicating the high utility of breed-specific markers for breed assignment and traceability. This study provides a blueprint for the way next-generation sequencing technologies can be used for the identification of breed-specific SNPs, as well as evidence that these SNPs may be a powerful tool for breed assignment and traceability of animal products to their breeds of origin. 相似文献
4.
John R. Candy Nathan R. Campbell Matthew H. Grinnell Terry D. Beacham Wesley A. Larson Shawn R. Narum 《Molecular ecology resources》2015,15(6):1421-1434
Twelve eulachon (Thaleichthys pacificus, Osmeridae) populations ranging from Cook Inlet, Alaska and along the west coast of North America to the Columbia River were examined by restriction‐site‐associated DNA (RAD) sequencing to elucidate patterns of neutral and adaptive variation in this high geneflow species. A total of 4104 single‐nucleotide polymorphisms (SNPs) were discovered across the genome, with 193 putatively adaptive SNPs as determined by FST outlier tests. Estimates of population structure in eulachon with the putatively adaptive SNPs were similar, but provided greater resolution of stocks compared with a putatively neutral panel of 3911 SNPs or previous estimates with 14 microsatellites. A cline of increasing measures of genetic diversity from south to north was found in the adaptive panel, but not in the neutral markers (SNPs or microsatellites). This may indicate divergent selective pressures in differing freshwater and marine environments between regional eulachon populations and that these adaptive diversity patterns not seen with neutral markers could be a consideration when determining genetic boundaries for conservation purposes. Estimates of effective population size (Ne) were similar with the neutral SNP panel and microsatellites and may be utilized to monitor population status for eulachon where census sizes are difficult to obtain. Greater differentiation with the panel of putatively adaptive SNPs provided higher individual assignment accuracy compared to the neutral panel or microsatellites for stock identification purposes. This study presents the first SNPs that have been developed for eulachon, and analyses with these markers highlighted the importance of integrating genome‐wide neutral and adaptive genetic variation for the applications of conservation and management. 相似文献
5.
A. A. Malik R. Sharma S. Ahlawat R. Deb M. S. Negi S. B. Tripathi 《Animal genetics》2018,49(3):242-245
Genetic relatedness of 24 animals belonging to seven Indian cattle breeds was studied using high throughput genotyping‐by‐sequencing (GBS) markers. GBS produced 93.6 million reads with an average of about 3.9 million reads per animal. A total of 107 488 SNPs were identified in these individuals. When only one SNP per read was considered, a total of 60 261 SNPs representing independent reads were identified with an average SNP‐to‐SNP distance of 45 kb across the bovine reference genome. About 24% of the GBS‐SNP markers were more than 100 kb apart. Of these, 58 322 SNPs mapped to autosomes, 1645 to the X chromosome and 28 to the Y chromosome. The average SNP‐to‐SNP distance on the X chromosome was 91.3 kb, whereas on the Y chromosome it was 1546.4 kb. The minor allele frequency within the Indian cattle varied from 0.103 (Ongole) to 0.177 (Siri), whereas Holstein cattle had the lowest value of 0.089. This is the first application of GBS in cattle of South Asia. The baseline information generated in this study might prompt implementation of GBS in breeding of cattle belonging to this region. 相似文献
6.
Michael Imelfort Chris Duran Jacqueline Batley David Edwards 《Plant biotechnology journal》2009,7(4):312-317
The ongoing revolution in DNA sequencing technology now enables the reading of thousands of millions of nucleotide bases in a single instrument run. However, this data quantity is often compromised by poor confidence in the read quality. The identification of genetic polymorphisms from this data is therefore problematic and, combined with the vast quantity of data, poses a major bioinformatics challenge. However, once these difficulties have been addressed, next-generation sequencing will offer a means to identify and characterize the wealth of genetic polymorphisms underlying the vast phenotypic variation in biological systems. We describe the recent advances in next-generation sequencing technology, together with preliminary approaches that can be applied for single nucleotide polymorphism discovery in plant species. 相似文献
7.
Parentage analysis is a cornerstone of molecular ecology that has delivered fundamental insights into behaviour, ecology and evolution. Microsatellite markers have long been the king of parentage, their hypervariable nature conferring sufficient power to correctly assign offspring to parents. However, microsatellite markers have seen a sharp decline in use with the rise of next‐generation sequencing technologies, especially in the study of population genetics and local adaptation. The time is ripe to review the current state of parentage analysis and see how it stands to be affected by the emergence of next‐generation sequencing approaches. We find that single nucleotide polymorphisms (SNPs), the typical next‐generation sequencing marker, remain underutilized in parentage analysis but are gaining momentum, with 58 SNP‐based parentage analyses published thus far. Many of these papers, particularly the earlier ones, compare the power of SNPs and microsatellites in a parentage context. In virtually every case, SNPs are at least as powerful as microsatellite markers. As few as 100–500 SNPs are sufficient to resolve parentage completely in most situations. We also provide an overview of the analytical programs that are commonly used and compatible with SNP data. As the next‐generation parentage enterprise grows, a reliance on likelihood and Bayesian approaches, as opposed to strict exclusion, will become increasingly important. We discuss some of the caveats surrounding the use of next‐generation sequencing data for parentage analysis and conclude that the future is bright for this important realm of molecular ecology. 相似文献
8.
Allen AM Barker GL Berry ST Coghill JA Gwilliam R Kirby S Robinson P Brenchley RC D'Amore R McKenzie N Waite D Hall A Bevan M Hall N Edwards KJ 《Plant biotechnology journal》2011,9(9):1086-1099
Food security is a global concern and substantial yield increases in cereal crops are required to feed the growing world population. Wheat is one of the three most important crops for human and livestock feed. However, the complexity of the genome coupled with a decline in genetic diversity within modern elite cultivars has hindered the application of marker‐assisted selection (MAS) in breeding programmes. A crucial step in the successful application of MAS in breeding programmes is the development of cheap and easy to use molecular markers, such as single‐nucleotide polymorphisms. To mine selected elite wheat germplasm for intervarietal single‐nucleotide polymorphisms, we have used expressed sequence tags derived from public sequencing programmes and next‐generation sequencing of normalized wheat complementary DNA libraries, in combination with a novel sequence alignment and assembly approach. Here, we describe the development and validation of a panel of 1114 single‐nucleotide polymorphisms in hexaploid bread wheat using competitive allele‐specific polymerase chain reaction genotyping technology. We report the genotyping results of these markers on 23 wheat varieties, selected to represent a broad cross‐section of wheat germplasm including a number of elite UK varieties. Finally, we show that, using relatively simple technology, it is possible to rapidly generate a linkage map containing several hundred single‐nucleotide polymorphism markers in the doubled haploid mapping population of Avalon × Cadenza. 相似文献
9.
Chenmiao Liu Huiling Chen Zhanjun Ren Chengdong Zhang Xuejiao Yang 《Ecology and evolution》2019,9(19):11232-11242
Restriction site‐associated DNA sequencing (RAD‐seq) is one of the most effective high‐throughput sequencing technologies for SNP development and utilization and has been applied to studying the origin and evolution of various species. The domestic Bactrian camels play an important role in economic trade and cultural construction. They are precious species resources and indispensable animals in China's agricultural production. Recently, the rapid development of modern transportation and agriculture, and the deterioration of the environment have led to a sharp decline in the number of camels. Although there have been some reports on the evolution history of the domestic Bactrian camel in China, the origin, evolutionary relationship, and genetic diversity of the camels are unclear due to the limitations of sample size and sequencing technology. Therefore, 47 samples of seven domestic Bactrian camel species from four regions (Inner Mongolia, Gansu, Qinghai, and Xinjiang) were prepared for RAD‐seq analysis to study the evolutionary relationship and genetic diversity. In addition, seven domestic Bactrian camel species are located in different ecological zones, forming different characteristics and having potential development value. A total of 6,487,849 SNPs were genotyped. On the one hand, the filtered SNP information was used to conduct polymorphism mapping construction, LD attenuation analysis, and nucleotide diversity analysis. The results showed that the number of SNPs in Dongjiang camel was the highest, the LD coefficient decayed the fastest, and the nucleotide diversity was the highest. It indicates that Dongjiang camel has the highest genetic diversity. On the other hand, the filtered SNPs information was used to construct the phylogenetic tree, and FST analysis, inbreeding coefficient analysis, principal component analysis, and population structure analysis were carried out. The results showed that Nanjiang camel and Beijiang camels grouped together, and the other five Bactrian camel populations gathered into another branch. It may be because the mountains in the northern part of Xinjiang and the desert in the middle isolate the two groups from the other five groups. 相似文献
10.
Alexandra M. Allen Gary L. A. Barker Paul Wilkinson Amanda Burridge Mark Winfield Jane Coghill Cristobal Uauy Simon Griffiths Peter Jack Simon Berry Peter Werner James P. E. Melichar Jane McDougall Rhian Gwilliam Phil Robinson Keith J. Edwards 《Plant biotechnology journal》2013,11(3):279-295
Globally, wheat is the most widely grown crop and one of the three most important crops for human and livestock feed. However, the complex nature of the wheat genome has, until recently, resulted in a lack of single nucleotide polymorphism (SNP)‐based molecular markers of practical use to wheat breeders. Recently, large numbers of SNP‐based wheat markers have been made available via the use of next‐generation sequencing combined with a variety of genotyping platforms. However, many of these markers and platforms have difficulty distinguishing between heterozygote and homozygote individuals and are therefore of limited use to wheat breeders carrying out commercial‐scale breeding programmes. To identify exome‐based co‐dominant SNP‐based assays, which are capable of distinguishing between heterozygotes and homozygotes, we have used targeted re‐sequencing of the wheat exome to generate large amounts of genomic sequences from eight varieties. Using a bioinformatics approach, these sequences have been used to identify 95 266 putative single nucleotide polymorphisms, of which 10 251 were classified as being putatively co‐dominant. Validation of a subset of these putative co‐dominant markers confirmed that 96% were true polymorphisms and 65% were co‐dominant SNP assays. The new co‐dominant markers described here are capable of genotypic classification of a segregating locus in polyploid wheat and can be used on a variety of genotyping platforms; as such, they represent a powerful tool for wheat breeders. These markers and related information have been made publically available on an interactive web‐based database to facilitate their use on genotyping programmes worldwide. 相似文献
11.
12.
Understanding patterns of reproduction, dispersal and recruitment in deep‐sea communities is increasingly important with the need to manage resource extraction and conserve species diversity. Glass sponges are usually found in deep water (>1000 m) worldwide but form kilometre‐long reefs on the continental shelf of British Columbia and Alaska that are under threat from trawling and resource exploration. Due to their deep‐water habitat, larvae have not yet been found and the level of genetic connectivity between reefs and nonreef communities is unknown. The genetic structure of Aphrocallistes vastus, the primary reef‐building species in the Strait of Georgia (SoG) British Columbia, was studied using single nucleotide polymorphisms (SNPs). Pairwise comparisons of multilocus genotypes were used to assess whether sexual reproduction is common. Structure was examined 1) between individuals in reefs, 2) between reefs and 3) between sites in and outside the SoG. Sixty‐seven SNPs were genotyped in 91 samples from areas in and around the SoG, including four sponge reefs and nearby nonreef sites. The results show that sponge reefs are formed through sexual reproduction. Within a reef and across the SoG basin, the genetic distance between individuals does not vary with geographic distance (r = ?0.005 to 0.014), but populations within the SoG basin are genetically distinct from populations in Barkley Sound, on the west coast of Vancouver Island. Population structure was seen across all sample sites (global FST = 0.248), especially between SoG and non‐SoG locations (average pairwise FST = 0.251). Our results suggest that genetic mixing occurs across sponge reefs via larvae that disperse widely. 相似文献
13.
Nathan R. Campbell Stephanie A. Harmon Shawn R. Narum 《Molecular ecology resources》2015,15(4):855-867
14.
M. A. Bernal N. L. Sinai C. Rocha M. R. Gaither F. Dunker L. A. Rocha 《Journal of fish biology》2015,86(3):1171-1176
This study investigated the birth of a brownbanded bamboo shark Chiloscyllium punctatum at the Steinhart Aquarium. Genetic analyses suggest this is the longest documented case of sperm storage for any species of shark (45 months). 相似文献
15.
P. F. Roux S. Marthey A. Djari M. Moroldo D. Esquerré J. Estellé C. Klopp O. Demeure 《Animal genetics》2015,46(1):82-86
The number of polymorphisms identified with next‐generation sequencing approaches depends directly on the sequencing depth and therefore on the experimental cost. Although higher levels of depth ensure more sensitive and more specific SNP calls, economic constraints limit the increase of depth for whole‐genome resequencing (WGS). For this reason, capture resequencing is used for studies focusing on only some specific regions of the genome. However, several biases in capture resequencing are known to have a negative impact on the sensitivity of SNP detection. Within this framework, the aim of this study was to compare the accuracy of WGS and capture resequencing on SNP detection and genotype calling, which differ in terms of both sequencing depth and biases. Indeed, we have evaluated the SNP calling and genotyping accuracy in a WGS dataset (13X) and in a capture resequencing dataset (87X) performed on 11 individuals. The percentage of SNPs not identified due to a sevenfold sequencing depth decrease was estimated at 7.8% using a down‐sampling procedure on the capture sequencing dataset. A comparison of the 87X capture sequencing dataset with the WGS dataset revealed that capture‐related biases were leading with the loss of 5.2% of SNPs detected with WGS. Nevertheless, when considering the SNPs detected by both approaches, capture sequencing appears to achieve far better SNP genotyping, with about 4.4% of the WGS genotypes that can be considered as erroneous and even 10% focusing on heterozygous genotypes. In conclusion, WGS and capture deep sequencing can be considered equivalent strategies for SNP detection, as the rate of SNPs not identified because of a low sequencing depth in the former is quite similar to SNPs missed because of method biases of the latter. On the other hand, capture deep sequencing clearly appears more adapted for studies requiring great accuracy in genotyping. 相似文献
16.
Vinay Kumar Aamir W. Khan Rachit K. Saxena Vanika Garg Rajeev K. Varshney 《Plant biotechnology journal》2016,14(8):1673-1681
Whole genome re‐sequencing (WGRS) was conducted on a panel of 20 Cajanus spp. accessions (crossing parentals of recombinant inbred lines, introgression lines, multiparent advanced generation intercross and nested association mapping population) comprising of two wild species and 18 cultivated species accessions. A total of 791.77 million paired‐end reads were generated with an effective mapping depth of ~12X per accession. Analysis of WGRS data provided 5 465 676 genome‐wide variations including 4 686 422 SNPs and 779 254 InDels across the accessions. Large structural variations in the form of copy number variations (2598) and presence and absence variations (970) were also identified. Additionally, 2 630 904 accession‐specific variations comprising of 2 278 571 SNPs (86.6%), 166 243 deletions (6.3%) and 186 090 insertions (7.1%) were also reported. Identified polymorphic sites in this study provide the first‐generation HapMap in Cajanus spp. which will be useful in mapping the genomic regions responsible for important traits. 相似文献
17.
18.
Emily D. Fountain Jonathan N. Pauli Brendan N. Reid Per J. Palsbøll M. Zachariah Peery 《Molecular ecology resources》2016,16(4):966-978
Restriction‐enzyme‐based sequencing methods enable the genotyping of thousands of single nucleotide polymorphism (SNP) loci in nonmodel organisms. However, in contrast to traditional genetic markers, genotyping error rates in SNPs derived from restriction‐enzyme‐based methods remain largely unknown. Here, we estimated genotyping error rates in SNPs genotyped with double digest RAD sequencing from Mendelian incompatibilities in known mother–offspring dyads of Hoffman's two‐toed sloth (Choloepus hoffmanni) across a range of coverage and sequence quality criteria, for both reference‐aligned and de novo‐assembled data sets. Genotyping error rates were more sensitive to coverage than sequence quality and low coverage yielded high error rates, particularly in de novo‐assembled data sets. For example, coverage ≥5 yielded median genotyping error rates of ≥0.03 and ≥0.11 in reference‐aligned and de novo‐assembled data sets, respectively. Genotyping error rates declined to ≤0.01 in reference‐aligned data sets with a coverage ≥30, but remained ≥0.04 in the de novo‐assembled data sets. We observed approximately 10‐ and 13‐fold declines in the number of loci sampled in the reference‐aligned and de novo‐assembled data sets when coverage was increased from ≥5 to ≥30 at quality score ≥30, respectively. Finally, we assessed the effects of genotyping coverage on a common population genetic application, parentage assignments, and showed that the proportion of incorrectly assigned maternities was relatively high at low coverage. Overall, our results suggest that the trade‐off between sample size and genotyping error rates be considered prior to building sequencing libraries, reporting genotyping error rates become standard practice, and that effects of genotyping errors on inference be evaluated in restriction‐enzyme‐based SNP studies. 相似文献
19.
Nicolas Dussex Helen R. Taylor Willam R. Stovall Kim Rutherford Ken G. Dodds Shannon M. Clarke Neil J. Gemmell 《Ecology and evolution》2018,8(17):8736-8749
Next‐generation reduced representation sequencing (RRS) approaches show great potential for resolving the structure of wild populations. However, the population structure of species that have shown rapid demographic recovery following severe population bottlenecks may still prove difficult to resolve due to high gene flow between subpopulations. Here, we tested the effectiveness of the RRS method Genotyping‐By‐Sequencing (GBS) for describing the population structure of the New Zealand fur seal (NZFS, Arctocephalus forsteri), a species that was heavily exploited by the 19th century commercial sealing industry and has since rapidly recolonized most of its former range from a few isolated colonies. Using 26,026 neutral single nucleotide polymorphisms (SNPs), we assessed genetic variation within and between NZFS colonies. We identified low levels of population differentiation across the species range (<1% of variation explained by regional differences) suggesting a state of near panmixia. Nonetheless, we observed subtle population substructure between West Coast and Southern East Coast colonies and a weak, but significant (p = 0.01), isolation‐by‐distance pattern among the eight colonies studied. Furthermore, our demographic reconstructions supported severe bottlenecks with potential 10‐fold and 250‐fold declines in response to Polynesian and European hunting, respectively. Finally, we were able to assign individuals treated as unknowns to their regions of origin with high confidence (96%) using our SNP data. Our results indicate that while it may be difficult to detect population structure in species that have experienced rapid recovery, next‐generation markers and methods are powerful tools for resolving fine‐scale structure and informing conservation and management efforts. 相似文献