SimRAD: an R package for simulation‐based prediction of the number of loci expected in RADseq and similar genotyping by sequencing approaches |
| |
Authors: | Olivier Lepais Jason T Weir |
| |
Institution: | 1. INRA, UMR 1224, Ecologie Comportementale et Biologie des Populations de Poissons, INRA, , Saint Pée sur Nivelle, France;2. Univ Pau & Pays Adour, UMR 1224, Ecologie Comportementale et Biologie des Populations de Poissons, UFR Sciences et Techniques de la C?te Basque, Univ Pau and Pays Adour, , Anglet, France;3. Department of Biological Sciences, and Department of Ecology and Evolutionary Biology, University of Toronto Scarborough, , Toronto, ON, M1C 1A4 Canada |
| |
Abstract: | Application of high‐throughput sequencing platforms in the field of ecology and evolutionary biology is developing quickly with the introduction of efficient methods to reduce genome complexity. Numerous approaches for genome complexity reduction have been developed using different combinations of restriction enzymes, library construction strategies and fragment size selection. As a result, the choice of which techniques to use may become cumbersome, because it is difficult to anticipate the number of loci resulting from each method. We developed SimRAD, an R package that performs in silico restriction enzyme digests and fragment size selection as implemented in most restriction site associated DNA polymorphism and genotyping by sequencing methods. In silico digestion is performed on a reference genome or on a randomly generated DNA sequence when no reference genome sequence is available. SimRAD accurately predicts the number of loci under alternative protocols when a reference genome sequence is available for the targeted species (or a close relative) but may be unreliable when no reference genome is available. SimRAD is also useful for fine‐tuning a given protocol to adjust the number of targeted loci. Here, we outline the functionality of SimRAD and provide an illustrative example of the use of the package (available on the CRAN at http://cran.r-project.org/web/packages/SimRAD ). |
| |
Keywords: |
GBS
genome complexity reduction in silico digestion next generation sequencing restriction site associated DNA polymorphism single nucleotide polymorphism |
|
|