Population genetic simulation: Benchmarking frameworks for non-standard models of natural selection |
| |
Authors: | Olivia L. Johnson Raymond Tobler Joshua M. Schmidt Christian D. Huber |
| |
Affiliation: | 1. School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia;2. Evolution of Cultural Diversity Initiative, The Australian National University, Canberra, Australian Capital Territory, Australia;3. Department of Ophthalmology, College of Medicine and Public Health, Flinders University, Adelaide, South Australia, Australia |
| |
Abstract: | Population genetic simulation has emerged as a common tool for investigating increasingly complex evolutionary and demographic models. Software capable of handling high-level model complexity has recently been developed, and the advancement of tree sequence recording now allows simulations to merge the efficiency and genealogical insight of coalescent simulations with the flexibility of forward simulations. However, frameworks utilizing these features have not yet been compared and benchmarked. Here, we evaluate various simulation workflows using the coalescent simulator msprime and the forward simulator SLiM, to assess resource efficiency and determine an optimal simulation framework. Three aspects were evaluated: (1) the burn-in, to establish an equilibrium level of neutral diversity in the population; (2) the forward simulation, in which temporally fluctuating selection is acting; and (3) the final computation of summary statistics. We provide typical memory and computation time requirements for each step. We find that the fastest framework, a combination of coalescent and forward simulation with tree sequence recording, increases simulation speed by over twenty times compared to classical forward simulations without tree sequence recording, although it does require six times more memory. Overall, using efficient simulation workflows can lead to a substantial improvement when modelling complex evolutionary scenarios—although the optimal framework ultimately depends on the available computational resources. |
| |
Keywords: | fluctuating selection msprime population genetic simulation SLiM tree sequence recording |
|
|