High performance computation of landscape genomic models including local indicators of spatial association |
| |
Authors: | S. Stucki P. Orozco‐terWengel B. R. Forester S. Duruz L. Colli C. Masembe R. Negrini E. Landguth M. R. Jones The NEXTGEN Consortium M. W. Bruford P. Taberlet S. Joost |
| |
Affiliation: | 1. Laboratory of Geographic Information Systems (LASIG), School of Architecture, Civil and Environmental Engineering (ENAC), Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland;2. School of Biosciences, Cardiff University, Cardiff, UK;3. Nicholas School of the Environment, University Program in Ecology, Duke University, Durham, NC, USA;4. BioDNA ‐ Centro di Ricerca sulla Biodiversità e sul DNA Antico, Istituto di Zootecnica, Università Cattolica del S. Cuore, Piacenza, Italy;5. Department of Zoology, Entomology and Fisheries Sciences, College of Natural Sciences, Makerere University, Kampala, Uganda;6. Associazione Italiana Allevatori, Roma, Italy;7. Division of Biological Sciences, University of Montana, Missoula, MT, USA;8. http://nextgen.epfl.ch;9. Laboratoire d'Ecologie Alpine (LECA), CNRS, Grenoble, France;10. Laboratoire d'Ecologie Alpine (LECA), Univ. Grenoble Alpes, Grenoble, France |
| |
Abstract: | With the increasing availability of both molecular and topo‐climatic data, the main challenges facing landscape genomics – that is the combination of landscape ecology with population genomics – include processing large numbers of models and distinguishing between selection and demographic processes (e.g. population structure). Several methods address the latter, either by estimating a null model of population history or by simultaneously inferring environmental and demographic effects. Here we present sam βada , an approach designed to study signatures of local adaptation, with special emphasis on high performance computing of large‐scale genetic and environmental data sets. sam βada identifies candidate loci using genotype–environment associations while also incorporating multivariate analyses to assess the effect of many environmental predictor variables. This enables the inclusion of explanatory variables representing population structure into the models to lower the occurrences of spurious genotype–environment associations. In addition, sam βada calculates local indicators of spatial association for candidate loci to provide information on whether similar genotypes tend to cluster in space, which constitutes a useful indication of the possible kinship between individuals. To test the usefulness of this approach, we carried out a simulation study and analysed a data set from Ugandan cattle to detect signatures of local adaptation with sam βada , bayenv , lfmm and an FST outlier method (FDIST approach in arlequin ) and compare their results. sam βada – an open source software for Windows, Linux and Mac OS X available at http://lasig.epfl.ch/sambada – outperforms other approaches and better suits whole‐genome sequence data processing. |
| |
Keywords: | environmental correlations genome scans high performance computing landscape genomics local adaptation spatial autocorrelation |
|
|