共查询到20条相似文献,搜索用时 15 毫秒
1.
Approximate Bayesian computation (ABC) is widely used to infer demographic history of populations and species using DNA markers. Genomic markers can now be developed for nonmodel species using reduced representation library (RRL) sequencing methods that select a fraction of the genome using targeted sequence capture or restriction enzymes (genotyping‐by‐sequencing, GBS). We explored the influence of marker number and length, knowledge of gametic phase, and tradeoffs between sample size and sequencing depth on the quality of demographic inferences performed with ABC. We focused on two‐population models of recent spatial expansion with varying numbers of unknown parameters. Performing ABC on simulated data sets with known parameter values, we found that the timing of a recent spatial expansion event could be precisely estimated in a three‐parameter model. Taking into account uncertainty in parameters such as initial population size and migration rate collectively decreased the precision of inferences dramatically. Phasing haplotypes did not improve results, regardless of sequence length. Numerous short sequences were as valuable as fewer, longer sequences, and performed best when a large sample size was sequenced at low individual depth, even when sequencing errors were added. ABC results were similar to results obtained with an alternative method based on the site frequency spectrum (SFS) when performed with unphased GBS‐type markers. We conclude that unphased GBS‐type data sets can be sufficient to precisely infer simple demographic models, and discuss possible improvements for the use of ABC with genomic data. 相似文献
2.
The analysis of genetic variation to estimate demographic and historical parameters and to quantitatively compare alternative scenarios recently gained a powerful and flexible approach: the Approximate Bayesian Computation (ABC). The likelihood functions does not need to be theoretically specified, but posterior distributions can be approximated by simulation even assuming very complex population models including both natural and human‐induced processes. Prior information can be easily incorporated and the quality of the results can be analysed with rather limited additional effort. ABC is not a statistical analysis per se, but rather a statistical framework and any specific application is a sort of hybrid between a simulation and a data‐analysis study. Complete software packages performing the necessary steps under a set of models and for specific genetic markers are already available, but the flexibility of the method is better exploited combining different programs. Many questions relevant in ecology can be addressed using ABC, but adequate amount of time should be dedicated to decide among alternative options and to evaluate the results. In this paper we will describe and critically comment on the different steps of an ABC analysis, analyse some of the published applications of ABC and provide user guidelines. 相似文献
3.
João Carvalho Hernán E. Morales Rui Faria Roger K. Butlin Vítor C. Sousa 《Molecular ecology resources》2023,23(7):1737-1755
Next-generation sequencing of pooled samples (Pool-seq) is a popular method to assess genome-wide diversity patterns in natural and experimental populations. However, Pool-seq is associated with specific sources of noise, such as unequal individual contributions. Consequently, using Pool-seq for the reconstruction of evolutionary history has remained underexplored. Here we describe a novel Approximate Bayesian Computation (ABC) method to infer demographic history, explicitly modelling Pool-seq sources of error. By jointly modelling Pool-seq data, demographic history and the effects of selection due to barrier loci, we obtain estimates of demographic history parameters accounting for technical errors associated with Pool-seq. Our ABC approach is computationally efficient as it relies on simulating subsets of loci (rather than the whole-genome) and on using relative summary statistics and relative model parameters. Our simulation study results indicate Pool-seq data allows distinction between general scenarios of ecotype formation (single versus parallel origin) and to infer relevant demographic parameters (e.g. effective sizes and split times). We exemplify the application of our method to Pool-seq data from the rocky-shore gastropod Littorina saxatilis, sampled on a narrow geographical scale at two Swedish locations where two ecotypes (Wave and Crab) are found. Our model choice and parameter estimates show that ecotypes formed before colonization of the two locations (i.e. single origin) and are maintained despite gene flow. These results indicate that demographic modelling and inference can be successful based on pool-sequencing using ABC, contributing to the development of suitable null models that allow for a better understanding of the genetic basis of divergent adaptation. 相似文献
4.
Xin Tong Jian‐Hui Li Kai Jiang Yuan‐Yuan Li Chao‐Nan Liu Shekhar Biswas Rong Wang Xiao‐Yong Chen 《植物分类学报:英文版》2022,60(5):1158-1171
Both paleoclimatic change and anthropogenic habitat destruction can have adverse effects on species demography and, in turn, could lead a species towards being endangered and rare. Understanding the relative importance of these natural and anthropogenic factors driving species endangerment and rarity is thus crucial for effective conservation planning but remains elusive. Here, we examine the phylogeography and demographic history of an endangered conifer species in China, Torreya jackii Chun, and assess the relative importance of natural and anthropogenic factors that might have put the species in its endangered state. We collected tissue samples from all the 13 extant wild populations, and analyzed the genetic variation using eight nuclear microsatellites and four chloroplast and one mitochondrial DNA fragments. We found low genetic and nucleotide diversities, which could explain the absence of spatial and phylogeographic structure. Using a hierarchical approximate Bayesian computation technique, we identified the demographic scenario that best fits the genetic data and found that effective population size was low at least 200 000 years ago but expanded after the last glacial maximum (LGM). The paleoclimatic niche model revealed a profound effect of precipitation on the distribution of T. jackii and predicted that the current distribution areas were suitable during the LGM. Despite the post-LGM expansion, the best-supported scenario showed a dramatic population collapse during the past 300 years, when anthropogenic disturbances also increased dramatically. Overall, our study sheds light on how historical factors and human impacts jointly threaten the persistence of a species, and these aspects should be duly considered in species conservation planning. 相似文献
5.
With the availability of whole-genome sequence data biologists are able to test hypotheses regarding the demography of populations. Furthermore, the advancement of the Approximate Bayesian Computation (ABC) methodology allows the demographic inference to be performed in a simple framework using summary statistics. We present here msABC, a coalescent-based software that facilitates the simulation of multi-locus data, suitable for an ABC analysis. msABC is based on Hudson's ms algorithm, which is used extensively for simulating neutral demographic histories of populations. The flexibility of the original algorithm has been extended so that sample size may vary among loci, missing data can be incorporated in simulations and calculations, and a multitude of summary statistics for single or multiple populations is generated. The source code of msABC is available at http://bio.lmu.de/~pavlidis/msabc or upon request from the authors. 相似文献
6.
The Kingman coalescent and its developments are often considered among the most important advances in population genetics of the last decades. Demographic inference based on coalescent theory has been used to reconstruct the population dynamics and evolutionary history of several species, including Mycobacterium tuberculosis (MTB), an important human pathogen causing tuberculosis. One key assumption of the Kingman coalescent is that the number of descendants of different individuals does not vary strongly, and violating this assumption could lead to severe biases caused by model misspecification. Individual lineages of MTB are expected to vary strongly in reproductive success because 1) MTB is potentially under constant selection due to the pressure of the host immune system and of antibiotic treatment, 2) MTB undergoes repeated population bottlenecks when it transmits from one host to the next, and 3) some hosts show much higher transmission rates compared with the average (superspreaders).Here, we used an approximate Bayesian computation approach to test whether multiple-merger coalescents (MMC), a class of models that allow for large variation in reproductive success among lineages, are more appropriate models to study MTB populations. We considered 11 publicly available whole-genome sequence data sets sampled from local MTB populations and outbreaks and found that MMC had a better fit compared with the Kingman coalescent for 10 of the 11 data sets. These results indicate that the null model for analyzing MTB outbreaks should be reassessed and that past findings based on the Kingman coalescent need to be revisited. 相似文献
7.
The simultaneous analysis of intra‐ and interspecies variation is challenging mainly because our knowledge about patterns of polymorphisms where both intra‐ and interspecies samples coexist is limited. In this study, we present CoMuS (Coalescent of Multiple Species), a multispecies coalescent software that can simulate intra‐ and interspecies polymorphisms. CoMuS supports a variety of speciation models and demographic scenarios related to the history of each species. In CoMuS, speciation can be accompanied by either instant or gradual isolation between sister species. Sampling may also occur in the past, and thus, we can study simultaneously extinct and extant species. Our software supports both the infinite‐ and the finite‐site model, with substitution rate heterogeneity among sites and a user‐defined proportion of invariable sites. We demonstrate the usage of CoMuS in various applications: species delimitation, software testing, model selection and parameter inference involving present‐day and ancestral samples, comparison between gradual and instantaneous isolation models, estimation of speciation time between human and chimpanzee using both intra‐ and interspecies variation. We expect that CoMuS will be particularly useful for studies where species have been separated recently from their common ancestor and phenomena such as incomplete lineage sorting or introgression still occur. 相似文献
8.
Model‐based analyses are common in phylogeographic inference because they parameterize processes such as population division, gene flow and expansion that are of interest to biologists. Approximate Bayesian computation is a model‐based approach that can be customized to any empirical system and used to calculate the relative posterior probability of several models, provided that suitable models can be identified for comparison. The question of how to identify suitable models is explored using data from Plethodon idahoensis, a salamander that inhabits the North American inland northwest temperate rainforest. First, we conduct an ABC analysis using five models suggested by previous research, calculate the relative posterior probabilities and find that a simple model of population isolation has the best fit to the data (PP = 0.70). In contrast to this subjective choice of models to include in the analysis, we also specify models in a more objective manner by simulating prior distributions for 143 models that included panmixia, population isolation, change in effective population size, migration and range expansion. We then identify a smaller subset of models for comparison by generating an expectation of the highest posterior probability that a false model is likely to achieve due to chance and calculate the relative posterior probabilities of only those models that exceed this expected level. A model that parameterized divergence with population expansion and gene flow in one direction offered the best fit to the P. idahoensis data (in contrast to an isolation‐only model from the first analysis). Our investigation demonstrates that the determination of which models to include in ABC model choice experiments is a vital component of model‐based phylogeographic analysis. 相似文献
9.
10.
Parul Johri Kellen Riall Hannes Becher Laurent Excoffier Brian Charlesworth Jeffrey D. Jensen 《Molecular biology and evolution》2021,38(7):2986
Current procedures for inferring population history generally assume complete neutrality—that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy. 相似文献
11.
Jeffrey D. Jensen 《Molecular ecology resources》2015,15(1):87-98
With novel developments in sequencing technologies, time‐sampled data are becoming more available and accessible. Naturally, there have been efforts in parallel to infer population genetic parameters from these data sets. Here, we compare and analyse four recent approaches based on the Wright–Fisher model for inferring selection coefficients (s) given effective population size (Ne), with simulated temporal data sets. Furthermore, we demonstrate the advantage of a recently proposed approximate Bayesian computation (ABC)‐based method that is able to correctly infer genomewide average Ne from time‐serial data, which is then set as a prior for inferring per‐site selection coefficients accurately and precisely. We implement this ABC method in a new software and apply it to a classical time‐serial data set of the medionigra genotype in the moth Panaxia dominula. We show that a recessive lethal model is the best explanation for the observed variation in allele frequency by implementing an estimator of the dominance ratio (h). 相似文献
12.
Accounting for historical demographic features, such as the strength and timing of gene flow and divergence times between closely related lineages, is vital for many inferences in evolutionary biology. Approximate Bayesian computation (ABC) is one method commonly used to estimate demographic parameters. However, the DNA sequences used as input for this method, often microsatellites or RADseq loci, usually represent a small fraction of the genome. Whole genome sequencing (WGS) data, on the other hand, have been used less often with ABC, and questions remain about the potential benefit of, and how to best implement, this type of data; we used pseudo‐observed data sets to explore such questions. Specifically, we addressed the potential improvements in parameter estimation accuracy that could be associated with WGS data in multiple contexts; namely, we quantified the effects of (a) more data, (b) haplotype‐based summary statistics, and (c) locus length. Compared with a hypothetical RADseq data set with 2.5 Mbp of data, using a 1 Gbp data set consisting of 100 Kbp sequences led to substantial gains in the accuracy of parameter estimates, which was mostly due to haplotype statistics and increased data. We also quantified the effects of including (a) locus‐specific recombination rates, and (b) background selection information in ABC analyses. Importantly, assuming uniform recombination or ignoring background selection had a negative effect on accuracy in many cases. Software and results from this method validation study should be useful for future demographic history analyses. 相似文献
13.
Géraldine Loot Charlotte Veyssière Benjamin Roche Simon Blanchet 《Molecular ecology》2015,24(21):5348-5363
Emerging pathogens constitute a severe threat for human health and biodiversity. Determining the status (native or non‐native) of emerging pathogens, and tracing back their spatio‐temporal dynamics, is crucial to understand the eco‐evolutionary factors promoting their emergence, to control their spread and mitigate their impacts. However, tracing back the spatio‐temporal dynamics of emerging wildlife pathogens is challenging because (i) they are often neglected until they become sufficiently abundant and pose socio‐economical concerns and (ii) their geographical range is often little known. Here, we combined classical population genetics tools and approximate Bayesian computation (i.e. ABC) to retrace the dynamics of Tracheliastes polycolpus, a poorly documented pathogenic ectoparasite emerging in Western Europe that threatens several freshwater fish species. Our results strongly suggest that populations of T. polycolpus in France emerged from individuals originating from a unique genetic pool that were most likely introduced in the 1920s in central France. From this initial population, three waves of colonization occurred into peripheral watersheds within the next two decades. We further demonstrated that populations remained at low densities, and hence undetectable, during 10 years before a major demographic expansion occurred, and before its official detection in France. These findings corroborate and expand the few historical records available for this emerging pathogen. More generally, our study demonstrates how ABC can be used to determine the status, reconstruct the colonization history and infer key evolutionary parameters of emerging wildlife pathogens with low data availability, and for which samples from the putative native area are inaccessible. 相似文献
14.
Spatial and environmental heterogeneity are major factors in structuring species distributions in alpine landscapes. These landscapes have also been affected by glacial advances and retreats, causing alpine taxa to undergo range shifts and demographic changes. These nonequilibrium population dynamics have the potential to obscure the effects of environmental factors on the distribution of genetic variation. Here, we investigate how demographic change and environmental factors influence genetic variation in the alpine butterfly Colias behrii. Data from 14 microsatellite loci provide evidence of bottlenecks in all population samples. We test several alternative models of demography using approximate Bayesian computation (ABC), with the results favouring a model in which a recent bottleneck precedes rapid population growth. Applying independent calibrations to microsatellite loci and a nuclear gene, we estimate that this bottleneck affected both northern and southern populations 531–281 years ago, coinciding with a period of global cooling. Using regression approaches, we attempt to separate the effects of population structure, geographical distance and landscape on patterns of population genetic differentiation. Only 40% of the variation in FST is explained by these models, with geographical distance and least‐cost distance among meadow patches selected as the best predictors. Various measures of genetic diversity within populations are also decoupled from estimates of local abundance and habitat patch characteristics. Our results demonstrate that demographic change can have a disproportionate influence on genetic diversity in alpine species, contrasting with other studies that suggest landscape features control contemporary demographic processes in high‐elevation environments. 相似文献
15.
Jiyeon Kim Gang Ni Taeho Kim Jae‐Yong Chun Elizabeth M. A. Kern Joong‐Ki Park 《Evolutionary Applications》2019,12(2):324-336
Plant‐parasitic nematodes (PPNs) threaten crop production worldwide. Yet few studies have examined their intraspecific genetic diversity or patterns of invasion, critical data for managing the spread of these cryptic pests. The sugar beet nematode Heterodera schachtii, a global invader that parasitizes over 200 plant species, represents a model for addressing important questions about the invasion genetics of PPNs. Here, a phylogeographic study using 15 microsatellite markers was conducted on 231 H. schachtii individuals sampled from four continents, and invasion history was reconstructed through an approximate Bayesian computation approach, with emphasis on the origin of newly discovered populations in Korea. Multiple analyses confirmed the existence of cryptic lineages within this species, with the Korean populations comprising one group (group 1) and the populations from Europe, Australia, North America, and western Asia comprising another (group 2). No multilocus genotypes were shared between the two groups, and large genetic distance was inferred between them. Population subdivision was also revealed among the populations of group 2 in both population comparison and STRUCTURE analyses, mostly due to different divergent times between invasive and source populations. The Korean populations showed substantial genetic homogeneity and likely originated from a single invasion event. However, none of the other studied populations were implicated as the source. Further studies with additional populations are needed to better describe the distribution of the potential source population for the East Asian lineage. 相似文献
16.
Anna Brüniche-Olsen Menna E. Jones Jeremy J. Austin Christopher P. Burridge Barbara R. Holland 《Biology letters》2014,10(11)
The Tasmanian devil (Sarcophilus harrisii) was widespread in Australia during the Late Pleistocene but is now endemic to the island of Tasmania. Low genetic diversity combined with the spread of devil facial tumour disease have raised concerns for the species’ long-term survival. Here, we investigate the origin of low genetic diversity by inferring the species'' demographic history using temporal sampling with summary statistics, full-likelihood and approximate Bayesian computation methods. Our results show extensive population declines across Tasmania correlating with environmental changes around the last glacial maximum and following unstable climate related to increased ‘El Niño–Southern Oscillation’ activity. 相似文献
17.
Understanding how assemblages of species responded to past climate change is a central goal of comparative phylogeography and comparative population genomics, an endeavour that has increasing potential to integrate with community ecology. New sequencing technology now provides the potential to perform complex demographic inference at unprecedented resolution across assemblages of nonmodel species. To this end, we introduce the aggregate site frequency spectrum (aSFS), an expansion of the site frequency spectrum to use single nucleotide polymorphism (SNP) data sets collected from multiple, co‐distributed species for assemblage‐level demographic inference. We describe how the aSFS is constructed over an arbitrary number of independent population samples and then demonstrate how the aSFS can differentiate various multispecies demographic histories under a wide range of sampling configurations while allowing effective population sizes and expansion magnitudes to vary independently. We subsequently couple the aSFS with a hierarchical approximate Bayesian computation (hABC) framework to estimate degree of temporal synchronicity in expansion times across taxa, including an empirical demonstration with a data set consisting of five populations of the threespine stickleback (Gasterosteus aculeatus). Corroborating what is generally understood about the recent postglacial origins of these populations, the joint aSFS/hABC analysis strongly suggests that the stickleback data are most consistent with synchronous expansion after the Last Glacial Maximum (posterior probability = 0.99). The aSFS will have general application for multilevel statistical frameworks to test models involving assemblages and/or communities, and as large‐scale SNP data from nonmodel species become routine, the aSFS expands the potential for powerful next‐generation comparative population genomic inference. 相似文献
18.
A. Klimova C. D. Phillips K. Fietz M. T. Olsen J. Harwood W. Amos J. I. Hoffman 《Molecular ecology》2014,23(16):3999-4017
Although the grey seal Halichoerus grypus is one of the most familiar and intensively studied of all pinniped species, its global population structure remains to be elucidated. Little is also known about how the species as a whole may have historically responded to climate‐driven changes in habitat availability and anthropogenic exploitation. We therefore analysed samples from over 1500 individuals collected from 22 colonies spanning the Western and Eastern Atlantic and the Baltic Sea regions, represented by 350 bp of the mitochondrial hypervariable region and up to nine microsatellites. Strong population structure was observed at both types of marker, and highly asymmetrical patterns of gene flow were also inferred, with the Orkney Islands being identified as a source of emigrants to other areas in the Eastern Atlantic. The Baltic and Eastern Atlantic regions were estimated to have diverged a little over 10 000 years ago, consistent with the last proposed isolation of the Baltic Sea. Approximate Bayesian computation also identified genetic signals consistent with postglacial population expansion across much of the species range, suggesting that grey seals are highly responsive to changes in habitat availability. 相似文献
19.
Stephen J. Taerum Adam Konečný Z. Wilhelm de Beer David Cibrián‐Tovar Michael J. Wingfield 《Biological journal of the Linnean Society. Linnean Society of London》2016,118(3):486-502
Exotic forest insects and their symbionts pose an increasing threat to forest health. This is apparently true for the red turpentine beetle (Dendroctonus valens), which was unintentionally introduced to China, where the beetle has killed millions of healthy native pine trees. Previous population genetics studies that used cytochrome oxidase I as a marker concluded that the source of D. valens in China was western North America. In contrast, surveys of fungi associated with D. valens demonstrated that more fungal species are shared between China and eastern North America than between China and western North America, suggesting that the source population of D. valens could be eastern North America. In this study, we used microsatellite markers to determine population structure of D. valens in North America as well as the source population of the beetle in China. The analyses revealed that four genetically distinct populations (herein named the West, Central, Northeast and Mexico) represent the native range of D. valens. Clustering analyses and a simulation‐based approximate Bayesian computation (ABC) approach supported the hypothesis that western North America is the source of the invasive D. valens population. This study provides a demonstration of non‐congruence between patterns inferred by studies on population genetics and symbiont assemblages in an invasive bark beetle. 相似文献
20.
The estimation of effective population size from one sample of genotypes has been problematic because most estimators have been proven imprecise or biased. We developed a web-based program, onesamp that uses approximate Bayesian computation to estimate effective population size from a sample of microsatellite genotypes. onesamp requires an input file of sampled individuals' microsatellite genotypes along with information about several sampling and biological parameters. onesamp provides an estimate of effective population size, along with 95% credible limits. We illustrate the use of onesamp with an example data set from a re-introduced population of ibex Capra ibex. 相似文献