首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 7 毫秒
1.
A population genetic model with a single locus at which balancing selection acts and many linked loci at which neutral mutations can occur is analysed using the coalescent approach. The model incorporates geographic subdivision with migration, as well as mutation, recombination, and genetic drift of neutral variation. It is found that geographic subdivision can affect genetic variation even with high rates of migration, providing that selection is strong enough to maintain different allele frequencies at the selected locus. Published sequence data from the alcohol dehydrogenase locus of Drosophila melanogaster are found to fit the proposed model slightly better than a similar model without subdivision.  相似文献   

2.
Island models and the coalescent process   总被引:2,自引:1,他引:1  
Using a coalescent approach, we derive several classical results and extend them to more general models. We find that the classic result for constant population size and constant migration rates holds in models with varying population size and varying migration rates with the obvious substitution of effective population size and mean migration fraction. In addition, the relationship of a 'local' F ST to local gene flow is derived. This result may be useful for analysing gene flow in a regional subset of a large global population, using only data from the regional subset.  相似文献   

3.
The genealogical structure of neutral populations in which reproductive success is highly-skewed has been the subject of many recent studies. Here we derive a coalescent dual process for a related class of continuous-time Moran models with viability selection. In these models, individuals can give birth to multiple offspring whose survival depends on both the parental genotype and the brood size. This extends the dual process construction for a multi-type Moran model with genic selection described in Etheridge and Griffiths (2009). We show that in the limit of infinite population size the non-neutral Moran models converge to a Markov jump process which we call the Λ-Fleming-Viot process with viability selection and we derive a coalescent dual for this process directly from the generator and as a limit from the Moran models. The dual is a branching-coalescing process similar to the Ancestral Selection Graph which follows the typed ancestry of genes backwards in time with real and virtual lineages. As an application, the transition functions of the non-neutral Moran and Λ-coalescent models are expressed as mixtures of the transition functions of the dual process.  相似文献   

4.
Matsen FA  Wakeley J 《Genetics》2006,172(1):701-708
In this article we apply some graph-theoretic results to the study of coalescence in a structured population with migration. The graph is the pattern of migration among subpopulations, or demes, and we use the theory of random walks on graphs to characterize the ease with which ancestral lineages can traverse the habitat in a series of migration events. We identify conditions under which the coalescent process in populations with restricted migration, such that individuals cannot traverse the habitat freely in a single migration event, nonetheless becomes identical to the coalescent process in the island migration model in the limit as the number of demes tends to infinity. Specifically, we first note that a sequence of symmetric graphs with Diaconis-Stroock constant bounded above has an unstructured Kingman-type coalescent in the limit for a sample of size two from two different demes. We then show that circular and toroidal models with long-range but restricted migration have an upper bound on this constant and so have an unstructured-migration coalescent in the limit. We investigate the rate of convergence to this limit using simulations.  相似文献   

5.
A coalescent dual process for a multi-type Moran model with genic selection is derived using a generator approach. This leads to an expansion of the transition functions in the Moran model and the Wright–Fisher diffusion process limit in terms of the transition functions for the coalescent dual. A graphical representation of the Moran model (in the spirit of Harris) identifies the dual as a strong dual process following typed lines backwards in time. An application is made to the harmonic measure problem of finding the joint probability distribution of the time to the first loss of an allele from the population and the distribution of the surviving alleles at the time of loss. Our dual process mirrors the Ancestral Selection Graph of [Krone, S. M., Neuhauser, C., 1997. Ancestral processes with selection. Theoret. Popul. Biol. 51, 210–237; Neuhauser, C., Krone, S. M., 1997. The genealogy of samples in models with selection. Genetics 145, 519–534], which allows one to reconstruct the genealogy of a random sample from a population subject to genic selection. In our setting, we follow [Stephens, M., Donnelly, P., 2002. Ancestral inference in population genetics models with selection. Aust. N. Z. J. Stat. 45, 395–430] in assuming that the types of individuals in the sample are known. There are also close links to [Fearnhead, P., 2002. The common ancestor at a nonneutral locus. J. Appl. Probab. 39, 38–54]. However, our methods and applications are quite different. This work can also be thought of as extending a dual process construction in a Wright–Fisher diffusion in [Barbour, A.D., Ethier, S.N., Griffiths, R.C., 2000. A transition function expansion for a diffusion model with selection. Ann. Appl. Probab. 10, 123–162]. The application to the harmonic measure problem extends a construction provided in the setting of a neutral diffusion process model in [Ethier, S.N., Griffiths, R.C., 1991. Harmonic measure for random genetic drift. In: Pinsky, M.A. (Ed.), Diffusion Processes and Related Problems in Analysis, vol. 1. In: Progress in Probability Series, vol. 22, Birkhäuser, Boston, pp. 73–81].  相似文献   

6.
Brownian motions on coalescent structures have a biological relevance, either as an approximation of the stepwise mutation model for microsatellites, or as a model of spatial evolution considering the locations of individuals at successive generations. We discuss estimation procedures for the dispersal parameter of a Brownian motion defined on coalescent trees. First, we consider the mean square distance unbiased estimator and compute its variance. In a second approach, we introduce a phylogenetic estimator. Given the UPGMA topology, the likelihood of the parameter is computed thanks to a new dynamical programming method. By a proper correction, an unbiased estimator is derived from the pseudomaximum of the likelihood. The last approach consists of computing the likelihood by a Markov chain Monte Carlo sampling method. In the one-dimensional Brownian motion, this method seems less reliable than pseudomaximum-likelihood.  相似文献   

7.
We present a new method for simulating samples of marker haplotypes, genotypes, or diplotypes in case-control studies in which the markers are linked to a disease locus in any specified region of the genome. The method allows realistic features to be incorporated into the simulations, including selection acting on disease alleles, sample ascertainment of disease chromosomes and polymorphic markers, a genetic dominance model of disease expression that allows incomplete penetrance and phenocopies, and an accurate genetic map of recombination rates and hotspots for recombination in the human genome (or, alternatively, an improved method for simulating the distribution of hotspots). The new method uses an approach that combines simulation of the coalescent process for the sampled chromosomes with a diffusion process used to model the evolution of the disease-mutation frequency over time. Examples illustrate how the method may be used to study the expected power of a marker-disease association study.  相似文献   

8.
K Zeng 《Heredity》2013,110(4):363-371
There is increasing evidence that background selection, the effects of the elimination of recurring deleterious mutations by natural selection on variability at linked sites, may be a major factor shaping genome-wide patterns of genetic diversity. To accurately quantify the importance of background selection, it is vital to have computationally efficient models that include essential biological features. To this end, a structured coalescent procedure is used to construct a model of background selection that takes into account the effects of recombination, recent changes in population size and variation in selection coefficients against deleterious mutations across sites. Furthermore, this model allows a flexible organization of selected and neutral sites in the region concerned, and has the ability to generate sequence variability at both selected and neutral sites, allowing the correlation between these two types of sites to be studied. The accuracy of the model is verified by checking against the results of forward simulations. These simulations also reveal several patterns of diversity that are in qualitative agreement with observations reported in recent studies of DNA sequence polymorphisms. These results suggest that the model should be useful for data analysis.  相似文献   

9.
We report the analysis results of the Genetic Analysis Workshop 14 simulated microsatellite marker dataset, using replicate 50 from the Danacaa population. We applied several methods for association analysis of multi-allelic markers to case-control data to study the association between Kofendrerd Personality Disorder and multi-allelic markers in a candidate region previously identified by the linkage analysis. Evidence for association was found for marker D03S0127 (p < 0.01). The analyses were done without any prior knowledge of the answers.  相似文献   

10.
Lohse K  Harrison RJ  Barton NH 《Genetics》2011,189(3):977-987
Analysis of genomic data requires an efficient way to calculate likelihoods across very large numbers of loci. We describe a general method for finding the distribution of genealogies: we allow migration between demes, splitting of demes [as in the isolation-with-migration (IM) model], and recombination between linked loci. These processes are described by a set of linear recursions for the generating function of branch lengths. Under the infinite-sites model, the probability of any configuration of mutations can be found by differentiating this generating function. Such calculations are feasible for small numbers of sampled genomes: as an example, we show how the generating function can be derived explicitly for three genes under the two-deme IM model. This derivation is done automatically, using Mathematica. Given data from a large number of unlinked and nonrecombining blocks of sequence, these results can be used to find maximum-likelihood estimates of model parameters by tabulating the probabilities of all relevant mutational configurations and then multiplying across loci. The feasibility of the method is demonstrated by applying it to simulated data and to a data set previously analyzed by Wang and Hey (2010) consisting of 26,141 loci sampled from Drosophila simulans and D. melanogaster. Our results suggest that such likelihood calculations are scalable to genomic data as long as the numbers of sampled individuals and mutations per sequence block are small.  相似文献   

11.
Natural populations are structured spatially into local populations and genetically into diverse 'genetic backgrounds' defined by different combinations of selected alleles. If selection maintains genetic backgrounds at constant frequency then neutral diversity is enhanced. By contrast, if background frequencies fluctuate then diversity is reduced. Provided that the population size of each background is large enough, these effects can be described by the structured coalescent process. Almost all the extant results based on the coalescent deal with a single selected locus. Yet we know that very large numbers of genes are under selection and that any substantial effects are likely to be due to the cumulative effects of many loci. Here, we set up a general framework for the extension of the coalescent to multilocus scenarios and we use it to study the simplest model, where strong balancing selection acting on a set of n loci maintains 2n backgrounds at constant frequencies and at linkage equilibrium. Analytical results show that the expected linked neutral diversity increases exponentially with the number of selected loci and can become extremely large. However, simulation results reveal that the structured coalescent approach breaks down when the number of backgrounds approaches the population size, because of stochastic fluctuations in background frequencies. A new method is needed to extend the structured coalescent to cases with large numbers of backgrounds.  相似文献   

12.
To extract full information from samples of DNA sequence data, it is necessary to use sophisticated model-based techniques such as importance sampling under the coalescent. However, these are limited in the size of datasets they can handle efficiently. Chen and Liu (2000) introduced the idea of stopping-time resampling and showed that it can dramatically improve the efficiency of importance sampling methods under a finite-alleles coalescent model. In this paper, a new framework is developed for designing stopping-time resampling schemes under more general models. It is implemented on data both from infinite sites and stepwise models of mutation, and extended to incorporate crossover recombination. A simulation study shows that this new framework offers a substantial improvement in the accuracy of likelihood estimation over a range of parameters, while a direct application of the scheme of Chen and Liu (2000) can actually diminish the estimate. The method imposes no additional computational burden and is robust to the choice of parameters.  相似文献   

13.
Tufto J 《Genetical research》2000,76(3):285-293
The evolution of a quantitative trait subject to stabilizing selection and immigration, with the immigrants deviating from the local optimum, is considered under a number of different models of the underlying genetic basis of the trait. By comparing exact predictions under the infinitesimal model obtained using numerical methods with predictions of a simplified approximate model based on ignoring linkage disequilibrium, the increase in the expressed genetic variance as a result of linkage disequilibrium generated by migration is shown to be relatively small and negligible, provided that the genetic variance relative to the squared deviation of immigrants from the local optimum is sufficiently large or selection and migration is sufficiently weak. Deviation from normality is shown to be less important by comparing predictions of the infinitesimal model with a model presupposing normality. For a more realistic symmetric model, involving a finite number of loci only, no linkage and equal effects and frequencies across loci, additional changes in the genetic variance arise as a result of changes in underlying allele frequencies. Again, provided that the genetic variance relative to the squared deviation of the immigrants from the local optimum is small, the difference between the predictions of infinitesimal and the symmetric model are small unless the number of loci is very small. However, if the genetic variance relative to the squared deviation of the immigrants from the local optimum is large, or if selection and migration are strong, both linkage disequilibrium and changes in the genetic variance as a result of changes in underlying allele frequencies become important.  相似文献   

14.

Background  

Coalescent simulations are playing a large role in interpreting large scale intra-specific sequence or polymorphism surveys and for planning and evaluating association studies. Coalescent simulations of data sets under different models can be compared to the actual data to test the importance of different evolutionary factors and thus get insight into these.  相似文献   

15.
A numerical method is developed for solving a nonstandard singular system of second-order differential equations arising from a problem in population genetics concerning the coalescent process for a sample from a population undergoing selection. The nonstandard feature of the system is that there are terms in the equations that approach infinity as one approaches the boundary. The numerical recipe is patterned after the LU decomposition for tridiagonal matrices. Although there is no analytic proof that this method leads to the correct solution, various examples are presented that suggest that the method works. This method allows one to calculate the expected number of segregating sites in a random sample of n genes from a population whose evolution is described by a model which is not selectively neutral.  相似文献   

16.
We shall extend Kingman's coalescent to the geographically structured population model with migration among colonies. It is described by a continuous-time Markov chain, which is proved to be a dual process of the diffusion process of stepping-stone model. We shall derive a system of equations for the spatial distribution of a common ancestor of sampled genes from colonies and the mean time to getting to one common ancestor. These equations are solved in three particular models; a two-population model, the island model and the one-dimensional stepping-stone model with symmetric nearest-neighbour migration.  相似文献   

17.
18.
We study the ancestral genetic process for samples from two large, subdivided populations that are connected by migration to, from, and within a small set of subpopulations, or demes. We consider convergence to an ancestral limit process as the numbers of demes in the two large, subdivided populations tend to infinity. We show that the ancestral limit process for a sample includes a recent instantaneous adjustment to the sample size and structure followed by a more ancient process that is identical to the usual structured coalescent, but with different scaled parameters. This justifies the application of a modified structured coalescent to some hierarchically structured populations.  相似文献   

19.
We study the ancestral genetic process for samples from two large, subdivided populations that are connected by migration to, from, and within a small set of subpopulations, or demes. We consider convergence to an ancestral limit process as the numbers of demes in the two large, subdivided populations tend to infinity. We show that the ancestral limit process for a sample includes a recent instantaneous adjustment to the sample size and structure followed by a more ancient process that is identical to the usual structured coalescent, but with different scaled parameters. This justifies the application of a modified structured coalescent to some hierarchically structured populations.  相似文献   

20.
A versatile algorithm is developed to model PCR on a computer. The method is based on a modification of the coalescent process and provides a general framework to analyse data from PCR. It allows for incorporation of the dynamics of the replication process as described in terms of the number of starting template molecules and cycle-dependent PCR efficiency. The simulation method generates, as a first step, the genealogy of a set of sequences sampled from a final PCR product. In a second step a mutation process is superimposed and the resulting data set is analysed. The efficiency of our algorithm enables us to get reliable approximations of various sample distributions. We demonstrate the relevance of our method with two applications: maximum likelihood estimation of the error rate in PCR and a test of homogeneity of the template.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号