首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Eldon B  Wakeley J 《Genetics》2008,178(3):1517-1532
Correlations in coalescence times between two loci are derived under selectively neutral population models in which the offspring of an individual can number on the order of the population size. The correlations depend on the rates of recombination and random drift and are shown to be functions of the parameters controlling the size and frequency of these large reproduction events. Since a prediction of linkage disequilibrium can be written in terms of correlations in coalescence times, it follows that the prediction of linkage disequilibrium is a function not only of the rate of recombination but also of the reproduction parameters. Low linkage disequilibrium is predicted if the offspring of a single individual frequently replace almost the entire population. However, high linkage disequilibrium can be predicted if the offspring of a single individual replace an intermediate fraction of the population. In some cases the model reproduces the standard Wright-Fisher predictions. Contrary to common intuition, high linkage disequilibrium can be predicted despite frequent recombination, and low linkage disequilibrium under infrequent recombination. Simulations support the analytical results but show that the variance of linkage disequilibrium is very large.  相似文献   

2.
The stepwise mutation model, which was at one time chiefly of interest in studying the evolution of protein charge-states, has recently undergone a resurgence of interest with the new popularity of microsatellites as phylogenetic markers. In this paper we describe a method which makes it possible to transfer many population genetics results from the standard infinite sites model to the stepwise mutation model. We study in detail the properties of pairwise differences in microsatellite repeat number between randomly chosen alleles. We show that the problem of finding the expected squared distance between two individuals and finding the variance of the squared distance can be reduced for a wide range of population models to finding the mean and mean square coalescence times. In many cases the distributions of coalescence times have already been studied for infinite site problems. In this study we show how to calculate these quantities for several population models. We also calculate the variance in mean squared pairwise distance (an estimator of mutation rate × population size) for samples of arbitrary size and show that this variance does not approach zero as the sample size increases. We can also use our method to study alleles at linked microsatellite loci. We suggest a metric which quantifies the level of association between loci—effectively a measure of linkage disequilibrium. It is shown that there can be linkage disequilibrium between partially linked loci at mutation–drift equilibrium.  相似文献   

3.
To model deviations from selectively neutral genetic variation caused by different forms of selection, it is necessary to first understand patterns of neutral variation. Best understood is neutral genetic variation at a single locus. But, as is well known, additional insights can be gained by investigating multiple loci. The resulting patterns reflect the degree of association (linkage) between loci and provide information about the underlying multilocus gene genealogies. The statistical properties of two-locus gene genealogies have been intensively studied for populations of constant size, as well as for simple demographic histories such as exponential population growth and single bottlenecks. By contrast, the combined effect of recombination and sustained demographic fluctuations is poorly understood. Addressing this issue, we study a two-locus Wright-Fisher model of a population subject to recurrent bottlenecks. We derive coalescent approximations for the covariance of the times to the most recent common ancestor at two loci in samples of two chromosomes. This covariance reflects the degree of association and thus linkage disequilibrium between these loci. We find, first, that an effective population-size approximation describes the numerically observed association between two loci provided that recombination occurs either much faster or much more slowly than the population-size fluctuations. Second, when recombination occurs frequently between but rarely within bottlenecks, we observe that the association of gene histories becomes independent of physical distance over a certain range of distances. Third, we show that in this case, a commonly used measure of linkage disequilibrium, σ(2)(d) (closely related to r(2)), fails to capture the long-range association between two loci. The reason is that constituent terms, each reflecting the long-range association, cancel. Fourth, we analyze a limiting case in which the long-range association can be described in terms of a Xi coalescent allowing for simultaneous multiple mergers of ancestral lines.  相似文献   

4.
The structure of linkage disequilibrium around a selective sweep   总被引:1,自引:0,他引:1       下载免费PDF全文
McVean G 《Genetics》2007,175(3):1395-1406
The fixation of advantageous mutations by natural selection has a profound impact on patterns of linked neutral variation. While it has long been appreciated that such selective sweeps influence the frequency spectrum of nearby polymorphism, it has only recently become clear that they also have dramatic effects on local linkage disequilibrium. By extending previous results on the relationship between genealogical structure and linkage disequilibrium, I obtain simple expressions for the influence of a selective sweep on patterns of allelic association. I show that sweeps can increase, decrease, or even eliminate linkage disequilibrium (LD) entirely depending on the relative position of the selected and neutral loci. I also show the importance of the age of the neutral mutations in predicting their degree of association and describe the consequences of such results for the interpretation of empirical data. In particular, I demonstrate that while selective sweeps can eliminate LD, they generate patterns of genetic variation very different from those expected from recombination hotspots.  相似文献   

5.
Linkage Disequilibrium in Growing and Stable Populations   总被引:23,自引:6,他引:17       下载免费PDF全文
M. Slatkin 《Genetics》1994,137(1):331-336
Nonrandom associations between alleles at different loci can be tested for using Fisher's exact test. Extensive simulations show that there is a substantial probability of obtaining significant nonrandom associations between closely or completely linked polymorphic neutral loci in a population of constant size at equilibrium under mutation and genetic drift. In a rapidly growing population, however, there will be little chance of finding significant nonrandom associations even between completely linked loci if the growth has been sufficiently rapid. This result is illustrated by the analysis of mitochondrial DNA sequence data from humans. In comparing all pairs of informative sites, fewer than 5% of the pairs show significant disequilibrium in Sardinians, which have apparently undergone rapid population growth, while 20% to 30% in !Kung and Pygmies, which apparently have not undergone rapid growth, show significance. The extent of linkage disequilibrium in a population is closely related to the gene genealogies of the loci examined, with ``starlike' genealogies making significant linkage disequilibrium unlikely.  相似文献   

6.
Richard R. Hudson 《Genetics》1985,109(3):611-631
The sampling distributions of several statistics that measure the association of alleles on gametes (linkage disequilibrium) are estimated under a two-locus neutral infinite allele model using an efficient Monte Carlo method. An often used approximation for the mean squared linkage disequilibrium is shown to be inaccurate unless the proper statistical conditioning is used. The joint distribution of linkage disequilibrium and the allele frequencies in the sample is studied. This estimated joint distribution is sufficient for obtaining an approximate maximum likelihood estimate of C = 4Nc, where N is the population size and c is the recombination rate. It has been suggested that observations of high linkage disequilibrium might be a good basis for rejecting a neutral model in favor of a model in which natural selection maintains genetic variation. It is found that a single sample of chromosomes, examined at two loci cannot provide sufficient information for such a test if C less than 10, because with C this small, very high levels of linkage disequilibrium are not unexpected under the neutral model. In samples of size 50, it is found that, even when C is as large as 50, the distribution of linkage disequilibrium conditional on the allele frequencies is substantially different from the distribution when there is no linkage between the loci. When conditioned on the number of alleles at each locus in the sample, all of the sample statistics examined are nearly independent of theta = 4N mu, where mu is the neutral mutation rate.  相似文献   

7.
Variants of different Class I alcohol dehydrogenase (ADH) genes have been shown to be associated with an effect that is protective against alcoholism. Previous work from our laboratory has shown that the two sites showing the association are in linkage disequilibrium and has identified the ADH1B Arg47His site as causative, with the ADH1C Ile349Val site showing association only because of the disequilibrium. Here, we describe an initial study of the nature of linkage disequilibrium and genetic variation, in population samples from different regions of the world, in a larger segment of the ADH cluster (including the three Class I ADH genes and ADH7). Linkage disequilibrium across approximately 40 kb of the Class I ADH cluster is moderate to strong in all population samples that we studied. We observed nominally significant pairwise linkage disequilibrium, in some populations, between the ADH7 site and some Class I ADH sites, at moderate values and at a molecular distance as great as 100 kb. Our data indicate (1) that most ADH-alcoholism association studies have failed to consider many sites in the ADH cluster that may harbor etiologically significant alleles and (2) that the relevance of the various ADH sites will be population dependent. Some individual sites in the Class I ADH cluster show Fst values that are among the highest seen among several dozen unlinked sites that were studied in the same subset of populations. The high Fst values can be attributed to the discrepant frequencies of specific alleles in eastern Asia relative to those in other regions of the world. These alleles are part of a single haplotype that exists at high (>65%) frequency only in the eastern-Asian samples. It seems unlikely that this haplotype, which is rare or unobserved in other populations, reached such high frequency because of random genetic drift alone.  相似文献   

8.
The model of genetic hitchhiking predicts a reduction in sequence diversity at a neutral locus closely linked to a beneficial allele. In addition, it has been shown that the same process results in a specific pattern of correlations (linkage disequilibrium) between neutral polymorphisms along the chromosome at the time of fixation of the beneficial allele. During the hitchhiking event, linkage disequilibrium on either side of the beneficial allele is built up whereas it is destroyed across the selected site. We derive explicit formulas for the expectation of the covariance measure D and standardized linkage disequilibrium sigma 2D between a pair of polymorphic sites. For our analysis we use the approximation of a star-like genealogy at the selected site. The resulting expressions are approximately correct in the limit of large selection coefficients. Using simulations we show that the resulting pattern of linkage disequilibrium is quickly-i.e., in <0.1N generations-destroyed after the fixation of the beneficial allele for moderately distant neutral loci, where N is the diploid population size.  相似文献   

9.
Hitchhiking: A Comparison of Linkage and Partial Selfing   总被引:5,自引:2,他引:3       下载免费PDF全文
Philip W. Hedrick 《Genetics》1980,94(3):791-808
Genetic hitchhiking occurs when alleles at unselected loci are changed in frequency because of an association with alleles at a selected locus. This association may be mediated either by linkage or partial selfing (inbreeding) and can affect the gene frequency and gametic disequilibrium at the neutral loci. Hitchhiking from partial selfing (unlinked loci) occurs more quickly than linkage hitchhiking and generally has a greater effect. In addition, partial-selfing hitchhiking can cause increases or changes in sign in gametic disequilibrium between neutral loci. The effects of the two types of hitchhiking with different levels of dominance, zygotic frequencies and number of selected loci are also examined. The general conditions for linkage and partial-selfing hitchhiking are outlined and the implications of hitchhiking are discussed for marker or electrophoretic loci.  相似文献   

10.
The quantitative genetic variance-covariance that can be maintained in a random environment is studied, assuming overlapping generations and Gaussian stabilizing selection with a fluctuating optimum. The phenotype of an individual is assumed to be determined by additive contributions from each locus on paternal and maternal gametes (i.e., no epistasis and no dominance). Recurrent mutation is ignored, but linkage between loci is arbitrary. The genotype distribution in the evolutionarily stable population is generically discrete: only a finite number of polymorphic alleles with distinctly different effects are maintained, even though we allow a continuum of alleles with arbitrary phenotypic contributions to invade. Fluctuating selection maintains nonzero genetic variance in the evolutionarily stable population if the environmental heterogeneity is larger than a certain threshold. Explicit asymptotic expressions for the standing variance-covariance components are derived for the population near the threshold, or for large generational overlap, as a function of environmental variability and genetic parameters (i.e., number of loci, recombination rate, etc.), using the fact that the genotype distribution is discrete. Above the threshold, the population maintains considerable genetic variance in the form of positive linkage disequilibrium and positive gamete covariance (Hardy-Weinberg disequilibrium) as well as allelic variance. The relative proportion of these disequilibrium variances in the total genetic variance increases with the environmental variability.  相似文献   

11.
The signature of positive selection at randomly chosen loci   总被引:35,自引:0,他引:35  
Przeworski M 《Genetics》2002,160(3):1179-1189
In Drosophila and humans, there are accumulating examples of loci with a significant excess of high-frequency-derived alleles or high levels of linkage disequilibrium, relative to a neutral model of a random-mating population of constant size. These are features expected after a recent selective sweep. Their prevalence suggests that positive directional selection may be widespread in both species. However, as I show here, these features do not persist long after the sweep ends: The high-frequency alleles drift to fixation and no longer contribute to polymorphism, while linkage disequilibrium is broken down by recombination. As a result, loci chosen without independent evidence of recent selection are not expected to exhibit either of these features, even if they have been affected by numerous sweeps in their genealogical history. How then can we explain the patterns in the data? One possibility is population structure, with unequal sampling from different subpopulations. Alternatively, positive selection may not operate as is commonly modeled. In particular, the rate of fixation of advantageous mutations may have increased in the recent past.  相似文献   

12.
An analysis is undertaken for a finite random mating population of the linkage disequilibrium between two loci, at both of which all alleles are neutral, all mutant alleles differ from existing ones and several may be segregating at any time. Formulae are derived for the expected total squared disequilibrium, measured as the sum of squares of disequilibria between all pairs of alleles. The ratio of this quantity to the expected value of the product of the heterozygosities at the two loci is similar to that obtained previously by Ohta and Kimura for two nucleotide sites at each of which not more than two mutant types can segregate at any time.  相似文献   

13.
Using a stochastic model of a finite population in which there is mutation to partially recessive detrimental alleles at many loci, we study the effects of population size and linkage between the loci on the population mean fitness and inbreeding depression values. Although linkage between the selected loci decreases the amount of inbreeding depression, neither population size nor recombination rate have strong effects on these quantities, unless extremely small values are assumed. We also investigate how partial linkage between the loci that determine fitness affects the invasion of populations by alleles at a modifier locus that controls the selfing rate. In most of the cases studied, the direction of selection on modifiers was consistent with that found in our previous deterministic calculations. However, there was some evidence that linkage between the modifier locus and the selected loci makes outcrossing less likely to evolve; more losses of alleles promoting outcrossing occurred in runs with linkage than in runs with free recombination. We also studied the fate of neutral alleles introduced into populations carrying detrimental mutations. The times to loss of neutral alleles introduced at low frequency were shorter than those predicted for alleles in the absence of selected loci, taking into account the reduction of the effective population size due to inbreeding. Previous studies have been confined to outbreeding populations, and to alleles at frequencies close to one-half, and have found an effect in the opposite direction. It therefore appears that associations between neutral and selected loci may produce effects that differ according to the initial frequencies of the neutral alleles.  相似文献   

14.
Linkage Disequilibrium in Subdivided Populations   总被引:27,自引:6,他引:21       下载免费PDF全文
The linkage disequilibrium in a subdivided populaton is shown to be equal to the sum of the average linkage disequilibrium for all subpopulations and the covariance between gene frequencies of the loci concerned. Thus, in a subdivided population the linkage disequilibrium may not be 0 even if the linkage disequilibrium in each subpopulation is 0. If a population is divided into two subpopulations between which migration occurs, the asymptotic rate of approach to linkage equilibrium is equal to either r or 2(m(1) + m(2)) - (m(1) + m(2))(2), whichever is smaller, where r is the recombination value and m(1) and m(2) are the proportions of immigrants in subpopulations 1 and 2, respectively. Thus, if migration rate is high compared with recombination value, the change of linkage disequilibrium in subdivided populations is similar to that of a single random mating population. On the other hand, if migration rate is low, the approach to lnkage equilibrium may be retarded in subdivided populations. If isolated populations begin to exchange genes by migration, linkage disequilibrium may increase temporarily even for neutral loci. If overdominant selection operates and the equilibrium gene frequencies are different in the two subpopulations, a permanent linkage disequilibrium may be produced without epistasis in each subpopulation.  相似文献   

15.
A 3.5-kb segment of the alcohol dehydrogenase (Adh) region that includes the Adh and Adh-related genes was sequenced in 139 Drosophila pseudoobscura strains collected from 13 populations. The Adh gene encodes four protein alleles and rejects a neutral model of protein evolution with the McDonald-Kreitman test, although the number of segregating synonymous sites is too high to conclude that adaptive selection has operated. The Adh-related gene encodes 18 protein haplotypes and fails to reject an equilibrium neutral model. The populations fail to show significant geographic differentiation of the Adh-related haplotypes. Eight of 404 single nucleotide polymorphisms (SNPs) in the Adh region were in significant linkage disequilibrium with three ADHR protein alleles. Coalescent simulations with and without recombination were used to derive the expected levels of significant linkage disequilibrium between SNPs and 18 protein haplotypes. Maximum levels of linkage disequilibrium are expected for protein alleles at moderate frequencies. In coalescent models without recombination, linkage disequilibrium decays between SNPs and high frequency haplotypes because common alleles mutate to haplotypes that are rare or that reach moderate frequency. The implication of this study is that linkage disequilibrium mapping has the highest probability of success with disease-causing alleles at frequencies of 10%.  相似文献   

16.
Barton NH  Etheridge AM 《Genetics》2004,166(2):1115-1131
The coalescent process can describe the effects of selection at linked loci only if selection is so strong that genotype frequencies evolve deterministically. Here, we develop methods proposed by Kaplan, Darden, and Hudson to find the effects of weak selection. We show that the overall effect is given by an extension to Price's equation: the change in properties such as moments of coalescence times is equal to the covariance between those properties and the fitness of the sample of genes. The distribution of coalescence times differs substantially between allelic classes, even in the absence of selection. However, the average coalescence time between randomly chosen genes is insensitive to the current allele frequency and is affected significantly by purifying selection only if deleterious mutations are common and selection is strong (i.e., the product of population size and selection coefficient, Ns>3). Balancing selection increases mean coalescence times, but the effect becomes large only when mutation rates between allelic classes are low and when selection is extremely strong. Our analysis supports previous simulations that show that selection has surprisingly little effect on genealogies. Moreover, small fluctuations in allele frequency due to random drift can greatly reduce any such effects. This will make it difficult to detect the action of selection from neutral variation alone.  相似文献   

17.
Navarro A  Barton NH 《Genetics》2002,161(2):849-863
We studied the effect of multilocus balancing selection on neutral nucleotide variability at linked sites by simulating a model where diallelic polymorphisms are maintained at an arbitrary number of selected loci by means of symmetric overdominance. Different combinations of alleles define different genetic backgrounds that subdivide the population and strongly affect variability. Several multilocus fitness regimes with different degrees of epistasis and gametic disequilibrium are allowed. Analytical results based on a multilocus extension of the structured coalescent predict that the expected linked neutral diversity increases exponentially with the number of selected loci and can become extremely large. Our simulation results show that although variability increases with the number of genetic backgrounds that are maintained in the population, it is reduced by random fluctuations in the frequencies of those backgrounds and does not reach high levels even in very large populations. We also show that previous results on balancing selection in single-locus systems do not extend to the multilocus scenario in a straightforward way. Different patterns of linkage disequilibrium and of the frequency spectrum of neutral mutations are expected under different degrees of epistasis. Interestingly, the power to detect balancing selection using deviations from a neutral distribution of allele frequencies seems to be diminished under the fitness regime that leads to the largest increase of variability over the neutral case. This and other results are discussed in the light of data from the Mhc.  相似文献   

18.
Grote MN 《Genetics》2007,176(4):2405-2420
I derive a covariance structure model for pairwise linkage disequilibrium (LD) between binary markers in a recently admixed population and use a generalized least-squares method to fit the model to two different data sets. Both linked and unlinked marker pairs are incorporated in the model. Under the model, a pairwise LD matrix is decomposed into two component matrices, one containing LD attributable to admixture, and another containing, in an aggregate form, LD specific to the populations forming the mixture. I use population genetics theory to show that the latter matrix has block-diagonal structure. For the data sets considered here, I show that the number of source populations can be determined by statistical inference on the canonical correlations of the sample LD matrix.  相似文献   

19.
Studies of the genetic covariance between habitat preference and performance have reported conflicting outcomes ranging from no covariance to strong covariance. The causes of this variability remain unclear. Here we show that variation in the magnitude of genetic covariance can result from variability in migration regimes. Using data from walking stick insects and a mathematical model, we find that genetic covariance within populations between host plant preference and a trait affecting performance on different hosts (cryptic color pattern) varies in magnitude predictably among populations according to migration regimes. Specifically, genetic covariance within populations is high in heterogeneous habitats where migration between populations locally adapted to different host plants generates nonrandom associations (i.e., linkage disequilibrium) between alleles at color pattern and host preference loci. Conversely, genetic covariance is low in homogeneous habitats where a single host exists and migration between hosts does not occur. Our results show that habitat structure and patterns of migration can strongly affect the evolution and variability of genetic covariance within populations.  相似文献   

20.
A general analytical formula is derived, which predicts the effects of background selection on population differentiation at a neutral locus as a result of its linkage with selected loci of deleterious mutations. The theory is based on the assumptions of random mating, multiplicative fitness, and weak selection in hermaphrodite plants in the island model of population structure. The analytical results show that Fst at the neutral locus increases as a result of the effects of background selection, regardless of the dependence or independence among linked background selective loci. The increment in Fst is closely related to the magnitude of linkage disequilibria between the neutral locus and selected loci, and can be estimated by the ratio of Fst with background selection to Fst without background selection minus one. The steady-state linkage disequilibrium between a neutral locus and a selected locus in subpopulations, primarily attained by gene flow, decreases with the recombination rate, and can be enhanced when there are dependence among linked selected loci. Monte Carlo computer simulations with two- and three-locus models show that the analytical formulae perform well under general conditions. Application of the present theory may aid in analyzing the genome-wide mapping of the effect of background selection in terms of Fst.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号