首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
DNA sequence variation in a 1410-bp region including the Cu,Zn Sod locus was examined in 41 homozygous lines of Drosophila melanogaster. Fourteen lines were from Barcelona, Spain, 25 were from California populations and the other two were from laboratory stocks. Two common electromorphs, SOD(S) and SOD(F), are segregating in the populations. Our sample of 41 lines included 19 Sod(S) and 22 Sod(F) alleles (henceforward referred to as Slow and Fast alleles). All 19 Slow alleles were identical in sequence. Of the 22 Fast alleles sequenced, nine were identical in sequence and are referred to as the Fast A haplotypes. The Slow allele sequence differed from the Fast A haplotype at a single nucleotide site, the site that accounts for the amino acid difference between SOD(S) and SOD(F). There were nine other haplotypes among the remaining 13 Fast alleles sequenced. The overall level of nucleotide diversity (π) in this sample is not greatly different than that found at other loci in D. melanogaster. It is concluded that the Slow/Fast polymorphism is a recently arisen polymorphism, not an old balanced polymorphism. The large group of nearly identical haplotypes suggests that a recent mutation, at the Sod locus or tightly linked to it, has increased rapidly in frequency to around 50%, both in California and Spain. The application of a new statistical test demonstrates that the occurrence of such large numbers of haplotypes with so little variation among them is very unlikely under the usual equilibrium neutral model. We suggest that the high frequency of some haplotypes is due to natural selection at the Sod locus or at a tightly linked locus.  相似文献   

2.
The Coalescent Process in Models with Selection   总被引:23,自引:12,他引:11       下载免费PDF全文
N. L. Kaplan  T. Darden    R. R. Hudson 《Genetics》1988,120(3):819-829
Statistical properties of the process describing the genealogical history of a random sample of genes are obtained for a class of population genetics models with selection. For models with selection, in contrast to models without selection, the distribution of this process, the coalescent process, depends on the distribution of the frequencies of alleles in the ancestral generations. If the ancestral frequency process can be approximated by a diffusion, then the mean and the variance of the number of segregating sites due to selectively neutral mutations in random samples can be numerically calculated. The calculations are greatly simplified if the frequencies of the alleles are tightly regulated. If the mutation rates between alleles maintained by balancing selection are low, then the number of selectively neutral segregating sites in a random sample of genes is expected to substantially exceed the number predicted under a neutral model.  相似文献   

3.
The ancestral selection graph, conditioned on the allelic types in the sample, is used to obtain a limiting gene genealogical process under strong selection. In an equilibrium, two-allele system with strong selection, neutral gene genealogies are predicted for random samples and for samples containing at most one unfavorable allele. Samples containing more than one unfavorable allele have gene genealogies that differ greatly from neutral predictions. However, they are related to neutral gene genealogies via the well-known Ewens sampling formula. Simulations show rapid convergence to limiting analytical predictions as the strength of selection increases. These results extend the idea of a soft selective sweep to deleterious alleles and have implications for the interpretation of polymorphism among disease-causing alleles in humans.  相似文献   

4.
The ``hitchhiking Effect'''' Revisited   总被引:49,自引:18,他引:49  
N. L. Kaplan  R. R. Hudson    C. H. Langley 《Genetics》1989,123(4):887-899
The number of selectively neutral polymorphic sites in a random sample of genes can be affected by ancestral selectively favored substitutions at linked loci. The degree to which this happens depends on when in the history of the sample the selected substitutions happen, the strength of selection and the amount of crossing over between the sampled locus and the loci at which the selected substitutions occur. This phenomenon is commonly called hitchhiking. Using the coalescent process for a random sample of genes from a selectively neutral locus that is linked to a locus at which selection is taking place, a stochastic, finite population model is developed that describes the steady state effect of hitchhiking on the distribution of the number of selectively neutral polymorphic sites in a random sample. A prediction of the model is that, in regions of low crossing over, strongly selected substitutions in the history of the sample can substantially reduce the number of polymorphic sites in a random sample of genes from that expected under a neutral model.  相似文献   

5.
Polymorphism and Balancing Selection at Major Histocompatibility Complex Loci   总被引:11,自引:0,他引:11  
N. Takahata  Y. Satta    J. Klein 《Genetics》1992,130(4):925-938
Amino acid replacements in the peptide-binding region (PBR) of the functional major histocompatibility complex (Mhc) genes appear to be driven by balancing selection. Of the various types of balancing selection, we have examined a model equivalent to overdominance that confers heterozygote advantage. As discussed by A. Robertson, overdominance selection tends to maintain alleles that have more or less the same degree of heterozygote advantage. Because of this symmetry, the model makes various testable predictions about the genealogical relationships among different alleles and provides ways of analyzing DNA sequences of Mhc alleles. In this paper, we analyze DNA sequences of 85 alleles at the HLA-A, -B, -C, -DRB1 and -DQB1 loci with respect to the number of alleles and extent of nucleotide differences at the PBR, as well as at the synonymous (presumably neutral) sites. Theory suggests that the number of alleles that differ at the sites targeted by selection (presumably the nonsynonymous sites in the PBR) should be equal to the mean number of nucleotide substitutions among pairs of alleles. We also demonstrate that the nucleotide substitution rate at the targeted sites relative to that of neutral sites may be much larger than 1. The predictions of the presented model are in surprisingly good agreement with the actual data and thus provide means for inferring certain population parameters. For overdominance selection in a finite population at equilibrium, the product of selection intensity (s) against homozygotes and the effective population size (N) is estimated to be 350-3000, being largest at the B locus and smallest at the C locus. We argue that N is of the order of 10(5) and s is several percent at most, if the mutation rate per site per generation is 10(-8).  相似文献   

6.
Kim Y  Stephan W 《Genetics》2000,155(3):1415-1427
Due to relatively high rates of strongly selected deleterious mutations, directional selection on favorable alleles (causing hitchhiking effects on linked neutral polymorphisms) is expected to occur while a deleterious mutation-selection balance is present in a population. We analyze this interaction of directional selection and background selection and study their combined effects on neutral variation, using a three-locus model in which each locus is subjected to either deleterious, favorable, or neutral mutations. Average heterozygosity is measured by simulations (1) at the stationary state under the assumption of recurrent hitchhiking events and (2) as a transient level after a single hitchhiking event. The simulation results are compared to theoretical predictions. It is shown that known analytical solutions describing the hitchhiking effect without background selection can be modified such that they accurately predict the joint effects of hitchhiking and background on linked, neutral variation. Generalization of these results to a more appropriate multilocus model (such that background selection can occur at multiple sites) suggests that, in regions of very low recombination rates, stationary levels of nucleotide diversity are primarily determined by hitchhiking, whereas in regions of high recombination, background selection is the dominant force. The implications of these results on the identification and estimation of the relevant parameters of the model are discussed.  相似文献   

7.
The simple codominant expression of the alleles at the self-incompatibility locus ( S) of Solanaceae and their extraordinary spectrum of sequence diversity present an ideal case for understanding the molecular forces that shape the sequence polymorphism of genes involved in recognition reactions. Here, with unprecedented conspecific sequences of 33 Physalis longifolia S alleles, including 27 of their single introns, the tempo-spatial nucleotide substitution patterns were first detailed for the S locus and analyzed in genealogical time scales. Three major genealogical clades of the 33 P. longifolia S alleles were used to divide the genealogical time into the within-clade and the between-clade periods. During the within-clade period, the average nonsynonymous substitution rate was 50% higher than the intron substitution rate but the opposite trend emerged for the between-clade period. A new and simple method developed here was utilized to estimate the selection intensity in the coding regions. The magnitudes and the distribution of these estimates, in conjunction with the spatial substitution pattern among closely related S sequences, revealed an initial short-term action of strong positive selection and a continuous but weaker action of negative selection brought by functional/structural constraints on the S alleles. The two modes of selection can significantly modify the branch lengths of an S genealogy and may be ubiquitous to recognition systems.  相似文献   

8.
M. Kreitman  R. R. Hudson 《Genetics》1991,127(3):565-582
The DNA sequences of 11 Drosophila melanogaster lines are compared across three contiguous regions, the Adh and Adh-dup loci and a noncoding 5' flanking region of Adh. Ninety-eight of approximately 4750 sites are segregating in the sample, 36 in the 5' flanking region, 38 in Adh and 24 in Adh-dup. Several methods are presented to test whether the patterns and levels of polymorphism are consistent with neutral molecular evolution. The analysis of within- and between-species polymorphism indicates that the region is evolving in a nonneutral and complex fashion. A graphical analysis of the data provides support for a hypothesized balanced polymorphism at or near position 1490, site of the amino acid replacement difference between Adhf and Adhs. The Adh-dup locus is less polymorphic than Adh and all 24 of its polymorphisms occur at low frequency--suggestive of a recent selective substitution in the Adh-dup region. Adhs alleles form two distinct evolutionary lineages that differ one from another at a total of nineteen sites in the Adh and Adh-dup loci. The polymorphisms are in complete linkage disequilibrium. A recombination experiment failed to find evidence for recombination suppression between the two allelic classes. Two hypotheses are presented to account for the widespread distribution of the two divergent lineages in natural populations. Natural selection appears to have played an important role in governing the overall patterns of nucleotide variation across the two-gene region.  相似文献   

9.
Within-population variation at the DNA level will rarely be studied by sequencing of loci of randomly chosen individuals. Instead, individuals will usually be chosen for sequencing based on some knowledge of their genotype. Data collected in this way require new sampling theory. Motivated by these observations, we have examined the sampling properties of a finite population model with two mutation processes and with no selection or recombination. One mutation process generates new alleles according to an infinite-alleles model, and the other generates polymorphisms at sites according to an infinite-sites model. A sample of n genes is considered. The stationary distribution of the number of segregating sites in a subsample from one of the allelic classes in the sample conditional on the allelic configuration of the sample is studied. A recursive scheme is developed to compute the moments of this distribution, and it is shown that the distribution is functionally independent of the number of additional alleles in the sample and their respective frequencies in the sample. For the case in which the sample contains only two alleles, the distribution of the number of segregating sites in a subsample containing both alleles conditional on the sample frequencies of the alleles is studied. The results are applied to the analysis of DNA sequences of two alleles found at the Adh locus of Drosophila melanogaster. No significant departure from the neutral model is detected.  相似文献   

10.
B. S. Gaut  M. T. Clegg 《Genetics》1993,135(4):1091-1097
We investigated nucleotide polymorphism in the Adh1 locus of pearl millet (Pennisetum glaucum) (Poaceae) by determining the DNA sequence of 20 alleles from 10 individuals. The individuals were sampled from throughout pearl millet's indigenous range and represent both wild and cultivated accessions. Our results indicated that there is little nucleotide polymorphism in the Adh1 locus. Estimates of per site nucleotide polymorphism did not differ significantly between cultivated and wild millet accessions. We compared nucleotide polymorphism in pearl millet Adh1 with nucleotide polymorphism in maize (Zea mays) Adh1 and conclude that the maize Adh1 sample is more polymorphic. Increased polymorphism in maize Adh1 may be attributable, in part, to faster substitution rates in the maize lineage. Analysis suggests that substitution rates in the maize Adh1 lineage are ~1.7 times faster than substitution rates in the millet Adh1 lineage.  相似文献   

11.
The assumption that selection alters the genealogical tree of a sample of alleles from a population relative to the neutral expectation underlies several "tests of neutrality." Two recent papers have studied the effect of purifying selection; their suggestive but incomplete results indicate that, in the single site case, the shape of a gene genealogy for a locus may differ only from the neutral expectation. We verify this finding for weak selection using the "ancestral selection graph." We consider a wider range of models, including both a four-allele single-site model and an infinite-sites model. Our results confirm the previous claim for the symmetric-mutation single site model. We emphasize, however, that a neutral-seeming genealogy is consistent with detectable effects of selection on the distribution of allele frequences within the sample. With selection operating, the information about a sample cannot be reduced to the genealogy. As a result, a distinction needs to be made between the selected sites themselves, for which the genealogy offers insufficient information, and linked neutral variation. This distinction seems to have been overlooked in previous papers, yet it has significant implications for the interpretation of data on DNA sequence variation. In particular, it predicts that under purifying selection, the frequency spectrum of neutral mutations will not reflect the skew toward rare polymorphisms at replacement sites even if there is no recombination between them. We caution, however, that the effect of weak selection on the genealogy is specific to the model; a (more realistic) model of multiple linked sites could lead to a more distorted genealogy than is observed for a single site.  相似文献   

12.
13.
The effect of selection on patterns of genetic structure within and between populations may be studied by contrasting observed patterns at the genes targeted by selection with those of unlinked neutral marker loci. Local directional selection on target genes will produce stronger population genetic structure than at neutral loci, whereas the reverse is expected for balancing selection. However, theoretical predictions on the intensity of this signal under precise models of balancing selection are still lacking. Using negative frequency-dependent selection acting on self-incompatibility systems in plants as a model of balancing selection, we investigated the effect of such selection on patterns of spatial genetic structure within a continuous population. Using numerical simulations, we tested the effect of the type of self-incompatibility system, the number of alleles at the self-incompatibility locus and the dominance interactions among them, the extent of gene dispersal, and the immigration rate on spatial genetic structure at the selected locus and at unlinked neutral loci. We confirm that frequency-dependent selection is expected to reduce the extent of spatial genetic structure as compared to neutral loci, particularly in situations with low number of alleles at the self-incompatibility locus, high frequency of codominant interactions among alleles, restricted gene dispersal and restricted immigration from outside populations. Hence the signature of selection on spatial genetic structure is expected to vary across species and populations, and we show that empirical data from the literature as well as data reported here on three natural populations of the herb Arabidopsis halleri confirm these theoretical results.  相似文献   

14.
A 3.5-kb segment of the alcohol dehydrogenase (Adh) region that includes the Adh and Adh-related genes was sequenced in 139 Drosophila pseudoobscura strains collected from 13 populations. The Adh gene encodes four protein alleles and rejects a neutral model of protein evolution with the McDonald-Kreitman test, although the number of segregating synonymous sites is too high to conclude that adaptive selection has operated. The Adh-related gene encodes 18 protein haplotypes and fails to reject an equilibrium neutral model. The populations fail to show significant geographic differentiation of the Adh-related haplotypes. Eight of 404 single nucleotide polymorphisms (SNPs) in the Adh region were in significant linkage disequilibrium with three ADHR protein alleles. Coalescent simulations with and without recombination were used to derive the expected levels of significant linkage disequilibrium between SNPs and 18 protein haplotypes. Maximum levels of linkage disequilibrium are expected for protein alleles at moderate frequencies. In coalescent models without recombination, linkage disequilibrium decays between SNPs and high frequency haplotypes because common alleles mutate to haplotypes that are rare or that reach moderate frequency. The implication of this study is that linkage disequilibrium mapping has the highest probability of success with disease-causing alleles at frequencies of 10%.  相似文献   

15.
Ohashi J  Tokunaga K 《Genetics》2000,155(2):921-927
The sojourn times until fixation of an overdominant allele were investigated based on the diffusion equation. Furthermore, the rate of accumulation of mutations, or the substitution rate, was predicted from the mean extinction time of a common overdominant allele. The substitution rate calculated theoretically agreed well with that determined by computer simulation. Overdominant selection enhances the polymorphism at linked loci, while its effect on the sojourn times and the substitution rate at linked loci has not been studied yet. To solve these problems, a model that assumed two linked loci, each with infinite alleles, was examined by computer simulation. A decrease in the recombination rate between two loci markedly changed the distribution of sojourn times of a neutral allele. Although overdominant selection obviously increased the sojourn times and the polymorphism at a linked locus, the rate of nucleotide substitution at the neutral locus was not influenced significantly even if complete linkage was assumed. These results suggest that, in regions containing overdominant genes, linked neutral loci will exhibit elevated levels of polymorphism, but their rate of molecular evolution remains that predicted by neutral theory.  相似文献   

16.
Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diversity. Here, we develop a tractable method of describing the effect of positive selection on the genealogical histories in the surrounding genome, explicitly modeling both the timing and context of an adaptive event. In addition, our framework allows us to go beyond analyzing polymorphism data via the site frequency spectrum or summaries thereof and instead leverage information contained in patterns of linked variants. Tests on both simulations and a human data example, as well as a comparison to SweepFinder2, show that even with very small sample sizes, our analytic framework has higher power to identify old selective sweeps and to correctly infer both the time and strength of selection. Finally, we derived the marginal distribution of genealogical branch lengths at a locus affected by selection acting at a linked site. This provides a much-needed link between our analytic understanding of the effects of sweeps on sequence variation and recent advances in simulation and heuristic inference procedures that allow researchers to examine the sequence of genealogical histories along the genome.  相似文献   

17.
X. Vekemans  M. Slatkin 《Genetics》1994,137(4):1157-1165
The properties of gene and allelic genealogies at a gametophytic self-incompatibility locus in plants have been investigated analytically and checked against extensive numerical simulations. It is found that, as with overdominant loci, there are two genealogical processes with markedly different time scales. First, functionally distinct allelic lines diverge on an extremely long time scale which is inversely related to the mutation rate to new alleles. These alleles show a genealogical structure which is similar, after an appropriate rescaling of time, to that described by the coalescent process for genes at a neutral locus. Second, gene copies sampled within the same functional allelic line show genealogical relationships similar to neutral gene genealogies but on a much shorter time scale, which is on the same order of magnitude as the harmonic mean of the number of gene copies within an allelic line. These results are discussed in relation to data showing trans-specific polymorphisms for alleles at the gametophytic self-incompatibility locus in the Solanaceae. It is shown that population sizes on the order of 4 X 10(5) and a mutation rate per locus per generation as high as 10(-6) could account for estimated allelic divergence times in this family.  相似文献   

18.
J Wang  W G Hill 《Genetics》1999,153(3):1475-1489
Transition matrices for selfing and full-sib mating were derived to investigate the effect of selection against deleterious mutations on the process of inbreeding at a linked neutral locus. Selection was allowed to act within lines only (selection type I) or equally within and between lines (type II). For selfing lines under selection type I, inbreeding is always retarded, the retardation being determined by the recombination fraction between the neutral and selected loci and the inbreeding depression from the selected locus, irrespective of the selection coefficient (s) and dominance coefficient (h) of the mutant allele. For selfing under selection type II or full-sib mating under both selection types, inbreeding is delayed by weak selection (small s and sh), due to the associative overdominance created at the neutral locus, and accelerated by strong selection, due to the elevated differential contributions between alternative alleles at the neutral locus within individuals and between lines (for selection type II). For multiple fitness loci under selection, stochastic simulations were run for populations with selfing, full-sib mating, and random mating, using empirical estimates of mutation parameters and inbreeding load in Drosophila. The simulations results are in general compatible with empirical observations.  相似文献   

19.
The effect of multi-allelic balancing selection on nucleotide diversity at linked neutral sites was investigated by simulations of subdivided populations. The motivation is to understand the behaviour of self-recognition systems such as the MHC and plant self-incompatibility. For neutral sites, two types of subdivision are present: (1) into demes (connected by migration), and (2) into classes defined by different functional alleles at the selected locus (connected by recombination). Previous theoretical studies of each type of subdivision separately have shown that each increases diversity, and decreases the relative frequencies of low-frequency variants, at neutral sites or loci. We show here that the two types of subdivision act non-additively when sampling is at the whole population level, and that subdivision produces some non-intuitive results. For instance, in highly subdivided populations, genetic diversity at neutral sites may decrease with tighter linkage to a selected locus or site. Another conclusion is that, if there is population subdivision, balancing selection leads to decreased expected FST values for neutral sites linked to the selected locus. Finally, we show that the ability to detect balancing selection by its effects on linked variation, using tests such as Tajima's D, is reduced when genes in a subdivided population are sampled from the total population, rather than within demes.  相似文献   

20.
Nucleotide variation at the alcohol dehydrogenase locus (Adh) was studied in the outcrossing Arabidopsis lyrata, a close relative of the selfing Arabidopsis thaliana. Overall, estimated nucleotide diversity in the North American ssp. lyrata and two European ssp. petraea populations was 0.0038, lower than the corresponding specieswide estimate for A. thaliana at the same set of nucleotide sites. The distribution of segregating sites across the gene differed between the two species. Estimated sequence diversity within an A. lyrata population with a large sample size (0.0023) was much higher than has previously been observed for A. thaliana. This North American population has an excess of sites at intermediate frequencies compared with neutral expectation (Tajima's D = 2.3, P < 0.005), suggestive of linked balancing selection or a recent population bottleneck. In contrast, an excess of rare polymorphisms has been found in A. thaliana. Polymorphism within A. lyrata and divergence from A. thaliana appear to be correlated across the Adh gene sequence. The geographic distribution of polymorphism was quite different from that of A. thaliana, for which earlier studies of several genes found low within-population nucleotide site polymorphism and no overall continental differentiation of variation despite large differences in site frequencies between local populations. Differences between the outcrossing A. lyrata and the selfing A. thaliana reflect the impact of differences in mating system and the influence of bottlenecks in A. thaliana during rapid colonization on DNA sequence polymorphism. The influence of additional variability-reducing mechanisms, such as background selection or hitchhiking, may not be discernible.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号