共查询到20条相似文献,搜索用时 15 毫秒
1.
It has previously been shown that, conditional on its fixation, the time to fixation of a semi-dominant deleterious autosomal mutation in a randomly mating population is the same as that of an advantageous mutation. This result implies that deleterious mutations could generate selective sweep-like effects. Although their fixation probabilities greatly differ, the much larger input of deleterious relative to beneficial mutations suggests that this phenomenon could be important. We here examine how the fixation of mildly deleterious mutations affects levels and patterns of polymorphism at linked sites—both in the presence and absence of interference amongst deleterious mutations—and how this class of sites may contribute to divergence between-populations and species. We find that, while deleterious fixations are unlikely to represent a significant proportion of outliers in polymorphism-based genomic scans within populations, minor shifts in the frequencies of deleterious mutations can influence the proportions of private variants and the value of FST after a recent population split. As sites subject to deleterious mutations are necessarily found in functional genomic regions, interpretations in terms of recurrent positive selection may require reconsideration. 相似文献
2.
There are few methods tailored for detecting signals of positive selection in populations directly ancestral to multiple descendent populations. We introduce the ancestral branch statistic (ABS), a four‐population summary statistic for identifying selective sweeps occurring in the direct ancestor of a pair of populations. Simulations show that ABS performs at least as well as, and often better under model violations, than the complementary likelihood approach of 3P‐CLR across diverse selection scenarios and parameter values. We first applied ABS to contemporary human genomic data to identify genes that may have been adaptive in ancestral East Asian populations, uncovering the well‐established candidate EDAR, as well as a novel candidate SLC35F3, which encodes a putative thiamine transporter that may have been involved in adaptation to eating polished grains. Next, we performed scans with ancient European genomic data to reexamine evidence of recent positive selection in ancestral Europeans. The MCM6/LCT cluster and the SLC45A2 and HERC2 genes are strong outliers, agreeing with previous studies. Novel candidates, such as SLC30A9 and CYP1A2, may have been involved in adaptation to local nutrient sufficiency and lifestyle changes. Finally, we provide open‐source software, CalcABS, which can perform genomic scans of ancestral sweeps with ABS from population allele frequency data. 相似文献
3.
We study the trajectory of an allele that affects a polygenic trait selected toward a phenotypic optimum. Furthermore, conditioning on this trajectory we analyze the effect of the selected mutation on linked neutral variation. We examine the well-characterized two-locus two-allele model but we also provide results for diallelic models with up to eight loci. First, when the optimum phenotype is that of the double heterozygote in a two-locus model, and there is no dominance or epistasis of effects on the trait, the trajectories of selected mutations rarely reach fixation; instead, a polymorphic equilibrium at both loci is approached. Whether a polymorphic equilibrium is reached (rather than fixation at both loci) depends on the intensity of selection and the relative distances to the optimum of the homozygotes at each locus. Furthermore, if both loci have similar effects on the trait, fixation of an allele at a given locus is less likely when it starts at low frequency and the other locus is polymorphic (with alleles at intermediate frequencies). Weaker selection increases the probability of fixation of the studied allele, as the polymorphic equilibrium is less stable in this case. When we do not require the double heterozygote to be at the optimum we find that the polymorphic equilibrium is more difficult to reach, and fixation becomes more likely. Second, increasing the number of loci decreases the probability of fixation, because adaptation to the optimum is possible by various combinations of alleles. Summaries of the genealogy (height, total length, and imbalance) and of sequence polymorphism (number of polymorphisms, frequency spectrum, and haplotype structure) next to a selected locus depend on the frequency that the selected mutation approaches at equilibrium. We conclude that multilocus response to selection may in some cases prevent selective sweeps from being completed, as described in previous studies, but that conditions causing this to happen strongly depend on the genetic architecture of the trait, and that fixation of selected mutations is likely in many instances. 相似文献
4.
Nair S Nash D Sudimack D Jaidee A Barends M Uhlemann AC Krishna S Nosten F Anderson TJ 《Molecular biology and evolution》2007,24(2):562-573
When selection is strong and beneficial alleles have a single origin, local reductions in genetic diversity are expected. However, when beneficial alleles have multiple origins or were segregating in the population prior to a change in selection regime, the impact on genetic diversity may be less clear. We describe an example of such a "soft" selective sweep in the malaria parasite Plasmodium falciparum that involves adaptive genome rearrangements. Amplification in copy number of genome regions containing the pfmdr1 gene on chromosome 5 confer resistance to mefloquine and spread rapidly in the 1990s. Using flanking microsatellite data and real-time polymerase chain reaction determination of copy number, we show that 5-15 independent amplification events have occurred in parasites on the Thailand/Burma border. The amplified genome regions (amplicons) range in size from 14.7 to 49 kb and contain 2-11 genes, with 2-4 copies arranged in tandem. To examine the impact of drug selection on flanking variation, we genotyped 48 microsatellites on chromosome 5 in 326 parasites from a single Thai location. Diversity was reduced in a 170- to 250-kb (10-15 cM) region of chromosomes containing multiple copies of pfmdr1, consistent with hitchhiking resulting from the rapid recent spread of selected chromosomes. However, diversity immediately flanking pfmdr1 is reduced by only 42% on chromosomes bearing multiple amplicons relative to chromosomes carrying a single copy. We highlight 2 features of these results: 1) All amplicon break points occur in monomeric A/T tracts (9-45 bp). Given the abundance of these tracts in P. falciparum, we expect that duplications will occur frequently at multiple genomic locations and have been underestimated as drivers of phenotypic evolution in this pathogen. 2) The signature left by the spread of amplified genome segments is broad, but results in only limited reduction in diversity. If such "soft" sweeps are common in nature, statistical methods based on diversity reduction may be inefficient at detecting evidence for selection in genome-wide marker screens. This may be particularly likely when mutation rate is high, as appears to be the case for gene duplications, and in pathogen populations where effective population sizes are typically very large. 相似文献
5.
MEIKE THOMAS FRIEDRICH M
LLER THOMAS WIEHE DIETHARD TAUTZ 《Molecular ecology resources》2007,7(3):400-403
We have evaluated a pooling approach that can reduce the number of polymerase chain reactions in a screen for selective sweeps by more than an order of magnitude. We show that the complex peak pattern that results from pooling of all samples from a given population is a faithful reflection of the composite pattern of the individual alleles, although with an under‐representation of the larger alleles. Candidate loci for selective sweeps can be identified by visual inspection of the pool patterns. We have also implemented a software tool, which can find suitable microsatellite loci in the vicinity of annotated genes. 相似文献
6.
Selective sweeps are typically associated with a local reduction of genetic diversity around the adaptive site. However, selective sweeps can also quickly carry neutral mutations to observable population frequencies if they arise early in a sweep and hitchhike with the adaptive allele. We show that the interplay between mutation and exponential amplification through hitchhiking results in a characteristic frequency spectrum of the resulting novel haplotype variation that depends only on the ratio of the mutation rate and the selection coefficient of the sweep. On the basis of this result, we develop an estimator for the selection coefficient driving a sweep. Since this estimator utilizes the novel variation arising from mutations during a sweep, it does not rely on preexisting variation and can also be applied to loci that lack recombination. Compared with standard approaches that infer selection coefficients from the size of dips in genetic diversity around the adaptive site, our estimator requires much shorter sequences but sampled at high population depth to capture low-frequency variants; given such data, it consistently outperforms standard approaches. We investigate analytically and numerically how the accuracy of our estimator is affected by the decay of the sweep pattern over time as a consequence of random genetic drift and discuss potential effects of recombination, soft sweeps, and demography. As an example for its use, we apply our estimator to deep sequencing data from human immunodeficiency virus populations. 相似文献
7.
Wolfgang Stephan 《Ecology and evolution》2021,11(22):15896
Detecting selective sweeps driven by strong positive selection and localizing the targets of selection in the genome play a major role in modern population genetics and genomics. Most of these analyses are based on the classical model of genetic hitchhiking proposed by Maynard Smith and Haigh (1974, Genetical Research, 23, 23). Here, we consider extensions of the classical two‐locus model. Introducing mutation at the strongly selected site, we analyze the conditions under which soft sweeps may arise. We identify a new parameter (the ratio of the beneficial mutation rate to the selection coefficient) that characterizes the occurrence of multiple‐origin soft sweeps. Furthermore, we quantify the hitchhiking effect when the polymorphism at the linked locus is not neutral but maintained in a mutation‐selection balance. In this case, we find a smaller relative reduction of heterozygosity at the linked site than for a neutral polymorphism. In our analysis, we use a semi‐deterministic approach; i.e., we analyze the frequency process of the beneficial allele in an infinitely large population when its frequency is above a certain threshold; however, for very small frequencies in the initial phase after the onset of selection we rely on diffusion theory. 相似文献
8.
Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diversity. Here, we develop a tractable method of describing the effect of positive selection on the genealogical histories in the surrounding genome, explicitly modeling both the timing and context of an adaptive event. In addition, our framework allows us to go beyond analyzing polymorphism data via the site frequency spectrum or summaries thereof and instead leverage information contained in patterns of linked variants. Tests on both simulations and a human data example, as well as a comparison to SweepFinder2, show that even with very small sample sizes, our analytic framework has higher power to identify old selective sweeps and to correctly infer both the time and strength of selection. Finally, we derived the marginal distribution of genealogical branch lengths at a locus affected by selection acting at a linked site. This provides a much-needed link between our analytic understanding of the effects of sweeps on sequence variation and recent advances in simulation and heuristic inference procedures that allow researchers to examine the sequence of genealogical histories along the genome. 相似文献
9.
10.
Rmi Tournebize Valrie Poncet Mattias Jakobsson Yves Vigouroux Stphanie Manel 《Molecular ecology resources》2019,19(1):283-295
Inferring the mode and tempo of natural selection helps further our understanding of adaptation to past environmental changes. Here, we introduce McSwan, a method to detect and date past and recent natural selection events in the case of a hard sweep. The method is based on the comparison of site frequency spectra obtained under various demographic models that include selection. McSwan demonstrated high power (high sensitivity and specificity) in capturing hard selective sweep events without requiring haplotype phasing. It performed slightly better than SweeD when the recent effective population size was low and the genomic region was small. We then applied our method to a European (CEU) and an African (LWK) human re‐sequencing data set. Most hard sweeps were detected in the CEU population (96%). Moreover, hard sweeps in the African population were estimated to have occurred further back in time (mode: 43,625 years BP) compared to those of Europeans (mode: 24,850 years BP). Most of the estimated ages of hard sweeps in Europeans were associated with the Last Glacial Maximum and were enriched in immunity‐associated genes. 相似文献
11.
Simon Boitard Robert Kofler Pierre Françoise David Robelin Christian Schlötterer Andreas Futschik 《Molecular ecology resources》2013,13(2):337-340
Due to its cost effectiveness, next generation sequencing of pools of individuals (Pool‐Seq) is becoming a popular strategy for genome‐wide estimation of allele frequencies in population samples. As the allele frequency spectrum provides information about past episodes of selection, Pool‐seq is also a promising design for genomic scans for selection. However, no software tool has yet been developed for selection scans based on Pool‐Seq data. We introduce Pool‐hmm, a Python program for the estimation of allele frequencies and the detection of selective sweeps in a Pool‐Seq sample. Pool‐hmm includes several options that allow a flexible analysis of Pool‐Seq data, and can be run in parallel on several processors. Source code and documentation for Pool‐hmm is freely available at https://qgsp.jouy.inra.fr/ . 相似文献
12.
Genome scans of DNA variability in humans reveal evidence for selective sweeps outside of Africa 总被引:17,自引:0,他引:17
The last 50,000-150,000 years of human history have been characterized by rapid demographic expansions and the colonization of novel environments outside of sub-Saharan Africa. Mass migrations outside the ancestral species range likely entailed many new selection pressures, suggesting that genetic adaptation to local environmental conditions may have been more prevalent in colonizing populations outside of sub-Saharan Africa. Here we report a test of this hypothesis using genome-wide patterns of DNA polymorphism. We conducted a multilocus scan of microsatellite variability to identify regions of the human genome that may have been subject to continent-specific hitchhiking events. Using published polymorphism data for a total of 624 autosomal loci in multiple populations of humans, we used coalescent simulations to identify candidate loci for geographically restricted selective sweeps. We identified a total of 13 loci that appeared as outliers in replicated population comparisons involving different reference samples for Africa. A disproportionate number of these loci exhibited reduced levels of relative variability in non-African populations alone, suggesting that recent episodes of positive selection have been more prevalent outside of sub-Saharan Africa. 相似文献
13.
Computer simulation is an essential tool in the analysis ofDNA sequence variation for mapping events of recent adaptiveevolution in the genome. Various simulation methods are employedto predict the signature of selection in sequence variation.The most informative and efficient method currently in use iscoalescent simulation. However, this method is limited to simplemodels of directional selection. Whole-population forward-in-timesimulations are the alternative to coalescent simulations formore complex models. The notorious problem of excessive computationalcost in forward-in-time simulations can be overcome by varioussimplifying amendments. Overall, the success of simulationsdepends on the creative application of some population genetictheory to the simulation algorithm. 相似文献
14.
A report on the 46th annual PopGroup conference, Glasgow, UK, December 18-21,2012. 相似文献
15.
Benjamin C. Haller Jared Galloway Jerome Kelleher Philipp W. Messer Peter L. Ralph 《Molecular ecology resources》2019,19(2):552-566
There is an increasing demand for evolutionary models to incorporate relatively realistic dynamics, ranging from selection at many genomic sites to complex demography, population structure, and ecological interactions. Such models can generally be implemented as individual‐based forward simulations, but the large computational overhead of these models often makes simulation of whole chromosome sequences in large populations infeasible. This situation presents an important obstacle to the field that requires conceptual advances to overcome. The recently developed tree‐sequence recording method (Kelleher, Thornton, Ashander, & Ralph, 2018), which stores the genealogical history of all genomes in the simulated population, could provide such an advance. This method has several benefits: (1) it allows neutral mutations to be omitted entirely from forward‐time simulations and added later, thereby dramatically improving computational efficiency; (2) it allows neutral burn‐in to be constructed extremely efficiently after the fact, using “recapitation”; (3) it allows direct examination and analysis of the genealogical trees along the genome; and (4) it provides a compact representation of a population's genealogy that can be analysed in Python using the msprime package. We have implemented the tree‐sequence recording method in SLiM 3 (a free, open‐source evolutionary simulation software package) and extended it to allow the recording of non‐neutral mutations, greatly broadening the utility of this method. To demonstrate the versatility and performance of this approach, we showcase several practical applications that would have been beyond the reach of previously existing methods, opening up new horizons for the modelling and exploration of evolutionary processes. 相似文献
16.
Roven Rommel Fuentes Dick de Ridder Aalt D J van Dijk Sander A Peters 《Molecular biology and evolution》2022,39(1)
Meiotic recombination is a biological process of key importance in breeding, to generate genetic diversity and develop novel or agronomically relevant haplotypes. In crop tomato, recombination is curtailed as manifested by linkage disequilibrium decay over a longer distance and reduced diversity compared with wild relatives. Here, we compared domesticated and wild populations of tomato and found an overall conserved recombination landscape, with local changes in effective recombination rate in specific genomic regions. We also studied the dynamics of recombination hotspots resulting from domestication and found that loss of such hotspots is associated with selective sweeps, most notably in the pericentromeric heterochromatin. We detected footprints of genetic changes and structural variants, among them associated with transposable elements, linked with hotspot divergence during domestication, likely causing fine-scale alterations to recombination patterns and resulting in linkage drag. 相似文献
17.
Genomic analysis of a migratory divide reveals candidate genes for migration and implicates selective sweeps in generating islands of differentiation 下载免费PDF全文
Kira E. Delmore Sariel Hübner Nolan C. Kane Richard Schuster Rose L. Andrew Francisco Câmara Roderic Guigó Darren E. Irwin 《Molecular ecology》2015,24(8):1873-1888
Differential gene flow, reductions in diversity following linked selection and/or features of the genome can structure patterns of genomic differentiation during the process of speciation. Possible sources of reproductive isolation are well studied between coastal and inland subspecies groups of Swainson's thrushes, with differences in seasonal migratory behaviour likely playing a key role in reducing hybrid fitness. We assembled and annotated a draft reference genome for this species and generated whole‐genome shotgun sequence data for populations adjacent to the hybrid zone between these groups. We documented substantial genomewide heterogeneity in relative estimates of genetic differentiation between the groups. Within population diversity was lower in areas of high relative differentiation, supporting a role for selective sweeps in generating this pattern. Absolute genetic differentiation was reduced in these areas, further suggesting that recurrent selective sweeps in the ancestral population and/or between divergent populations following secondary contact likely occurred. Relative genetic differentiation was also higher near centromeres and on the Z chromosome, suggesting that features of the genome also contribute to genomewide heterogeneity. Genes linked to migratory traits were concentrated in islands of differentiation, supporting previous suggestions that seasonal migration is under divergent selection between Swainson's thrushes. Differences in migratory behaviour likely play a central role in the speciation of many taxa; we developed the infrastructure here to permit future investigations into the role several candidate genes play in reducing gene flow between not only Swainson's thrushes but other species as well. 相似文献
18.
Low levels of nucleotide diversity in mammalian Y chromosomes 总被引:10,自引:0,他引:10
Sex chromosomes provide a useful context for the study of the relative importance of evolutionary forces affecting genetic diversity. The human Y chromosome shows levels of nucleotide diversity 20% that of autosomes, which is significantly less than expected when differences in effective population size and sex-specific mutation rates are taken into account. To study the generality of low levels of Y chromosome variability in mammalian genomes, we investigated nucleotide diversity in intron sequences of X (1.1-3.0 kb) and Y (0.7-3.5 kb) chromosome genes of five mammals: lynx, wolf, reindeer, cattle, and field vole. For all species, nucleotide diversity was found to be lower on Y than on X, with no segregating site observed in Y-linked sequences of lynx, reindeer, and cattle. For X chromosome sequences, nucleotide diversity was in the range of 1.6 x 10(-4) (lynx) to 8.0 x 10(-4) (field vole). When differences in effective population size and the extent of the male mutation bias were taken into account, all five species showed evidence of reduced levels of Y chromosome variability. Reduced levels of Y chromosome variability have also been observed in Drosophila and in plants, as well as in the female-specific W chromosome of birds. Among the different factors proposed to explain low levels of genetic variability in the sex-limited chromosome (Y/W), we note that selection is the only factor that is broadly applicable irrespective of mode of reproduction and whether there is male or female heterogamety. 相似文献
19.
A haplotype method detects diverse scenarios of local adaptation from genomic sequence variation 下载免费PDF全文
Identifying genomic targets of population‐specific positive selection is a major goal in several areas of basic and applied biology. However, it is unclear how often such selection should act on new mutations versus standing genetic variation or recurrent mutation, and furthermore, favoured alleles may either become fixed or remain variable in the population. Very few population genetic statistics are sensitive to all of these modes of selection. Here, we introduce and evaluate the Comparative Haplotype Identity statistic (χMD), which assesses whether pairwise haplotype sharing at a locus in one population is unusually large compared with another population, relative to genomewide trends. Using simulations that emulate human and Drosophila genetic variation, we find that χMD is sensitive to a wide range of selection scenarios, and for some very challenging cases (e.g. partial soft sweeps), it outperforms other two‐population statistics. We also find that, as with FST, our haplotype approach has the ability to detect surprisingly ancient selective sweeps. Particularly for the scenarios resembling human variation, we find that χMD outperforms other frequency‐ and haplotype‐based statistics for soft and/or partial selective sweeps. Applying χMD and other between‐population statistics to published population genomic data from D. melanogaster, we find both shared and unique genes and functional categories identified by each statistic. The broad utility and computational simplicity of χMD will make it an especially valuable tool in the search for genes targeted by local adaptation. 相似文献
20.
Hitchhiking both ways: effect of two interfering selective sweeps on linked neutral variation 下载免费PDF全文
The neutral polymorphism pattern in the vicinity of a selective sweep can be altered by both stochastic and deterministic factors. Here, we focus on the impact of another selective sweep in the region of influence of a first one. We study the signature left on neutral polymorphism by positive selection at two closely linked loci, when both beneficial mutations reach fixation. We show that, depending on the timing of selective sweeps and on their selection coefficients, the two hitchhiking effects can interfere with each other, leading to less reduction in heterozygosity than a single selective sweep of the same magnitude and more importantly to an excess of intermediate-frequency variants relative to neutrality under some parameter values. This pattern can be sustained and potentially alter the detection of positive selection, including by provoking spurious detection of balancing selection. In situations where positive selection is suspected a priori at several closely linked loci, the polymorphism pattern in the region may also be informative about their selective histories. 相似文献