首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Genomic regions under positive selection harbor variation linked for example to adaptation. Most tools for detecting positively selected variants have computational resource requirements rendering them impractical on population genomic datasets with hundreds of thousands of individuals or more. We have developed and implemented an efficient haplotype-based approach able to scan large datasets and accurately detect positive selection. We achieve this by combining a pattern matching approach based on the positional Burrows–Wheeler transform with model-based inference which only requires the evaluation of closed-form expressions. We evaluate our approach with simulations, and find it to be both sensitive and specific. The computational resource requirements quantified using UK Biobank data indicate that our implementation is scalable to population genomic datasets with millions of individuals. Our approach may serve as an algorithmic blueprint for the era of “big data” genomics: a combinatorial core coupled with statistical inference in closed form.  相似文献   

2.
A key challenge in understanding how organisms adapt to their environments is to identify the mutations and genes that make it possible. By comparing patterns of sequence variation to neutral predictions across genomes, the targets of positive selection can be located. We applied this logic to house mice that invaded Gough Island (GI), an unusual population that shows phenotypic and ecological hallmarks of selection. We used massively parallel short-read sequencing to survey the genomes of 14 GI mice. We computed a set of summary statistics to capture diverse aspects of variation across these genome sequences, used approximate Bayesian computation to reconstruct a null demographic model, and then applied machine learning to estimate the posterior probability of positive selection in each region of the genome. Using a conservative threshold, 1,463 5-kb windows show strong evidence for positive selection in GI mice but not in a mainland reference population of German mice. Disproportionate shares of these selection windows contain genes that harbor derived nonsynonymous mutations with large frequency differences. Over-represented gene ontologies in selection windows emphasize neurological themes. Inspection of genomic regions harboring many selection windows with high posterior probabilities pointed to genes with known effects on exploratory behavior and body size as potential targets. Some genes in these regions contain candidate adaptive variants, including missense mutations and/or putative regulatory mutations. Our results provide a genomic portrait of adaptation to island conditions and position GI mice as a powerful system for understanding the genetic component of natural selection.  相似文献   

3.
The recent advent of high-throughput sequencing and genotyping technologies makes it possible to produce, easily and cost effectively, large amounts of detailed data on the genotype composition of populations. Detecting locus-specific effects may help identify those genes that have been, or are currently, targeted by natural selection. How best to identify these selected regions, loci, or single nucleotides remains a challenging issue. Here, we introduce a new model-based method, called SelEstim, to distinguish putative selected polymorphisms from the background of neutral (or nearly neutral) ones and to estimate the intensity of selection at the former. The underlying population genetic model is a diffusion approximation for the distribution of allele frequency in a population subdivided into a number of demes that exchange migrants. We use a Markov chain Monte Carlo algorithm for sampling from the joint posterior distribution of the model parameters, in a hierarchical Bayesian framework. We present evidence from stochastic simulations, which demonstrates the good power of SelEstim to identify loci targeted by selection and to estimate the strength of selection acting on these loci, within each deme. We also reanalyze a subset of SNP data from the Stanford HGDP–CEPH Human Genome Diversity Cell Line Panel to illustrate the performance of SelEstim on real data. In agreement with previous studies, our analyses point to a very strong signal of positive selection upstream of the LCT gene, which encodes for the enzyme lactase–phlorizin hydrolase and is associated with adult-type hypolactasia. The geographical distribution of the strength of positive selection across the Old World matches the interpolated map of lactase persistence phenotype frequencies, with the strongest selection coefficients in Europe and in the Indus Valley.  相似文献   

4.
5.
Comparisons of neutral marker and quantitative trait divergence can provide important insights into the relative roles of natural selection and neutral genetic drift in population differentiation. We investigated phenotypic and genetic differentiation among Fennoscandian threespine stickleback (Gasterosteus aculeatus) populations, and found that the highest degree of differentiation occurred between sea and freshwater habitats. Within habitats, morphological divergence was highest among the different freshwater populations. Pairwise phenotypic and neutral genetic distances among populations were positively correlated, suggesting that genetic drift may have contributed to the morphological differentiation among habitats. On the other hand, the degree of phenotypic differentiation (PST) clearly surpassed the neutral expectation set by FST, suggesting a predominant role for natural selection over genetic drift as an explanation for the observed differentiation. However, separate PST/FST comparisons by habitats revealed that body shape divergence between lake and marine populations, and even among marine populations, can be strongly influenced by natural selection. On the other hand, genetic drift can play an important role in the differentiation among lake populations.  相似文献   

6.
Examples of parallel evolution of phenotypic traits have been repeatedly demonstrated in threespine sticklebacks (Gasterosteus aculeatus) across their global distribution. Using these as a model, we performed a targeted genome scan--focusing on physiologically important genes potentially related to freshwater adaptation--to identify genetic signatures of parallel physiological evolution on a global scale. To this end, 50 microsatellite loci, including 26 loci within or close to (<6 kb) physiologically important genes, were screened in paired marine and freshwater populations from six locations across the Northern Hemisphere. Signatures of directional selection were detected in 24 loci, including 17 physiologically important genes, in at least one location. Although no loci showed consistent signatures of selection in all divergent population pairs, several outliers were common in multiple locations. In particular, seven physiologically important genes, as well as reference ectodysplasin gene (EDA), showed signatures of selection in three or more locations. Hence, although these results give some evidence for consistent parallel molecular evolution in response to freshwater colonization, they suggest that different evolutionary pathways may underlie physiological adaptation to freshwater habitats within the global distribution of the threespine stickleback.  相似文献   

7.
Genome scans have identified candidate regions of the genome undergoing selection in a wide variety of organisms, yet have rarely been applied to broadly dispersing marine organisms experiencing divergent selection pressures, where high recombination rates can reduce the extent of linkage disequilibrium (LD) and the ability to detect genomic regions under selection. The broadly dispersing periwinkle Echinolittorina hawaiiensis exhibits a heritable shell sculpture polymorphism that is correlated with environmental variation. To elucidate the genetic basis of phenotypic variation, a genome scan using over 1000 AFLP loci was conducted on smooth and sculptured snails from divergent habitats at four replicate sites. Approximately 5% of loci were identified as outliers with Dfdist, whereas no outliers were identified by BayeScan. Closer examination of the Dfdist outliers supported the conclusion that these loci were false positives. These results highlight the importance of controlling for Type I error using multiple outlier detection approaches, multitest corrections and replicate population comparisons. Assuming shell phenotypes have a genetic basis, our failure to detect outliers suggests that the life history of the target species needs to be considered when designing a genome scan.  相似文献   

8.
Adaptive evolutionary change is contingent on variation and selection; thus, understanding adaptive divergence and ultimately speciation requires information on both the genetic basis of adaptive traits as well as an understanding of the role of divergent natural selection on those traits. The lake whitefish (Coregonus clupeaformis) consists of several sympatric "dwarf" (limnetic) and normal (benthic) species pairs that co-inhabit northern postglacial lakes. These young species pairs have evolved independently and display parallelism in life history, behavioral, and morphological divergence associated with the use of distinct trophic resources. We identified phenotype-environment associations and determined the genetic architecture and the role of selection modulating population genetic divergence in sympatric dwarf and normal lake whitefish. The genetic architecture of 9 adaptive traits was analyzed in 2 hybrid backcrosses individually phenotyped throughout their life history. Significant quantitative trait loci (QTL) were associated with swimming behavior (habitat selection and predator avoidance), growth rate, morphology (condition factor and gill rakers), and life history (onset of maturity and fecundity). Genome scans among 4 natural sympatric pairs, using loci segregating in the map, revealed a signature of selection for 24 loci. Loci exhibiting a signature of selection were associated with QTL relative to other regions of the genome more often than expected by chance alone. Two parallel QTL outliers for growth and condition factor exhibited segregation distortion in both mapping families, supporting the hypothesis that adaptive divergence contributing to parallel reductions of gene flow among natural populations may cause genetic incompatibilities. Overall, these findings offer evidence that the genetic architecture of ecological speciation is associated with signatures of selection in nature, providing strong support for the hypothesis that divergent natural selection is currently maintaining adaptive differentiation and promoting ecological speciation in lake whitefish species pairs.  相似文献   

9.
A key objective of population genomics is to identify portions of the genome that have been shaped by natural selection rather than by neutral divergence. A previously recognized but underappreciated challenge to this objective is that observations of allele frequencies across genomes in natural populations often correspond to a single, unreplicated instance of the outcome of evolution. This is because the composition of each individual genomic region and population is expected to be the outcome of a unique array of evolutionary processes. Given a single observation, inference of the evolutionary processes that led to the observed state of a locus is associated with considerable uncertainty. This constraint on inference can be ameliorated by utilizing multi-allelic (e.g. DNA haplotypes) rather than bi-allelic markers, by analysing two or more populations with certain models and by utilizing studies of replicated experimental evolution. Future progress in population genomics will follow from research that recognizes the 'n = 1 constraint' and that utilizes appropriate and explicit evolutionary models for analysis.  相似文献   

10.
Rapid increases in global temperature are likely to impose strong directional selection on many plant populations, which must therefore adapt if they are to survive. Within populations, microgeographic genetic differentiation of individuals with respect to climate suggests that some populations may adapt to changing temperatures in the short-term through rapid changes in gene frequency. We used a genome scan to identify temperature-related adaptive differentiation of individuals of the tree species Fagus sylvatica. By combining molecular marker and dendrochronological data we assessed spatial and temporal variation in gene frequency at the locus identified as being under selection. We show that gene frequency at this locus varies predictably with temperature. The probability of the presence of the dominant marker allele shows a declining trend over the latter half of the 20th century, in parallel with rising temperatures in the region. Our results show that F. sylvatica populations may show some capacity for an in situ adaptive response to climate change. However as reported ongoing distributional changes demonstrate, this response is not enough to allow all populations of this species to persist in all of their current locations.  相似文献   

11.
Outlier detection methods were used to scan the genome of the boreal conifer black spruce (Picea mariana [Mill.] B.S.P.) for gene single-nucleotide polymorphisms (SNPs) potentially involved in adaptations to temperature and precipitation variations. The scan involved 583 SNPs from 313 genes potentially playing adaptive roles. Differentiation estimates among population groups defined following variation in temperature and precipitation were moderately high for adaptive quantitative characters such as the timing of budset or tree height (Q(ST) = 0.189-0.314). Average differentiation estimates for gene SNPs were null, with F(ST) values of 0.005 and 0.006, respectively, among temperature and precipitation population groups. Using two detection approaches, a total of 26 SNPs from 25 genes distributed among 11 of the 12 linkage groups of black spruce were detected as outliers with F(ST) as high as 0.078. Nearly half of the outlier SNPs were located in exons and half of those were nonsynonymous. The functional annotations of genes carrying outlier SNPs and regression analyses between the frequencies of these SNPs and climatic variables supported their implication in adaptive processes. Several genes carrying outlier SNPs belonged to gene families previously found to harbour outlier SNPs in a reproductively isolated but largely sympatric congeneric species, suggesting differential subfunctionalization of gene duplicates. Selection coefficient estimates (S) were moderate but well above the magnitude of drift (>1/N(e)), indicating that the signature of natural selection could be detected at the nucleotide level despite the recent establishment of these populations during the Holocene.  相似文献   

12.
Identifying local adaptation is crucial in conservation biology to define ecotypes and establish management guidelines. Local adaptation is often inferred from the detection of loci showing a high differentiation between populations, the so‐called FST outliers. Methods of detection of loci under selection are reputed to be robust in most spatial population models. However, using simulations we showed that FST outlier tests provided a high rate of false‐positives (up to 60%) in fractal environments such as river networks. Surprisingly, the number of sampled demes was correlated with parameters of population genetic structure, such as the variance of FSTs, and hence strongly influenced the rate of outliers. This unappreciated property of river networks therefore needs to be accounted for in genetic studies on adaptation and conservation of river organisms.  相似文献   

13.
Local adaptation to variable environments can generate clinal variation in morphological traits. Alternatively, similar patterns of clinal variation may be generated simply as a result of genetic drift/migration balance. Teasing apart these different processes is a continuing focus in evolutionary ecology. We compare genetic differentiation at molecular loci and quantitative traits to analyse the effect of these different processes in a morphological latitudinal cline of the barn swallow, Hirundo rustica, breeding across Europe. The results obtained show no structuring at neutral microsatellite loci, which contrasts with positive structuring at five quantitative morphometric traits. This supports the hypothesis that the observed morphometric cline in barn swallows is the result of selection acting in a spatially heterogeneous environment. © 2010 The Linnean Society of London, Biological Journal of the Linnean Society, 2010, 99 , 306–314.  相似文献   

14.
We investigated the performance of two of the most popular differentiation-based methods to detect loci under selection (dfdist/fdist and bayescan) in order to ascertain the average chromosome map distance between the detected outlier markers and the nearest loci under selection. We used a model of neutral markers genetically linked to selected loci (QTL) controlling a quantitative trait subject to divergent selection in two subpopulations connected by migration. The results are not particularly encouraging because for chromosome lengths above 0.5 morgan, at least 30% of outliers detected were positioned in chromosomes where QTL were absent, clearly denoting false positives. Outliers linked to QTL were on average closer to the nearest QTL than randomly chosen markers, but the methods showed a substantial uncertainty about the genetic association between markers and selected loci, as this association could be shown significantly only in a moderate number of replicates for most scenarios. At equal conditions, bayescan seemed to perform somewhat more efficiently than dfdist/fdist, with little difference between results for dominant and codominant markers.  相似文献   

15.
Abstract We know little about how natural selection on a species is altered when a closely related species consuming similar resources appears in its environment. In a pond experiment with threespine sticklebacks I tested the prediction that divergent natural selection between competitors is frequency-dependent, changing with the distribution of phe-notypes in the environment. Differential growth and survival of phenotypes in a target stickleback population were contrasted between two treatments. In one treatment an offshore zooplankton feeder (the limnetic stickleback species) was added to the same pond as the target. In the other treatment I added the benthic stickleback instead, a species adapted to feeding on invertebrates from sediments and inshore vegetation. The target population was ecologically and morphologically intermediate with phenotypic variance artificially inflated by hybridization. Growth rates of phenotypes within the target population differed between treatments as predicted by character displacement. The impact of adding a second species always fell most heavily on those phenotypes in the target population resembling the added species most closely. However, those individuals in the target population that most resembled the added species did not experience reduced survival. Instead, consistent survival differences between populations suggested the presence of an inshore –offshore gradient in mortality risk. These results provide further support for the hypothesis of character displacement in sympatric sticklebacks. They suggest that displacement along the resource gradient also led to divergence in vulnerability to agents of mortality, probably including predation.  相似文献   

16.
Cereal genes are classified into two distinct classes according to the guanine-cytosine(GC)content at the third codonsites(GC_3).Natural selection and mutation bias have been proposed to affect the GC content.However,there has beencontroversy about the cause of GC variation.Here,we characterized the GC content of 1092 paralogs and other single-copygenes in the duplicated chromosomal regions of the rice genome(ssp.indica)and classified the paralogs into GC_3-richand GC_3-poor groups.By referring to out-group sequences from Arabidopsis and maize,we confirmed that the averagesynonymous substitution rate of the GC_3-rich genes is significantly lower than that of the GC_3-poor genes.Furthermore,we explored the other possible factors corresponding to the GC variation including the length of coding sequences,thenumber of exons in each gene,the number of genes in each family,the location of genes on chromosomes and the proteinfunctions.Consequently,we propose that natural selection rather than mutation bias was the primary cause of the GCvariation.  相似文献   

17.
Parallel evolution of sexual isolation in sticklebacks   总被引:10,自引:0,他引:10  
Mechanisms of speciation are not well understood, despite decades of study. Recent work has focused on how natural and sexual selection cause sexual isolation. Here, we investigate the roles of divergent natural and sexual selection in the evolution of sexual isolation between sympatric species of threespine sticklebacks. We test the importance of morphological and behavioral traits in conferring sexual isolation and examine to what extent these traits have diverged in parallel between multiple, independently evolved species pairs. We use the patterns of evolution in ecological and mating traits to infer the likely nature of selection on sexual isolation. Strong parallel evolution implicates ecologically based divergent natural and/or sexual selection, whereas arbitrary directionality implicates nonecological sexual selection or drift. In multiple pairs we find that sexual isolation arises in the same way: assortative mating on body size and asymmetric isolation due to male nuptial color. Body size and color have diverged in a strongly parallel manner, similar to ecological traits. The data implicate ecologically based divergent natural and sexual selection as engines of speciation in this group.  相似文献   

18.
Understanding the genetic mechanisms of adaptive population divergence is one of the most fundamental endeavours in evolutionary biology and is becoming increasingly important as it will allow predictions about how organisms will respond to global environmental crisis. This is particularly important for the honey bee, a species of unquestionable ecological and economical importance that has been exposed to increasing human‐mediated selection pressures. Here, we conducted a single nucleotide polymorphism (SNP)‐based genome scan in honey bees collected across an environmental gradient in Iberia and used four FST‐based outlier tests to identify genomic regions exhibiting signatures of selection. Additionally, we analysed associations between genetic and environmental data for the identification of factors that might be correlated or act as selective pressures. With these approaches, 4.4% (17 of 383) of outlier loci were cross‐validated by four FST‐based methods, and 8.9% (34 of 383) were cross‐validated by at least three methods. Of the 34 outliers, 15 were found to be strongly associated with one or more environmental variables. Further support for selection, provided by functional genomic information, was particularly compelling for SNP outliers mapped to different genes putatively involved in the same function such as vision, xenobiotic detoxification and innate immune response. This study enabled a more rigorous consideration of selection as the underlying cause of diversity patterns in Iberian honey bees, representing an important first step towards the identification of polymorphisms implicated in local adaptation and possibly in response to recent human‐mediated environmental changes.  相似文献   

19.
Cereal genes are classified into two distinct classes according to the guanine-cytosine (GC) content at the third codon sites (GC3). Natural selection and mutation bias have been proposed to affect the GC content. However, there has been controversy about the cause of GC variation. Here, we characterized the GC content of 1 092 paralogs and other single-copy genes in the duplicated chromosomal regions of the rice genome (ssp. indica) and classified the paralogs into GC3-rich and GC3-poor groups. By referring to out-group sequences from Arabidopsis and maize, we confirmed that the average synonymous substitution rate of the GC3-rich genes is significantly lower than that of the GC3-poor genes. Furthermore, we explored the other possible factors corresponding to the GC variation including the length of coding sequences, the number of exons in each gene, the number of genes in each family, the location of genes on chromosomes and the protein functions. Consequently, we propose that natural selection rather than mutation bias was the primary cause of the GC variation.  相似文献   

20.
The role of natural selection in promoting reproductive isolation has received substantial renewed interest within the last two decades. As a consequence, the study of ecological speciation has become an extremely productive research area in modern evolutionary biology. Recent innovations in sequencing technologies offer an unprecedented opportunity to study the mechanisms involved in ecological speciation. Genome scans provide significant insights but have some important limitations; efforts are needed to integrate them with other approaches to make full use of the sequencing data deluge. An international conference ‘Advances in Ecological Speciation’ organized by the University of Porto (Portugal) aimed to review current progress in ecological speciation. Using some of the examples presented at the conference, we highlight the benefits of integrating ecological and genomic data and discuss different mechanisms of parallel evolution. Finally, future avenues of research are suggested to advance our knowledge concerning the role of natural selection in the establishment of reproductive isolation during ecological speciation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号