首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Different populations suffer from different rates of obesity and type-2 diabetes (T2D). Little is known about the genetic or adaptive component, if any, that underlies these differences. Given the cultural, geographic, and dietary variation that accumulated among humans over the last 60,000 years, we examined whether loci identified by genome-wide association studies for these traits have been subject to recent selection pressures. Using genome-wide SNP data on 938 individuals in 53 populations from the Human Genome Diversity Panel, we compare population differentiation and haplotype patterns at these loci to the rest of the genome. Using an “expanding window” approach (100–1,600 kb) for the individual loci as well as the loci as ensembles, we find a high degree of differentiation for the ensemble of T2D loci. This differentiation is most pronounced for East Asians and sub-Saharan Africans, suggesting that these groups experienced natural selection at loci associated with T2D. Haplotype analysis suggests an excess of obesity loci with evidence of recent positive selection among South Asians and Europeans, compared to sub-Saharan Africans and Native Americans. We also identify individual loci that may have been subjected to natural selection, such as the T2D locus, HHEX, which displays both elevated differentiation and extended haplotype homozygosity in comparisons of East Asians with other groups. Our findings suggest that there is an evolutionary genetic basis for population differences in these traits, and we have identified potential group-specific genetic risk factors.  相似文献   

2.
When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1) decreased multi-locus heterozygosity in the region; 2) elevated or diminished genetic divergence (F(ST)) of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3) a consequent regional increase in the variance of F(ST) (S(2)F(ST)) for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5-65 SNPs) across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans from smaller studies of humans or other species.  相似文献   

3.
Throughout human evolutionary history, large-scale migrations have led to intermixing (i.e., admixture) between previously separated human groups. Although classical and recent work have shown that studying admixture can yield novel historical insights, the extent to which this process contributed to adaptation remains underexplored. Here, we introduce a novel statistical model, specific to admixed populations, that identifies loci under selection while determining whether the selection likely occurred post-admixture or prior to admixture in one of the ancestral source populations. Through extensive simulations, we show that this method is able to detect selection, even in recently formed admixed populations, and to accurately differentiate between selection occurring in the ancestral or admixed population. We apply this method to genome-wide SNP data of ∼4,000 individuals in five admixed Latin American cohorts from Brazil, Chile, Colombia, Mexico, and Peru. Our approach replicates previous reports of selection in the human leukocyte antigen region that are consistent with selection post-admixture. We also report novel signals of selection in genomic regions spanning 47 genes, reinforcing many of these signals with an alternative, commonly used local-ancestry-inference approach. These signals include several genes involved in immunity, which may reflect responses to endemic pathogens of the Americas and to the challenge of infectious disease brought by European contact. In addition, some of the strongest signals inferred to be under selection in the Native American ancestral groups of modern Latin Americans overlap with genes implicated in energy metabolism phenotypes, plausibly reflecting adaptations to novel dietary sources available in the Americas.  相似文献   

4.
In recent years it has emerged that structural variants have a substantial impact on genomic variation. Inversion polymorphisms represent a significant class of structural variant, and despite the challenges in their detection, data on inversions in the human genome are increasing rapidly. Statistical methods for inferring parameters such as the recombination rate and the selection coefficient have generally been developed without accounting for the presence of inversions. Here we exploit new software for simulating inversions in population genetic data, invertFREGENE, to assess the potential impact of inversions on such methods. Using data simulated by invertFREGENE, as well as real data from several sources, we test whether large inversions have a disruptive effect on widely applied population genetics methods for inferring recombination rates, for detecting selection, and for controlling for population structure in genome-wide association studies (GWAS). We find that recombination rates estimated by LDhat are biased downward at inversion loci relative to the true contemporary recombination rates at the loci but that recombination hotspots are not falsely inferred at inversion breakpoints as may have been expected. We find that the integrated haplotype score (iHS) method for detecting selection appears robust to the presence of inversions. Finally, we observe a strong bias in the genome-wide results of principal components analysis (PCA), used to control for population structure in GWAS, in the presence of even a single large inversion, confirming the necessity to thin SNPs by linkage disequilibrium at large physical distances to obtain unbiased results.  相似文献   

5.
Celiac disease is a common small intestinal inflammatory condition induced by wheat gluten and related proteins from rye and barley. Left untreated, the clinical presentation of CD can include failure to thrive, malnutrition, and distension in juveniles. The disease can additionally lead to vitamin deficiencies, anemia, and osteoporosis. Therefore, CD potentially negatively affected fitness in past populations utilizing wheat, barley, and rye. Previous analyses of CD risk variants have uncovered evidence for positive selection on some of these loci. These studies also suggest the possibility that risk for common autoimmune conditions such as CD may be the result of positive selection on immune related loci in the genome to fight infection. Under this evolutionary scenario, disease phenotypes may be a trade-off from positive selection on immunity. If this hypothesis is generally true, we can expect to find a signal of natural selection when we survey across the network of loci known to influence CD risk. This study examines the non-HLA autosomal network of gene loci associated with CD risk in Europe. We reject the null hypothesis of neutrality on this network of CD risk loci. Additionally, we can localize evidence of selection in time and space by adding information from the genome of the Tyrolean Iceman. While we can show significant differentiation between continental regions across the CD network, the pattern of evidence is not consistent with primarily recent (Holocene) selection across this network in Europe. Further localization of ancient selection on this network may illuminate the ecological pressures acting on the immune system during this critically interesting phase of our evolution.  相似文献   

6.
Recent genome sequencing studies with large sample sizes in humans have discovered a vast quantity of low-frequency variants, providing an important source of information to analyze how selection is acting on human genetic variation. In order to estimate the strength of natural selection acting on low-frequency variants, we have developed a likelihood-based method that uses the lengths of pairwise identity-by-state between haplotypes carrying low-frequency variants. We show that in some nonequilibrium populations (such as those that have had recent population expansions) it is possible to distinguish between positive or negative selection acting on a set of variants. With our new framework, one can infer a fixed selection intensity acting on a set of variants at a particular frequency, or a distribution of selection coefficients for standing variants and new mutations. We show an application of our method to the UK10K phased haplotype dataset of individuals.  相似文献   

7.
Genome-wide association studies (GWAS) have identified ∼100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trait-influencing variants remain unknown. We conducted a trans-ethnic fine-mapping study at 18, 22, and 18 GWAS loci on the Metabochip for their association with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), respectively, in individuals of African American (n = 6,832), East Asian (n = 9,449), and European (n = 10,829) ancestry. We aimed to identify the variants with strongest association at each locus, identify additional and population-specific signals, refine association signals, and assess the relative significance of previously described functional variants. Among the 58 loci, 33 exhibited evidence of association at P<1×10−4 in at least one ancestry group. Sequential conditional analyses revealed that ten, nine, and four loci in African Americans, Europeans, and East Asians, respectively, exhibited two or more signals. At these loci, accounting for all signals led to a 1.3- to 1.8-fold increase in the explained phenotypic variance compared to the strongest signals. Distinct signals across ancestry groups were identified at PCSK9 and APOA5. Trans-ethnic analyses narrowed the signals to smaller sets of variants at GCKR, PPP1R3B, ABO, LCAT, and ABCA1. Of 27 variants reported previously to have functional effects, 74% exhibited the strongest association at the respective signal. In conclusion, trans-ethnic high-density genotyping and analysis confirm the presence of allelic heterogeneity, allow the identification of population-specific variants, and limit the number of candidate SNPs for functional studies.  相似文献   

8.
Genome-wide association studies (GWAS) have in recent years discovered thousands of associated markers for hundreds of phenotypes. However, associated loci often only explain a relatively small fraction of heritability and the link between association and causality has yet to be uncovered for most loci. Rare causal variants have been suggested as one scenario that may partially explain these shortcomings. Specifically, Dickson et al. recently reported simulations of rare causal variants that lead to association signals of common, tag single nucleotide polymorphisms, dubbed "synthetic associations". However, an open question is what practical implications synthetic associations have for GWAS. Here, we explore the signatures exhibited by such "synthetic associations" and their implications based on patterns of genetic variation observed in human populations, thus accounting for human evolutionary history -a force disregarded in previous simulation studies. This is made possible by human population genetic data from HapMap 3 consisting of both resequencing and array-based genotyping data for the same set of individuals from multiple populations. We report that synthetic associations tend to be further away from the underlying risk alleles compared to "natural associations" (i.e. associations due to underlying common causal variants), but to a much lesser extent than previously predicted, with both the age and the effect size of the risk allele playing a part in this phenomenon. We find that while a synthetic association has a lower probability of capturing causal variants within its linkage disequilibrium block, sequencing around the associated variant need not extend substantially to have a high probability of capturing at least one causal variant. We also show that the minor allele frequency of synthetic associations is lower than of natural associations for most, but not all, loci that we explored. Finally, we find the variance in associated allele frequency to be a potential indicator of synthetic associations.  相似文献   

9.
Hedrick PW 《Heredity》2011,107(4):283-304
The high mortality and widespread impact of malaria have resulted in this disease being the strongest evolutionary selective force in recent human history, and genes that confer resistance to malaria provide some of the best-known case studies of strong positive selection in modern humans. I begin by reviewing JBS Haldane''s initial contribution to the potential of malaria genetic resistance in humans. Further, I discuss the population genetics aspects of many of the variants, including globin, G6PD deficiency, Duffy, ovalocytosis, ABO and human leukocyte antigen variants. Many of the variants conferring resistance to malaria are ‘loss-of-function'' mutants and appear to be recent polymorphisms from the last 5000–10 000 years or less. I discuss estimation of selection coefficients from case–control data and make predictions about the change for S, C and G6PD-deficiency variants. In addition, I consider the predicted joint changes when the two β-globin alleles S and C are both variable in the same population and when there is a variation for α-thalassemia and S, two unlinked, but epistatic variants. As more becomes known about genes conferring genetic resistance to malaria in humans, population genetics approaches can contribute both to investigating past selection and predicting the consequences in future generations for these variants.  相似文献   

10.
The identification of the causative genetic variants in quantitative trait loci (QTL) influencing phenotypic traits is challenging, especially in crosses between outbred strains. We have previously identified several QTL influencing tameness and aggression in a cross between two lines of wild-derived, outbred rats (Rattus norvegicus) selected for their behavior towards humans. Here, we use targeted sequence capture and massively parallel sequencing of all genes in the strongest QTL in the founder animals of the cross. We identify many novel sequence variants, several of which are potentially functionally relevant. The QTL contains several regions where either the tame or the aggressive founders contain no sequence variation, and two regions where alternative haplotypes are fixed between the founders. A re-analysis of the QTL signal showed that the causative site is likely to be fixed among the tame founder animals, but that several causative alleles may segregate among the aggressive founder animals. Using a formal test for the detection of positive selection, we find 10 putative positively selected regions, some of which are close to genes known to influence behavior. Together, these results show that the QTL is probably not caused by a single selected site, but may instead represent the joint effects of several sites that were targets of polygenic selection.  相似文献   

11.
The recent advent of high-throughput sequencing and genotyping technologies makes it possible to produce, easily and cost effectively, large amounts of detailed data on the genotype composition of populations. Detecting locus-specific effects may help identify those genes that have been, or are currently, targeted by natural selection. How best to identify these selected regions, loci, or single nucleotides remains a challenging issue. Here, we introduce a new model-based method, called SelEstim, to distinguish putative selected polymorphisms from the background of neutral (or nearly neutral) ones and to estimate the intensity of selection at the former. The underlying population genetic model is a diffusion approximation for the distribution of allele frequency in a population subdivided into a number of demes that exchange migrants. We use a Markov chain Monte Carlo algorithm for sampling from the joint posterior distribution of the model parameters, in a hierarchical Bayesian framework. We present evidence from stochastic simulations, which demonstrates the good power of SelEstim to identify loci targeted by selection and to estimate the strength of selection acting on these loci, within each deme. We also reanalyze a subset of SNP data from the Stanford HGDP–CEPH Human Genome Diversity Cell Line Panel to illustrate the performance of SelEstim on real data. In agreement with previous studies, our analyses point to a very strong signal of positive selection upstream of the LCT gene, which encodes for the enzyme lactase–phlorizin hydrolase and is associated with adult-type hypolactasia. The geographical distribution of the strength of positive selection across the Old World matches the interpolated map of lactase persistence phenotype frequencies, with the strongest selection coefficients in Europe and in the Indus Valley.  相似文献   

12.
It was expected that studies of electrophoretic variability in natural populations would resolve longstanding controversies concerning the form of natural selection and its effect on genetic variance in fitness. Recent studies of fitness components for allozymes in E. coli and Drosophila, where genetic backgrounds have been rigidly controlled, and experiments designed to detect small selection coefficients, suggest that selection is much weaker than earlier investigations would indicate. However, perturbing the metabolic background associated with specific loci often allows functional differences to be amplified to an experimentally measurable level. Frequencies of null activity variants in natural populations indicate that the fitness consequences of reduced activity in heterozygoles are probably very small. These results are supported by recent theoretical considerations suggesting that the activity variation associated with electrophoretic variation will have little effect on overall flux in many pathways.  相似文献   

13.
Analyses aimed at identifying genes that have been targeted by past selection provide a powerful means for investigating the molecular basis of adaptive differentiation. In the case of crop plants, such studies have the potential to not only shed light on important evolutionary processes, but also to identify genes of agronomic interest. In this study, we test for evidence of positive selection at the DNA sequence level in a set of candidate genes previously identified in a genome-wide scan for genotypic evidence of selection during the evolution of cultivated sunflower. In the majority of cases, we were able to confirm the effects of selection in shaping diversity at these loci. Notably, the genes that were found to be under selection via our sequence-based analyses were devoid of variation in the cultivated sunflower gene pool. This result confirms a possible strategy for streamlining the search for adaptively-important loci process by pre-screening the derived population to identify the strongest candidates before sequencing them in the ancestral population.  相似文献   

14.
Spatially varying selection can lead to population‐specific adaptation, which is often recognized at the phenotypic level; however, the genetic evidence is weaker in many groups of organisms. In plants, environmental shifts that occur due to colonization of a novel environment may require adaptive changes in the timing of growth and flowering, which are often governed by location‐specific environmental cues such as day length. We studied locally varying selection in 19 flowering time loci in nine populations of the perennial herb Arabidopsis lyrata, which has a wide but patchy distribution in temperate and boreal regions of the northern hemisphere. The populations differ in their recent population demographic and colonization histories and current environmental conditions, especially in the growing season length. We searched for population‐specific molecular signatures of directional selection by comparing a set of candidate flowering time loci with a genomic reference set within each population using multiple approaches and contrasted the patterns of different populations. The candidate loci possessed approximately 20% of the diversity of the reference loci. On average the flowering time loci had more rare alleles (a smaller Tajima's D) and an excess of highly differentiated sites relative to the reference, suggesting positive selection. The strongest signal of selection was detected in photoperiodic pathway loci in the colonizing populations of Northwestern Europe, whereas no evidence of positive selection was detected in the Central European populations. These findings emphasized the population‐specific nature of selection and suggested that photoperiodic adaptation was important during postglacial colonization of the species.  相似文献   

15.
Glinka S  Ometto L  Mousset S  Stephan W  De Lorenzo D 《Genetics》2003,165(3):1269-1278
Demography and selection have been recognized for their important roles in shaping patterns of nucleotide variability. To investigate the relative effects of these forces in the genome of Drosophila melanogaster, we used a multi-locus scan (105 fragments) of X-linked DNA sequence variation in a putatively ancestral African and a derived European population. Surprisingly, we found evidence for a recent size expansion in the African population, i.e., a significant excess of singletons at a chromosome-wide level. In the European population, such an excess was not detected. In contrast to the African population, we found evidence for positive natural selection in the European sample: (i) a large number of loci with low levels of variation and (ii) a significant excess of derived variants at the low-variation loci that are fixed in the European sample but rare in the African population. These results are consistent with the hypothesis that the European population has experienced frequent selective sweeps in the recent past during its adaptation to new habitats. Our study shows the advantages of a genomic approach (over a locus-specific analysis) in disentangling demographic and selective forces.  相似文献   

16.
Humans inhabit a remarkably diverse range of environments, and adaptation through natural selection has likely played a central role in the capacity to survive and thrive in extreme climates. Unlike numerous studies that used only population genetic data to search for evidence of selection, here we scan the human genome for selection signals by identifying the SNPs with the strongest correlations between allele frequencies and climate across 61 worldwide populations. We find a striking enrichment of genic and nonsynonymous SNPs relative to non-genic SNPs among those that are strongly correlated with these climate variables. Among the most extreme signals, several overlap with those from GWAS, including SNPs associated with pigmentation and autoimmune diseases. Further, we find an enrichment of strong signals in gene sets related to UV radiation, infection and immunity, and cancer. Our results imply that adaptations to climate shaped the spatial distribution of variation in humans.  相似文献   

17.
Multilocus surveys of sequence variation can be used to identify targets of directional selection, which are expected to have reduced levels of variation. Following a population bottleneck, the signal of directional selection may be hard to detect because many loci may have low variation by chance and the frequency spectrum of variation may be perturbed in ways that resemble the effects of selection. Cultivated Sorghum bicolor contains a subset of the genetic diversity found in its wild ancestor(s) due to the combined effects of a domestication bottleneck and human selection on traits associated with agriculture. As a framework for distinguishing between the effects of demography and selection, we sequenced 204 loci in a diverse panel of 17 cultivated S. bicolor accessions. Genomewide patterns of diversity depart strongly from equilibrium expectations with regard to the variance of the number of segregating sites, the site frequency spectrum, and haplotype configuration. Furthermore, gene genealogies of most loci with an excess of low frequency variants and/or an excess of segregating sites do not show the characteristic signatures of directional and diversifying selection, respectively. A simple bottleneck model provides an improved but inadequate fit to the data, suggesting the action of other population-level factors, such as population structure and migration. Despite a known history of recent selection, we find little evidence for directional selection, likely due to low statistical power and lack of an appropriate null model.  相似文献   

18.
Genome-wide association studies (GWAS) have identified 14 tagging single nucleotide polymorphisms (tagSNPs) that are associated with the risk of colorectal cancer (CRC), and several of these tagSNPs are near bone morphogenetic protein (BMP) pathway loci. The penalty of multiple testing implicit in GWAS increases the attraction of complementary approaches for disease gene discovery, including candidate gene- or pathway-based analyses. The strongest candidate loci for additional predisposition SNPs are arguably those already known both to have functional relevance and to be involved in disease risk. To investigate this proposition, we searched for novel CRC susceptibility variants close to the BMP pathway genes GREM1 (15q13.3), BMP4 (14q22.2), and BMP2 (20p12.3) using sample sets totalling 24,910 CRC cases and 26,275 controls. We identified new, independent CRC predisposition SNPs close to BMP4 (rs1957636, P = 3.93×10(-10)) and BMP2 (rs4813802, P = 4.65×10(-11)). Near GREM1, we found using fine-mapping that the previously-identified association between tagSNP rs4779584 and CRC actually resulted from two independent signals represented by rs16969681 (P = 5.33×10(-8)) and rs11632715 (P = 2.30×10(-10)). As low-penetrance predisposition variants become harder to identify-owing to small effect sizes and/or low risk allele frequencies-approaches based on informed candidate gene selection may become increasingly attractive. Our data emphasise that genetic fine-mapping studies can deconvolute associations that have arisen owing to independent correlation of a tagSNP with more than one functional SNP, thus explaining some of the apparently missing heritability of common diseases.  相似文献   

19.
Levels and patterns of human DNA sequence variation vary widely among loci. However, some of this variation may be due to the different populations used in different studies. So far, few studies of diverse human populations have compared different genetic loci for the same samples of populations and individuals. Here, we present new polymorphism data from intron 4 of the Factor IX gene (FIX) sequenced in diverse Old World populations. An explicit comparison is made with another X-linked gene, PDHA1, for which the sampling of individuals was very similar. Despite having a similar amount of divergence from chimpanzees, as do other nuclear genes, FIX has comparatively much less DNA sequence variation among humans. Nucleotide diversity at FIX is the lowest among the existing non-Y chromosome nuclear gene datasets and is less than 10% of the diversity found at PDHA1. Estimates of effective population size based on FIX are 8,558, about half of the value obtained for PDHA1, and the time to the most recent common ancestry among human FIX gene copies (282,000 years) is one of the most recent estimates reported for human genes. Analyses presented here suggest a history for the FIX region that includes recent positive directional selection, or background, selection. The general conclusion emerging is that very large variations can exist between the histories of similar genomic regions, even when sampling differences are minimized.  相似文献   

20.
All non-human great apes are endangered in the wild, and it is therefore important to gain an understanding of their demography and genetic diversity. Whole genome assembly projects have provided an invaluable foundation for understanding genetics in all four genera, but to date genetic studies of multiple individuals within great ape species have largely been confined to mitochondrial DNA and a small number of other loci. Here, we present a genome-wide survey of genetic variation in gorillas using a reduced representation sequencing approach, focusing on the two lowland subspecies. We identify 3,006,670 polymorphic sites in 14 individuals: 12 western lowland gorillas (Gorilla gorilla gorilla) and 2 eastern lowland gorillas (Gorilla beringei graueri). We find that the two species are genetically distinct, based on levels of heterozygosity and patterns of allele sharing. Focusing on the western lowland population, we observe evidence for population substructure, and a deficit of rare genetic variants suggesting a recent episode of population contraction. In western lowland gorillas, there is an elevation of variation towards telomeres and centromeres on the chromosomal scale. On a finer scale, we find substantial variation in genetic diversity, including a marked reduction close to the major histocompatibility locus, perhaps indicative of recent strong selection there. These findings suggest that despite their maintaining an overall level of genetic diversity equal to or greater than that of humans, population decline, perhaps associated with disease, has been a significant factor in recent and long-term pressures on wild gorilla populations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号