首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Population differentiation has proved to be effective for identifying loci under geographically localized positive selection, and has the potential to identify loci subject to balancing selection. We have previously investigated the pattern of genetic differentiation among human populations at 36.8 million genomic variants to identify sites in the genome showing high frequency differences. Here, we extend this dataset to include additional variants, survey sites with low levels of differentiation, and evaluate the extent to which highly differentiated sites are likely to result from selective or other processes.

Results

We demonstrate that while sites with low differentiation represent sampling effects rather than balancing selection, sites showing extremely high population differentiation are enriched for positive selection events and that one half may be the result of classic selective sweeps. Among these, we rediscover known examples, where we actually identify the established functional SNP, and discover novel examples including the genes ABCA12, CALD1 and ZNF804, which we speculate may be linked to adaptations in skin, calcium metabolism and defense, respectively.

Conclusions

We identify known and many novel candidate regions for geographically restricted positive selection, and suggest several directions for further research.  相似文献   

2.
Selection signals of Korean cattle might be attributed largely to artificial selection for meat quality. Rapidly increased intragenic markers of newly annotated genes in the bovine genome would help overcome limited findings of genetic markers associated with meat quality at the selection signals in a previous study. The present study examined genetic associations of marbling score (MS) with intragenic nucleotide variants at selection signals of Korean cattle. A total of 39 092 nucleotide variants of 407 Korean cattle were utilized in the association analysis. A total of 129 variants were selected within newly annotated genes in the bovine genome. Their genetic associations were analyzed using the mixed model with random polygenic effects based on identical-by-state genetic relationships among animals in order to control for spurious associations produced by population structure. Genetic associations of MS were found (P<3.88×10−4) with six intragenic nucleotide variants on bovine autosomes 3 (cache domain containing 1, CACHD1), 5 (like-glycosyltransferase, LARGE), 16 (cell division cycle 42 binding protein kinase alpha, CDC42BPA) and 21 (snurportin 1, SNUPN; protein tyrosine phosphatase, non-receptor type 9, PTPN9; chondroitin sulfate proteoglycan 4, CSPG4). In particular, the genetic associations with CDC42BPA and LARGE were confirmed using an independent data set of Korean cattle. The results implied that allele frequencies of functional variants and their proximity variants have been augmented by directional selection for greater MS and remain selection signals in the bovine genome. Further studies of fine mapping would be useful to incorporate favorable alleles in marker-assisted selection for MS of Korean cattle.  相似文献   

3.
Common genetic variants have been shown to explain a fraction of the inherited variation for many common diseases and quantitative traits, including height, a classic polygenic trait. The extent to which common variation determines the phenotype of highly heritable traits such as height is uncertain, as is the extent to which common variation is relevant to individuals with more extreme phenotypes. To address these questions, we studied 1,214 individuals from the top and bottom extremes of the height distribution (tallest and shortest ∼1.5%), drawn from ∼78,000 individuals from the HUNT and FINRISK cohorts. We found that common variants still influence height at the extremes of the distribution: common variants (49/141) were nominally associated with height in the expected direction more often than is expected by chance (p<5×10−28), and the odds ratios in the extreme samples were consistent with the effects estimated previously in population-based data. To examine more closely whether the common variants have the expected effects, we calculated a weighted allele score (WAS), which is a weighted prediction of height for each individual based on the previously estimated effect sizes of the common variants in the overall population. The average WAS is consistent with expectation in the tall individuals, but was not as extreme as expected in the shortest individuals (p<0.006), indicating that some of the short stature is explained by factors other than common genetic variation. The discrepancy was more pronounced (p<10−6) in the most extreme individuals (height<0.25 percentile). The results at the extreme short tails are consistent with a large number of models incorporating either rare genetic non-additive or rare non-genetic factors that decrease height. We conclude that common genetic variants are associated with height at the extremes as well as across the population, but that additional factors become more prominent at the shorter extreme.  相似文献   

4.
Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR) and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers.  相似文献   

5.
6.
Environmental gradients influence the distribution and taxonomic composition of planktonic taxa, including Daphnia. In canyon-shaped reservoirs with pronounced horizontal gradients of food supply, predation pressure and other factors, not only species and interspecific hybrids but also clones within these taxa are non-randomly distributed. Using a long-term data set from a reservoir mostly dominated by a single Daphnia species, we evaluated whether intraspecific genetic differentiation can be frequently detected between upstream and downstream reservoir regions with different environmental conditions. We analysed variation at four allozyme loci (two species-specific and two polymorphic) to assess the taxonomic composition and intraspecific variation of Daphnia collected in different years (between 1995 and 2005) and periods of the growing season. D. galeata dominated in all samples, although other species and hybrids with D. galeata were also occasionally detected. Despite limited variation at the analysed loci, D. galeata from upstream and downstream regions were significantly genetically differentiated on seven out of twelve sampling dates. Although genetic drift in geographically distant subpopulations may contribute to differentiation, we presume that the observed patterns are primarily due to different selection regimes. We predict that a significant genetic differentiation within planktonic populations also occurs more frequently in natural water bodies.  相似文献   

7.
Strain selection and strain improvement are the first, and arguably most important, steps in the industrial production of biological compounds by microorganisms. While traditional methods of mutagenesis and selection have been effective in improving production of compounds at a commercial scale, the genetic changes underpinning the altered phenotypes have remained largely unclear. We utilized high-throughput Illumina short read sequencing of a wild Penicillium chrysogenum strain in order to make whole genome comparisons to a sequenced improved strain (WIS 54–1255). We developed an assembly-free method of identifying chromosomal rearrangements and validated the in silico predictions with a PCR-based assay and Sanger sequencing. Despite many rounds of mutagen treatment and artificial selection, WIS 54–1255 differs from its wild progenitor at only one of the identified rearrangements. We suggest that natural variants predisposed for high penicillin production were instrumental in the success of WIS 54–1255 as an industrial strain. In addition to finding a previously published inversion in the penicillin biosynthesis cluster, we located several genes related to penicillin production associated with these rearrangements. By comparing the configuration of rearrangement events among several historically important strains known to be high penicillin producers to a collection of recently isolated wild strains, we suggest that wild strains with rearrangements similar to those in known high penicillin producers may be viable candidates for further improvement efforts.  相似文献   

8.
There is an ongoing discussion in the literature on whether human mitochondrial DNA (mtDNA) evolves neutrally. There have been previous claims for natural selection on human mtDNA based on an excess of non-synonymous mutations and higher evolutionary persistence of specific mitochondrial mutations in Arctic populations. However, these findings were not supported by the reanalysis of larger datasets. Using a geographical framework, we perform the first direct test of the relative extent to which climate and past demography have shaped the current spatial distribution of mtDNA sequences worldwide. We show that populations living in colder environments have lower mitochondrial diversity and that the genetic differentiation between pairs of populations correlates with difference in temperature. These associations were unique to mtDNA; we could not find a similar pattern in any other genetic marker. We were able to identify two correlated non-synonymous point mutations in the ND3 and ATP6 genes characterized by a clear association with temperature, which appear to be plausible targets of natural selection producing the association with climate. The same mutations have been previously shown to be associated with variation in mitochondrial pH and calcium dynamics. Our results indicate that natural selection mediated by climate has contributed to shape the current distribution of mtDNA sequences in humans.  相似文献   

9.
Cancer stem cells (CSCs) represent a population of cancer cells that possess unique self-renewal and differentiation characteristics required for tumorigenesis and are resistant to chemotherapy-induced apoptosis. Lung CSCs can be enriched by several markers including drug-resistant side population (SP), CD133pos and ALDHhigh. Using human non-small cell lung adenocarcinoma cell lines and patient-derived primary tumor cells, we demonstrate that SP cells represent a subpopulation distinct from other cancer stem/progenitor cell (CS/PC) populations marked by CD133pos or ALDHhigh. The non-CS/PCs and CS/PCs of each subpopulation are interconvertible. Epithelial-mesenchymal transition (EMT) promotes the formation of CD133pos and ALDHhigh CS/PC subpopulations while suppressing the SP CS/PC subpopulation. Rac1 GTPase activity is significantly increased in cells that have undergone EMT, and targeting Rac1 is effective in inhibiting the dynamic conversion of non-CS/PCs to CS/PCs, as well as the CS/PC activity. These results imply that various subpopulations of CS/PCs and non-CS/PCs may achieve a stochastic equilibrium in a defined microenvironment, and eliminating multiple subpopulations of CS/PCs and effectively blocking non-CS/PC to CS/PC transition, by an approach such as targeting Rac1, can be a more effective therapy.  相似文献   

10.
Nonalcoholic fatty liver disease (NAFLD) clusters in families, but the only known common genetic variants influencing risk are near PNPLA3. We sought to identify additional genetic variants influencing NAFLD using genome-wide association (GWA) analysis of computed tomography (CT) measured hepatic steatosis, a non-invasive measure of NAFLD, in large population based samples. Using variance components methods, we show that CT hepatic steatosis is heritable (∼26%–27%) in family-based Amish, Family Heart, and Framingham Heart Studies (n = 880 to 3,070). By carrying out a fixed-effects meta-analysis of genome-wide association (GWA) results between CT hepatic steatosis and ∼2.4 million imputed or genotyped SNPs in 7,176 individuals from the Old Order Amish, Age, Gene/Environment Susceptibility-Reykjavik study (AGES), Family Heart, and Framingham Heart Studies, we identify variants associated at genome-wide significant levels (p<5×10−8) in or near PNPLA3, NCAN, and PPP1R3B. We genotype these and 42 other top CT hepatic steatosis-associated SNPs in 592 subjects with biopsy-proven NAFLD from the NASH Clinical Research Network (NASH CRN). In comparisons with 1,405 healthy controls from the Myocardial Genetics Consortium (MIGen), we observe significant associations with histologic NAFLD at variants in or near NCAN, GCKR, LYPLAL1, and PNPLA3, but not PPP1R3B. Variants at these five loci exhibit distinct patterns of association with serum lipids, as well as glycemic and anthropometric traits. We identify common genetic variants influencing CT–assessed steatosis and risk of NAFLD. Hepatic steatosis associated variants are not uniformly associated with NASH/fibrosis or result in abnormalities in serum lipids or glycemic and anthropometric traits, suggesting genetic heterogeneity in the pathways influencing these traits.  相似文献   

11.
Knowing how microevolutionary processes, such as genetic drift and natural selection, shape variation in adaptive traits is strategic for conservation measures. One way to estimate local adaptation is to compare divergences in quantitative traits (QST) and neutral loci (FST). Therefore, we have assessed the pattern of phenotypic and molecular genetic divergence among natural subpopulations of the fruit tree Eugenia dysenterica DC. A provenance and progeny test was performed to assess the quantitative traits of the subpopulations collected in a wide distribution area of the species in the Brazilian Cerrado. The sampled environments are in a biodiversity hotspot with heterogeneous soil and climate conditions. By associating quantitative trait variation in initial seedling development with neutral microsatellite marker variation, we tested the local adaptation of the traits by the QSTFST contrast. Genetic drift was prevalent in the phenotypic differentiation among the subpopulations, although the traits seedling emergence time and root green mass, which are relevant for adaptation to the Cerrado climate, showed signs of uniform selection. Our results suggest that E. dysenterica has a spatial genetic structure divided into two large groups, separated by a line that divides the Cerrado biome in a southwestern to northeastern direction. This structure must be taken into account for managing E. dysenterica genetic resources both for conservation and breeding purposes.  相似文献   

12.
The number of recombination events per meiosis varies extensively among individuals. This recombination phenotype differs between female and male, and also among individuals of each gender. In this study, we used high-density SNP genotypes of over 2,300 individuals and their offspring in two datasets to characterize recombination landscape and to map the genetic variants that contribute to variation in recombination phenotypes. We found six genetic loci that are associated with recombination phenotypes. Two of these (RNF212 and an inversion on chromosome 17q21.31) were previously reported in the Icelandic population, and this is the first replication in any other population. Of the four newly identified loci (KIAA1462, PDZK1, UGCG, NUB1), results from expression studies provide support for their roles in meiosis. Each of the variants that we identified explains only a small fraction of the individual variation in recombination. Notably, we found different sequence variants associated with female and male recombination phenotypes, suggesting that they are regulated by different genes. Characterization of genetic variants that influence natural variation in meiotic recombination will lead to a better understanding of normal meiotic events as well as of non-disjunction, the primary cause of pregnancy loss.  相似文献   

13.
Discovering local adaptation, its genetic underpinnings, and environmental drivers is important for conserving forest species. Ecological genomic approaches coupled with next‐generation sequencing are useful means to detect local adaptation and uncover its underlying genetic basis in nonmodel species. We report results from a study on flowering dogwood trees (Cornus florida L.) using genotyping by sequencing (GBS). This species is ecologically important to eastern US forests but is severely threatened by fungal diseases. We analyzed subpopulations in divergent ecological habitats within North Carolina to uncover loci under local selection and associated with environmental–functional traits or disease infection. At this scale, we tested the effect of incorporating additional sequencing before scaling for a broader examination of the entire range. To test for biases of GBS, we sequenced two similarly sampled libraries independently from six populations of three ecological habitats. We obtained environmental–functional traits for each subpopulation to identify associations with genotypes via latent factor mixed modeling (LFMM) and gradient forests analysis. To test whether heterogeneity of abiotic pressures resulted in genetic differentiation indicative of local adaptation, we evaluated Fst per locus while accounting for genetic differentiation between coastal subpopulations and Piedmont‐Mountain subpopulations. Of the 54 candidate loci with sufficient evidence of being under selection among both libraries, 28–39 were Arlequin–BayeScan Fst outliers. For LFMM, 45 candidates were associated with climate (of 54), 30 were associated with soil properties, and four were associated with plant health. Reanalysis of combined libraries showed that 42 candidate loci still showed evidence of being under selection. We conclude environment‐driven selection on specific loci has resulted in local adaptation in response to potassium deficiencies, temperature, precipitation, and (to a marginal extent) disease. High allele turnover along ecological gradients further supports the adaptive significance of loci speculated to be under selection.  相似文献   

14.
《PloS one》2012,7(12)
Meta-analyses of European populations has successfully identified genetic variants in over 100 loci associated with lipid levels, but our knowledge in other ethnicities remains limited. To address this, we performed dense genotyping of ∼2,000 candidate genes in 7,657 African Americans, 1,315 Hispanics and 841 East Asians, using the IBC array, a custom ∼50,000 SNP genotyping array. Meta-analyses confirmed 16 lipid loci previously established in European populations at genome-wide significance level, and found multiple independent association signals within these lipid loci. Initial discovery and in silico follow-up in 7,000 additional African American samples, confirmed two novel loci: rs5030359 within ICAM1 is associated with total cholesterol (TC) and low-density lipoprotein cholesterol (LDL-C) (p = 8.8×10−7 and p = 1.5×10−6 respectively) and a nonsense mutation rs3211938 within CD36 is associated with high-density lipoprotein cholesterol (HDL-C) levels (p = 13.5×10−12). The rs3211938-G allele, which is nearly absent in European and Asian populations, has been previously found to be associated with CD36 deficiency and shows a signature of selection in Africans and African Americans. Finally, we have evaluated the effect of SNPs established in European populations on lipid levels in multi-ethnic populations and show that most known lipid association signals span across ethnicities. However, differences between populations, especially differences in allele frequency, can be leveraged to identify novel signals, as shown by the discovery of ICAM1 and CD36 in the current report.  相似文献   

15.
DNA methylation is an important molecular-level phenotype that links genotypes and complex disease traits. Previous studies have found local correlation between genetic variants and DNA methylation levels (cis-meQTLs). However, general mechanisms underlying cis-meQTLs are unclear. We conducted a cis-meQTL analysis of the Genetics of Lipid Lowering Drugs and Diet Network data (n = 593). We found that over 80% of genetic variants at CpG sites (meSNPs) are meQTL loci (P-value < 10−9), and meSNPs account for over two thirds of the strongest meQTL signals (P-value < 10−200). Beyond direct effects on the methylation of the meSNP site, the CpG-disrupting allele of meSNPs were associated with lowered methylation of CpG sites located within 45 bp. The effect of meSNPs extends to as far as 10 kb and can contribute to the observed meQTL signals in the surrounding region, likely through correlated methylation patterns and linkage disequilibrium. Therefore, meSNPs are behind a large portion of observed meQTL signals and play a crucial role in the biological process linking genetic variation to epigenetic changes.  相似文献   

16.
The link between long-term host–parasite coevolution and genetic diversity is key to understanding genetic epidemiology and the evolution of resistance. The model of Red Queen host–parasite coevolution posits that high genetic diversity is maintained when rare host resistance variants have a selective advantage, which is believed to be the mechanistic basis for the extraordinarily high levels of diversity at disease-related genes such as the major histocompatibility complex in jawed vertebrates and R-genes in plants. The parasites that drive long-term coevolution are, however, often elusive. Here we present evidence for long-term balancing selection at the phenotypic (variation in resistance) and genomic (resistance locus) level in a particular host–parasite system: the planktonic crustacean Daphnia magna and the bacterium Pasteuria ramosa. The host shows widespread polymorphisms for pathogen resistance regardless of geographic distance, even though there is a clear genome-wide pattern of isolation by distance at other sites. In the genomic region of a previously identified resistance supergene, we observed consistent molecular signals of balancing selection, including higher genetic diversity, older coalescence times, and lower differentiation between populations, which set this region apart from the rest of the genome. We propose that specific long-term coevolution by negative-frequency-dependent selection drives this elevated diversity at the host''s resistance loci on an intercontinental scale and provide an example of a direct link between the host’s resistance to a virulent pathogen and the large-scale diversity of its underlying genes.  相似文献   

17.
Researchers have successfully applied exome sequencing to discover causal variants in selected individuals with familial, highly penetrant disorders. We demonstrate the utility of exome sequencing followed by imputation for discovering low-frequency variants associated with complex quantitative traits. We performed exome sequencing in a reference panel of 761 African Americans and then imputed newly discovered variants into a larger sample of more than 13,000 African Americans for association testing with the blood cell traits hemoglobin, hematocrit, white blood count, and platelet count. First, we illustrate the feasibility of our approach by demonstrating genome-wide-significant associations for variants that are not covered by conventional genotyping arrays; for example, one such association is that between higher platelet count and an MPL c.117G>T (p.Lys39Asn) variant encoding a p.Lys39Asn amino acid substitution of the thrombpoietin receptor gene (p = 1.5 × 10−11). Second, we identified an association between missense variants of LCT and higher white blood count (p = 4 × 10−13). Third, we identified low-frequency coding variants that might account for allelic heterogeneity at several known blood cell-associated loci: MPL c.754T>C (p.Tyr252His) was associated with higher platelet count; CD36 c.975T>G (p.Tyr325) was associated with lower platelet count; and several missense variants at the α-globin gene locus were associated with lower hemoglobin. By identifying low-frequency missense variants associated with blood cell traits not previously reported by genome-wide association studies, we establish that exome sequencing followed by imputation is a powerful approach to dissecting complex, genetically heterogeneous traits in large population-based studies.  相似文献   

18.
BackgroundIn wild plant populations, genetic divergence within continuous stands is common, sometimes at very short geographical scales. While restrictions to gene flow combined with local inbreeding and genetic drift may cause neutral differentiation among subpopulations, microgeographical variations in environmental conditions can drive adaptive divergence through natural selection at some targeted loci. Such phenomena have recurrently been observed in plant populations occurring across sharp environmental boundaries, but the interplay between selective processes and neutral genetic divergence has seldom been studied.MethodsWe assessed the extent of within-stand neutral and environmentally-driven divergence in the Neotropical tree Eperua falcate Aubl. (Fabaceae) through a genome-scan approach. Populations of this species grow in dense stands that cross the boundaries between starkly contrasting habitats. Within-stand phenotypic and candidate-gene divergence have already been proven, making this species a suitable model for the study of genome-wide microgeographic divergence. Thirty trees from each of two habitats (seasonally flooded swamps and well-drained plateaus) in two separate populations were genotyped using thousands of AFLPs markers. To avoid genotyping errors and increase marker reliability, each sample was genotyped twice and submitted to a rigorous procedure for data cleaning, which resulted in 1196 reliable and reproducible markers.ResultsDespite the short spatial distances, we detected within-populations genetic divergence, probably caused by neutral processes, such as restrictions in gene flow. Moreover, habitat-structured subpopulations belonging to otherwise continuous stands also diverge in relation to environmental variability and habitat patchiness: we detected convincing evidence of divergent selection at the genome-wide level and for a fraction of the analyzed loci (comprised between 0.25% and 1.6%). Simulations showed that the levels of differentiation for these outliers are compatible with scenarios of strong divergent selection.  相似文献   

19.
Genetic diversity, especially at genes important for immune functioning within the Major Histocompatibility Complex (MHC), has been associated with fitness-related traits, including disease resistance, in many species. Recently, genetic diversity has been associated with mate preferences in humans. Here we asked whether these preferences are adaptive in terms of obtaining healthier mates. We investigated whether genetic diversity (heterozygosity and standardized mean d2) at MHC and nonMHC microsatellite loci, predicted health in 153 individuals. Individuals with greater allelic diversity (d2) at nonMHC loci and at one MHC locus, linked to HLA-DRB1, reported fewer symptoms over a four-month period than individuals with lower d2. In contrast, there were no associations between MHC or nonMHC heterozygosity and health. NonMHC-d2 has previously been found to predict male preferences for female faces. Thus, the current findings suggest that nonMHC diversity may play a role in both natural and sexual selection acting on human populations.  相似文献   

20.
While progress has been made in identifying common genetic variants associated with human diseases, for most of common complex diseases, the identified genetic variants only account for a small proportion of heritability. Challenges remain in finding additional unknown genetic variants predisposing to complex diseases. With the advance in next-generation sequencing technologies, sequencing studies have become commonplace in genetic research. The ongoing exome-sequencing and whole-genome-sequencing studies generate a massive amount of sequencing variants and allow researchers to comprehensively investigate their role in human diseases. The discovery of new disease-associated variants can be enhanced by utilizing powerful and computationally efficient statistical methods. In this paper, we propose a functional analysis of variance (FANOVA) method for testing an association of sequence variants in a genomic region with a qualitative trait. The FANOVA has a number of advantages: (1) it tests for a joint effect of gene variants, including both common and rare; (2) it fully utilizes linkage disequilibrium and genetic position information; and (3) allows for either protective or risk-increasing causal variants. Through simulations, we show that FANOVA outperform two popularly used methods – SKAT and a previously proposed method based on functional linear models (FLM), – especially if a sample size of a study is small and/or sequence variants have low to moderate effects. We conduct an empirical study by applying three methods (FANOVA, SKAT and FLM) to sequencing data from Dallas Heart Study. While SKAT and FLM respectively detected ANGPTL 4 and ANGPTL 3 associated with obesity, FANOVA was able to identify both genes associated with obesity.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号