首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Genome-wide association study (GWAS) data on a disease are increasingly available from multiple related populations. In this scenario, meta-analyses can improve power to detect homogeneous genetic associations, but if there exist ancestry-specific effects, via interactions on genetic background or with a causal effect that co-varies with genetic background, then these will typically be obscured. To address this issue, we have developed a robust statistical method for detecting susceptibility gene-ancestry interactions in multi-cohort GWAS based on closely-related populations. We use the leading principal components of the empirical genotype matrix to cluster individuals into “ancestry groups” and then look for evidence of heterogeneous genetic associations with disease or other trait across these clusters. Robustness is improved when there are multiple cohorts, as the signal from true gene-ancestry interactions can then be distinguished from gene-collection artefacts by comparing the observed interaction effect sizes in collection groups relative to ancestry groups. When applied to colorectal cancer, we identified a missense polymorphism in iron-absorption gene CYBRD1 that associated with disease in individuals of English, but not Scottish, ancestry. The association replicated in two additional, independently-collected data sets. Our method can be used to detect associations between genetic variants and disease that have been obscured by population genetic heterogeneity. It can be readily extended to the identification of genetic interactions on other covariates such as measured environmental exposures. We envisage our methodology being of particular interest to researchers with existing GWAS data, as ancestry groups can be easily defined and thus tested for interactions.  相似文献   

2.
Schizophrenia is a severe and highly heritable neuropsychiatric disorder. Recent genetic analyses including genome-wide association studies (GWAS) have implicated multiple genome-wide significant variants for schizophrenia among European populations. However, many of these risk variants were not largely validated in other populations of different ancestry such as Asians. To validate whether these European GWAS significant loci are associated with schizophrenia in Asian populations, we conducted a systematic literature search and meta-analyses on 19 single nucleotide polymorphisms (SNPs) in Asian populations by combining all available case-control and family-based samples, including up to 30,000 individuals. We employed classical fixed (or random) effects inverse variance weighted methods to calculate summary odds ratios (ORs) and 95 % confidence intervals (CIs). Among the 19 GWAS loci, we replicated the risk associations of nine markers (e.g., SNPs at VRK2, ITIH3/4, NDST3, NOTCH4) surpassing significance level (two-tailed P?<?0.05), and three additional SNPs in MIR137 and ZNF804A also showed trend associations (one-tailed P?<?0.05). These risk associations are in the same directions of allelic effects between Asian replication samples and initial European GWAS findings, and the successful replications of these GWAS loci in a different ethnic group provide stronger evidence for their clinical associations with schizophrenia. Further studies, focusing on the molecular mechanisms of these GWAS significant loci, will become increasingly important for understanding of the pathogenesis to schizophrenia.  相似文献   

3.
Recent genome-wide association studies (GWAS) have successfully identified several gene loci associated with multiple sclerosis (MS) susceptibility, severity or interferon-beta (IFN-ß) response. However, due to the nature of these studies, the functional relevance of these loci is not yet fully understood. We have utilized a systems biology based approach to explore the genetic interactomes of these MS related traits. We hypothesised that genes and pathways associated with the 3 MS related phenotypes might interact collectively to influence the heterogeneity and unpredictable clinical outcomes observed. Individual genetic interactomes for each trait were constructed and compared, followed by prioritization of common interactors based on their frequencies. Pathway enrichment analyses were performed to highlight shared functional pathways. Biologically relevant genes ABL1, GRB2, INPP5D, KIF1B, PIK3R1, PLCG1, PRKCD, SRC, TUBA1A and TUBA4A were identified as common to all 3 MS phenotypes. We observed that the highest number of first degree interactors were shared between MS susceptibility and MS severity (p = 1.34×10−79) with UBC as the most prominent first degree interactor for this phenotype pair from the prioritisation analysis. As expected, pairwise comparisons showed that MS susceptibility and severity interactomes shared the highest number of pathways. Pathways from signalling molecules and interaction, and signal transduction categories were found to be highest shared pathways between 3 phenotypes. Finally, FYN was the most common first degree interactor in the MS drugs-gene network. By applying the systems biology based approach, additional significant information can be extracted from GWAS. Results of our interactome analyses are complementary to what is already known in the literature and also highlight some novel interactions which await further experimental validation. Overall, this study illustrates the potential of using a systems biology based approach in an attempt to unravel the biological significance of gene loci identified in large GWAS.  相似文献   

4.
Blood lipid concentrations are heritable risk factors associated with atherosclerosis and cardiovascular diseases. Lipid traits exhibit considerable variation among populations of distinct ancestral origin as well as between individuals within a population. We performed association analyses to identify genetic loci influencing lipid concentrations in African American and Hispanic American women in the Women’s Health Initiative SNP Health Association Resource. We validated one African-specific high-density lipoprotein cholesterol locus at CD36 as well as 14 known lipid loci that have been previously implicated in studies of European populations. Moreover, we demonstrate striking similarities in genetic architecture (loci influencing the trait, direction and magnitude of genetic effects, and proportions of phenotypic variation explained) of lipid traits across populations. In particular, we found that a disproportionate fraction of lipid variation in African Americans and Hispanic Americans can be attributed to genomic loci exhibiting statistical evidence of association in Europeans, even though the precise genes and variants remain unknown. At the same time, we found substantial allelic heterogeneity within shared loci, characterized both by population-specific rare variants and variants shared among multiple populations that occur at disparate frequencies. The allelic heterogeneity emphasizes the importance of including diverse populations in future genetic association studies of complex traits such as lipids; furthermore, the overlap in lipid loci across populations of diverse ancestral origin argues that additional knowledge can be gleaned from multiple populations.  相似文献   

5.
The range of hosts a pathogen infects (host specificity) is a key element of disease risk that may be influenced by both shared phylogenetic history and shared ecological attributes of prospective hosts. Phylospecificity indices quantify host specificity in terms of host relatedness, but can fail to capture ecological attributes that increase susceptibility. For instance, similarity in habitat niche may expose phylogenetically unrelated host species to similar pathogen assemblages. Using a recently proposed method that integrates multiple distances, we assess the relative contributions of host phylogenetic and functional distances to pathogen host specificity (functional–phylogenetic host specificity). We apply this index to a data set of avian malaria parasite (Plasmodium and Haemoproteus spp.) infections from Melanesian birds to show that multihost parasites generally use hosts that are closely related, not hosts with similar habitat niches. We also show that host community phylogenetic ß‐diversity (Pßd) predicts parasite Pßd and that individual host species carry phylogenetically clustered Haemoproteus parasite assemblages. Our findings were robust to phylogenetic uncertainty, and suggest that phylogenetic ancestry of both hosts and parasites plays important roles in driving avian malaria host specificity and community assembly. However, restricting host specificity analyses to either recent or historical timescales identified notable exceptions, including a ‘habitat specialist’ parasite that infects a diversity of unrelated host species with similar habitat niches. This work highlights that integrating ecological and phylogenetic distances provides a powerful approach to better understand drivers of pathogen host specificity and community assembly.  相似文献   

6.
Understanding the relationship between population genetic structure and phenotypic diversity is a fundamental question in evolutionary biology. Yeasts display wide genetic diversity and exhibit remarkably diverse heterotrophic metabolisms that allow a variety of niche occupations. However, little is known about how intra-species genetic population structure is related to trait diversity in yeasts. In this study, we investigated the link between intra-species genetic population structure and trait diversity in the floral nectar-inhabiting yeast Metschnikowia reukaufii (Ascomycota). A total of 73 strains obtained from 11 plant species were genotyped by whole genome sequencing, followed by single nucleotide polymorphism (SNP) calling, and phenotyped using a robot-assisted high-throughput screening platform. Analysis of the population structure estimated the number of ancestral populations to be K = 5, each one including strains from different locations and host plants, and 26% of strains showed significant genetic admixture (<80% ancestry from a single population). These mosaic strains were scattered throughout a maximum-likelihood phylogenetic tree built from SNP data, and differed widely in their ancestry. While yeast strains varied in nutrient assimilation and tolerance to inhibitors, trait differentiation among genetic lineages was in most cases negligible. Notably, outlier phenotypes largely corresponded to the mosaic strains, and removal of these from the data had a dramatic effect on the intra-species phylogenetic signal of studied phenotypes and patterns of trait evolution. Overall, these results suggest that genetic mosaicism broadens the phenotypic landscape explored by M. reukaufii and may allow adaptation to highly variable nectar environments.  相似文献   

7.
A major goal in evolutionary biology is to understand the genetic basis of adaptive traits. In migratory birds, wing morphology is such a trait. Our previous work on the great reed warbler (Acrocephalus arundinaceus) shows that wing length is highly heritable and under sexually antagonistic selection. Moreover, a quantitative trait locus (QTL) mapping analysis detected a pronounced QTL for wing length on chromosome 2, suggesting that wing morphology is partly controlled by genes with large effects. Here, we re‐evaluate the genetic basis of wing length in great reed warblers using a genomewide association study (GWAS) approach based on restriction site‐associated DNA sequencing (RADseq) data. We use GWAS models that account for relatedness between individuals and include covariates (sex, age and tarsus length). The resulting association landscape was flat with no peaks on chromosome 2 or elsewhere, which is in line with expectations for polygenic traits. Analysis of the distribution of p‐values did not reveal biases, and the inflation factor was low. Effect sizes were however not uniformly distributed on some chromosomes, and the Z chromosome had weaker associations than autosomes. The level of linkage disequilibrium (LD) in the population decayed to background levels within c. 1 kbp. There could be several reasons to why our QTL study and GWAS gave contrasting results including differences in how associations are modelled (cosegregation in pedigree vs. LD associations), how covariates are accounted for in the models, type of marker used (multi‐ vs. biallelic), difference in power or a combination of these. Our study highlights that the genetic architecture even of highly heritable traits is difficult to characterize in wild populations.  相似文献   

8.
Current methods for studying the genetic basis of adaptation evaluate genetic associations with ecologically relevant traits or single environmental variables, under the implicit assumption that natural selection imposes correlations between phenotypes, environments and genotypes. In practice, observed trait and environmental data are manifestations of unknown selective forces and are only indirectly associated with adaptive genetic variation. In theory, improved estimation of these forces could enable more powerful detection of loci under selection. Here we present an approach in which we approximate adaptive variation by modeling phenotypes as a function of the environment and using the predicted trait in multivariate and univariate genome-wide association analysis (GWAS). Based on computer simulations and published flowering time data from the model plant Arabidopsis thaliana, we find that environmentally predicted traits lead to higher recovery of functional loci in multivariate GWAS and are more strongly correlated to allele frequencies at adaptive loci than individual environmental variables. Our results provide an example of the use of environmental data to obtain independent and meaningful information on adaptive genetic variation.  相似文献   

9.
Host genetics has recently been shown to be a driver of plant microbiome composition. However, identifying the underlying genetic loci controlling microbial selection remains challenging. Genome-wide association studies (GWAS) represent a potentially powerful, unbiased method to identify microbes sensitive to the host genotype and to connect them with the genetic loci that influence their colonization. Here, we conducted a population-level microbiome analysis of the rhizospheres of 200 sorghum genotypes. Using 16S rRNA amplicon sequencing, we identify rhizosphere-associated bacteria exhibiting heritable associations with plant genotype, and identify significant overlap between these lineages and heritable taxa recently identified in maize. Furthermore, we demonstrate that GWAS can identify host loci that correlate with the abundance of specific subsets of the rhizosphere microbiome. Finally, we demonstrate that these results can be used to predict rhizosphere microbiome structure for an independent panel of sorghum genotypes based solely on knowledge of host genotypic information.Subject terms: Agricultural genetics, Plant ecology, Soil microbiology  相似文献   

10.
The adaptive potential of pathogens in novel or heterogeneous environments underpins the risk of disease epidemics. Antagonistic pleiotropy or differential resource allocation among life-history traits can constrain pathogen adaptation. However, we lack understanding of how the genetic architecture of individual traits can generate trade-offs. Here, we report a large-scale study based on 145 global strains of the fungal wheat pathogen Zymoseptoria tritici from four continents. We measured 50 life-history traits, including virulence and reproduction on 12 different wheat hosts and growth responses to several abiotic stressors. To elucidate the genetic basis of adaptation, we used genome-wide association mapping coupled with genetic correlation analyses. We show that most traits are governed by polygenic architectures and are highly heritable suggesting that adaptation proceeds mainly through allele frequency shifts at many loci. We identified negative genetic correlations among traits related to host colonization and survival in stressful environments. Such genetic constraints indicate that pleiotropic effects could limit the pathogen’s ability to cause host damage. In contrast, adaptation to abiotic stress factors was likely facilitated by synergistic pleiotropy. Our study illustrates how comprehensive mapping of life-history trait architectures across diverse environments allows to predict evolutionary trajectories of pathogens confronted with environmental perturbations.Subject terms: Population genetics, Plant sciences, Molecular evolution, Fungi  相似文献   

11.
Understanding the contribution of genetic variation to drug response can improve the delivery of precision medicine. However, genome-wide association studies (GWAS) for drug response are uncommon and are often hindered by small sample sizes. We present a high-throughput framework to efficiently identify eligible patients for genetic studies of adverse drug reactions (ADRs) using “drug allergy” labels from electronic health records (EHRs). As a proof-of-concept, we conducted GWAS for ADRs to 14 common drug/drug groups with 81,739 individuals from Vanderbilt University Medical Center’s BioVU DNA Biobank. We identified 7 genetic loci associated with ADRs at P < 5 × 10−8, including known genetic associations such as CYP2D6 and OPRM1 for CYP2D6-metabolized opioid ADR. Additional expression quantitative trait loci and phenome-wide association analyses added evidence to the observed associations. Our high-throughput framework is both scalable and portable, enabling impactful pharmacogenomic research to improve precision medicine.  相似文献   

12.
Genome-wide association studies (GWAS) have identified ∼100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trait-influencing variants remain unknown. We conducted a trans-ethnic fine-mapping study at 18, 22, and 18 GWAS loci on the Metabochip for their association with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), respectively, in individuals of African American (n = 6,832), East Asian (n = 9,449), and European (n = 10,829) ancestry. We aimed to identify the variants with strongest association at each locus, identify additional and population-specific signals, refine association signals, and assess the relative significance of previously described functional variants. Among the 58 loci, 33 exhibited evidence of association at P<1×10−4 in at least one ancestry group. Sequential conditional analyses revealed that ten, nine, and four loci in African Americans, Europeans, and East Asians, respectively, exhibited two or more signals. At these loci, accounting for all signals led to a 1.3- to 1.8-fold increase in the explained phenotypic variance compared to the strongest signals. Distinct signals across ancestry groups were identified at PCSK9 and APOA5. Trans-ethnic analyses narrowed the signals to smaller sets of variants at GCKR, PPP1R3B, ABO, LCAT, and ABCA1. Of 27 variants reported previously to have functional effects, 74% exhibited the strongest association at the respective signal. In conclusion, trans-ethnic high-density genotyping and analysis confirm the presence of allelic heterogeneity, allow the identification of population-specific variants, and limit the number of candidate SNPs for functional studies.  相似文献   

13.
Genome-wide association studies (GWAS) have detected many disease associations. However, the reported variants tend to explain small fractions of risk, and there are doubts about issues such as the portability of findings over different ethnic groups or the relative roles of rare versus common variants in the genetic architecture of complex disease. Studying the degree of sharing of disease-associated variants across populations can help in solving these issues. We present a comprehensive survey of GWAS replicability across 28 diseases. Most loci and SNPs discovered in Europeans for these conditions have been extensively replicated using peoples of European and East Asian ancestry, while the replication with individuals of African ancestry is much less common. We found a strong and significant correlation of Odds Ratios across Europeans and East Asians, indicating that underlying causal variants are common and shared between the two ancestries. Moreover, SNPs that failed to replicate in East Asians map into genomic regions where Linkage Disequilibrium patterns differ significantly between populations. Finally, we observed that GWAS with larger sample sizes have detected variants with weaker effects rather than with lower frequencies. Our results indicate that most GWAS results are due to common variants. In addition, the sharing of disease alleles and the high correlation in their effect sizes suggest that most of the underlying causal variants are shared between Europeans and East Asians and that they tend to map close to the associated marker SNPs.  相似文献   

14.
15.
Plasma levels of aspartate aminotransferase (AST), a liver enzyme, are elevated in patients with visceral obesity. This study examined whether adipocyte volume is under the influence of genetic factors and evaluated its genetic correlations with AST. Fasting plasma levels of 344 pedigreed baboons from the Southwest National Primate Research Center in San Antonio, TX, USA, were assayed for AST. Adipocyte volume was measured using biopsies of omental adipose tissue. Adipocyte volume, body weight, and plasma AST were heritable. Genetic correlations between the measured adiposity-related phenotypes and AST were significant. A quantitative trait locus (LOD score 3.2) for adipocyte volume was identified on the baboon homolog of human chromosome 6 near marker D6S1028. These results suggest that omental adipocyte volume is under genetic regulation and that shared genetic factors influence adiposity-associated traits and AST.  相似文献   

16.
Polymorphisms that affect complex traits or quantitative trait loci (QTL) often affect multiple traits. We describe two novel methods (1) for finding single nucleotide polymorphisms (SNPs) significantly associated with one or more traits using a multi-trait, meta-analysis, and (2) for distinguishing between a single pleiotropic QTL and multiple linked QTL. The meta-analysis uses the effect of each SNP on each of n traits, estimated in single trait genome wide association studies (GWAS). These effects are expressed as a vector of signed t-values (t) and the error covariance matrix of these t values is approximated by the correlation matrix of t-values among the traits calculated across the SNP (V). Consequently, t''V−1t is approximately distributed as a chi-squared with n degrees of freedom. An attractive feature of the meta-analysis is that it uses estimated effects of SNPs from single trait GWAS, so it can be applied to published data where individual records are not available. We demonstrate that the multi-trait method can be used to increase the power (numbers of SNPs validated in an independent population) of GWAS in a beef cattle data set including 10,191 animals genotyped for 729,068 SNPs with 32 traits recorded, including growth and reproduction traits. We can distinguish between a single pleiotropic QTL and multiple linked QTL because multiple SNPs tagging the same QTL show the same pattern of effects across traits. We confirm this finding by demonstrating that when one SNP is included in the statistical model the other SNPs have a non-significant effect. In the beef cattle data set, cluster analysis yielded four groups of QTL with similar patterns of effects across traits within a group. A linear index was used to validate SNPs having effects on multiple traits and to identify additional SNPs belonging to these four groups.  相似文献   

17.
Genome-wide association studies (GWAS) are routinely being used to examine the genetic contribution to complex human traits, such as high-density lipoprotein cholesterol (HDL-C). Although HDL-C levels are highly heritable (h2∼0.7), the genetic determinants identified through GWAS contribute to a small fraction of the variance in this trait. Reasons for this discrepancy may include rare variants, structural variants, gene-environment (GxE) interactions, and gene-gene (GxG) interactions. Clinical practice-based biobanks now allow investigators to address these challenges by conducting GWAS in the context of comprehensive electronic medical records (EMRs). Here we apply an EMR-based phenotyping approach, within the context of routine care, to replicate several known associations between HDL-C and previously characterized genetic variants: CETP (rs3764261, p = 1.22e-25), LIPC (rs11855284, p = 3.92e-14), LPL (rs12678919, p = 1.99e-7), and the APOA1/C3/A4/A5 locus (rs964184, p = 1.06e-5), all adjusted for age, gender, body mass index (BMI), and smoking status. By using a novel approach which censors data based on relevant co-morbidities and lipid modifying medications to construct a more rigorous HDL-C phenotype, we identified an association between HDL-C and TRIB1, a gene which previously resisted identification in studies with larger sample sizes. Through the application of additional analytical strategies incorporating biological knowledge, we further identified 11 significant GxG interaction models in our discovery cohort, 8 of which show evidence of replication in a second biobank cohort. The strongest predictive model included a pairwise interaction between LPL (which modulates the incorporation of triglyceride into HDL) and ABCA1 (which modulates the incorporation of free cholesterol into HDL). These results demonstrate that gene-gene interactions modulate complex human traits, including HDL cholesterol.  相似文献   

18.
Progress in genomics and the associated technological, statistical and bioinformatics advances have facilitated the successful implementation of genome-wide association studies (GWAS) towards understanding the genetic basis of common diseases. Infectious diseases contribute significantly to the global burden of disease and there is robust epidemiological evidence that host genetic factors are important determinants of the outcome of interactions between host and pathogen. Indeed, infectious diseases have exerted profound selective pressure on human evolution. However, the application of GWAS to infectious diseases has been relatively limited compared with non-communicable diseases. Here we review GWAS findings for important infectious diseases, including malaria, tuberculosis and HIV. We highlight some of the pitfalls recognized more generally for GWAS, as well as issues specific to infection, including the role of the pathogen which also has a genome. We also discuss the challenges encountered when studying African populations which are genetically more ancient and more diverse that other populations and disproportionately bear the main global burden of serious infectious diseases.  相似文献   

19.
Understanding the genetic architecture of complex traits is a major objective in biology. The standard approach for doing so is genome-wide association studies (GWAS), which aim to identify genetic polymorphisms responsible for variation in traits of interest. In human genetics, consistency across studies is commonly used as an indicator of reliability. However, if traits are involved in adaptation to the local environment, we do not necessarily expect reproducibility. On the contrary, results may depend on where you sample, and sampling across a wide range of environments may decrease the power of GWAS because of increased genetic heterogeneity. In this study, we examine how sampling affects GWAS in the model plant species Arabidopsis thaliana. We show that traits like flowering time are indeed influenced by distinct genetic effects in local populations. Furthermore, using gene expression as a molecular phenotype, we show that some genes are globally affected by shared variants, whereas others are affected by variants specific to subpopulations. Remarkably, the former are essentially all cis-regulated, whereas the latter are predominately affected by trans-acting variants. Our result illustrate that conclusions about genetic architecture can be extremely sensitive to sampling and population structure.  相似文献   

20.
Genome-wide association studies (GWAS) have yielded novel genetic loci underlying common diseases. We propose a systems genetics approach to utilize these discoveries for better understanding of the genetic architecture of rheumatoid arthritis (RA). Current evidence of genetic associations with RA was sought through PubMed and the NHGRI GWAS catalog. The associations of 15 single nucleotide polymorphisms and HLA-DRB1 alleles were confirmed in 1,287 cases and 1,500 controls of Japanese subjects. Among these, HLA-DRB1 alleles and eight SNPs showed significant associations and all but one of the variants had the same direction of effect as identified in the previous studies, indicating that the genetic risk factors underlying RA are shared across populations. By receiver operating characteristic curve analysis, the area under the curve (AUC) for the genetic risk score based on the selected variants was 68.4%. For seropositive RA patients only, the AUC improved to 70.9%, indicating good but suboptimal predictive ability. A simulation study shows that more than 200 additional loci with similar effect size as recent GWAS findings or 20 rare variants with intermediate effects are needed to achieve AUC = 80.0%. We performed the random walk with restart (RWR) algorithm to prioritize genes for future mapping studies. The performance of the algorithm was confirmed by leave-one-out cross-validation. The RWR algorithm pointed to ZAP70 in the first rank, in which mutation causes RA-like autoimmune arthritis in mice. By applying the hierarchical clustering method to a subnetwork comprising RA-associated genes and top-ranked genes by the RWR, we found three functional modules relevant to RA etiology: “leukocyte activation and differentiation”, “pattern-recognition receptor signaling pathway”, and “chemokines and their receptors”.These results suggest that the systems genetics approach is useful to find directions of future mapping strategies to illuminate biological pathways.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号