首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
Genome-wide association studies (GWAS) have rapidly become a standard method for disease gene discovery. A substantial number of recent GWAS indicate that for most disorders, only a few common variants are implicated and the associated SNPs explain only a small fraction of the genetic risk. This review is written from the viewpoint that findings from the GWAS provide preliminary genetic information that is available for additional analysis by statistical procedures that accumulate evidence, and that these secondary analyses are very likely to provide valuable information that will help prioritize the strongest constellations of results. We review and discuss three analytic methods to combine preliminary GWAS statistics to identify genes, alleles, and pathways for deeper investigations. Meta-analysis seeks to pool information from multiple GWAS to increase the chances of finding true positives among the false positives and provides a way to combine associations across GWAS, even when the original data are unavailable. Testing for epistasis within a single GWAS study can identify the stronger results that are revealed when genes interact. Pathway analysis of GWAS results is used to prioritize genes and pathways within a biological context. Following a GWAS, association results can be assigned to pathways and tested in aggregate with computational tools and pathway databases. Reviews of published methods with recommendations for their application are provided within the framework for each approach.  相似文献   

2.
Aging is the single largest risk factor for chronic disease. Studies in model organisms have identified conserved pathways that modulate aging rate and the onset and progression of multiple age‐related diseases, suggesting that common pathways of aging may influence age‐related diseases in humans as well. To determine whether there is genetic evidence supporting the notion of common pathways underlying age‐related diseases, we analyzed the genes and pathways found to be associated with five major categories of age‐related disease using a total of 410 genomewide association studies (GWAS). While only a small number of genes are shared among all five disease categories, those found in at least three of the five major age‐related disease categories are highly enriched for apoliprotein metabolism genes. We found that a more substantial number of gene ontology (GO) terms are shared among the 5 age‐related disease categories and shared GO terms include canonical aging pathways identified in model organisms, such as nutrient‐sensing signaling, translation, proteostasis, stress responses, and genome maintenance. Taking advantage of the vast amount of genetic data from the GWAS, our findings provide the first direct evidence that conserved pathways of aging simultaneously influence multiple age‐related diseases in humans as has been demonstrated in model organisms.  相似文献   

3.
Genome-wide association studies (GWAS) with hundreds of żthousands of single nucleotide polymorphisms (SNPs) are popular strategies to reveal the genetic basis of human complex diseases. Despite many successes of GWAS, it is well recognized that new analytical approaches have to be integrated to achieve their full potential. Starting with a list of SNPs, found to be associated with disease in GWAS, here we propose a novel methodology to devise functionally important KEGG pathways through the identification of genes within these pathways, where these genes are obtained from SNP analysis. Our methodology is based on functionalization of important SNPs to identify effected genes and disease related pathways. We have tested our methodology on WTCCC Rheumatoid Arthritis (RA) dataset and identified: i) previously known RA related KEGG pathways (e.g., Toll-like receptor signaling, Jak-STAT signaling, Antigen processing, Leukocyte transendothelial migration and MAPK signaling pathways); ii) additional KEGG pathways (e.g., Pathways in cancer, Neurotrophin signaling, Chemokine signaling pathways) as associated with RA. Furthermore, these newly found pathways included genes which are targets of RA-specific drugs. Even though GWAS analysis identifies 14 out of 83 of those drug target genes; newly found functionally important KEGG pathways led to the discovery of 25 out of 83 genes, known to be used as drug targets for the treatment of RA. Among the previously known pathways, we identified additional genes associated with RA (e.g. Antigen processing and presentation, Tight junction). Importantly, within these pathways, the associations between some of these additionally found genes, such as HLA-C, HLA-G, PRKCQ, PRKCZ, TAP1, TAP2 and RA were verified by either OMIM database or by literature retrieved from the NCBI PubMed module. With the whole-genome sequencing on the horizon, we show that the full potential of GWAS can be achieved by integrating pathway and network-oriented analysis and prior knowledge from functional properties of a SNP.  相似文献   

4.
Torkamani A  Topol EJ  Schork NJ 《Genomics》2008,92(5):265-272
Recent genome-wide association studies (GWAS) have identified DNA sequence variations that exhibit unequivocal statistical associations with many common chronic diseases. However, the vast majority of these studies identified variations that explain only a very small fraction of disease burden in the population at large, suggesting that other factors, such as multiple rare or low-penetrance variations and interacting environmental factors, are major contributors to disease susceptibility. Identifying multiple low-penetrance variations (or "polygenes") contributing to disease susceptibility will be difficult. We present a pathway analysis approach to characterizing the likely polygenic basis of seven common diseases using the Wellcome Trust Case Control Consortium (WTCCC) GWAS results. We identify numerous pathways implicated in disease predisposition that would have not been revealed using standard single-locus GWAS statistical analysis criteria. Many of these pathways have long been assumed to contain polymorphic genes that lead to disease predisposition. Additionally, we analyze the genetic relationships between the seven diseases, and based upon similarities with respect to the associated genes and pathways affected in each, propose a new way of categorizing the diseases.  相似文献   

5.
Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical “noise” that warrant further analysis for causal variants.  相似文献   

6.
The major mental disorders, schizophrenia and bipolar disorder are substantially heritable. Recent genomic studies have identified a small number of common and rare risk genes contributing to both disorders and support epidemiological evidence that genetic susceptibility overlaps between them. Prompted by the question of whether risk genes cluster in specific molecular pathways or implicate discrete mechanisms we and others have developed hypothesis-free methods of investigating genome-wide association datasets at a pathway-level. The application of our method to the 212 experimentally-derived pathways in the Kyoto Encycolpaedia of Genes and Genomes (KEGG) database identified significant association between the cell adhesion molecule (CAM) pathway and both schizophrenia and bipolar disorder susceptibility across three GWAS datasets. Interestingly, a similar approach applied to an autistic spectrum disorders (ASDs) sample identified a similar pathway and involved many of the same genes. Disruption of a number of these genes (including NRXN1, CNTNAP2 and CASK) are known to cause diverse neurodevelopmental brain disorder phenotypes including schizophenia, autism, learning disability and specific language disorder. Taken together these studies bring the CAM pathway sharply into focus for more comprehensive DNA sequencing to identify the critical genes, and investigate their relationships and interaction with environmental risk factors in the expression of many seemingly different neurodevelopmental disorders.  相似文献   

7.
Li H 《Human genetics》2012,131(9):1395-1401
Many common human diseases are complex and are expected to be highly heterogeneous, with multiple causative loci and multiple rare and common variants at some of the causative loci contributing to the risk of these diseases. Data from the genome-wide association studies (GWAS) and metadata such as known gene functions and pathways provide the possibility of identifying genetic variants, genes and pathways that are associated with complex phenotypes. Single-marker-based tests have been very successful in identifying thousands of genetic variants for hundreds of complex phenotypes. However, these variants only explain very small percentages of the heritabilities. To account for the locus- and allelic-heterogeneity, gene-based and pathway-based tests can be very useful in the next stage of the analysis of GWAS data. U-statistics, which summarize the genomic similarity between pair of individuals and link the genomic similarity to phenotype similarity, have proved to be very useful for testing the associations between a set of single nucleotide polymorphisms and the phenotypes. Compared to single marker analysis, the advantages afforded by the U-statistics-based methods is large when the number of markers involved is large. We review several formulations of U-statistics in genetic association studies and point out the links of these statistics with other similarity-based tests of genetic association. Finally, potential application of U-statistics in analysis of the next-generation sequencing data and rare variants association studies are discussed.  相似文献   

8.

Background

Polycystic ovary syndrome (PCOS) is one of the most common endocrine disorders in women of reproductive age, and it is affected by both environmental and genetic factors. Although the genetic component of PCOS is evident, studies aiming to identify susceptibility genes have shown controversial results. This study conducted a pathway-based analysis using a dataset obtained through a genome-wide association study (GWAS) to elucidate the biological pathways that contribute to PCOS susceptibility and the associated genes.

Methods

We used GWAS data on 636,797 autosomal single nucleotide polymorphisms (SNPs) from 1,221 individuals (432 PCOS patients and 789 controls) for analysis. A pathway analysis was conducted using meta-analysis gene-set enrichment of variant associations (MAGENTA). Top-ranking pathways or gene sets associated with PCOS were identified, and significant genes within the pathways were analyzed.

Results

The pathway analysis of the GWAS dataset identified significant pathways related to oocyte meiosis and the regulation of insulin secretion by acetylcholine and free fatty acids (all nominal gene-set enrichment analysis (GSEA) P-values < 0.05). In addition, INS, GNAQ, STXBP1, PLCB3, PLCB2, SMC3 and PLCZ1 were significant genes observed within the biological pathways (all gene P-values < 0.05).

Conclusions

By applying MAGENTA pathway analysis to PCOS GWAS data, we identified significant pathways and candidate genes involved in PCOS. Our findings may provide new leads for understanding the mechanisms underlying the development of PCOS.  相似文献   

9.
Celiac disease is a common autoimmune disorder characterized by an intestinal inflammation triggered by gluten, a storage protein found in wheat, rye and barley. Similar to other autoimmune diseases such as type 1 diabetes, psoriasis and rheumatoid arthritis, celiac disease is the result of an immune response to self-antigens leading to tissue destruction and production of autoantibodies. Common diseases like celiac disease have a complex pattern of inheritance with inputs from both environmental as well as additive and non-additive genetic factors. In the past few years, Genome Wide Association Studies (GWAS) have been successful in finding genetic risk variants behind many common diseases and traits. To complement and add to the previous findings, we performed a GWAS including 206 trios from 97 nuclear Swedish and Norwegian families affected with celiac disease. By stratifying for HLA-DQ, we identified a new genome-wide significant risk locus covering the DUSP10 gene. To further investigate the associations from the GWAS we performed pathway analyses and two-locus interaction analyses. These analyses showed an over-representation of genes involved in type 2 diabetes and identified a set of candidate mechanisms and genes of which some were selected for mRNA expression analysis using small intestinal biopsies from 98 patients. Several genes were expressed differently in the small intestinal mucosa from patients with celiac autoimmunity compared to intestinal mucosa from control patients. From top-scoring regions we identified susceptibility genes in several categories: 1) polarity and epithelial cell functionality; 2) intestinal smooth muscle; 3) growth and energy homeostasis, including proline and glutamine metabolism; and finally 4) innate and adaptive immune system. These genes and pathways, including specific functions of DUSP10, together reveal a new potential biological mechanism that could influence the genesis of celiac disease, and possibly also other chronic disorders with an inflammatory component.  相似文献   

10.
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait''s genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function.  相似文献   

11.
12.
Plasma triglyceride (TG) concentration is reemerging as an important cardiovascular disease risk factor. More complete understanding of the genes and variants that modulate plasma TG should enable development of markers for risk prediction, diagnosis, prognosis, and response to therapies and might help specify new directions for therapeutic interventions. Recent genome-wide association studies (GWAS) have identified both known and novel loci associated with plasma TG concentration. However, genetic variation at these loci explains only ~10% of overall TG variation within the population. As the GWAS approach may be reaching its limit for discovering genetic determinants of TG, alternative genetic strategies, such as rare variant sequencing studies and evaluation of animal models, may provide complementary information to flesh out knowledge of clinically and biologically important pathways in TG metabolism. Herein, we review genes recently implicated in TG metabolism and describe how some of these genes likely modulate plasma TG concentration. We also discuss lessons regarding plasma TG metabolism learned from various genomic and genetic experimental approaches. Treatment of patients with moderate to severe hypertriglyceridemia with existing therapies is often challenging; thus, gene products and pathways found in recent genetic research studies provide hope for development of more effective clinical strategies.  相似文献   

13.
14.
Maize (Zea mays mays) oil is a rich source of polyunsaturated fatty acids (FAs) and energy, making it a valuable resource for human food, animal feed, and bio‐energy. Although this trait has been studied via conventional genome‐wide association study (GWAS), the single nucleotide polymorphism (SNP)‐trait associations generated by GWAS may miss the underlying associations when traits are based on many genes, each with small effects that can be overshadowed by genetic background and environmental variation. Detecting these SNPs statistically is also limited by the levels set for false discovery rate. A complementary pathways analysis that emphasizes the cumulative aspects of SNP‐trait associations, rather than just the significance of single SNPs, was performed to understand the balance of lipid metabolism, conversion, and catabolism in this study. This pathway analysis indicated that acyl‐lipid pathways, including biosynthesis of wax esters, sphingolipids, phospholipids and flavonoids, along with FA and triacylglycerol (TAG) biosynthesis, were important for increasing oil and FA content. The allelic variation found among the genes involved in many degradation pathways, and many biosynthesis pathways leading from FAs and carbon partitioning pathways, was critical for determining final FA content, changing FA ratios and, ultimately, to final oil content. The pathways and pathway networks identified in this study, and especially the acyl‐lipid associated pathways identified beyond what had been found with GWAS alone, provide a real opportunity to precisely and efficiently manipulate high‐oil maize genetic improvement.  相似文献   

15.
Li C  Han J  Shang D  Li J  Wang Y  Wang Y  Zhang Y  Yao Q  Zhang C  Li K  Li X 《Gene》2012,503(1):101-109
Most methods for genome-wide association studies (GWAS) focus on discovering a single genetic variant, but the pathogenesis of complex diseases is thought to arise from the joint effect of multiple genetic variants. Information about pathway structure, such as the interactions and distances between gene products within pathways, can help us learn more about the functions and joint effect of genes associated with disease risk. We developed a novel sub-pathway based approach to study the joint effect of multiple genetic variants that are modestly associated with disease. The approach prioritized sub-pathways based on the significance values of single nucleotide polymorphisms (SNPs) and the interactions and distances between gene products within pathways. We applied the method to seven complex diseases. The result showed that our method can efficiently identify statistically significant sub-pathways associated with the pathogenesis of complex diseases. The approach identified sub-pathways that may inform the interpretation of GWAS data.  相似文献   

16.
17.
Chen M  Cho J  Zhao H 《PLoS genetics》2011,7(4):e1001353
Genome-wide association studies (GWAS) examine a large number of markers across the genome to identify associations between genetic variants and disease. Most published studies examine only single markers, which may be less informative than considering multiple markers and multiple genes jointly because genes may interact with each other to affect disease risk. Much knowledge has been accumulated in the literature on biological pathways and interactions. It is conceivable that appropriate incorporation of such prior knowledge may improve the likelihood of making genuine discoveries. Although a number of methods have been developed recently to prioritize genes using prior biological knowledge, such as pathways, most methods treat genes in a specific pathway as an exchangeable set without considering the topological structure of a pathway. However, how genes are related with each other in a pathway may be very informative to identify association signals. To make use of the connectivity information among genes in a pathway in GWAS analysis, we propose a Markov Random Field (MRF) model to incorporate pathway topology for association analysis. We show that the conditional distribution of our MRF model takes on a simple logistic regression form, and we propose an iterated conditional modes algorithm as well as a decision theoretic approach for statistical inference of each gene's association with disease. Simulation studies show that our proposed framework is more effective to identify genes associated with disease than a single gene-based method. We also illustrate the usefulness of our approach through its applications to a real data example.  相似文献   

18.

Background  

Recently we have witnessed a surge of interest in using genome-wide association studies (GWAS) to discover the genetic basis of complex diseases. Many genetic variations, mostly in the form of single nucleotide polymorphisms (SNPs), have been identified in a wide spectrum of diseases, including diabetes, cancer, and psychiatric diseases. A common theme arising from these studies is that the genetic variations discovered by GWAS can only explain a small fraction of the genetic risks associated with the complex diseases. New strategies and statistical approaches are needed to address this lack of explanation. One such approach is the pathway analysis, which considers the genetic variations underlying a biological pathway, rather than separately as in the traditional GWAS studies. A critical challenge in the pathway analysis is how to combine evidences of association over multiple SNPs within a gene and multiple genes within a pathway. Most current methods choose the most significant SNP from each gene as a representative, ignoring the joint action of multiple SNPs within a gene. This approach leads to preferential identification of genes with a greater number of SNPs.  相似文献   

19.
Recent evidence suggests that inflammation plays a pivotal role in the development of lung cancer. In this study, we used a two-stage approach to investigate associations between genetic variants in inflammation pathways and lung cancer risk based on genome-wide association study (GWAS) data. A total of 7,650 sequence variants from 720 genes relevant to inflammation pathways were identified using keyword and pathway searches from Gene Cards and Gene Ontology databases. In Stage 1, six GWAS datasets from the International Lung Cancer Consortium were pooled (4,441 cases and 5,094 controls of European ancestry), and a hierarchical modeling (HM) approach was used to incorporate prior information for each of the variants into the analysis. The prior matrix was constructed using (1) role of genes in the inflammation and immune pathways; (2) physical properties of the variants including the location of the variants, their conservation scores and amino acid coding; (3) LD with other functional variants and (4) measures of heterogeneity across the studies. HM affected the priority ranking of variants particularly among those having low prior weights, imprecise estimates and/or heterogeneity across studies. In Stage 2, we used an independent NCI lung cancer GWAS study (5,699 cases and 5,818 controls) for in silico replication. We identified one novel variant at the level corrected for multiple comparisons (rs2741354 in EPHX2 at 8q21.1 with p value = 7.4 × 10?6), and confirmed the associations between TERT (rs2736100) and the HLA region and lung cancer risk. HM allows for prior knowledge such as from bioinformatic sources to be incorporated into the analysis systematically, and it represents a complementary analytical approach to the conventional GWAS analysis.  相似文献   

20.
The relationship between obesity, diabetes, hyperlipidemia, hypertension, kidney disease and cardiovascular disease (CVD) is established when looked at from a clinical, epidemiological or pathophysiological perspective. Yet, when viewed from a genetic perspective, there is comparatively little data synthesis that these conditions have an underlying relationship. We sought to investigate the overlap of genetic variants independently associated with each of these commonly co-existing conditions from the NHGRI genome-wide association study (GWAS) catalog, in an attempt to replicate the established notion of shared pathophysiology and risk. We used pathway-based analyses to detect subsets of pleiotropic genes involved in similar biological processes. We identified 107 eligible GWAS studies related to CVD and its established comorbidities and risk factors and assigned genes that correspond to the associated signals based on their position. We found 44 positional genes shared across at least two CVD-related phenotypes that independently recreated the established relationship between the six phenotypes, but only if studies representing non-European populations were included. Seven genes revealed pleiotropy across three or more phenotypes, mostly related to lipid transport and metabolism. Yet, many genes had no relationship to each other or to genes with established functional connection. Whilst we successfully reproduced established relationships between CVD risk factors using GWAS findings, interpretation of biological pathways involved in the observed pleiotropy was limited. Further studies linking genetic variation to gene expression, as well as describing novel biological pathways will be needed to take full advantage of GWAS results.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号