首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background  

Single Nucleotide Polymorphism (SNP) analysis only captures a small proportion of associated genetic variants in Genome-Wide Association Studies (GWAS) partly due to small marginal effects. Pathway level analysis incorporating prior biological information offers another way to analyze GWAS's of complex diseases, and promises to reveal the mechanisms leading to complex diseases. Biologically defined pathways are typically comprised of numerous genes. If only a subset of genes in the pathways is associated with disease then a joint analysis including all individual genes would result in a loss of power. To address this issue, we propose a pathway-based method that allows us to test for joint effects by using a pre-selected gene subset. In the proposed approach, each gene is considered as the basic unit, which reduces the number of genetic variants considered and hence reduces the degrees of freedom in the joint analysis. The proposed approach also can be used to investigate the joint effect of several genes in a candidate gene study.  相似文献   

2.
Recently, genome-wide association studies (GWAS) have led to the discovery of hundreds of susceptibility loci that are associated with complex metabolic diseases, such as type 2 diabetes and hyperthyroidism. The majority of the susceptibility loci are common across different races or populations; while some of them show ethnicity-specific distribution. Though the abundant novel susceptibility loci identified by GWAS have provided insight into biology through the discovery of new genes or pathways that were previously not known, most of them are in introns and the associated variants cumulatively explain only a small fraction of total heritability. Here we reviewed the genetic studies on the metabolic disorders, mainly type 2 diabetes and hyperthyroidism, including candidate genes-based findings and more recently the GWAS discovery; we also included the clinical relevance of these novel loci and the gene-environmental interactions. Finally, we discussed the future direction about the genetic study on the exploring of the pathogenesis of the metabolic diseases.  相似文献   

3.
Most common diseases are complex, involving multiple genetic and environmental factors and their interactions. In the past decade, genome-wide association studies (GWAS) have successfully identified thousands of genetic variants underlying susceptibility to complex diseases. However, the results from these studies often do not provide evidence on how the variants affect downstream pathways and lead to the disease. Therefore, in the post-GWAS era the greatest challenge lies in combining GWAS findings with additional molecular data to functionally characterize the associations. The advances in various ~omics techniques have made it possible to investigate the effect of risk variants on intermediate molecular levels, such as gene expression, methylation, protein abundance or metabolite levels. As disease aetiology is complex, no single molecular analysis is expected to fully unravel the disease mechanism. Multiple molecular levels can interact and also show plasticity in different physiological conditions, cell types and disease stages. There is therefore a great need for new integrative approaches that can combine data from different molecular levels and can help construct the causal inference from genotype to phenotype. Systems genetics is such an approach; it is used to study genetic effects within the larger scope of systems biology by integrating genotype information with various ~omics datasets as well as with environmental and physiological variables. In this review, we describe this approach and discuss how it can help us unravel the molecular mechanisms through which genetic variation causes disease. This article is part of a Special Issue entitled: From Genome to Function.  相似文献   

4.
Genome-wide association studies (GWAS) have been widely used for identifying common variants associated with complex diseases. Despite remarkable success in uncovering many risk variants and providing novel insights into disease biology, genetic variants identified to date fail to explain the vast majority of the heritability for most complex diseases. One explanation is that there are still a large number of common variants that remain to be discovered, but their effect sizes are generally too small to be detected individually. Accordingly, gene set analysis of GWAS, which examines a group of functionally related genes, has been proposed as a complementary approach to single-marker analysis. Here, we propose a flexible and adaptive test for gene sets (FLAGS), using summary statistics. Extensive simulations showed that this method has an appropriate type I error rate and outperforms existing methods with increased power. As a proof of principle, through real data analyses of Crohn’s disease GWAS data and bipolar disorder GWAS meta-analysis results, we demonstrated the superior performance of FLAGS over several state-of-the-art association tests for gene sets. Our method allows for the more powerful application of gene set analysis to complex diseases, which will have broad use given that GWAS summary results are increasingly publicly available.  相似文献   

5.
Li C  Li Y  Xu J  Lv J  Ma Y  Shao T  Gong B  Tan R  Xiao Y  Li X 《Gene》2011,489(2):119-129
Detection of the synergetic effects between variants, such as single-nucleotide polymorphisms (SNPs), is crucial for understanding the genetic characters of complex diseases. Here, we proposed a two-step approach to detect differentially inherited SNP modules (synergetic SNP units) from a SNP network. First, SNP-SNP interactions are identified based on prior biological knowledge, such as their adjacency on the chromosome or degree of relatedness between the functional relationships of their genes. These interactions form SNP networks. Second, disease-risk SNP modules (or sub-networks) are prioritised by their differentially inherited properties in IBD (Identity by Descent) profiles of affected and unaffected sibpairs. The search process is driven by the disease information and follows the structure of a SNP network. Simulation studies have indicated that this approach achieves high accuracy and a low false-positive rate in the identification of known disease-susceptible SNPs. Applying this method to an alcoholism dataset, we found that flexible patterns of susceptible SNP combinations do play a role in complex diseases, and some known genes were detected through these risk SNP modules. One example is GRM7, a known alcoholism gene successfully detected by a SNP module comprised of two SNPs, but neither of the two SNPs was significantly associated with the disease in single-locus analysis. These identified genes are also enriched in some pathways associated with alcoholism, including the calcium signalling pathway, axon guidance and neuroactive ligand-receptor interaction. The integration of network biology and genetic analysis provides putative functional bridges between genetic variants and candidate genes or pathways, thereby providing new insight into the aetiology of complex diseases.  相似文献   

6.
Coronary artery disease(CAD) is a complex human disease, involving multiple genes and their nonlinear interactions, which often act in a modular fashion. Genome-wide single nucleotide polymorphism(SNP) profiling provides an effective technique to unravel these underlying genetic interplays or their functional involvements for CAD. This study aimed to identify the susceptible pathways and modules for CAD based on SNP omics. First, the Wellcome Trust Case Control Consortium(WTCCC) SNP datasets of CAD and control samples were used to assess the jointeffect of multiple genetic variants at the pathway level, using logistic kernel machine regression model. Then, an expanded genetic network was constructed by integrating statistical gene–gene interactions involved in these susceptible pathways with their protein–protein interaction(PPI)knowledge. Finally, risk functional modules were identified by decomposition of the network. Of 276 KEGG pathways analyzed, 6 pathways were found to have a significant effect on CAD. Other than glycerolipid metabolism, glycosaminoglycan biosynthesis, and cardiac muscle contraction pathways, three pathways related to other diseases were also revealed, including Alzheimer's disease, non-alcoholic fatty liver disease, and Huntington's disease. A genetic epistatic network of 95 genes was further constructed using the abovementioned integrative approach. Of 10 functional modules derived from the network, 6 have been annotated to phospholipase C activity and cell adhesion molecule binding, which also have known functional involvement in Alzheimer's disease.These findings indicate an overlap of the underlying molecular mechanisms between CAD and Alzheimer's disease, thus providing new insights into the molecular basis for CAD and its molecular relationships with other diseases.  相似文献   

7.
Genome-wide association studies (GWAS) have successfully identified many genetic variants associated with complex diseases and traits. However, functional consequence of genetic variants studied in GWAS is not yet fully investigated, which would hinder the application of GWAS. We therefore performed a systematic functional analysis of HapMap SNPs, which have been most commonly used as the reference panel for GWAS. Our study highlights several characteristics of HapMap SNPs and identifies subsets of genetic variants with interesting functional implication. The results show that HapMap SNPs have good coverage within RefSeq genes, especially within known disease-related genes. On the other hand, only a small percentage of SNPs are non-synonymous SNPs while many SNPs are actually located at gene deserts. Moreover, many functionally important variants are not yet still interrogated. A redesigned SNP reference panel with additional functionally important variants would be useful to identify disease-causal variants in the future genome-wide studies.  相似文献   

8.
Recent advances in genotyping methodologies have allowed genome-wide association studies (GWAS) to accurately identify genetic variants that associate with common or pathological complex traits. Although most GWAS have focused on associations with single genetic variants, joint identification of multiple genetic variants, and how they interact, is essential for understanding the genetic architecture of complex phenotypic traits. Here, we propose an efficient stepwise method based on the Cochran-Mantel-Haenszel test (for stratified categorical data) to identify causal joint multiple genetic variants in GWAS. This method combines the CMH statistic with a stepwise procedure to detect multiple genetic variants associated with specific categorical traits, using a series of associated I × J contingency tables and a null hypothesis of no phenotype association. Through a new stratification scheme based on the sum of minor allele count criteria, we make the method more feasible for GWAS data having sample sizes of several thousands. We also examine the properties of the proposed stepwise method via simulation studies, and show that the stepwise CMH test performs better than other existing methods (e.g., logistic regression and detection of associations by Markov blanket) for identifying multiple genetic variants. Finally, we apply the proposed approach to two genomic sequencing datasets to detect linked genetic variants associated with bipolar disorder and obesity, respectively.  相似文献   

9.
牛大彦  严卫丽 《遗传》2015,37(12):1204-1210
心血管疾病、2型糖尿病、原发性高血压、哮喘、肥胖、肿瘤等复杂疾病在全球范围内流行,并成为人类死亡的主要原因。越来越多的人开始关注遗传易感性在复杂疾病发病机制中的作用。至今,与复杂疾病相关的易感基因和基因序列变异仍未完全清楚。人们希望通过遗传关联研究来阐明复杂疾病的遗传基础。近年来,全基因组关联研究和候选基因研究发现了大量与复杂疾病有关的基因序列变异。这些与复杂疾病有因果和(或)关联关系的基因序列变异的发现促进了复杂疾病预测和防治方法的产生和发展。遗传风险评分(Genetic risk score,GRS)作为探索单核苷酸多态(Single nucleotide polymorphisms,SNPs)与复杂疾病临床表型之间关系的新兴方法,综合了若干SNPs的微弱效应,使基因多态对疾病的预测性大幅度提升。该方法在许多复杂疾病遗传学研究中得到成功应用。本文重点介绍了GRS的计算方法和评价标准,简要列举了运用GRS取得的系列成果,并对运用过程中所存在的局限性进行了探讨,最后对遗传风险评分的未来发展方向进行了展望。  相似文献   

10.

Background  

Recently we have witnessed a surge of interest in using genome-wide association studies (GWAS) to discover the genetic basis of complex diseases. Many genetic variations, mostly in the form of single nucleotide polymorphisms (SNPs), have been identified in a wide spectrum of diseases, including diabetes, cancer, and psychiatric diseases. A common theme arising from these studies is that the genetic variations discovered by GWAS can only explain a small fraction of the genetic risks associated with the complex diseases. New strategies and statistical approaches are needed to address this lack of explanation. One such approach is the pathway analysis, which considers the genetic variations underlying a biological pathway, rather than separately as in the traditional GWAS studies. A critical challenge in the pathway analysis is how to combine evidences of association over multiple SNPs within a gene and multiple genes within a pathway. Most current methods choose the most significant SNP from each gene as a representative, ignoring the joint action of multiple SNPs within a gene. This approach leads to preferential identification of genes with a greater number of SNPs.  相似文献   

11.
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait''s genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function.  相似文献   

12.
Type 2 diabetes (T2D) and coronary artery disease (CAD) are closely related chronic diseases with high prevalence and morbidity. However, a comprehensive comparison of the two diseases is lacking. Recent genome-wide association studies (GWAS) have identified a handful of single nucleotide polymorphisms (SNPs) that are significantly associated with the risk of T2D and CAD. These most significant findings may help interpret the pathogenesis of T2D and CAD. However, tremendous results from these GWAS are ignored. Here we revisited the raw datasets of these GWAS and performed an integrated gene network analysis to unveil the relationship between T2D and CAD by combining multiple datasets including protein–protein interaction (PPI) database, publication libraries, and pathway datasets. Our results showed that majority of genes were involved in the first module (1122 genes in T2D and 895 in CAD). Four pathways were found to be common in both T2D and CAD, including regulation of actin cytoskeleton, calcium signaling pathway, MAPK signaling pathway and focal adhesion (all P < 0.00001). MAX which was involved in small cell lung cancer pathway was a hub gene unique to T2D (OR = 1.2, P = 0.006) but not in CAD. In contrast, three hub genes including PLEKHG5 (T2D: OR = 1, P = 1; CAD: OR = 1.12, P = 0.006), TIAM1 (T2D: OR = 1, P = 1; CAD: OR = 1.48, P = 0.004) and AKAP13 (T2D: OR = 1, P = 1; CAD: OR = 1.38, P = 0.001) were hub genes unique to CAD. Moreover, for some hub genes (such as SMAD3) that were susceptible to both T2D and CAD, their associated polymorphisms were unique to each of the two diseases. Our findings might provide a landscape of the relationship between T2D and CAD.  相似文献   

13.
Chronic inflammatory diseases such as rheumatoid arthritis, inflammatory bowel disease, spondyloarthritis and psoriasis cause significant morbidity and are a considerable burden for the patients in terms of pain, impaired function and diminished quality of life, as well as for society, because of the associated high health-care costs, and loss of productivity. Our limited understanding of the pathogenic mechanisms involved in these diseases currently hinders early diagnosis and the development of more specific and effective therapies.The past years have been marked by considerable progress in our insight of the genetic basis of many diseases. In particular, genome-wide association studies (GWAS) performed with thousands of patients have provided detailed information about the genetic variants associated with a large number of chronic inflammatory diseases. These studies have brought to the forefront many genes linked to signaling pathways that were not previously known to be involved in pathogenesis, pointing to new directions in the study of disease mechanisms. GWAS also provided fundamental evidence for a key role of the immune system in the pathogenesis of these diseases, because many of the identified loci map to genes involved in different immune processes. However, the mechanisms by which disease-associated genetic variants act on disease development and the targeted cell populations remain poorly understood. The challenge of the post-GWAS era is to understand how these variants affect pathogenesis, to allow translation of genetic data into better diagnostics and innovative treatment strategies.Here, we review recent results that document the importance of the IL-23/IL-17 pathway for the pathogenesis of several chronic inflammatory diseases and summarize data that demonstrate how therapeutic targeting of this pathway can benefit affected patients.  相似文献   

14.
Li H 《Human genetics》2012,131(9):1395-1401
Many common human diseases are complex and are expected to be highly heterogeneous, with multiple causative loci and multiple rare and common variants at some of the causative loci contributing to the risk of these diseases. Data from the genome-wide association studies (GWAS) and metadata such as known gene functions and pathways provide the possibility of identifying genetic variants, genes and pathways that are associated with complex phenotypes. Single-marker-based tests have been very successful in identifying thousands of genetic variants for hundreds of complex phenotypes. However, these variants only explain very small percentages of the heritabilities. To account for the locus- and allelic-heterogeneity, gene-based and pathway-based tests can be very useful in the next stage of the analysis of GWAS data. U-statistics, which summarize the genomic similarity between pair of individuals and link the genomic similarity to phenotype similarity, have proved to be very useful for testing the associations between a set of single nucleotide polymorphisms and the phenotypes. Compared to single marker analysis, the advantages afforded by the U-statistics-based methods is large when the number of markers involved is large. We review several formulations of U-statistics in genetic association studies and point out the links of these statistics with other similarity-based tests of genetic association. Finally, potential application of U-statistics in analysis of the next-generation sequencing data and rare variants association studies are discussed.  相似文献   

15.
全基因组关联分析的进展与反思   总被引:1,自引:0,他引:1  
Tu X  Shi LS  Wang F  Wang Q 《生理科学进展》2010,41(2):87-94
全基因组关联分析(genomewide association study,GWAS)是应用人类基因组中数以百万计的单核苷酸多态性(single nucleotide polymorphism,SNP)为标记进行病例-对照关联分析,以期发现影响复杂性疾病发生的遗传特征的一种新策略。近年来,随着人类基因组计划和基因组单倍体图谱计划的实施,人们已通过GWAS方法发现并鉴定了大量与人类性状或复杂性疾病关联的遗传变异,为进一步了解控制人类复杂性疾病发生的遗传特征提供了重要的线索。然而,由于造成复杂性疾病/性状的因素较多,而且GWAS研究系统较为复杂,因此目前GWAS本身亦存在诸多的问题。本文将从研究方式、研究对象、遗传标记,以及统计分析等方面,探讨GWAS的研究现状以及存在的潜在问题,并展望GWAS今后的发展方向。  相似文献   

16.
Leeyoung Park  Ju H. Kim 《Genetics》2015,199(4):1007-1016
Causal models including genetic factors are important for understanding the presentation mechanisms of complex diseases. Familial aggregation and segregation analyses based on polygenic threshold models have been the primary approach to fitting genetic models to the family data of complex diseases. In the current study, an advanced approach to obtaining appropriate causal models for complex diseases based on the sufficient component cause (SCC) model involving combinations of traditional genetics principles was proposed. The probabilities for the entire population, i.e., normal–normal, normal–disease, and disease–disease, were considered for each model for the appropriate handling of common complex diseases. The causal model in the current study included the genetic effects from single genes involving epistasis, complementary gene interactions, gene–environment interactions, and environmental effects. Bayesian inference using a Markov chain Monte Carlo algorithm (MCMC) was used to assess of the proportions of each component for a given population lifetime incidence. This approach is flexible, allowing both common and rare variants within a gene and across multiple genes. An application to schizophrenia data confirmed the complexity of the causal factors. An analysis of diabetes data demonstrated that environmental factors and gene–environment interactions are the main causal factors for type II diabetes. The proposed method is effective and useful for identifying causal models, which can accelerate the development of efficient strategies for identifying causal factors of complex diseases.  相似文献   

17.
Uncovering the underlying genetic component of any disease is key to the understanding of its pathophysiology and may open new avenues for development of therapeutic strategies and biomarkers. In the past several years, there has been an explosion of genome-wide association studies (GWAS) resulting in the discovery of novel candidate genes conferring risk for complex diseases, including neurodegenerative diseases. Despite this success, there still remains a substantial genetic component for many complex traits and conditions that is unexplained by the GWAS findings. Additionally, in many cases, the mechanism of action of the newly discovered disease risk variants is not inherently obvious. Furthermore, a genetic region with multiple genes may be identified via GWAS, making it difficult to discern the true disease risk gene. Several alternative approaches are proposed to overcome these potential shortcomings of GWAS, including the use of quantitative, biologically relevant phenotypes. Gene expression levels represent an important class of endophenotypes. Genetic linkage and association studies that utilize gene expression levels as endophenotypes determined that the expression levels of many genes are under genetic influence. This led to the postulate that there may exist many genetic variants that confer disease risk via modifying gene expression levels. Results from the handful of genetic studies which assess gene expression level endophenotypes in conjunction with disease risk suggest that this combined phenotype approach may both increase the power for gene discovery and lead to an enhanced understanding of their mode of action. This review summarizes the evidence in support of gene expression levels as promising endophenotypes in the discovery and characterization of novel candidate genes for complex diseases, which may also represent a novel approach in the genetic studies of Alzheimer's and other neurodegenerative diseases.  相似文献   

18.
Keloid disorder is a tumour-like disease with invasive growth and a high recurrence rate. Genetic contribution is well expected due to the presence of autosomal dominant inheritance and various genetic mutations in keloid lesions. However, GWAS failed to reveal functional variants in exon regions but single nucleotide polymorphisms in the non-coding regions, suggesting the necessity of innovative genetic investigation. This study employed combined GWAS, RNA-sequence and Hi-C analyses to dissect keloid disorder genetic mechanisms using paired keloid tissues and normal skins. Differentially expressed genes, miRNAs and lncRNAs mined by RNA-sequence were identified to construct a network. From which, 8 significant pathways involved in keloid disorder pathogenesis were enriched and 6 of them were verified. Furthermore, topologically associated domains at susceptible loci were located via the Hi-C database and ten differentially expressed RNAs were identified. Among them, the functions of six molecules for cell proliferation, cell cycle and apoptosis were particularly examined and confirmed by overexpressing and knocking-down assays. This study firstly revealed unknown key biomarkers and pathways in keloid lesions using RNA-sequence and previously reported mutation loci, indicating a feasible approach to reveal the genetic contribution to keloid disorder and possibly to other diseases that are failed by GWAS analysis alone.  相似文献   

19.
Genome-wide association studies (GWAS) have rapidly become a standard method for disease gene discovery. A substantial number of recent GWAS indicate that for most disorders, only a few common variants are implicated and the associated SNPs explain only a small fraction of the genetic risk. This review is written from the viewpoint that findings from the GWAS provide preliminary genetic information that is available for additional analysis by statistical procedures that accumulate evidence, and that these secondary analyses are very likely to provide valuable information that will help prioritize the strongest constellations of results. We review and discuss three analytic methods to combine preliminary GWAS statistics to identify genes, alleles, and pathways for deeper investigations. Meta-analysis seeks to pool information from multiple GWAS to increase the chances of finding true positives among the false positives and provides a way to combine associations across GWAS, even when the original data are unavailable. Testing for epistasis within a single GWAS study can identify the stronger results that are revealed when genes interact. Pathway analysis of GWAS results is used to prioritize genes and pathways within a biological context. Following a GWAS, association results can be assigned to pathways and tested in aggregate with computational tools and pathway databases. Reviews of published methods with recommendations for their application are provided within the framework for each approach.  相似文献   

20.
Genome-wide association studies (GWAS) have identified at least 133 ulcerative colitis (UC) associated loci. The role of genetic factors in clinical practice is not clearly defined. The relevance of genetic variants to disease pathogenesis is still uncertain because of not characterized gene–gene and gene–environment interactions. We examined the predictive value of combining the 133 UC risk loci with genetic interactions in an ongoing inflammatory bowel disease (IBD) GWAS. The Wellcome Trust Case–Control Consortium (WTCCC) IBD GWAS was used as a replication cohort. We applied logic regression (LR), a novel adaptive regression methodology, to search for high-order interactions. Exploratory genotype correlations with UC sub-phenotypes [extent of disease, need of surgery, age of onset, extra-intestinal manifestations and primary sclerosing cholangitis (PSC)] were conducted. The combination of 133 UC loci yielded good UC risk predictability [area under the curve (AUC) of 0.86]. A higher cumulative allele score predicted higher UC risk. Through LR, several lines of evidence for genetic interactions were identified and successfully replicated in the WTCCC cohort. The genetic interactions combined with the gene-smoking interaction significantly improved predictability in the model (AUC, from 0.86 to 0.89, P = 3.26E?05). Explained UC variance increased from 37 to 42 % after adding the interaction terms. A within case analysis found suggested genetic association with PSC. Our study demonstrates that the LR methodology allows the identification and replication of high-order genetic interactions in UC GWAS datasets. UC risk can be predicted by a 133 loci and improved by adding gene–gene and gene–environment interactions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号