首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Whole genome pathway analysis is a powerful tool for the exploration of the combined effects of gene-sets within biological pathways. This study applied Interval Based Enrichment Analysis (INRICH) to perform whole-genome pathway analysis of body-mass index (BMI). We used a discovery set composed of summary statistics from a meta-analysis of 123,865 subjects performed by the GIANT Consortium, and an independent sample of 8,632 subjects to assess replication of significant pathways. We examined SNPs within nominally significant pathways using linear mixed models to estimate their contribution to overall BMI heritability. Six pathways replicated as having significant enrichment for association after correcting for multiple testing, including the previously unknown relationships between BMI and the Reactome regulation of ornithine decarboxylase pathway, the KEGG lysosome pathway, and the Reactome stabilization of P53 pathway. Two non-overlapping sets of genes emerged from the six significant pathways. The clustering of shared genes based on previously identified protein-protein interactions listed in PubMed and OMIM supported the relatively independent biological effects of these two gene-sets. We estimate that the SNPs located in examined pathways explain ∼20% of the heritability for BMI that is tagged by common SNPs (3.35% of the 16.93% total).  相似文献   

2.
The power of genome-wide SNP association studies is limited, among others, by the large number of false positive test results. To provide a remedy, we combined SNP association analysis with the pathway-driven gene set enrichment analysis (GSEA), recently developed to facilitate handling of genome-wide gene expression data. The resulting GSEA-SNP method rests on the assumption that SNPs underlying a disease phenotype are enriched in genes constituting a signaling pathway or those with a common regulation. Besides improving power for association mapping, GSEA-SNP may facilitate the identification of disease-associated SNPs and pathways, as well as the understanding of the underlying biological mechanisms. GSEA-SNP may also help to identify markers with weak effects, undetectable in association studies without pathway consideration. The program is freely available and can be downloaded from our website.  相似文献   

3.
Recent genome‐wide association (GWA) studies have identified a number of novel genes/variants predisposing to obesity. However, most GWA studies have focused on individual single‐nucleotide polymorphism (SNPs)/genes with a strong statistical association with a phenotypic trait without considering potential biological interplay of the tested genes. In this study, we performed biological pathway‐based GWA analysis for BMI and body fat mass. We used individual level genotype data generated from 1,000 unrelated US whites that were genotyped for ~500,000 SNPs. Statistical analysis of pathways was performed using a modification of the Gene Set Enrichment Algorithm. A total of 963 pathways extracted from the BioCarta, Kyoto Encyclopedia of Genes and Genomes (KEGG), Ambion GeneAssist, and Gene Ontology (GO) databases were analyzed. Among all of the pathways analyzed, the vasoactive intestinal peptide (VIP) pathway was most strongly associated with fat mass (nominal P = 0.0009) and was the third most strongly associated pathway with BMI (nominal P = 0.0006). After multiple testing correction, the VIP pathway achieved false‐discovery rate (FDR) q values of 0.042 and 0.120 for fat mass and BMI, respectively. Our study is the first to demonstrate that the VIP pathway may play an important role in development of obesity. The study also highlights the importance of pathway‐based GWA analysis in identification of additional genes/variants for complex human diseases.  相似文献   

4.
Pathway analysis of genome-wide association studies (GWAS) offer a unique opportunity to collectively evaluate genetic variants with effects that are too small to be detected individually. We applied a pathway analysis to a bladder cancer GWAS containing data from 3,532 cases and 5,120 controls of European background (n = 5 studies). Thirteen hundred and ninety-nine pathways were drawn from five publicly available resources (Biocarta, Kegg, NCI-PID, HumanCyc, and Reactome), and we constructed 22 additional candidate pathways previously hypothesized to be related to bladder cancer. In total, 1421 pathways, 5647 genes and ∼90,000 SNPs were included in our study. Logistic regression model adjusting for age, sex, study, DNA source, and smoking status was used to assess the marginal trend effect of SNPs on bladder cancer risk. Two complementary pathway-based methods (gene-set enrichment analysis [GSEA], and adapted rank-truncated product [ARTP]) were used to assess the enrichment of association signals within each pathway. Eighteen pathways were detected by either GSEA or ARTP at P≤0.01. To minimize false positives, we used the I2 statistic to identify SNPs displaying heterogeneous effects across the five studies. After removing these SNPs, seven pathways (‘Aromatic amine metabolism’ [PGSEA = 0.0100, PARTP = 0.0020], ‘NAD biosynthesis’ [PGSEA = 0.0018, PARTP = 0.0086], ‘NAD salvage’ [PARTP = 0.0068], ‘Clathrin derived vesicle budding’ [PARTP = 0.0018], ‘Lysosome vesicle biogenesis’ [PGSEA = 0.0023, PARTP<0.00012], ’Retrograde neurotrophin signaling’ [PGSEA = 0.00840], and ‘Mitotic metaphase/anaphase transition’ [PGSEA = 0.0040]) remained. These pathways seem to belong to three fundamental cellular processes (metabolic detoxification, mitosis, and clathrin-mediated vesicles). Identification of the aromatic amine metabolism pathway provides support for the ability of this approach to identify pathways with established relevance to bladder carcinogenesis.  相似文献   

5.
6.
In spite of the success of genome-wide association studies (GWASs), only a small proportion of heritability for each complex trait has been explained by identified genetic variants, mainly SNPs. Likely reasons include genetic heterogeneity (i.e., multiple causal genetic variants) and small effect sizes of causal variants, for which pathway analysis has been proposed as a promising alternative to the standard single-SNP-based analysis. A pathway contains a set of functionally related genes, each of which includes multiple SNPs. Here we propose a pathway-based test that is adaptive at both the gene and SNP levels, thus maintaining high power across a wide range of situations with varying numbers of the genes and SNPs associated with a trait. The proposed method is applicable to both common variants and rare variants and can incorporate biological knowledge on SNPs and genes to boost statistical power. We use extensively simulated data and a WTCCC GWAS dataset to compare our proposal with several existing pathway-based and SNP-set-based tests, demonstrating its promising performance and its potential use in practice.  相似文献   

7.
Genome-wide association studies (GWAS) are designed to identify the portion of single-nucleotide polymorphisms (SNPs) in genome sequences associated with a complex trait. Strategies based on the gene list enrichment concept are currently applied for the functional analysis of GWAS, according to which a significant overrepresentation of candidate genes associated with a biological pathway is used as a proxy to infer overrepresentation of candidate SNPs in the pathway. Here we show that such inference is not always valid and introduce the program SNP2GO, which implements a new method to properly test for the overrepresentation of candidate SNPs in biological pathways.  相似文献   

8.
Braun R  Buetow K 《PLoS genetics》2011,7(6):e1002101
Genome-wide association studies (GWAS) have become increasingly common due to advances in technology and have permitted the identification of differences in single nucleotide polymorphism (SNP) alleles that are associated with diseases. However, while typical GWAS analysis techniques treat markers individually, complex diseases (cancers, diabetes, and Alzheimers, amongst others) are unlikely to have a single causative gene. Thus, there is a pressing need for multi-SNP analysis methods that can reveal system-level differences in cases and controls. Here, we present a novel multi-SNP GWAS analysis method called Pathways of Distinction Analysis (PoDA). The method uses GWAS data and known pathway-gene and gene-SNP associations to identify pathways that permit, ideally, the distinction of cases from controls. The technique is based upon the hypothesis that, if a pathway is related to disease risk, cases will appear more similar to other cases than to controls (or vice versa) for the SNPs associated with that pathway. By systematically applying the method to all pathways of potential interest, we can identify those for which the hypothesis holds true, i.e., pathways containing SNPs for which the samples exhibit greater within-class similarity than across classes. Importantly, PoDA improves on existing single-SNP and SNP-set enrichment analyses, in that it does not require the SNPs in a pathway to exhibit independent main effects. This permits PoDA to reveal pathways in which epistatic interactions drive risk. In this paper, we detail the PoDA method and apply it to two GWAS: one of breast cancer and the other of liver cancer. The results obtained strongly suggest that there exist pathway-wide genomic differences that contribute to disease susceptibility. PoDA thus provides an analytical tool that is complementary to existing techniques and has the power to enrich our understanding of disease genomics at the systems-level.  相似文献   

9.
The widely used pathway-based approach for interpreting Genome Wide Association Studies (GWAS), assumes that since function is executed through the interactions of multiple genes, different perturbations of the same pathway would result in a similar phenotype. This assumption, however, was not systemically assessed on a large scale. To determine whether SNPs associated with a given complex phenotype affect the same pathways more than expected by chance, we analyzed 368 phenotypes that were studied in >5000 GWAS. We found 216 significant phenotype-pathway associations between 70 of the phenotypes we analyzed and known pathways. We also report 391 strong phenotype-phenotype associations between phenotypes that are affected by the same pathways. While some of these associations confirm previously reported connections, others are new and could shed light on the molecular basis of these diseases. Our findings confirm that phenotype-associated SNPs cluster into pathways much more than expected by chance. However, this is true for <20% (70/368) of the phenotypes. Different types of phenotypes show markedly different tendencies: Virtually all autoimmune phenotypes show strong clustering of SNPs into pathways, while most cancers and metabolic conditions, and all electrophysiological phenotypes, could not be significantly associated with any pathway despite being significantly associated with a large number of SNPs. While this may be due to missing data, it may also suggest that these phenotypes could result only from perturbations of specific genes and not from other perturbations of the same pathway. Further analysis of pathway-associated versus gene-associated phenotypes is, therefore, needed in order to understand disease etiology and in order to promote better drug target selection.  相似文献   

10.
We performed a gene–smoking interaction analysis using families from an early-onset coronary artery disease cohort (GENECARD). This analysis was focused on validating and expanding results from previous studies implicating single nucleotide polymorphisms (SNPs) on chromosome 3 in smoking-mediated coronary artery disease. We analyzed 430 SNPs on chromosome 3 and identified 16 SNPs that showed a gene–smoking interaction at P < 0.05 using association in the presence of linkage—ordered subset analysis, a method that uses permutations of the data to empirically estimate the strength of the association signal. Seven of the 16 SNPs were in the Rho-GTPase pathway indicating a 1.87-fold enrichment for this pathway. A meta-analysis of gene–smoking interactions in three independent studies revealed that rs9289231 in KALRN had a Fisher’s combined P value of 0.0017 for the interaction with smoking. In a gene-based meta-analysis KALRN had a P value of 0.026. Finally, a pathway-based analysis of the association results using WebGestalt revealed several enriched pathways including the regulation of the actin cytoskeleton pathway as defined by the Kyoto Encyclopedia of Genes and Genomes.  相似文献   

11.
12.
13.
Nicotine dependence is the primary addictive stage of cigarette smoking. Although a lot of studies have been performed to explore the molecular mechanism underlying nicotine dependence, our understanding on this disorder is still far from complete. Over the past decades, an increasing number of candidate genes involved in nicotine dependence have been identified by different technical approaches, including the genetic association analysis. In this study, we performed a comprehensive collection of candidate genes reported to be genetically associated with nicotine dependence. Then, the biochemical pathways enriched in these genes were identified by considering the gene’s propensity to be related to nicotine dependence. One of the most widely used pathway enrichment analysis approach, over-representation analysis, ignores the function non-equivalence of genes in candidate gene set and may have low discriminative power in identifying some dysfunctional pathways. To overcome such drawbacks, we constructed a comprehensive human protein–protein interaction network, and then assigned a function weighting score to each candidate gene based on their network topological features. Evaluation indicated the function weighting score scheme was consistent with available evidence. Finally, the function weighting scores of the candidate genes were incorporated into pathway analysis to identify the dysfunctional pathways involved in nicotine dependence, and the interactions between pathways was detected by pathway crosstalk analysis. Compared to conventional over-representation-based pathway analysis tool, the modified method exhibited improved discriminative power and detected some novel pathways potentially underlying nicotine dependence. In summary, we conducted a comprehensive collection of genes associated with nicotine dependence and then detected the biochemical pathways enriched in these genes using a modified pathway enrichment analysis approach with function weighting score of candidate genes integrated. Our results may provide insight into the molecular mechanism underlying nicotine dependence.  相似文献   

14.
There is increasing evidence suggesting that higher intakes of fish or n-3 polyunsaturated fatty acids supplements may decrease the risk of preterm delivery (PTD). We hypothesized that genetic variants of the enzymes critical to fatty acids biosynthesis and metabolism may be associated with PTD. We genotyped 231 potentially functional single nucleotide polymorphisms (SNPs) and tagSNPs in 9 genes (FADS1, FADS2, PTGS1, PTGS2, ALOX5, ALOX5AP, PTGES, PTGES2, and PTGES3) among 1,110 black mothers, including 542 mothers who delivered preterm (<37 weeks gestation) and 568 mothers who delivered full-term babies (≥37 weeks gestation) at Boston Medical Center. After excluding SNPs that are in complete linkage disequilibrium or have lower minor allele frequency (<1%) or call rate (<90%), we examined the association of 206 SNPs with PTD using multiple logistic regression models. We also imputed 190 HapMap SNPs via program MACH and examined their associations with PTD. Finally, we explored gene-level and pathway-level associations with PTD using the adaptive rank truncated product (ARTP) methods. A total of 21 SNPs were associated with PTD (p value ranging from 0.003 to 0.05), including 3 imputed SNPs. Gene-level ARTP statistics indicated that the gene PTGES2 was significantly associated with PTD with a gene-based p value equal to 0.01. No pathway-based association was found. In this large and comprehensive candidate gene study, we found a modest association of genes in fatty acid metabolism pathway with PTD. Further investigation of these gene polymorphisms jointly with fatty acid measures and other genetic factors would help better understand the pathogenesis of PTD.  相似文献   

15.
OBJECTIVE: Function of the renin-angiotensin system is important to human hypertension, but its genetic etiology remains elusive. We set out to examine a hypothesis that multiple genetic variants in the system act together in blood pressure regulation, via intermediate phenotypes such as blood pressure reactivity. METHODS: A sample of 531 hypertensive cases and 417 controls was selected from the HyperGEN study. Hypertension-related traits including blood pressure responses to challenges to math test, handgrip and postural change (mathBP, gripBP, and postBP), and body mass index (BMI) were analyzed for association with 10 single nucleotide polymorphisms (SNPs) in the angiotensinogen (AGT) gene. Single-marker and haplotype analyses were performed to examine the effects of both individual and multiple variants. Multiple-trait profiling was used to assess interaction of latent intermediate factors with susceptible haplotypes. RESULTS: In Blacks, two SNPs in exon 5 and 3'UTR showed significant association with gripBP, and two promoter SNPs were strongly associated with postBP. In Whites, only borderline association was found for 2 promoter SNPs with mathBP. Haplotype analyses in Blacks confirmed association with gripBP, and detected significant association of a haplotype to BMI (p=0.029). With the interactions modeled, haplotype associations found in Blacks remain significant, while significant associations to BMI (p=0.009) and gripSBP emerged in Whites. CONCLUSION: Genetic variants in regulatory regions of AGT showed strong association with blood pressure reactivity. Interaction of promoter and genic SNPs in AGT revealed collective action of multiple variants on blood pressure reactivity and BMI both in Blacks and in Whites, possibly following different pathways.  相似文献   

16.
Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False Discovery Rate (sFDR) methods to leverage genic enrichment in GWAS summary statistics data to uncover new loci likely to replicate in independent samples. Specifically, we use linkage disequilibrium-weighted annotations for each SNP in combination with nominal p-values to estimate the True Discovery Rate (TDR = 1−FDR) for strata determined by different genic categories. We show a consistent pattern of enrichment of polygenic effects in specific annotation categories across diverse phenotypes, with the greatest enrichment for SNPs tagging regulatory and coding genic elements, little enrichment in introns, and negative enrichment for intergenic SNPs. Stratified enrichment directly leads to increased TDR for a given p-value, mirrored by increased replication rates in independent samples. We show this in independent Crohn''s disease GWAS, where we find a hundredfold variation in replication rate across genic categories. Applying a well-established sFDR methodology we demonstrate the utility of stratification for improving power of GWAS in complex phenotypes, with increased rejection rates from 20% in height to 300% in schizophrenia with traditional FDR and sFDR both fixed at 0.05. Our analyses demonstrate an inherent stratification among GWAS SNPs with important conceptual implications that can be leveraged by statistical methods to improve the discovery of loci.  相似文献   

17.
SNP-based gene-set enrichment analysis from single nucleotide polymorphisms, or GSEA-SNP, is a tool to identify candidate genes based on enrichment analysis of sets of genes rather than single SNP associations. The objective of this study was to identify modest-effect genes associated with Mycobacterium avium subsp. paratuberculosis (Map) tissue infection or fecal shedding using GSEA-SNP applied to KEGG pathways or Gene Ontology (GO) gene sets. The Illumina Bovine SNP50 BeadChip was used to genotype 209 Holstein cows for the GSEA-SNP analyses. For each of 13,744 annotated genes genome-wide located within 50 kb of a Bovine SNP50 SNP, the single SNP with the highest Cochran-Armitage Max statistic was used as a proxy statistic for that gene’s strength of affiliation with Map. Gene-set enrichment was tested using a weighted Kolmogorov-Smirnov-like running sum statistic with data permutation to adjust for multiple testing. For tissue infection and fecal shedding, no gene sets in KEGG pathways or in GO sets for molecular function or cellular component were enriched for signal. The GO biological process gene set for positive regulation of cell motion (GO:0051272, q = 0.039, 5/11 genes contributing to the core enrichment) was enriched for Map tissue infection, while no GO biological process gene sets were enriched for fecal shedding. GSEA-SNP complements traditional SNP association approaches to identify genes of modest effects as well as genes with larger effects as demonstrated by the identification of one locus that we previously found to be associated with Map tissue infection using a SNP-by-SNP genome-wide association study.  相似文献   

18.
Li H 《Human genetics》2012,131(9):1395-1401
Many common human diseases are complex and are expected to be highly heterogeneous, with multiple causative loci and multiple rare and common variants at some of the causative loci contributing to the risk of these diseases. Data from the genome-wide association studies (GWAS) and metadata such as known gene functions and pathways provide the possibility of identifying genetic variants, genes and pathways that are associated with complex phenotypes. Single-marker-based tests have been very successful in identifying thousands of genetic variants for hundreds of complex phenotypes. However, these variants only explain very small percentages of the heritabilities. To account for the locus- and allelic-heterogeneity, gene-based and pathway-based tests can be very useful in the next stage of the analysis of GWAS data. U-statistics, which summarize the genomic similarity between pair of individuals and link the genomic similarity to phenotype similarity, have proved to be very useful for testing the associations between a set of single nucleotide polymorphisms and the phenotypes. Compared to single marker analysis, the advantages afforded by the U-statistics-based methods is large when the number of markers involved is large. We review several formulations of U-statistics in genetic association studies and point out the links of these statistics with other similarity-based tests of genetic association. Finally, potential application of U-statistics in analysis of the next-generation sequencing data and rare variants association studies are discussed.  相似文献   

19.
20.
Pathway analyses of genome-wide association studies aggregate information over sets of related genes, such as genes in common pathways, to identify gene sets that are enriched for variants associated with disease. We develop a model-based approach to pathway analysis, and apply this approach to data from the Wellcome Trust Case Control Consortium (WTCCC) studies. Our method offers several benefits over existing approaches. First, our method not only interrogates pathways for enrichment of disease associations, but also estimates the level of enrichment, which yields a coherent way to promote variants in enriched pathways, enhancing discovery of genes underlying disease. Second, our approach allows for multiple enriched pathways, a feature that leads to novel findings in two diseases where the major histocompatibility complex (MHC) is a major determinant of disease susceptibility. Third, by modeling disease as the combined effect of multiple markers, our method automatically accounts for linkage disequilibrium among variants. Interrogation of pathways from eight pathway databases yields strong support for enriched pathways, indicating links between Crohn''s disease (CD) and cytokine-driven networks that modulate immune responses; between rheumatoid arthritis (RA) and “Measles” pathway genes involved in immune responses triggered by measles infection; and between type 1 diabetes (T1D) and IL2-mediated signaling genes. Prioritizing variants in these enriched pathways yields many additional putative disease associations compared to analyses without enrichment. For CD and RA, 7 of 8 additional non-MHC associations are corroborated by other studies, providing validation for our approach. For T1D, prioritization of IL-2 signaling genes yields strong evidence for 7 additional non-MHC candidate disease loci, as well as suggestive evidence for several more. Of the 7 strongest associations, 4 are validated by other studies, and 3 (near IL-2 signaling genes RAF1, MAPK14, and FYN) constitute novel putative T1D loci for further study.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号