首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
It has been suggested that pathway analysis can complement single-SNP analysis in exploring genomewide association data. Pathway analysis incorporates the available biological knowledge of genes and SNPs and is expected to improve the chances of revealing the underlying genetic architecture of complex traits. Methods for pathway analysis can be classified as competitive (enrichment) or self-contained (association) according to the hypothesis tested. Although association tests are statistically more powerful than enrichment tests they can be difficult to calibrate because biases in analysis accumulate across multiple SNPs or genes. Furthermore, enrichment tests can be more scientifically relevant than association tests, as they detect pathways with relatively more evidence for association than the remaining genes. Here we show how some well known association tests can be simply adapted to test for enrichment, and compare their performance to some established enrichment tests. We propose versions of the Adaptive Rank Truncated Product (ARTP), Tail Strength Measure and Fisher's combination of p-values for testing the enrichment null hypothesis. We compare the behaviour of these proposed methods with the established Hypergeometric Test and Gene-Set Enrichment Analysis (GSEA). The results of the simulation study show that the modified version of the ARTP method has generally the best performance across the situations considered. The methods were also applied for finding enriched pathways for body mass index (BMI) and platelet function phenotypes. The pathway analysis of BMI identified the Vasoactive Intestinal Peptide pathway as significantly associated with BMI. This pathway has been previously reported as associated with BMI and the risk of obesity. The ARTP method was the method that identified the largest number of enriched pathways across all tested pathway databases and phenotypes. The simulation and data application results are in agreement with previous work on association tests and suggests that the ARTP should be preferred for both enrichment and association testing.  相似文献   

3.
Recent genome‐wide association (GWA) studies have identified a number of novel genes/variants predisposing to obesity. However, most GWA studies have focused on individual single‐nucleotide polymorphism (SNPs)/genes with a strong statistical association with a phenotypic trait without considering potential biological interplay of the tested genes. In this study, we performed biological pathway‐based GWA analysis for BMI and body fat mass. We used individual level genotype data generated from 1,000 unrelated US whites that were genotyped for ~500,000 SNPs. Statistical analysis of pathways was performed using a modification of the Gene Set Enrichment Algorithm. A total of 963 pathways extracted from the BioCarta, Kyoto Encyclopedia of Genes and Genomes (KEGG), Ambion GeneAssist, and Gene Ontology (GO) databases were analyzed. Among all of the pathways analyzed, the vasoactive intestinal peptide (VIP) pathway was most strongly associated with fat mass (nominal P = 0.0009) and was the third most strongly associated pathway with BMI (nominal P = 0.0006). After multiple testing correction, the VIP pathway achieved false‐discovery rate (FDR) q values of 0.042 and 0.120 for fat mass and BMI, respectively. Our study is the first to demonstrate that the VIP pathway may play an important role in development of obesity. The study also highlights the importance of pathway‐based GWA analysis in identification of additional genes/variants for complex human diseases.  相似文献   

4.
5.
6.
Genome-wide association studies (GWAS) have identified multiple common variants associated with body mass index (BMI). In this study, we tested 23 genotyped GWAS-significant SNPs (p-value<5*10-8) for longitudinal associations with BMI during childhood (3-17 years) and adulthood (18-45 years) for 658 subjects. We also proposed a heuristic forward search for the best joint effect model to explain the longitudinal BMI variation. After using false discovery rate (FDR) to adjust for multiple tests, childhood and adulthood BMI were found to be significantly associated with six SNPs each (q-value<0.05), with one SNP associated with both BMI measurements: KCTD15 rs29941 (q-value<7.6*10-4). These 12 SNPs are located at or near genes either expressed in the brain (BDNF, KCTD15, TMEM18, MTCH2, and FTO) or implicated in cell apoptosis and proliferation (FAIM2, MAP2K5, and TFAP2B). The longitudinal effects of FAIM2 rs7138803 on childhood BMI and MAP2K5 rs2241423 on adulthood BMI decreased as age increased (q-value<0.05). The FTO candidate SNPs, rs6499640 at the 5 '-end and rs1121980 and rs8050136 downstream, were associated with childhood and adulthood BMI, respectively, and the risk effects of rs6499640 and rs1121980 increased as birth weight decreased. The best joint effect model for childhood and adulthood BMI contained 14 and 15 SNPs each, with 11 in common, and the percentage of explained variance increased from 0.17% and 9.0*10(-6)% to 2.22% and 2.71%, respectively. In summary, this study evidenced the presence of long-term major effects of genes on obesity development, implicated in pathways related to neural development and cell metabolism, and different sets of genes associated with childhood and adulthood BMI, respectively. The gene effects can vary with age and be modified by prenatal development. The best joint effect model indicated that multiple variants with effects that are weak or absent alone can nevertheless jointly exert a large longitudinal effect on BMI.  相似文献   

7.
Vattikuti S  Guo J  Chow CC 《PLoS genetics》2012,8(3):e1002637
We used a bivariate (multivariate) linear mixed-effects model to estimate the narrow-sense heritability (h(2)) and heritability explained by the common SNPs (h(g)(2)) for several metabolic syndrome (MetS) traits and the genetic correlation between pairs of traits for the Atherosclerosis Risk in Communities (ARIC) genome-wide association study (GWAS) population. MetS traits included body-mass index (BMI), waist-to-hip ratio (WHR), systolic blood pressure (SBP), fasting glucose (GLU), fasting insulin (INS), fasting trigylcerides (TG), and fasting high-density lipoprotein (HDL). We found the percentage of h(2) accounted for by common SNPs to be 58% of h(2) for height, 41% for BMI, 46% for WHR, 30% for GLU, 39% for INS, 34% for TG, 25% for HDL, and 80% for SBP. We confirmed prior reports for height and BMI using the ARIC population and independently in the Framingham Heart Study (FHS) population. We demonstrated that the multivariate model supported large genetic correlations between BMI and WHR and between TG and HDL. We also showed that the genetic correlations between the MetS traits are directly proportional to the phenotypic correlations.  相似文献   

8.

Background

Recently mixed linear models are used to address the issue of “missing" heritability in traditional Genome-wide association studies (GWAS). The models assume that all single-nucleotide polymorphisms (SNPs) are associated with the phenotypes of interest. However, it is more common that only a small proportion of SNPs have significant effects on the phenotypes, while most SNPs have no or very small effects. To incorporate this feature, we propose an efficient Hierarchical Bayesian Model (HBM) that extends the existing mixed models to enforce automatic selection of significant SNPs. The HBM models the SNP effects using a mixture distribution of a point mass at zero and a normal distribution, where the point mass corresponds to those non-associative SNPs.

Results

We estimate the HBM using Gibbs sampling. The estimation performance of our method is first demonstrated through two simulation studies. We make the simulation setups realistic by using parameters fitted on the Framingham Heart Study (FHS) data. The simulation studies show that our method can accurately estimate the proportion of SNPs associated with the simulated phenotype and identify these SNPs, as well as adapt to certain model mis-specification than the standard mixed models. In addition, we analyze data from the FHS and the Health and Retirement Study (HRS) to study the association between Body Mass Index (BMI) and SNPs on Chromosome 16, and replicate the identified genetic associations. The analysis of the FHS data identifies 0.3% SNPs on Chromosome 16 that affect BMI, including rs9939609 and rs9939973 on the FTO gene. These two SNPs are in strong linkage disequilibrium with rs1558902 (Rsq =0.901 for rs9939609 and Rsq =0.905 for rs9939973), which has been reported to be linked with obesity in previous GWAS. We then replicate the findings using the HRS data: the analysis finds 0.4% of SNPs associated with BMI on Chromosome 16. Furthermore, around 25% of the genes that are identified to be associated with BMI are common between the two studies.

Conclusions

The results demonstrate that the HBM and the associated estimation algorithm offer a powerful tool for identifying significant genetic associations with phenotypes of interest, among a large number of SNPs that are common in modern genetics studies.  相似文献   

9.
干扰素信号传导通路与其基因组多态性网络模型的建立   总被引:1,自引:0,他引:1  
崔建军  田庚善  田地  曾争 《遗传》2008,30(6):788-794
通过检索Pubmed、Embase数据库, 整合文献资料, 运用Teranode Design Suite(TDS)生物软件, 建立干扰素(IFN)发挥生物学作用的信号传导通路。应用SNP Trawler软件搜索通路中相关基因的SNP信息, 并以属性形式在通路模型中给以显示。结果构建了融合基因的遗传信息尤其是SNP相关信息的IFN发挥生物学作用的信号传导通路的网络模型, 此模型共包含JAK-STAT, MAPK-p38和PI3K 3条主要传导通路, 干扰素通过不同的信号传导通路发挥不同的生物活性, 其中Ⅰ型干扰素通过这3条通路发挥抗病毒、抗细胞增殖及免疫调节等重要作用, 而Ⅱ型干扰素则通过JAK-STAT和MAPK-p38路径发挥生物学作用, Ⅲ型干扰素则仅通过PI3K通路发挥生物学作用; 这3条传导通路包含98个基因和19 693个SNPs, 组成了复杂的基因间相互作用网络。此通路模型的成功建立, 一方面为研究SNP对IFN生物学作用的影响而预测IFN疗效提供理论基础, 另一方面为实现个体化诊疗、发现新的药物作用靶点及开发新的药物奠定基础。  相似文献   

10.
Individuals with fast nicotine metabolism typically smoke more and thus have a greater risk for smoking-induced diseases. Further, the efficacy of smoking cessation pharmacotherapy is dependent on the rate of nicotine metabolism. Our objective was to use nicotine metabolite ratio (NMR), an established biomarker of nicotine metabolism rate, in a genome-wide association study (GWAS) to identify novel genetic variants influencing nicotine metabolism. A heritability estimate of 0.81 (95% CI 0.70–0.88) was obtained for NMR using monozygotic and dizygotic twins of the FinnTwin cohort. We performed a GWAS in cotinine-verified current smokers of three Finnish cohorts (FinnTwin, Young Finns Study, FINRISK2007), followed by a meta-analysis of 1518 subjects, and annotated the genome-wide significant SNPs with methylation quantitative loci (meQTL) analyses. We detected association on 19q13 with 719 SNPs exceeding genome-wide significance within a 4.2 Mb region. The strongest evidence for association emerged for CYP2A6 (min p = 5.77E-86, in intron 4), the main metabolic enzyme for nicotine. Other interesting genes with genome-wide significant signals included CYP2B6, CYP2A7, EGLN2, and NUMBL. Conditional analyses revealed three independent signals on 19q13, all located within or in the immediate vicinity of CYP2A6. A genetic risk score constructed using the independent signals showed association with smoking quantity (p = 0.0019) in two independent Finnish samples. Our meQTL results showed that methylation values of 16 CpG sites within the region are affected by genotypes of the genome-wide significant SNPs, and according to causal inference test, for some of the SNPs the effect on NMR is mediated through methylation. To our knowledge, this is the first GWAS on NMR. Our results enclose three independent novel signals on 19q13.2. The detected CYP2A6 variants explain a strikingly large fraction of variance (up to 31%) in NMR in these study samples. Further, we provide evidence for plausible epigenetic mechanisms influencing NMR.  相似文献   

11.
Many common diseases are accompanied by disturbances in biochemical traits. Identifying the genetic determinants could provide novel insights into disease mechanisms and reveal avenues for developing new therapies. Here, we report a genome-wide association analysis for commonly measured serum and urine biochemical traits. As part of the WTCCC, 500,000 SNPs genome wide were genotyped in 1955 hypertensive individuals characterized for 25 serum and urine biochemical traits. For each trait, we assessed association with individual SNPs, adjusting for age, sex, and BMI. Lipid measurements were further examined in a meta-analysis of genome-wide data from a type 2 diabetes scan. The most promising associations were examined in two epidemiological cohorts. We discovered association between serum urate and SLC2A9, a glucose transporter (p = 2 x 10(-15)) and confirmed this in two independent cohorts, GRAPHIC study (p = 9 x 10(-15)) and TwinsUK (p = 8 x 10(-19)). The odds ratio for hyperuricaemia (defined as urate >0.4 mMol/l) is 1.89 (95% CI = 1.36-2.61) per copy of common allele. We also replicated many genes previously associated with serum lipids and found previously recognized association between LDL levels and SNPs close to genes encoding PSRC1 and CELSR2 (p = 1 x 10(-7)). The common allele was associated with a 6% increase in nonfasting serum LDL. This region showed increased association in the meta-analysis (p = 4 x 10(-14)). This finding provides a potential biological mechanism for the recent association of this same allele of the same SNP with increased risk of coronary disease.  相似文献   

12.
Haw R  Hermjakob H  D'Eustachio P  Stein L 《Proteomics》2011,11(18):3598-3613
Reactome (http://www.reactome.org) is an open-source, expert-authored, peer-reviewed, manually curated database of reactions, pathways and biological processes. We provide an intuitive web-based user interface to pathway knowledge and a suite of data analysis tools. The Pathway Browser is a Systems Biology Graphical Notation-like visualization system that supports manual navigation of pathways by zooming, scrolling and event highlighting, and that exploits PSI Common Query Interface web services to overlay pathways with molecular interaction data from the Reactome Functional Interaction Network and interaction databases such as IntAct, ChEMBL and BioGRID. Pathway and expression analysis tools employ web services to provide ID mapping, pathway assignment and over-representation analysis of user-supplied data sets. By applying Ensembl Compara to curated human proteins and reactions, Reactome generates pathway inferences for 20 other species. The Species Comparison tool provides a summary of results for each of these species as a table showing numbers of orthologous proteins found by pathway from which users can navigate to inferred details for specific proteins and reactions. Reactome's diverse pathway knowledge and suite of data analysis tools provide a platform for data mining, modeling and analysis of large-scale proteomics data sets. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 8).  相似文献   

13.
Bipolar disorder (BPD) is a complex psychiatric trait with high heritability. Despite efforts through conducting genome-wide association (GWA) studies, the success of identifying susceptibility loci for BPD has been limited, which is partially attributed to the complex nature of its pathogenesis. Pathway-based analytic strategy is a powerful tool to explore joint effects of gene sets within specific biological pathways. Additionally, to incorporate other aspects of genomic data into pathway analysis may further enhance our understanding for the underlying mechanisms for BPD. Patterns of DNA methylation play important roles in regulating gene expression and function. A commonly observed phenomenon, allele-specific methylation (ASM) describes the associations between genetic variants and DNA methylation patterns. The present study aimed to identify biological pathways that are involve in the pathogenesis of BPD while incorporating brain specific ASM information in pathway analysis using two large-scale GWA datasets in Caucasian populations. A weighting scheme was adopted to take ASM information into consideration for each pathway. After multiple testing corrections, we identified 88 and 15 enriched pathways for their biological relevance for BPD in the Genetic Association Information Network (GAIN) and the Wellcome Trust Case Control Consortium dataset, respectively. Many of these pathways were significant only when applying the weighting scheme. Three ion channel related pathways were consistently identified in both datasets. Results in the GAIN dataset also suggest for the roles of extracellular matrix in brain for BPD. Findings from Gene Ontology (GO) analysis exhibited functional enrichment among genes of non-GO pathways in activity of gated channel, transporter, and neurotransmitter receptor. We demonstrated that integrating different data sources with pathway analysis provides an avenue to identify promising and novel biological pathways for exploring the underlying molecular mechanisms for bipolar disorder. Further basic research can be conducted to target the biological mechanisms for the identified genes and pathways.  相似文献   

14.
Large genome-wide association studies (GWAS) have identified many genetic loci associated with risk for myocardial infarction (MI) and coronary artery disease (CAD). Concurrently, efforts such as the National Institutes of Health (NIH) Roadmap Epigenomics Project and the Encyclopedia of DNA Elements (ENCODE) Consortium have provided unprecedented data on functional elements of the human genome. In the present study, we systematically investigate the biological link between genetic variants associated with this complex disease and their impacts on gene function. First, we examined the heritability of MI/CAD according to genomic compartments. We observed that single nucleotide polymorphisms (SNPs) residing within nearby regulatory regions show significant polygenicity and contribute between 59–71% of the heritability for MI/CAD. Second, we showed that the polygenicity and heritability explained by these SNPs are enriched in histone modification marks in specific cell types. Third, we found that a statistically higher number of 45 MI/CAD-associated SNPs that have been identified from large-scale GWAS studies reside within certain functional elements of the genome, particularly in active enhancer and promoter regions. Finally, we observed significant heterogeneity of this signal across cell types, with strong signals observed within adipose nuclei, as well as brain and spleen cell types. These results suggest that the genetic etiology of MI/CAD is largely explained by tissue-specific regulatory perturbation within the human genome.  相似文献   

15.
For most complex traits, results from genome-wide association studies show that the proportion of the phenotypic variance attributable to the additive effects of individual SNPs, that is, the heritability explained by the SNPs, is substantially less than the estimate of heritability obtained by standard methods using correlations between relatives. This difference has been called the “missing heritability”. One explanation is that heritability estimates from family (including twin) studies are biased upwards. Zuk et al. revisited overestimation of narrow sense heritability from twin studies as a result of confounding with non-additive genetic variance. They propose a limiting pathway (LP) model that generates significant epistatic variation and its simple parametrization provides a convenient way to explore implications of epistasis. They conclude that over-estimation of narrow sense heritability from family data (‘phantom heritability’) may explain an important proportion of missing heritability. We show that for highly heritable quantitative traits large phantom heritability estimates from twin studies are possible only if a large contribution of common environment is assumed. The LP model is underpinned by strong assumptions that are unlikely to hold, including that all contributing pathways have the same mean and variance and are uncorrelated. Here, we relax the assumptions that underlie the LP model to be more biologically plausible. Together with theoretical, empirical, and pragmatic arguments we conclude that in outbred populations the contribution of additive genetic variance is likely to be much more important than the contribution of non-additive variance.  相似文献   

16.
17.
We have previously identified and mapped porcine expressed sequence tags (ESTs) derived from genes that are preferentially expressed in liver. The aim of the present study was to identify single nucleotide polymorphisms (SNPs) in porcine genes encoding enzymes in hepatic metabolic pathways and use the SNPs for mapping. Furthermore, these genes, which are involved in utilization and partitioning of nutrients, were examined for their effects on carcass and meat quality traits by linkage analyses. In total, 100 ESTs were screened for SNPs by single strand conformation polymorphism analyses across a diverse panel of animals with a 36% success rate. Twelve of 36 polymorphic loci segregated in a three-generation Duroc x Berlin Miniature Pig (F2) resource population, the DUMI resource population, and were genetically mapped. Interval mapping of the corresponding chromosomes was performed to verify mapping of the genes within quantitative trait loci (QTL) regions detected in this resource population. QTL with genome-wide significance were detected in the vicinity of GNMT, ESTL147 and HGD. These loci therefore are positional candidate genes.  相似文献   

18.
The aim of this study was to investigate a series of single-nucleotide polymorphisms (SNPs) in the genes MC2R, MC3R, MC4R, MC5R, POMC, and ENPP1 for association with obesity. Twenty-five SNPs (2-7 SNPs/gene) were genotyped in 246 Finns with extreme obesity (BMI > or = 40 kg/m2) and in 481 lean subjects (BMI 20-25 kg/m2). Of the obese subjects, 23% had concomitant type 2 diabetes. SNPs and SNP haplotypes were tested for association with obesity and type 2 diabetes. Allele frequencies differed between obese and lean subjects for two SNPs in the ENPP1 gene, rs1800949 (P = 0.006) and rs943003 (P = 0.0009). These SNPs are part of a haplotype (rs1800949 C-rs943003 A), which was observed more frequently in lean subjects compared to obese subjects (P = 0.0007). Weaker associations were detected between the SNPs rs1541276 in the MC5R, rs1926065 in the MC3R genes and obesity (P = 0.04 and P = 0.03, respectively), and between SNPs rs2236700 in the MC5R, rs2118404 in the POMC, rs943003 in the ENPP1 genes and type 2 diabetes (P = 0.03, P = 0.02 and P = 0.02, respectively); these associations did not, however, remain significant after correction for multiple testing. In conclusion, a previously unexplored ENPP1 haplotype composed of SNPs rs1800949 and rs943003 showed suggestive evidence for association with adult-onset morbid obesity in Finns. In this study, we did not find association between the frequently studied ENPP1 K121Q variant, nor SNPs in the MCR or POMC genes and obesity or type 2 diabetes.  相似文献   

19.
Genome-wide association studies (GWAS) are designed to identify the portion of single-nucleotide polymorphisms (SNPs) in genome sequences associated with a complex trait. Strategies based on the gene list enrichment concept are currently applied for the functional analysis of GWAS, according to which a significant overrepresentation of candidate genes associated with a biological pathway is used as a proxy to infer overrepresentation of candidate SNPs in the pathway. Here we show that such inference is not always valid and introduce the program SNP2GO, which implements a new method to properly test for the overrepresentation of candidate SNPs in biological pathways.  相似文献   

20.
Identifying the genetic architecture underlying complex phenotypes is a notoriously difficult problem that often impedes progress in understanding adaptive eco‐evolutionary processes in natural populations. Host–parasite interactions are fundamentally important drivers of evolutionary processes, but a lack of understanding of the genes involved in the host's response to chronic parasite insult makes it particularly difficult to understand the mechanisms of host life history trade‐offs and the adaptive dynamics involved. Here, we examine the genetic basis of gastrointestinal nematode (Trichostrongylus tenuis) burden in 695 red grouse (Lagopus lagopus scotica) individuals genotyped at 384 genome‐wide SNPs. We first use genome‐wide association to identify individual SNPs associated with nematode burden. We then partition genome‐wide heritability to identify chromosomes with greater heritability than expected from gene content, due to harbouring a multitude of additive SNPs with individually undetectable effects. We identified five SNPs on five chromosomes that accounted for differences of up to 556 worms per bird, but together explained at best 4.9% of the phenotypic variance. These SNPs were closely linked to genes representing a range of physiological processes including the immune system, protein degradation and energy metabolism. Genome partitioning indicated genome‐wide heritability of up to 29% and three chromosomes with excess heritability of up to 4.3% (total 8.9%). These results implicate SNPs and novel genomic regions underlying nematode burden in this system and suggest that this phenotype is somewhere between being based on few large‐effect genes (oligogenic) and based on a large number of genes with small individual but large combined effects (polygenic).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号