首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Genome-wide association studies (GWAS) with hundreds of żthousands of single nucleotide polymorphisms (SNPs) are popular strategies to reveal the genetic basis of human complex diseases. Despite many successes of GWAS, it is well recognized that new analytical approaches have to be integrated to achieve their full potential. Starting with a list of SNPs, found to be associated with disease in GWAS, here we propose a novel methodology to devise functionally important KEGG pathways through the identification of genes within these pathways, where these genes are obtained from SNP analysis. Our methodology is based on functionalization of important SNPs to identify effected genes and disease related pathways. We have tested our methodology on WTCCC Rheumatoid Arthritis (RA) dataset and identified: i) previously known RA related KEGG pathways (e.g., Toll-like receptor signaling, Jak-STAT signaling, Antigen processing, Leukocyte transendothelial migration and MAPK signaling pathways); ii) additional KEGG pathways (e.g., Pathways in cancer, Neurotrophin signaling, Chemokine signaling pathways) as associated with RA. Furthermore, these newly found pathways included genes which are targets of RA-specific drugs. Even though GWAS analysis identifies 14 out of 83 of those drug target genes; newly found functionally important KEGG pathways led to the discovery of 25 out of 83 genes, known to be used as drug targets for the treatment of RA. Among the previously known pathways, we identified additional genes associated with RA (e.g. Antigen processing and presentation, Tight junction). Importantly, within these pathways, the associations between some of these additionally found genes, such as HLA-C, HLA-G, PRKCQ, PRKCZ, TAP1, TAP2 and RA were verified by either OMIM database or by literature retrieved from the NCBI PubMed module. With the whole-genome sequencing on the horizon, we show that the full potential of GWAS can be achieved by integrating pathway and network-oriented analysis and prior knowledge from functional properties of a SNP.  相似文献   

2.
Genome-wide association studies (GWAS) have identified loci reproducibly associated with pulmonary diseases; however, the molecular mechanism underlying these associations are largely unknown. The objectives of this study were to discover genetic variants affecting gene expression in human lung tissue, to refine susceptibility loci for asthma identified in GWAS studies, and to use the genetics of gene expression and network analyses to find key molecular drivers of asthma. We performed a genome-wide search for expression quantitative trait loci (eQTL) in 1,111 human lung samples. The lung eQTL dataset was then used to inform asthma genetic studies reported in the literature. The top ranked lung eQTLs were integrated with the GWAS on asthma reported by the GABRIEL consortium to generate a Bayesian gene expression network for discovery of novel molecular pathways underpinning asthma. We detected 17,178 cis- and 593 trans- lung eQTLs, which can be used to explore the functional consequences of loci associated with lung diseases and traits. Some strong eQTLs are also asthma susceptibility loci. For example, rs3859192 on chr17q21 is robustly associated with the mRNA levels of GSDMA (P = 3.55×10−151). The genetic-gene expression network identified the SOCS3 pathway as one of the key drivers of asthma. The eQTLs and gene networks identified in this study are powerful tools for elucidating the causal mechanisms underlying pulmonary disease. This data resource offers much-needed support to pinpoint the causal genes and characterize the molecular function of gene variants associated with lung diseases.  相似文献   

3.
Genome-wide association studies (GWAS) have identified a number of genetic variants associated with lung cancer risk. However, these loci explain only a small fraction of lung cancer hereditability and other variants with weak effect may be lost in the GWAS approach due to the stringent significance level after multiple comparison correction. In this study, in order to identify important pathways involving the lung carcinogenesis, we performed a two-stage pathway analysis in GWAS of lung cancer in Han Chinese using gene set enrichment analysis (GSEA) method. Predefined pathways by BioCarta and KEGG databases were systematically evaluated on Nanjing study (Discovery stage: 1,473 cases and 1,962 controls) and the suggestive pathways were further to be validated in Beijing study (Replication stage: 858 cases and 1,115 controls). We found that four pathways (achPathway, metPathway, At1rPathway and rac1Pathway) were consistently significant in both studies and the P values for combined dataset were 0.012, 0.010, 0.022 and 0.005 respectively. These results were stable after sensitivity analysis based on gene definition and gene overlaps between pathways. These findings may provide new insights into the etiology of lung cancer.  相似文献   

4.
5.
6.
Uncovering the underlying genetic component of any disease is key to the understanding of its pathophysiology and may open new avenues for development of therapeutic strategies and biomarkers. In the past several years, there has been an explosion of genome-wide association studies (GWAS) resulting in the discovery of novel candidate genes conferring risk for complex diseases, including neurodegenerative diseases. Despite this success, there still remains a substantial genetic component for many complex traits and conditions that is unexplained by the GWAS findings. Additionally, in many cases, the mechanism of action of the newly discovered disease risk variants is not inherently obvious. Furthermore, a genetic region with multiple genes may be identified via GWAS, making it difficult to discern the true disease risk gene. Several alternative approaches are proposed to overcome these potential shortcomings of GWAS, including the use of quantitative, biologically relevant phenotypes. Gene expression levels represent an important class of endophenotypes. Genetic linkage and association studies that utilize gene expression levels as endophenotypes determined that the expression levels of many genes are under genetic influence. This led to the postulate that there may exist many genetic variants that confer disease risk via modifying gene expression levels. Results from the handful of genetic studies which assess gene expression level endophenotypes in conjunction with disease risk suggest that this combined phenotype approach may both increase the power for gene discovery and lead to an enhanced understanding of their mode of action. This review summarizes the evidence in support of gene expression levels as promising endophenotypes in the discovery and characterization of novel candidate genes for complex diseases, which may also represent a novel approach in the genetic studies of Alzheimer's and other neurodegenerative diseases.  相似文献   

7.
Parkinson's disease (PD) has had six genome-wide association studies (GWAS) conducted as well as several gene expression studies. However, only variants in MAPT and SNCA have been consistently replicated. To improve the utility of these approaches, we applied pathway analyses integrating both GWAS and gene expression. The top 5000 SNPs (p<0.01) from a joint analysis of three existing PD GWAS were identified and each assigned to a gene. For gene expression, rather than the traditional comparison of one anatomical region between sets of patients and controls, we identified differentially expressed genes between adjacent Braak regions in each individual and adjusted using average control expression profiles. Over-represented pathways were calculated using a hyper-geometric statistical comparison. An integrated, systems meta-analysis of the over-represented pathways combined the expression and GWAS results using a Fisher's combined probability test. Four of the top seven pathways from each approach were identical. The top three pathways in the meta-analysis, with their corrected p-values, were axonal guidance (p = 2.8E-07), focal adhesion (p = 7.7E-06) and calcium signaling (p = 2.9E-05). These results support that a systems biology (pathway) approach will provide additional insight into the genetic etiology of PD and that these pathways have both biological and statistical support to be important in PD.  相似文献   

8.
9.
10.
Intracranial aneurysm (IA) is a complex genetic disease for which, to date, 10 loci have been identified by linkage. Identification of the risk-conferring genes in the loci has proven difficult, since the regions often contain several hundreds of genes. An approach to prioritize positional candidate genes for further studies is to use gene expression data from diseased and nondiseased tissue. Genes that are not expressed, either in diseased or nondiseased tissue, are ranked as unlikely to contribute to the disease. We demonstrate an approach for integrating expression and genetic mapping data to identify likely pathways involved in the pathogenesis of a disease. We used expression profiles for IAs and nonaneurysmal intracranial arteries (IVs) together with the 10 reported linkage intervals for IA. Expressed genes were analyzed for membership in Kyoto Encyclopedia of Genes and Genomes (KEGG) biological pathways. The 10 IA loci harbor 1,858 candidate genes, of which 1,561 (84%) were represented on the microarrays. We identified 810 positional candidate genes for IA that were expressed in IVs or IAs. Pathway information was available for 294 of these genes and involved 32 KEGG biological function pathways represented on at least 2 loci. A likelihood-based score was calculated to rank pathways for involvement in the pathogenesis of IA. Adherens junction, MAPK, and Notch signaling pathways ranked high. Integration of gene expression profiles with genetic mapping data for IA provides an approach to identify candidate genes that are more likely to function in the pathology of IA.  相似文献   

11.
12.
Li C  Han J  Shang D  Li J  Wang Y  Wang Y  Zhang Y  Yao Q  Zhang C  Li K  Li X 《Gene》2012,503(1):101-109
Most methods for genome-wide association studies (GWAS) focus on discovering a single genetic variant, but the pathogenesis of complex diseases is thought to arise from the joint effect of multiple genetic variants. Information about pathway structure, such as the interactions and distances between gene products within pathways, can help us learn more about the functions and joint effect of genes associated with disease risk. We developed a novel sub-pathway based approach to study the joint effect of multiple genetic variants that are modestly associated with disease. The approach prioritized sub-pathways based on the significance values of single nucleotide polymorphisms (SNPs) and the interactions and distances between gene products within pathways. We applied the method to seven complex diseases. The result showed that our method can efficiently identify statistically significant sub-pathways associated with the pathogenesis of complex diseases. The approach identified sub-pathways that may inform the interpretation of GWAS data.  相似文献   

13.
黑色素瘤是一种高侵袭性的恶性皮肤肿瘤,转移率高、预后差。研究黑素瘤细胞生物学特性对黑素瘤的治疗和控制具有重要的意义。本研究以C57BL/6J小鼠的正常黑色素细胞及B16黑色素瘤细胞为研究对象,采用二代测序技术分析两种细胞间的转录组表达差异,筛选差异基因,为后续黑色素瘤的形成机制研究提供理论依据。采用差异倍数及错误率分析测序数据,鉴定出1 436个新的mRNA和4 086个差异表达的已知mRNA。GO数据库和KEGG数据库分析显示,差异表达的mRNAs参与了149个调控途径, 主要集中在疾病调控、细胞周期调节和环境信息调控方面。qRT-PCR及Western印迹检测发现,调节细胞增殖、迁移的Pdgf-B、Integrin-β1和Integrin-β5以及调节黑色素颗粒增加的Mitf、Tyr、Tyrp1和Tyrp2在B16细胞中的表达量显著高于在正常黑色素细胞中的表达。本研究获得的差异基因为后续黑色素瘤的研究提供了新的候选基因。  相似文献   

14.
15.
The majority of the heritability of coronary artery disease (CAD) remains unexplained, despite recent successes of genome-wide association studies (GWAS) in identifying novel susceptibility loci. Integrating functional genomic data from a variety of sources with a large-scale meta-analysis of CAD GWAS may facilitate the identification of novel biological processes and genes involved in CAD, as well as clarify the causal relationships of established processes. Towards this end, we integrated 14 GWAS from the CARDIoGRAM Consortium and two additional GWAS from the Ottawa Heart Institute (25,491 cases and 66,819 controls) with 1) genetics of gene expression studies of CAD-relevant tissues in humans, 2) metabolic and signaling pathways from public databases, and 3) data-driven, tissue-specific gene networks from a multitude of human and mouse experiments. We not only detected CAD-associated gene networks of lipid metabolism, coagulation, immunity, and additional networks with no clear functional annotation, but also revealed key driver genes for each CAD network based on the topology of the gene regulatory networks. In particular, we found a gene network involved in antigen processing to be strongly associated with CAD. The key driver genes of this network included glyoxalase I (GLO1) and peptidylprolyl isomerase I (PPIL1), which we verified as regulatory by siRNA experiments in human aortic endothelial cells. Our results suggest genetic influences on a diverse set of both known and novel biological processes that contribute to CAD risk. The key driver genes for these networks highlight potential novel targets for further mechanistic studies and therapeutic interventions.  相似文献   

16.
Genome‐Wide Association studies (GWAS) offer an unbiased means to understand the genetic basis of traits by identifying single nucleotide polymorphisms (SNPs) linked to causal variants of complex phenotypes. GWAS have identified a host of susceptibility SNPs associated with many important human diseases, including diseases associated with aging. In an effort to understand the genetics of broad resistance to age‐associated diseases (i.e., ‘wellness’), we performed a meta‐analysis of human GWAS. Toward that end, we compiled 372 GWAS that identified 1775 susceptibility SNPs to 105 unique diseases and used these SNPs to create a genomic landscape of disease susceptibility. This map was constructed by partitioning the genome into 200 kb ‘bins’ and mapping the 1775 susceptibility SNPs to bins based on their genomic location. Investigation of these data revealed significant heterogeneity of disease association within the genome, with 92% of bins devoid of disease‐associated SNPs. In contrast, 10 bins (0.06%) were significantly (P < 0.05) enriched for susceptibility to multiple diseases, 5 of which formed two highly significant peaks of disease association (P < 0.0001). These peaks mapped to the Major Histocompatibility (MHC) locus on 6p21 and the INK4/ARF (CDKN2a/b) tumor suppressor locus on 9p21.3. Provocatively, all 10 significantly enriched bins contained genes linked to either inflammation or cellular senescence pathways, and SNPs near regulators of senescence were particularly associated with disease of aging (e.g., cancer, atherosclerosis, type 2 diabetes, glaucoma). This analysis suggests that germline genetic heterogeneity in the regulation of immunity and cellular senescence influences the human healthspan.  相似文献   

17.
Coronary artery disease(CAD) is a complex human disease, involving multiple genes and their nonlinear interactions, which often act in a modular fashion. Genome-wide single nucleotide polymorphism(SNP) profiling provides an effective technique to unravel these underlying genetic interplays or their functional involvements for CAD. This study aimed to identify the susceptible pathways and modules for CAD based on SNP omics. First, the Wellcome Trust Case Control Consortium(WTCCC) SNP datasets of CAD and control samples were used to assess the jointeffect of multiple genetic variants at the pathway level, using logistic kernel machine regression model. Then, an expanded genetic network was constructed by integrating statistical gene–gene interactions involved in these susceptible pathways with their protein–protein interaction(PPI)knowledge. Finally, risk functional modules were identified by decomposition of the network. Of 276 KEGG pathways analyzed, 6 pathways were found to have a significant effect on CAD. Other than glycerolipid metabolism, glycosaminoglycan biosynthesis, and cardiac muscle contraction pathways, three pathways related to other diseases were also revealed, including Alzheimer's disease, non-alcoholic fatty liver disease, and Huntington's disease. A genetic epistatic network of 95 genes was further constructed using the abovementioned integrative approach. Of 10 functional modules derived from the network, 6 have been annotated to phospholipase C activity and cell adhesion molecule binding, which also have known functional involvement in Alzheimer's disease.These findings indicate an overlap of the underlying molecular mechanisms between CAD and Alzheimer's disease, thus providing new insights into the molecular basis for CAD and its molecular relationships with other diseases.  相似文献   

18.
王钰嫣  王子兴  胡耀达  王蕾  李宁  张彪  韩伟  姜晶梅 《遗传》2017,39(8):707-716
全基因组关联研究(genome-wide association study, GWAS)自2005年首次发表以来已不断增进人们对疾病遗传机制的认识,结合系统生物学并改进统计分析方法是对GWAS数据进行深度挖掘的重要途径。通路分析(pathway analysis)将GWAS所检测的遗传变异根据一定的生物学含义组合为集合进行分析,有利于发现对疾病单独效应小却在通路中相互关联的遗传变异,更有利于进行生物学解释。当前通路分析在GWAS数据上已有较为广泛的应用并取得初步成果。与此同时,通路分析的统计方法仍在不断发展。本文旨在介绍现有直接以SNP为对象的GWAS通路分析算法,根据方法中是否采用核函数分为非核算法和核算法两大类,其中非核算法主要包括基因功能富集分析(gene set enrichment analysis, GSEA)和分层贝叶斯优取(hierarchical Bayes prioritization, HBP),核算法包括线性核(linear kernel, LIN)、状态认证核(identity-by-status kernel, IBS)和尺度不变核(powered exponential kernel)。通过介绍这些方法的计算原理和优缺点,以期为新算法的构建提供更好的思路,为GWAS领域研究方法的选择提供参考。  相似文献   

19.
【目的】中华大仰蝽Notonecta chinensis为中国和日本冲绳分布的重要水生天敌昆虫,可用于蚊虫的生物防治。本研究旨在建立中华大仰蝽转录组数据库,挖掘其基因信息。【方法】采用高通量测序平台Illumina NextSeq500对中华大仰蝽进行转录组测序、de novo组装及生物信息学分析;利用MISA软件基于转录组unigenes数据进行SSR新分子标记筛选。毛细管电泳检测SSR多态性。【结果】总计获得34782282条clean reads(NCBI SRA数据库登录号:SRR13259254),组装成37801条unigenes,N50为913 bp。将unigenes与已知数据库比对进行基因功能注释,分别有36474,32470,27781,35079和5638条序列注释到nr,Swiss-Prot,GO,eggNOG和KEGG数据库。通过GO数据库注释,unigenes的功能可分为生物学过程、细胞组分和分子功能三大类,其中参与细胞、细胞部分及结合功能的unigenes比例较大。eggNOG数据库注释结果显示,37801条unigenes归到25个基因家族,注释到未知功能的最多。KEGG代谢通路富集分析显示,5638条unigenes注释到245个代谢通路,注释到核糖体的数目最多。此外,用MISA软件在转录组测序数据中的37801条unigenes中搜索到3124个SSR位点(占总unigenes的8.26%),发生频率为7.07%。通过PCR筛选出16个SSR位点。7个中华大仰蝽地理种群3个位点NcCF/NcCR,NcKF/NcKR和NcLF/NcLR的多态信息含量(PIC)分别为0.870,0.902和0.857,具高度多态性。【结论】本研究成功获得了中华大仰蝽转录组数据,为其基因功能分析提供了分子理论基础;SSR新标记的开发为中华大仰蝽遗传多样性分析、隐存种鉴定及基因图谱构建提供了更丰富的候选分子标记。  相似文献   

20.
Most common diseases are complex, involving multiple genetic and environmental factors and their interactions. In the past decade, genome-wide association studies (GWAS) have successfully identified thousands of genetic variants underlying susceptibility to complex diseases. However, the results from these studies often do not provide evidence on how the variants affect downstream pathways and lead to the disease. Therefore, in the post-GWAS era the greatest challenge lies in combining GWAS findings with additional molecular data to functionally characterize the associations. The advances in various ~omics techniques have made it possible to investigate the effect of risk variants on intermediate molecular levels, such as gene expression, methylation, protein abundance or metabolite levels. As disease aetiology is complex, no single molecular analysis is expected to fully unravel the disease mechanism. Multiple molecular levels can interact and also show plasticity in different physiological conditions, cell types and disease stages. There is therefore a great need for new integrative approaches that can combine data from different molecular levels and can help construct the causal inference from genotype to phenotype. Systems genetics is such an approach; it is used to study genetic effects within the larger scope of systems biology by integrating genotype information with various ~omics datasets as well as with environmental and physiological variables. In this review, we describe this approach and discuss how it can help us unravel the molecular mechanisms through which genetic variation causes disease. This article is part of a Special Issue entitled: From Genome to Function.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号