首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Complex diseases are associated with altered interactions between thousands of genes. We developed a novel method to identify and prioritize disease genes, which was generally applicable to complex diseases.

Results

We identified modules of highly interconnected genes in disease-specific networks derived from integrating gene-expression and protein interaction data. We examined if those modules were enriched for disease-associated SNPs, and could be used to find novel genes for functional studies. First, we analyzed publicly available gene expression microarray and genome-wide association study (GWAS) data from 13, highly diverse, complex diseases. In each disease, highly interconnected genes formed modules, which were significantly enriched for genes harboring disease-associated SNPs. To test if such modules could be used to find novel genes for functional studies, we repeated the analyses using our own gene expression microarray and GWAS data from seasonal allergic rhinitis. We identified a novel gene, FGF2, whose relevance was supported by functional studies using combined small interfering RNA-mediated knock-down and gene expression microarrays. The modules in the 13 complex diseases analyzed here tended to overlap and were enriched for pathways related to oncological, metabolic and inflammatory diseases. This suggested that this union of the modules would be associated with a general increase in susceptibility for complex diseases. Indeed, we found that this union was enriched with GWAS genes for 145 other complex diseases.

Conclusions

Modules of highly interconnected complex disease genes were enriched for disease-associated SNPs, and could be used to find novel genes for functional studies.  相似文献   

2.
Chen L  Tai J  Zhang L  Shang Y  Li X  Qu X  Li W  Miao Z  Jia X  Wang H  Li W  He W 《Molecular bioSystems》2011,7(9):2547-2553
Understanding the pathogenesis of complex diseases is aided by precise identification of the genes responsible. Many computational methods have been developed to prioritize candidate disease genes, but coverage of functional annotations may be a limiting factor for most of these methods. Here, we introduce a global candidate gene prioritization approach that considers information about network properties in the human protein interaction network and risk transformative contents from known disease genes. Global risk transformative scores were then used to prioritize candidate genes. This method was introduced to prioritize candidate genes for prostate cancer. The effectiveness of our global risk transformative algorithm for prioritizing candidate genes was evaluated according to validation studies. Compared with ToppGene and random walk-based methods, our method outperformed the two other candidate gene prioritization methods. The generality of our method was assessed by testing it on prostate cancer and other types of cancer. The performance was evaluated using standard leave-one-out cross-validation.  相似文献   

3.
Gene prioritization through genomic data fusion   总被引:4,自引:0,他引:4  
The identification of genes involved in health and disease remains a challenge. We describe a bioinformatics approach, together with a freely accessible, interactive and flexible software termed Endeavour, to prioritize candidate genes underlying biological processes or diseases, based on their similarity to known genes involved in these phenomena. Unlike previous approaches, ours generates distinct prioritizations for multiple heterogeneous data sources, which are then integrated, or fused, into a global ranking using order statistics. In addition, it offers the flexibility of including additional data sources. Validation of our approach revealed it was able to efficiently prioritize 627 genes in disease data sets and 76 genes in biological pathway sets, identify candidates of 16 mono- or polygenic diseases, and discover regulatory genes of myeloid differentiation. Furthermore, the approach identified a novel gene involved in craniofacial development from a 2-Mb chromosomal region, deleted in some patients with DiGeorge-like birth defects. The approach described here offers an alternative integrative method for gene discovery.  相似文献   

4.
Increasing consumption of refined carbohydrates is now being recognized as a primary contributor to the development of nutritionally related chronic diseases such as obesity and type 2 diabetes mellitus (T2DM). A data mining approach was used to evaluate the role of carbohydrate metabolic pathway genes in the development of obesity and T2DM. Data from public databases were used to map the position of the carbohydrate metabolic pathway genes to known quantitative trait loci (QTL) for obesity and T2DM and for examining the pathway genes for the presence of sequence and structural genetic variants such as single nucleotide polymorphisms (SNPs) and copy number variants (CNS), respectively. The results demonstrated that a majority of the genes of the carbohydrate metabolic pathways are associated with QTL for obesity and many for T2DM. In addition, some key genes of the pathways also encode non-synonymous SNPs that exhibit significant differences in population frequencies. This study emphasizes the significance of the metabolic pathways genes in the development of disease phenotypes, its differential occurrence across populations and between individuals, and a strategy for interpreting an individuals' risk for disease.  相似文献   

5.
Chronic obstructive pulmonary disease (COPD) is a complex human disease likely influenced by multiple genes, cigarette smoking, and gene-by-smoking interactions, but only severe alpha 1-antitrypsin deficiency is a proven genetic risk factor for COPD. Prior linkage analyses in the Boston Early-Onset COPD Study have demonstrated significant linkage to a key intermediate phenotype of COPD on chromosome 2q. We integrated results from murine lung development and human COPD gene-expression microarray studies with human COPD linkage results on chromosome 2q to prioritize candidate-gene selection, thus identifying SERPINE2 as a positional candidate susceptibility gene for COPD. Immunohistochemistry demonstrated expression of serpine2 protein in mouse and human adult lung tissue. In family-based association testing of 127 severe, early-onset COPD pedigrees from the Boston Early-Onset COPD Study, we observed significant association with COPD phenotypes and 18 single-nucleotide polymorphisms (SNPs) in the SERPINE2 gene. Association of five of these SNPs with COPD was replicated in a case-control analysis, with cases from the National Emphysema Treatment Trial and controls from the Normative Aging Study. Family-based and case-control haplotype analyses supported similar regions of association within the SERPINE2 gene. When significantly associated SNPs in these haplotypic regions were included as covariates in linkage models, LOD score attenuation was observed most markedly in a smokers-only linkage model (LOD 4.41, attenuated to 1.74). After the integration of murine and human microarray data to inform candidate-gene selection, we observed significant family-based association and independent replication of association in a case-control study, suggesting that SERPINE2 is a COPD-susceptibility gene and is likely influenced by gene-by-smoking interaction.  相似文献   

6.
Complex diseases result from contributions of multiple genes that act in concert through pathways. Here we present a method to prioritize novel candidates of disease-susceptibility genes depending on the biological similarities to the known disease-related genes. The extent of disease-susceptibility of a gene is prioritized by analyzing seven features of human genes captured in H-InvDB. Taking rheumatoid arthritis (RA) and prostate cancer (PC) as two examples, we evaluated the efficiency of our method. Highly scored genes obtained included TNFSF12 and OSM as candidate disease genes for RA and PC, respectively. Subsequent characterization of these genes based upon an extensive literature survey reinforced the validity of these highly scored genes as possible disease-susceptibility genes. Our approach, Prioritization ANalysis of Disease Association (PANDA), is an efficient and cost-effective method to narrow down a large set of genes into smaller subsets that are most likely to be involved in the disease pathogenesis.  相似文献   

7.
Chronic obstructive pulmonary disease (COPD) is a major cause of morbidity and mortality worldwide. Irreversible airflow limitation, both progressive and associated with an inflammatory response of the lungs to noxious particles or gases, is a hallmark of the disease. Cigarette smoking is the most important environmental risk factor for COPD, nevertheless, only approximately 20–30% of smokers develop symptomatic disease. Epidemiological studies, case-control studies in relatives of patients with COPD, and twin studies suggest that COPD is a genetically complex disease with environmental factors and many involved genes interacting together. Two major strategies have been employed to identify the genes and the polymorphisms that likely contribute to the development of complex diseases: association studies and linkage analyses. Biologically plausible pathogenetic mechanisms are prerequisites to focus the search for genes of known function in association studies. Protease-antiprotease imbalance, generation of oxidative stress, and chronic inflammation are recognized as the principal mechanisms leading to irreversible airflow obstruction and parenchymal destruction in the lung. Therefore, genes which have been implicated in the pathogenesis of COPD are involved in antiproteolysis, antioxidant barrier and metabolism of xenobiotic substances, inflammatory response to cigarette smoke, airway hyperresponsiveness, and pulmonary vascular remodelling. Significant associations with COPD-related phenotypes have been reported for polymorphisms in genes coding for matrix metalloproteinases, microsomal epoxide hydrolase, glutathione-S-transferases, heme oxygenase, tumor necrosis factor, interleukines 1, 8, and 13, vitamin D-binding protein and β-2-adrenergic receptor (ADRB2), whereas adequately powered replication studies failed to confirm most of the previously observed associations. Genome-wide linkage analyses provide us with a novel tool to identify the general locations of COPD susceptibility genes, and should be followed by association analyses of positional candidate genes from COPD pathophysiology, positional candidate genes selected from gene expression studies, or dense single nucleotide polymorphism panels across regions of linkage. Haplotype analyses of genes with multiple polymorphic sites in linkage disequilibrium, such as the ADRB2 gene, provide another promising field that has yet to be explored in patients with COPD. In the present article we review the current knowledge about gene polymorphisms that have been recently linked to the risk of developing COPD and/or may account for variations in the disease course.  相似文献   

8.
Obesity and related metabolic diseases, such as type 2 diabetes, hypertension and hyperlipidemia are an increasingly prevalent medical and social problem in developed and developing countries. These conditions are associated with increased risk of cardiovascular disease, the leading cause of death. Therefore, it is important to understand the molecular basis underlying obesity and related metabolic diseases in order to develop effective preventive and therapeutic approaches against these conditions. Recently, a family of proteins structurally similar to the angiogenic-regulating factors known as angiopoietins was identified and designated 'angiopoietin-like proteins' (ANGPTLs). Encoded by seven genes, ANGPTL1-7 all possess an N-terminal coiled-coil domain and a C-terminal fibrinogen-like domain, both characteristic of angiopoietins. ANGPTLs do not bind to either the angiopoietin receptor Tie2 or the related protein Tie1, indicating that these ligands function differently from angiopoietins. Like angiopoietins, some ANGPTLs potently regulate angiogenesis, but ANGPTL3, -4 and ANGPTL6/angiopoietin-related growth factor (AGF) directly regulate lipid, glucose and energy metabolism independent of angiogenic effects. Recently, we found that ANGPTL2 is a key adipocyte-derived inflammatory mediator that links obesity to systemic insulin resistance. In this minireview, we focus on the roles of ANGPTL2 and ANGPTL6/AGF in obesity and related metabolic diseases, and discuss the possibility that both could function as molecular targets for the prevention and treatment of obesity and metabolic diseases.  相似文献   

9.
10.
Type 2 diabetes (T2D) is a complex metabolic disease that is more prevalent in ethnic groups such as Mexican Americans, and is strongly associated with the risk factors obesity and insulin resistance. The goal of this study was to perform whole genome gene expression profiling in adipose tissue to detect common patterns of gene regulation associated with obesity and insulin resistance. We used phenotypic and genotypic data from 308 Mexican American participants from the Veterans Administration Genetic Epidemiology Study (VAGES). Basal fasting RNA was extracted from adipose tissue biopsies from a subset of 75 unrelated individuals, and gene expression data generated on the Illumina BeadArray platform. The number of gene probes with significant expression above baseline was approximately 31,000. We performed multiple regression analysis of all probes with 15 metabolic traits. Adipose tissue had 3,012 genes significantly associated with the traits of interest (false discovery rate, FDR ≤ 0.05). The significance of gene expression changes was used to select 52 genes with significant (FDR ≤ 10-4) gene expression changes across multiple traits. Gene sets/Pathways analysis identified one gene, alcohol dehydrogenase 1B (ADH1B) that was significantly enriched (P < 10-60) as a prime candidate for involvement in multiple relevant metabolic pathways. Illumina BeadChip derived ADH1B expression data was consistent with quantitative real time PCR data. We observed significant inverse correlations with waist circumference (2.8 x 10-9), BMI (5.4 x 10-6), and fasting plasma insulin (P < 0.001). These findings are consistent with a central role for ADH1B in obesity and insulin resistance and provide evidence for a novel genetic regulatory mechanism for human metabolic diseases related to these traits.  相似文献   

11.
The pathophysiological mechanisms underlying the development of obesity and metabolic diseases are not well understood. To gain more insight into the genetic mediators associated with the onset and progression of diet-induced obesity and metabolic diseases, we studied the molecular changes in response to a high-fat diet (HFD) by using a mode-of-action by network identification (MNI) analysis. Oligo DNA microarray analysis was performed on visceral and subcutaneous adipose tissues and muscles of male C57BL/6N mice fed a normal diet or HFD for 2, 4, 8, and 12 weeks. Each of these data was queried against the MNI algorithm, and the lists of top 5 highly ranked genes and gene ontology (GO)-annotated pathways that were significantly overrepresented among the 100 highest ranked genes at each time point in the 3 different tissues of mice fed the HFD were considered in the present study. The 40 highest ranked genes identified by MNI analysis at each time point in the different tissues of mice with diet-induced obesity were subjected to clustering based on their temporal patterns. On the basis of the above-mentioned results, we investigated the sequential induction of distinct olfactory receptors and the stimulation of cancer-related genes during the development of obesity in both adipose tissues and muscles. The top 5 genes recognized using the MNI analysis at each time point and gene cluster identified based on their temporal patterns in the peripheral tissues of mice provided novel and often surprising insights into the potential genetic mediators for obesity progression.  相似文献   

12.
The pig is a well-known animal model used to investigate genetic and mechanistic aspects of human disease biology. They are particularly useful in the context of obesity and metabolic diseases because other widely used models (e.g. mice) do not completely recapitulate key pathophysiological features associated with these diseases in humans. Therefore, we established a F2 pig resource population (n = 564) designed to elucidate the genetics underlying obesity and metabolic phenotypes. Segregation of obesity traits was ensured by using breeds highly divergent with respect to obesity traits in the parental generation. Several obesity and metabolic phenotypes were recorded (n = 35) from birth to slaughter (242 ± 48 days), including body composition determined at about two months of age (63 ± 10 days) via dual-energy x-ray absorptiometry (DXA) scanning. All pigs were genotyped using Illumina Porcine 60k SNP Beadchip and a combined linkage disequilibrium-linkage analysis was used to identify genome-wide significant associations for collected phenotypes. We identified 229 QTLs which associated with adiposity- and metabolic phenotypes at genome-wide significant levels. Subsequently comparative analyses were performed to identify the extent of overlap between previously identified QTLs in both humans and pigs. The combined analysis of a large number of obesity phenotypes has provided insight in the genetic architecture of the molecular mechanisms underlying these traits indicating that QTLs underlying similar phenotypes are clustered in the genome. Our analyses have further confirmed that genetic heterogeneity is an inherent characteristic of obesity traits most likely caused by segregation or fixation of different variants of the individual components belonging to cellular pathways in different populations. Several important genes previously associated to obesity in human studies, along with novel genes were identified. Altogether, this study provides novel insight that may further the current understanding of the molecular mechanisms underlying human obesity.  相似文献   

13.
Many cell activities are organized as a network, and genes are clustered into co-expressed groups if they have the same or closely related biological function or they are co-regulated. In this study, based on an assumption that a strong candidate disease gene is more likely close to gene groups in which all members coordinately differentially express than individual genes with differential expression, we developed a novel disease gene prioritization method GroupRank by integrating gene co-expression and differential expression information generated from microarray data as well as PPI network. A candidate gene is ranked high using GroupRank if it is differentially expressed in disease and control or is close to differentially co-expressed groups in PPI network. We tested our method on data sets of lung, kidney, leukemia and breast cancer. The results revealed GroupRank could efficiently prioritize disease genes with significantly improved AUC value in comparison to the previous method with no consideration of co-exprssed gene groups in PPI network. Moreover, the functional analyses of the major contributing gene group in gene prioritization of kidney cancer verified that our algorithm GroupRank not only ranks disease genes efficiently but also could help us identify and understand possible mechanisms in important physiological and pathological processes of disease.  相似文献   

14.
An imbalance between caloric intake and energy expenditure leads to obesity. Obesity is an important risk factor for the development of several metabolic diseases including insulin resistance, metabolic syndrome, type 2 diabetes mellitus, and cardiovascular disease. So, controlling obesity could be effective in the improvement of obesity-related diseases. Various factors are involved in obesity, such as AMP-activated protein kinases (AMPK), silent information regulators, inflammatory mediators, oxidative stress parameters, gastrointestinal hormones, adipokines, angiopoietin-like proteins, and microRNAs. These factors play an important role in obesity by controlling fat metabolism, energy homeostasis, food intake, and insulin sensitivity. AMPK is a heterotrimeric serine/threonine protein kinase known as a fuel-sensing enzyme. The central role of AMPK in obesity makes it an attractive molecule to target obesity and related metabolic diseases. In this review, the critical role of AMPK in obesity and the interplay between AMPK and obesity-associated factors were elaborated.  相似文献   

15.
Li X  Li C  Shang D  Li J  Han J  Miao Y  Wang Y  Wang Q  Li W  Wu C  Zhang Y  Li X  Yao Q 《PloS one》2011,6(6):e21131
One of the challenging problems in the etiology of diseases is to explore the relationships between initiation and progression of diseases and abnormalities in local regions of metabolic pathways. To gain insight into such relationships, we applied the "k-clique" subpathway identification method to all disease-related gene sets. For each disease, the disease risk regions of metabolic pathways were then identified and considered as subpathways associated with the disease. We finally built a disease-metabolic subpathway network (DMSPN). Through analyses based on network biology, we found that a few subpathways, such as that of cytochrome P450, were highly connected with many diseases, and most belonged to fundamental metabolisms, suggesting that abnormalities of fundamental metabolic processes tend to cause more types of diseases. According to the categories of diseases and subpathways, we tested the clustering phenomenon of diseases and metabolic subpathways in the DMSPN. The results showed that both disease nodes and subpathway nodes displayed slight clustering phenomenon. We also tested correlations between network topology and genes within disease-related metabolic subpathways, and found that within a disease-related subpathway in the DMSPN, the ratio of disease genes and the ratio of tissue-specific genes significantly increased as the number of diseases caused by the subpathway increased. Surprisingly, the ratio of essential genes significantly decreased and the ratio of housekeeping genes remained relatively unchanged. Furthermore, the coexpression levels between disease genes and other types of genes were calculated for each subpathway in the DMSPN. The results indicated that those genes intensely influenced by disease genes, including essential genes and tissue-specific genes, might be significantly associated with the disease diversity of subpathways, suggesting that different kinds of genes within a disease-related subpathway may play significantly differential roles on the diversity of diseases caused by the corresponding subpathway.  相似文献   

16.
Bardet-Biedl syndrome (BBS) is a genetically heterogeneous, pleiotropic human disorder characterized by obesity, retinopathy, polydactyly, renal and cardiac malformations, learning disabilities, and hypogenitalism. Eight BBS loci have been mapped, and seven genes have been identified. BBS3 was previously mapped to chromosome 3 by linkage analysis in a large Israeli Bedouin kindred. The rarity of other families mapping to the BBS3 locus has made it difficult to narrow the disease interval sufficiently to identify the gene by positional cloning. We hypothesized that the genomes of model organisms that contained the orthologues to known BBS genes would also likely contain a BBS3 orthologue. Therefore, comparative genomic analysis was performed to prioritize BBS candidate genes for mutation screening. Known BBS proteins were compared with the translated genomes of model organisms to identify a subset of organisms in which these proteins were conserved. By including multiple organisms that have relatively small genome sizes in the analysis, the number of candidate genes was reduced, and a few genes mapping to the BBS3 interval emerged as the best candidates for this disorder. One of these genes, ADP-ribosylation factor-like 6 (ARL6), contains a homozygous stop mutation that segregates completely with the disease in the Bedouin kindred originally used to map the BBS3 locus, identifying this gene as the BBS3 gene. These data illustrate the power of comparative genomic analysis for the study of human disease and identifies a novel BBS gene.  相似文献   

17.
Zhao J  Yang TH  Huang Y  Holme P 《PloS one》2011,6(9):e24306
Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.  相似文献   

18.
Identifying the genes involved in venous thromboembolism (VTE) recurrence is important not only for understanding the pathogenesis but also for discovering the therapeutic targets. We proposed a novel prioritization method called Function-Interaction-Pearson (FIP) by creating gene-disease similarity scores to prioritize candidate genes underling VTE. The scores were calculated by integrating and optimizing three types of resources including gene expression, gene ontology and protein-protein interaction. As a result, 124 out of top 200 prioritized candidate genes had been confirmed in literature, among which there were 34 antithrombotic drug targets. Compared with two well-known gene prioritization tools Endeavour and ToppNet, FIP was shown to have better performance. The approach provides a valuable alternative for drug targets discovery and disease therapy.  相似文献   

19.
Recent studies have demonstrated that multiple early-onset diseases have shared risk genes, based on findings from de novo mutations (DNMs). Therefore, we may leverage information from one trait to improve statistical power to identify genes for another trait. However, there are few methods that can jointly analyze DNMs from multiple traits. In this study, we develop a framework called M-DATA (Multi-trait framework for De novo mutation Association Test with Annotations) to increase the statistical power of association analysis by integrating data from multiple correlated traits and their functional annotations. Using the number of DNMs from multiple diseases, we develop a method based on an Expectation-Maximization algorithm to both infer the degree of association between two diseases as well as to estimate the gene association probability for each disease. We apply our method to a case study of jointly analyzing data from congenital heart disease (CHD) and autism. Our method was able to identify 23 genes for CHD from joint analysis, including 12 novel genes, which is substantially more than single-trait analysis, leading to novel insights into CHD disease etiology.  相似文献   

20.
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号