首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Inferring potential drug indications, for either novel or approved drugs, is a key step in drug development. Previous computational methods in this domain have focused on either drug repositioning or matching drug and disease gene expression profiles. Here, we present a novel method for the large‐scale prediction of drug indications (PREDICT) that can handle both approved drugs and novel molecules. Our method is based on the observation that similar drugs are indicated for similar diseases, and utilizes multiple drug–drug and disease–disease similarity measures for the prediction task. On cross‐validation, it obtains high specificity and sensitivity (AUC=0.9) in predicting drug indications, surpassing existing methods. We validate our predictions by their overlap with drug indications that are currently under clinical trials, and by their agreement with tissue‐specific expression information on the drug targets. We further show that disease‐specific genetic signatures can be used to accurately predict drug indications for new diseases (AUC=0.92). This lays the computational foundation for future personalized drug treatments, where gene expression signatures from individual patients would replace the disease‐specific signatures.  相似文献   

2.
Fröhlich H 《PloS one》2011,6(10):e25364
Diagnostic and prognostic biomarkers for cancer based on gene expression profiles are viewed as a major step towards a better personalized medicine. Many studies using various computational approaches have been published in this direction during the last decade. However, when comparing different gene signatures for related clinical questions often only a small overlap is observed. This can have various reasons, such as technical differences of platforms, differences in biological samples or their treatment in lab, or statistical reasons because of the high dimensionality of the data combined with small sample size, leading to unstable selection of genes. In conclusion retrieved gene signatures are often hard to interpret from a biological point of view. We here demonstrate that it is possible to construct a consensus signature from a set of seemingly different gene signatures by mapping them on a protein interaction network. Common upstream proteins of close gene products, which we identified via our developed algorithm, show a very clear and significant functional interpretation in terms of overrepresented KEGG pathways, disease associated genes and known drug targets. Moreover, we show that such a consensus signature can serve as prior knowledge for predictive biomarker discovery in breast cancer. Evaluation on different datasets shows that signatures derived from the consensus signature reveal a much higher stability than signatures learned from all probesets on a microarray, while at the same time being at least as predictive. Furthermore, they are clearly interpretable in terms of enriched pathways, disease associated genes and known drug targets. In summary we thus believe that network based consensus signatures are not only a way to relate seemingly different gene signatures to each other in a functional manner, but also to establish prior knowledge for highly stable and interpretable predictive biomarkers.  相似文献   

3.
Genes involved in the same function tend to have similar evolutionary histories, in that their rates of evolution covary over time. This coevolutionary signature, termed Evolutionary Rate Covariation (ERC), is calculated using only gene sequences from a set of closely related species and has demonstrated potential as a computational tool for inferring functional relationships between genes. To further define applications of ERC, we first established that roughly 55% of genetic diseases posses an ERC signature between their contributing genes. At a false discovery rate of 5% we report 40 such diseases including cancers, developmental disorders and mitochondrial diseases. Given these coevolutionary signatures between disease genes, we then assessed ERC''s ability to prioritize known disease genes out of a list of unrelated candidates. We found that in the presence of an ERC signature, the true disease gene is effectively prioritized to the top 6% of candidates on average. We then apply this strategy to a melanoma-associated region on chromosome 1 and identify MCL1 as a potential causative gene. Furthermore, to gain global insight into disease mechanisms, we used ERC to predict molecular connections between 310 nominally distinct diseases. The resulting “disease map” network associates several diseases with related pathogenic mechanisms and unveils many novel relationships between clinically distinct diseases, such as between Hirschsprung''s disease and melanoma. Taken together, these results demonstrate the utility of molecular evolution as a gene discovery platform and show that evolutionary signatures can be used to build informative gene-based networks.  相似文献   

4.
5.
Astrocytomas are the most common type of brain tumors, which originate from glial cells, and are classified into specific grades based on their histopathological behavior. To develop precise therapeutic strategies for the disease, it is important to identify the molecular signatures specific to each grade as well as the key factors responsible for the transition from one grade to the next. In this study, we have taken a systems approach to investigate the gene expression profiles of each grades of the disease by mapping shortest paths of gene interaction in each grade and also between one grade and the next. Module core genes govern the topology of these networks and serve as important functional players in each grade as well as help in grade transition events. Shortlisted among these module core genes are well-characterized players of glioma as well as novel molecules (32 grade-specific genes and 15 grade transition–specific genes), which influence important prooncogenic functions but have not been linked to glioma biology yet. These module core genes provide interesting insight into the biology of astrocytic tumors and are potential therapeutic targets for astrocytoma.  相似文献   

6.
7.
The role of cancer‐associated fibroblasts (CAFs) has been thoroughly investigated in tumour microenvironments but not in bladder urothelial carcinoma (BLCA). The cell fraction of CAFs gradually increased with BLCA progression. Weighted gene co‐expression network analysis (WGCNA) revealed a specific gene expression module of CAFs that are relevant to cancer progression and survival status. Fifteen key genes of the module were consistent with a fibroblast signature in single‐cell RNA sequencing, functionally related to the extracellular matrix, and significant in survival analysis and tumour staging. A comparison of the luminal‐infiltrated versus luminal‐papillary subtypes and fibroblast versus urothelial carcinoma cell lines and immunohistochemical data analysis demonstrated that the key genes were specifically expressed in CAFs. Moreover, these genes are highly correlated with previously reported CAF markers. In summary, CAFs play a major role in the progression of BLCA, and the 15 key genes act as BLCA‐specific CAF markers and can predict CAF changes. WGCNA can, therefore, be used to sort CAF‐specific gene set in cancer tissues.  相似文献   

8.
Tsai YS  Aguan K  Pal NR  Chung IF 《PloS one》2011,6(9):e24259
Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases.  相似文献   

9.
Discovering drugs that efficiently treat brain diseases has been challenging. Genetic variants that modulate the expression of potential drug targets can be utilized to assess the efficacy of therapeutic interventions. We therefore employed Mendelian Randomization (MR) on gene expression measured in brain tissue to identify drug targets involved in neurological and psychiatric diseases. We conducted a two-sample MR using cis-acting brain-derived expression quantitative trait loci (eQTLs) from the Accelerating Medicines Partnership for Alzheimer’s Disease consortium (AMP-AD) and the CommonMind Consortium (CMC) meta-analysis study (n = 1,286) as genetic instruments to predict the effects of 7,137 genes on 12 neurological and psychiatric disorders. We conducted Bayesian colocalization analysis on the top MR findings (using P<6x10-7 as evidence threshold, Bonferroni-corrected for 80,557 MR tests) to confirm sharing of the same causal variants between gene expression and trait in each genomic region. We then intersected the colocalized genes with known monogenic disease genes recorded in Online Mendelian Inheritance in Man (OMIM) and with genes annotated as drug targets in the Open Targets platform to identify promising drug targets. 80 eQTLs showed MR evidence of a causal effect, from which we prioritised 47 genes based on colocalization with the trait. We causally linked the expression of 23 genes with schizophrenia and a single gene each with anorexia, bipolar disorder and major depressive disorder within the psychiatric diseases and 9 genes with Alzheimer’s disease, 6 genes with Parkinson’s disease, 4 genes with multiple sclerosis and two genes with amyotrophic lateral sclerosis within the neurological diseases we tested. From these we identified five genes (ACE, GPNMB, KCNQ5, RERE and SUOX) as attractive drug targets that may warrant follow-up in functional studies and clinical trials, demonstrating the value of this study design for discovering drug targets in neuropsychiatric diseases.  相似文献   

10.
Colorectal cancer (CRC) is highly heterogeneous leading to variable prognosis and treatment responses. Therefore, it is necessary to explore novel personalized and reproducible prognostic signatures to aid clinical decision‐making. The present study combined large‐scale gene expression profiles and clinical data of 1828 patients with CRC from multi‐centre studies and identified a personalized gene prognostic signature consisting of 46 unique genes (called function‐derived personalized gene signature [FunPGS]) from an integrated statistics and function‐derived perspective. In the meta‐training and multiple independent validation cohorts, the FunPGS effectively discriminated patients with CRC with significantly different prognosis at the individual level and remained as an independent factor upon adjusting for clinical covariates in multivariate analysis. Furthermore, the FunPGS demonstrated superior performance for risk stratification with respect to other recently reported signatures and clinical factors. The complementary value of the molecular signature and clinical factors was further explored, and it was observed that the composite signature called IMCPS greatly improved the predictive performance of survival estimation relative to molecular signatures or clinical factors alone. With further prospective validation in clinical trials, the FunPGS may become a promising and powerful personalized prognostic tool for stratifying patients with CRC in order to achieve an optimal systemic therapy.  相似文献   

11.
Quite a few estrogen receptor (ER)‐positive breast cancer patients receiving endocrine therapy are at risk of disease recurrence and death. ER‐related genes are involved in the progression and chemoresistance of breast cancer. In this study, we identified an ER‐related gene signature that can predict the prognosis of ER‐positive breast cancer patient receiving endocrine therapy. We collected RNA expression profiling from Gene Expression Omnibus database. An ER‐related signature was developed to separate patients into high‐risk and low‐risk groups. Patients in the low‐risk group had significantly better survival than those in the high‐risk group. ROC analysis indicated that this signature exhibited good diagnostic efficiency for the 1‐, 3‐ and 5‐year disease‐relapse events. Moreover, multivariate Cox regression analysis demonstrated that the ER‐related signature was an independent risk factor when adjusting for several clinical signatures. The prognostic value of this signature was validated in the validation sets. In addition, a nomogram was built and the calibration plots analysis indicated the good performance of this nomogram. In conclusion, combining with ER status, our results demonstrated that the ER‐related prognostic signature is a promising method for predicting the prognosis of ER‐positive breast cancer patients receiving endocrine therapy.  相似文献   

12.
The relationship between metabolism reprogramming and neuroblastoma (NB) is largely unknown. In this study, one RNA‐sequence data set (n = 153) was used as discovery cohort and two microarray data sets (n = 498 and n = 223) were used as validation cohorts. Differentially expressed metabolic genes were identified by comparing stage 4s and stage 4 NBs. Twelve metabolic genes were selected by LASSO regression analysis and integrated into the prognostic signature. The metabolic gene signature successfully stratifies NB patients into two risk groups and performs well in predicting survival of NB patients. The prognostic value of the metabolic gene signature is also independent with other clinical risk factors. Nine metabolism‐related long non‐coding RNAs (lncRNAs) were also identified and integrated into the metabolism‐related lncRNA signature. The lncRNA signature also performs well in predicting survival of NB patients. These results suggest that the metabolic signatures have the potential to be used for risk stratification of NB. Gene set enrichment analysis (GSEA) reveals that multiple metabolic processes (including oxidative phosphorylation and tricarboxylic acid cycle, both of which are emerging targets for cancer therapy) are enriched in the high‐risk NB group, and no metabolic process is enriched in the low‐risk NB group. This result indicates that metabolism reprogramming is associated with the progression of NB and targeting certain metabolic pathways might be a promising therapy for NB.  相似文献   

13.
14.
Genome-wide association studies (GWAS) have identified loci reproducibly associated with pulmonary diseases; however, the molecular mechanism underlying these associations are largely unknown. The objectives of this study were to discover genetic variants affecting gene expression in human lung tissue, to refine susceptibility loci for asthma identified in GWAS studies, and to use the genetics of gene expression and network analyses to find key molecular drivers of asthma. We performed a genome-wide search for expression quantitative trait loci (eQTL) in 1,111 human lung samples. The lung eQTL dataset was then used to inform asthma genetic studies reported in the literature. The top ranked lung eQTLs were integrated with the GWAS on asthma reported by the GABRIEL consortium to generate a Bayesian gene expression network for discovery of novel molecular pathways underpinning asthma. We detected 17,178 cis- and 593 trans- lung eQTLs, which can be used to explore the functional consequences of loci associated with lung diseases and traits. Some strong eQTLs are also asthma susceptibility loci. For example, rs3859192 on chr17q21 is robustly associated with the mRNA levels of GSDMA (P = 3.55×10−151). The genetic-gene expression network identified the SOCS3 pathway as one of the key drivers of asthma. The eQTLs and gene networks identified in this study are powerful tools for elucidating the causal mechanisms underlying pulmonary disease. This data resource offers much-needed support to pinpoint the causal genes and characterize the molecular function of gene variants associated with lung diseases.  相似文献   

15.
Understanding the categorization of human diseases is critical for reliably identifying disease causal genes. Recently, genome-wide studies of abnormal chromosomal locations related to diseases have mapped >2000 phenotype–gene relations, which provide valuable information for classifying diseases and identifying candidate genes as drug targets. In this article, a regularized non-negative matrix tri-factorization (R-NMTF) algorithm is introduced to co-cluster phenotypes and genes, and simultaneously detect associations between the detected phenotype clusters and gene clusters. The R-NMTF algorithm factorizes the phenotype–gene association matrix under the prior knowledge from phenotype similarity network and protein–protein interaction network, supervised by the label information from known disease classes and biological pathways. In the experiments on disease phenotype–gene associations in OMIM and KEGG disease pathways, R-NMTF significantly improved the classification of disease phenotypes and disease pathway genes compared with support vector machines and Label Propagation in cross-validation on the annotated phenotypes and genes. The newly predicted phenotypes in each disease class are highly consistent with human phenotype ontology annotations. The roles of the new member genes in the disease pathways are examined and validated in the protein–protein interaction subnetworks. Extensive literature review also confirmed many new members of the disease classes and pathways as well as the predicted associations between disease phenotype classes and pathways.  相似文献   

16.
17.
Whole‐genome or whole‐exome sequencing (WGS/WES) of the affected proband together with normal parents (trio) is commonly adopted to identify de novo germline mutations (DNMs) underlying sporadic cases of various genetic disorders. However, our current knowledge of the occurrence and functional effects of DNMs remains limited and accurately identifying the disease‐causing DNM from a group of irrelevant DNMs is complicated. Herein, we provide a general‐purpose discussion of important issues related to pathogenic gene identification based on trio‐based WGS/WES data. Specifically, the relevance of DNMs to human sporadic diseases, current knowledge of DNM biogenesis mechanisms, and common strategies or software tools used for DNM detection are reviewed, followed by a discussion of pathogenic gene prioritization. In addition, several key factors that may affect DNM identification accuracy and causal gene prioritization are reviewed. Based on recent major advances, this review both sheds light on how trio‐based WGS/WES technologies can play a significant role in the identification of DNMs and causal genes for sporadic diseases, and also discusses existing challenges.  相似文献   

18.
Although triple negative breast cancers (TNBC) are the most aggressive subtype of breast cancer, they currently lack targeted therapies. Because this classification still includes a heterogeneous collection of tumors, new tools to classify TNBCs are urgently required in order to improve our prognostic capability for high risk patients and predict response to therapy. We previously defined a gene expression signature, RKIP Pathway Metastasis Signature (RPMS), based upon a metastasis-suppressive signaling pathway initiated by Raf Kinase Inhibitory Protein (RKIP). We have now generated a new BACH1 Pathway Metastasis gene signature (BPMS) that utilizes targets of the metastasis regulator BACH1. Specifically, we substituted experimentally validated target genes to generate a new BACH1 metagene, developed an approach to optimize patient tumor stratification, and reduced the number of signature genes to 30. The BPMS significantly and selectively stratified metastasis-free survival in basal-like and, in particular, TNBC patients. In addition, the BPMS further stratified patients identified as having a good or poor prognosis by other signatures including the Mammaprint® and Oncotype® clinical tests. The BPMS is thus complementary to existing signatures and is a prognostic tool for high risk ER-HER2- patients. We also demonstrate the potential clinical applicability of the BPMS as a single sample predictor. Together, these results reveal the potential of this pathway-based BPMS gene signature to identify high risk TNBC patients that can respond effectively to targeted therapy, and highlight BPMS genes as novel drug targets for therapeutic development.  相似文献   

19.
We have used a supervised classification approach to systematically mine a large microarray database derived from livers of compound‐treated rats. Thirty‐four distinct signatures (classifiers) for pharmacological and toxicological end points can be identified. Just 200 genes are sufficient to classify these end points. Signatures were enriched in xenobiotic and immune response genes and contain un‐annotated genes, indicating that not all key genes in the liver xenobiotic responses have been characterized. Many signatures with equal classification capabilities but with no gene in common can be derived for the same phenotypic end point. The analysis of the union of all genes present in these signatures can reveal the underlying biology of that end point as illustrated here using liver fibrosis signatures. Our approach using the whole genome and a diverse set of compounds allows a comprehensive view of most pharmacological and toxicological questions and is applicable to other situations such as disease and development.  相似文献   

20.
In complex diseases, various combinations of genomic perturbations often lead to the same phenotype. On a molecular level, combinations of genomic perturbations are assumed to dys-regulate the same cellular pathways. Such a pathway-centric perspective is fundamental to understanding the mechanisms of complex diseases and the identification of potential drug targets. In order to provide an integrated perspective on complex disease mechanisms, we developed a novel computational method to simultaneously identify causal genes and dys-regulated pathways. First, we identified a representative set of genes that are differentially expressed in cancer compared to non-tumor control cases. Assuming that disease-associated gene expression changes are caused by genomic alterations, we determined potential paths from such genomic causes to target genes through a network of molecular interactions. Applying our method to sets of genomic alterations and gene expression profiles of 158 Glioblastoma multiforme (GBM) patients we uncovered candidate causal genes and causal paths that are potentially responsible for the altered expression of disease genes. We discovered a set of putative causal genes that potentially play a role in the disease. Combining an expression Quantitative Trait Loci (eQTL) analysis with pathway information, our approach allowed us not only to identify potential causal genes but also to find intermediate nodes and pathways mediating the information flow between causal and target genes. Our results indicate that different genomic perturbations indeed dys-regulate the same functional pathways, supporting a pathway-centric perspective of cancer. While copy number alterations and gene expression data of glioblastoma patients provided opportunities to test our approach, our method can be applied to any disease system where genetic variations play a fundamental causal role.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号