首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.

Background

One of the major goals in gene and protein expression profiling of cancer is to identify biomarkers and build classification models for prediction of disease prognosis or treatment response. Many traditional statistical methods, based on microarray gene expression data alone and individual genes' discriminatory power, often fail to identify biologically meaningful biomarkers thus resulting in poor prediction performance across data sets. Nonetheless, the variables in multivariable classifiers should synergistically interact to produce more effective classifiers than individual biomarkers.

Results

We developed an integrated approach, namely network-constrained support vector machine (netSVM), for cancer biomarker identification with an improved prediction performance. The netSVM approach is specifically designed for network biomarker identification by integrating gene expression data and protein-protein interaction data. We first evaluated the effectiveness of netSVM using simulation studies, demonstrating its improved performance over state-of-the-art network-based methods and gene-based methods for network biomarker identification. We then applied the netSVM approach to two breast cancer data sets to identify prognostic signatures for prediction of breast cancer metastasis. The experimental results show that: (1) network biomarkers identified by netSVM are highly enriched in biological pathways associated with cancer progression; (2) prediction performance is much improved when tested across different data sets. Specifically, many genes related to apoptosis, cell cycle, and cell proliferation, which are hallmark signatures of breast cancer metastasis, were identified by the netSVM approach. More importantly, several novel hub genes, biologically important with many interactions in PPI network but often showing little change in expression as compared with their downstream genes, were also identified as network biomarkers; the genes were enriched in signaling pathways such as TGF-beta signaling pathway, MAPK signaling pathway, and JAK-STAT signaling pathway. These signaling pathways may provide new insight to the underlying mechanism of breast cancer metastasis.

Conclusions

We have developed a network-based approach for cancer biomarker identification, netSVM, resulting in an improved prediction performance with network biomarkers. We have applied the netSVM approach to breast cancer gene expression data to predict metastasis in patients. Network biomarkers identified by netSVM reveal potential signaling pathways associated with breast cancer metastasis, and help improve the prediction performance across independent data sets.  相似文献   

3.
Altered expression and functional roles of the transcribed ultraconserved regions (T-UCRs), as genomic sequences with 100% conservation between the genomes of human, mouse, and rat, in the pathophysiology of neoplasms has already been investigated. Nevertheless, the relevance of the functions for T-UCRs in gastric cancer (GC) is still the subject of inquiry. In the current study, we first used a genome-wide profiling approach to analyze the expression of T-UCRs in GC patients. Then, we constructed a three-component regulatory network and investigated potential diagnostic and prognostic values of the T-UCRs. The Cancer Genome Atlas Stomach Adenocarcinoma (TCGA-STAD) dataset was used as a resource for the RNA-sequencing data. FeatureCounts was utilized to quantify the number of reads mapped to each T-UCR. Differential expression analysis was then conducted using DESeq2. In the following, interactions between T-UCRs, microRNAs (miRNAs), and messenger RNAs (mRNAs) were combined into a three-component network. Enrichment analyses were performed and a protein–protein interaction (PPI) network was constructed. The R Survival package was utilized to identify survival-related significantly differentially expressed T-UCRs (DET-UCRs). Using an in-house cohort of GC tissues, expression of two DET-UCRs was furthermore experimentally verified. Our results showed that several T-UCRs were dysregulated in TCGA-STAD tumoral samples compared to nontumoral counterparts. The three-component network was constructed which composed of DET-UCRs, miRNAs, and mRNAs nodes. Functional enrichment and PPI network analyses revealed important enriched signaling pathways and gene ontologies such as “pathway in cancer” and regulation of cell proliferation and apoptosis. Five T-UCRs were significantly correlated with the overall survival of GC patients. While no expression of uc.232 was observed in our in-house cohort of GC tissues, uc.343 showed an increased expression, although not statistically significant, in gastric tumoral tissues. The constructed three-component regulatory network of T-UCRs in GC presents a comprehensive understanding of the underlying gene expression regulation processes involved in tumor development and can serve as a basis to investigate potential prognostic biomarkers and therapeutic targets.  相似文献   

4.
5.
6.
Colorectal cancer (CRC) is a major cause of morbidity and mortality throughout the world. However, the genetic alterations and molecular mechanism of the early onset CRCs are not fully investigated. The present study aimed to characterize early onset CRC by analyzing its gene expression compared with normal controls and to identify network-based biomarkers of early onset CRC. The gene expression profiles of early onset CRC were downloaded from Gene Expression Omnibus and the differentially expressed genes (DEGs) in CRC patients were identified. Then, a protein–protein interaction (PPI) network was constructed and the clusters in PPI were analyzed by ClusterONE. Furthermore, the gene ontology functional analysis and pathway enrichment analysis were conducted to the modules in PPI network. A systems biology approach integrating microarray data and PPI was further applied to construct a PPI network in CRC. Total 631 DEGs were identified from the early onset CRC compared to healthy controls. These genes were found to be involved in several biological processes, including cell communication, cell proliferation, cell shape and apoptosis. Five functional modules which may play important roles in the initiation of early onset CRC were identified from the PPI network. Functional annotation revealed that these five modules were involved in the pathways of signal transduction, carcinogenesis and metastasis. The hub nodes of these five modules, CDC42, TEX11, QKI, CAV1 and FN1, may serve as the biomarkers of early onset CRC and have the potential to be targets for therapeutic intervention. However, further investigations are still needed to confirm our findings.  相似文献   

7.
Neuropathic pain (NP) is a common pathological pain state with limited effective treatments. This study was designed to identify potential mechanisms and candidate genes using gene expression–based genome-wide association study (eGWAS). All NP-related microarray experiments were obtained from Gene Expression Omnibus and ArrayExpress. Significantly dysregulated genes were identified between experimental and untreated groups, and the number of microarray experiments in which each gene was dysregulated was calculated. Significantly dysregulated genes were ranked according to P values of the chi-square test. Using Gene Ontology and Kyoto Encyclopedia of Genes and Genomes database, we performed functional and pathway enrichment analysis. Protein-protein interaction (PPI) network and module analysis was performed using Cytoscape software. A total of 115 candidate genes were identified from 19 independent microarray experiments by eGWAS based on the Bonferroni threshold ( P < 2.97 × 10 −6). Immune and inflammatory responses, and complement and coagulation cascades, were respectively the most enriched biological process and pathways for candidate genes. The hub genes with highest connectivity in PPI network and two modules Ccl2 and Jun, and Ctss application of the eGWAS methodology can identify mechanisms and candidate genes associated with NP. Our results support the validity and prevalence of inflammatory and immune mechanisms across different NP models, and Ccl2, Jun, and Ctss may be the hub genes for NP.  相似文献   

8.
Abstract

The prognostic, diagnostic and therapeutic value of microRNA (miRNA) expression aberrations in renal fibrosis has been studied in recent years. However, the miRNA expression profiling efforts have led to inconsistent results between the studies. The aim of this study was to perform a meta-analysis on the renal fibrosis miRNA expression profiling studies to identify candidate diagnostic biomarkers. We performed comprehensive literature searches in several databases to identify miRNA expression studies of renal fibrosis in animal models and humans. The miRNAs expression data were extracted from 20 included studies, and both miRNA vote-counting strategy and Robust Rank Aggregation method were utilized to identify significant miRNA meta-signatures. The predicted and validated targets of miRNA meta-signature were obtained by using MultiMiR package in 11 databases. Then a gene set enrichment analysis (KEGG, PANTHER pathways and GO processes) were carried out with GeneCodis web tool to recognize pathways that are most strongly influenced by modified expressions of these miRNAs. We recognized in both meta-analysis approaches a significant miRNA meta-signature of five up-regulated (miR-142-3p, miR-223-3p, miR-21-5p, miR-142-5p and miR-214-3p) and two down-regulated (miR-29c-3p and miR-200a-3p) miRNAs. Enrichment analysis confirmed that miRNA meta-signature cooperatively target functionally related genes in signalling and developmental pathways in renal fibrosis. This meta-analysis identified seven highly significant and consistently dysregulated miRNAs from 20 datasets, as the focus of future investigations to discover their potential influence to renal fibrosis and their clinical utility as biomarkers and/or as therapeutic mediators against chronic kidney disease..  相似文献   

9.
10.
Genome-wide gene expression was comparatively investigated in early-passage rheumatoid arthritis (RA) and osteoarthritis (OA) synovial fibroblasts (SFBs; n = 6 each) using oligonucleotide microarrays; mRNA/protein data were validated by quantitative PCR (qPCR) and western blotting and immunohistochemistry, respectively. Gene set enrichment analysis (GSEA) of the microarray data suggested constitutive upregulation of components of the transforming growth factor (TGF)-beta pathway in RA SFBs, with 2 hits in the top 30 regulated pathways. The growth factor TGF-beta1, its receptor TGFBR1, the TGF-beta binding proteins LTBP1/2, the TGF-beta-releasing thrombospondin 1 (THBS1), the negative effector SkiL, and the smad-associated molecule SARA were upregulated in RA SFBs compared to OA SFBs, whereas TGF-beta2 was downregulated. Upregulation of TGF-beta1 and THBS1 mRNA (both positively correlated with clinical markers of disease activity/severity) and downregulation of TGF-beta2 mRNA in RA SFBs were confirmed by qPCR. TGFBR1 mRNA (only numerically upregulated in RA SFBs) and SkiL mRNA were not differentially expressed. At the protein level, TGF-beta1 showed a slightly higher expression, and the signal-transducing TGFBR1 and the TGF-beta-activating THBS1 a significantly higher expression in RA SFBs than in OA SFBs. Consistent with the upregulated TGF-beta pathway in RA SFBs, stimulation with TGF-beta1 resulted in a significantly enhanced expression of matrix-metalloproteinase (MMP)-11 mRNA and protein in RA SFBs, but not in OA SFBs. In conclusion, RA SFBs show broad, constitutive alterations of the TGF-beta pathway. The abundance of TGF-beta, in conjunction with an augmented mRNA and/or protein expression of TGF-beta-releasing THBS1 and TGFBR1, suggests a pathogenetic role of TGF-beta-induced effects on SFBs in RA, for example, the augmentation of MMP-mediated matrix degradation/remodeling.  相似文献   

11.
Integration of pathway and protein-protein interaction(PPI) data can provide more information that could lead to new biological insights. PPIs are usually represented by a simple binary model, whereas pathways are represented by more complicated models. We developed a series of rules for transforming protein interactions from pathway to binary model, and the protein interactions from seven pathway databases, including PID, Bio Carta, Reactome, Net Path, INOH, SPIKE and KEGG, were transformed based on these rules. These pathway-derived binary protein interactions were integrated with PPIs from other five PPI databases including HPRD, Int Act, Bio GRID, MINT and DIP, to develop integrated dataset(named Path PPI). More detailed interaction type and modification information on protein interactions can be preserved in Path PPI than other existing datasets. Comparison analysis results indicate that most of the interaction overlaps values(OAB) among these pathway databases were less than 5%, and these databases must be used conjunctively. The Path PPI data was provided at http://proteomeview. hupo.org.cn/Path PPI/Path PPI.html.  相似文献   

12.
13.
本研究运用网络药理学和分子对接方法对中药桑白皮治疗糖尿病周围神经病变(DPN)的活性成分、潜在作用靶点和信号通路进行研究,探索桑白皮治疗DPN的可能作用机制。首先从中药系统药理学数据库(TCMSP)筛选出桑白皮的活性成分及靶点基因。通过GeneCards数据库及OMIM数据库筛选出DPN的疾病靶点基因,并用Cytoscape软件构建"药物-有效成分-靶基因-疾病"中药调控网络图。将有效成分靶标与疾病靶标上传到STRING数据库,构建蛋白互作网络图(PPI),并使用R语言对得到的PPI进行核心基因的筛选。运用R语言对关键靶点进行GO富集分析和KEGG通路富集分析。其次从活性成分及靶点基因中根据degree值筛选出前3个关键成分,并将该网络中的基因靶点以degree值高低进行排序,选择前3个核心靶点,然后从RCSB数据库下载相关蛋白的结构,使用Pymol软件去除溶剂分子与配体,使用AutoDock软件进行分子对接。最后通过酶联免疫吸附实验和荧光光谱实验验证网络药理学富集分析的结果。最终预测到31个桑白皮活性成分,312个活性成分相关靶点,120个桑白皮-糖尿病周围神经病变共同有效靶点。活性...  相似文献   

14.
15.
调控通路内基因表达的相关性分析   总被引:1,自引:1,他引:0  
李传星  李霞  郭政  宫滨生  屠康 《遗传》2004,26(6):929-933
本研究从基因表达调控通路的角度分析了基因功能与基因表达之间的关系,利用7套酿酒酵母基因芯片表达谱数据和通路数据库(KEGG和CYGD)所提供的信息,应用我们研制的Genehub软件分析研究了同一基因表达调控通路内的基因在mRNA表达水平上的相关性,共涉及16条通路,495个基因。通过Pearson相关系数和Spearman相关系数两种相似性测度的分析,我们发现有94%(15条)的基因表达调控通路内的基因在大于等于4套的表达谱数据中是共表达的,以上结果从基因表达调控通路的角度,证实了基因功能与基因表达之间存在着一定的相关性。  相似文献   

16.
17.
ABSTRACT: BACKGROUND: In the postgenome era, a prediction of response to treatment could lead to better dose selection for patients in radiotherapy. To identify a radiosensitive gene signature and elucidate related signaling pathways, four different microarray experiments were reanalyzed before radiotherapy. RESULTS: Radiosensitivity profiling data using clonogenic assay and gene expression profiling data from four published microarray platforms applied to NCI-60 cancer cell panel were used. The survival fraction at 2 Gy (SF2, range from 0 to 1) was calculated as a measure of radiosensitivity and a linear regression model was applied to identify genes or a gene set with a correlation between expression and radiosensitivity (SF2). Radiosensitivity signature genes were identified using significant analysis of microarrays (SAM) and gene set analysis was performed using a global test using linear regression model. Using the radiation-related signaling pathway and identified genes, a genetic network was generated. According to SAM, 31 genes were identified as common to all the microarray platforms and therefore a common radiosensitivity signature. In gene set analysis, functions in the cell cycle, DNA replication, and cell junction, including adherence and gap junctions were related to radiosensitivity. The integrin, VEGF, MAPK, p53, JAK-STAT and Wnt signaling pathways were overrepresented in radiosensitivity. Significant genes including ACTN1, CCND1, HCLS1, ITGB5, PFN2, PTPRC, RAB13, and WAS, which are adhesion-related molecules that were identified by both SAM and gene set analysis, and showed interaction in the genetic network with the integrin signaling pathway. CONCLUSIONS: Integration of four different microarray experiments and gene selection using gene set analysis discovered possible target genes and pathways relevant to radiosensitivity. Our results suggested that the identified genes are candidates for radiosensitivity biomarkers and that integrin signaling via adhesion molecules could be a target for radiosensitization.  相似文献   

18.
Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related functional modules. To integrate CNA and GE data, we first built a gene-gene relationship network from a set of seed genes by enumerating all types of pairwise correlations, e.g. GE-GE, CNA-GE, and CNA-CNA, over multiple patients. Next, we propose a voting-based cancer module identification algorithm by combining topological and data-driven properties (VToD algorithm) by using the gene-gene relationship network as a source of data-driven information, and the PPI data as topological information. We applied the VToD algorithm to 266 glioblastoma multiforme (GBM) and 96 ovarian carcinoma (OVC) samples that have both expression and copy number measurements, and identified 22 GBM modules and 23 OVC modules. Among 22 GBM modules, 15, 12, and 20 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Among 23 OVC modules, 19, 18, and 23 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Similarly, we also observed that 9 and 2 GBM modules and 15 and 18 OVC modules were enriched with cancer gene census (CGC) and specific cancer driver genes, respectively. Our proposed module-detection algorithm significantly outperformed other existing methods in terms of both functional and cancer gene set enrichments. Most of the cancer-related pathways from both cancer data sets found in our algorithm contained more than two types of gene-gene relationships, showing strong positive correlations between the number of different types of relationship and CGC enrichment -values (0.64 for GBM and 0.49 for OVC). This study suggests that identified modules containing both expression changes and CNAs can explain cancer-related activities with greater insights.  相似文献   

19.
20.
Adrenocortical carcinoma (ACC), a rare malignant neoplasm originating from adrenal cortical cells, has high malignancy and few treatments. Therefore, it is necessary to explore the molecular mechanism of tumorigenesis, screen and verify potential biomarkers, which will provide new clues for the treatment and diagnosis of ACC. In this paper, three gene expression profiles (GSE10927, GSE12368 and GSE90713) were downloaded from the Gene Expression Omnibus (GEO) database. Differentially expressed genes (DEGs) were obtained using the Limma package. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were enriched by DAVID. Protein‐protein interaction (PPI) network was evaluated by STRING database, and PPI network was constructed by Cytoscape. Finally, GEPIA was used to validate hub genes’ expression. Compared with normal adrenal tissues, 74 up‐regulated DEGs and 126 down‐regulated DEGs were found in ACC samples; GO analysis showed that up‐regulated DEGs were enriched in organelle fission, nuclear division, spindle, et al, while down‐regulated DEGs were enriched in angiogenesis, proteinaceous extracellular matrix and growth factor activity; KEGG pathway analysis showed that up‐regulated DEGs were significantly enriched in cell cycle, cellular senescence and progesterone‐mediated oocyte maturation; Nine hub genes (CCNB1, CDK1, TOP2A, CCNA2, CDKN3, MAD2L1, RACGAP1, BUB1 and CCNB2) were identified by PPI network; ACC patients with high expression of 9 hub genes were all associated with worse overall survival (OS). These hub genes and pathways might be involved in the tumorigenesis, which will offer the opportunities to develop the new therapeutic targets of ACC.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号