首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Li BQ  Zhang J  Huang T  Zhang L  Cai YD 《Biochimie》2012,94(9):1910-1917
This paper presents a new method for identifying retinoblastoma related genes by integrating gene expression profile and shortest path in a functional linkage graph. With the existing protein-protein interaction data from STRING, a weighted functional linkage graph is constructed. 119 consistently differentially expressed genes between retinoblastoma and normal retina were obtained from the overlap of two gene expression studies of retinoblastoma. Then the shortest paths between each pair of these 119 genes were determined with Dijkstra's algorithm. Finally, all the genes present on the shortest paths were extracted and ranked according to their betweenness and the 119 shortest genes with a betweenness greater than 100 and with a p-value less than 0.05 were selected for further analysis. We also identified 53 retinoblastoma related miRNAs from published miRNA array data and most of the 238 (119 consistently differentially expressed genes and 119 shortest path genes) retinoblastoma genes were shown to be target genes of these 53 miRNAs. Interestingly, the genes we identified from both the gene expression profiles and the functional protein association network included more cancer genes than did the genes identified from the gene expression profiles alone. In addition, these genes also had greater functional similarity to the reported cancer genes than did the genes identified from the gene expression profiles alone. This study shows promising results and proves the efficiency of the proposed methods.  相似文献   

2.
Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.  相似文献   

3.
Li W  Wang R  Yan Z  Bai L  Sun Z 《PloS one》2012,7(3):e33653
A considerable portion of patients with colorectal cancer have a high risk of disease recurrence after surgery. These patients can be identified by analyzing the expression profiles of signature genes in tumors. But there is no consensus on which genes should be used and the performance of specific set of signature genes varies greatly with different datasets, impeding their implementation in the routine clinical application. Instead of using individual genes, here we identified functional multi-gene modules with significant expression changes between recurrent and recurrence-free tumors, used them as the signatures for predicting colorectal cancer recurrence in multiple datasets that were collected independently and profiled on different microarray platforms. The multi-gene modules we identified have a significant enrichment of known genes and biological processes relevant to cancer development, including genes from the chemokine pathway. Most strikingly, they recruited a significant enrichment of somatic mutations found in colorectal cancer. These results confirmed the functional relevance of these modules for colorectal cancer development. Further, these functional modules from different datasets overlapped significantly. Finally, we demonstrated that, leveraging above information of these modules, our module based classifier avoided arbitrary fitting the classifier function and screening the signatures using the training data, and achieved more consistency in prognosis prediction across three independent datasets, which holds even using very small training sets of tumors.  相似文献   

4.
5.
6.
ABSTRACT: BACKGROUND: The use of biological molecular network information for diagnostic and prognostic purposes and elucidation of molecular disease mechanism is a key objective in systems biomedicine. The network of regulatory miRNA-target and functional protein interactions is a rich source of information to elucidate the function and the prognostic value of miRNAs in cancer. The objective of this study is to identify miRNAs that have high influence on target protein complexes in prostate cancer as a case study. This could provide biomarkers or therapeutic targets relevant for prostate cancer treatment. RESULTS: Our findings demonstrate that a miRNA's functional role can be explained by its target protein connectivity within a physical and functional interaction network. To detect miRNAs with high influence on target protein modules, we integrated miRNA and mRNA expression profiles with a sequence based miRNA-target network and human functional and physical protein interactions (FPI). miRNAs with high influence on target protein complexes play a role in prostate cancer progression and are promising diagnostic or prognostic biomarkers. We uncovered several miRNA-regulated protein modules which were enriched in focal adhesion and prostate cancer genes. Several miRNAs such as miR-96, miR-182, and miR-143 demonstrated high influence on their target protein complexes and could explain most of the gene expression changes in our analyzed prostate cancer data set. CONCLUSIONS: We describe a novel method to identify active miRNA-target modules relevant to prostate cancer progression and outcome. miRNAs with high influence on protein networks are valuable biomarkers that can be used in clinical investigations for prostate cancer treatment.  相似文献   

7.
结合基因功能分类体系Gene Ontology筛选聚类特征基因   总被引:3,自引:0,他引:3  
使用两套基因表达谱数据,按各基因的表达值方差,选择表达变异基因对样本聚类,发现一般使用方差较大的前10%的基因作为特征基因,就可以较好地对疾病样本聚类。对不同的疾病,包含聚类信息的特征基因有不同的分布特点。在此基础上,结合基因功能分类体系(Gene Ontology,GO),进一步筛选聚类的特征基因。通过检验在Gene Ontology中的每个功能类中的表达变异基因是否非随机地聚集,寻找疾病相关功能类,再根据相关功能类中的表达变异基因进行聚类分析。实验结果显示:结合基因功能体系进一步筛选表达变异基因作为聚类特征基因,可以保持或提高聚类准确性,并使得聚类结果具有明确的生物学意义。另外,发现了一些可能和淋巴瘤和白血病相关的基因。  相似文献   

8.
To identify molecular alterations in the progression of colorectal carcinoma, we analyzed gene expression profiles of colon cancer cell lines derived from primary and metastatic tumors from a single patient. Of 2280 cDNAs investigated using our in-house microarray, the expression of 6 genes (tumor-associated antigen L6, L-plastin, the human homologue of yeast ribosomal protein S28, the B-cell translocation gene, mitochondrial aspartate-aminotransferase, and HLA-A) increased, while that of 2 genes (keratin 5 and phosphoglucomutase) decreased in metastatic-tumor-derived cells compared with primary-tumor-derived cells. Of these genes, we assessed the L-plastin gene, an actin-bundling protein, at the protein level using a tissue microarray consisting of 58 clinically stratified colorectal cancer specimens. Consistent with our microarray results, the expression of L-plastin was significantly correlated with the progression of cancer staging. Therefore, our results suggest that the L-plastin gene is a potential metastatic marker. In addition, combining cDNA microarrays and tissue arrays, as shown here, is thought to facilitate the rapid characterization of candidate biomarkers.  相似文献   

9.
BackgroundThere is a growing body of evidence associating microRNAs (miRNAs) with human diseases. MiRNAs are new key players in the disease paradigm demonstrating roles in several human diseases. The functional association between miRNAs and diseases remains largely unclear and far from complete. With the advent of high-throughput functional genomics techniques that infer genes and biological pathways dysregulted in diseases, it is now possible to infer functional association between diseases and biological molecules by integrating disparate biological information.ResultsHere, we first used Lasso regression model to identify miRNAs associated with disease signature as a proof of concept. Then we proposed an integrated approach that uses disease-gene associations from microarray experiments and text mining, and miRNA-gene association from computational predictions and protein networks to build functional associations network between miRNAs and diseases. The findings of the proposed model were validated against gold standard datasets using ROC analysis and results were promising (AUC=0.81). Our protein network-based approach discovered 19 new functional associations between prostate cancer and miRNAs. The new 19 associations were validated using miRNA expression data and clinical profiles and showed to act as diagnostic and prognostic prostate biomarkers. The proposed integrated approach allowed us to reconstruct functional associations between miRNAs and human diseases and uncovered functional roles of newly discovered miRNAs.ConclusionsLasso regression was used to find associations between diseases and miRNAs using their gene signature. Defining miRNA gene signature by integrating the downstream effect of miRNAs demonstrated better performance than the miRNA signature alone. Integrating biological networks and multiple data to define miRNA and disease gene signature demonstrated high performance to uncover new functional associations between miRNAs and diseases.  相似文献   

10.
Well-established models of colorectal cancer progression are based on the idea that the disease evolves through a multistep process involving sequential genetic mutations, suggesting that progression through clinically defined stages should correlate with well-defined patterns of gene expression. The majority of studies to date, however, have assessed these processes one gene and one protein at a time. We report the first comprehensive assessment of both gene and protein expression performed in parallel across progressive stages of human colorectal neoplasia. Remarkably, despite the global nature of the gene expression assessment, very few genes could be linked with certainty to specific proteins through currently available annotations. Furthermore, the correlation of expression between identified genes and proteins was poor. Nevertheless, both produced expression signatures that differentiated normal mucosa and nonmalignant adenomas from each other and from the malignant carcinomas and both produced fairly consistent subclasses of the malignancies, suggesting that a molecular staging might be more appropriate provided that these profiles can be tied to clinical outcome. This is potentially important as clinical staging is widely used as a prognostic indicator and used in the decision to pursue adjuvant therapies.  相似文献   

11.

Introduction

Heredity is estimated to cause at least 20% of colorectal cancer. The hereditary nonpolyposis colorectal cancer subset is divided into Lynch syndrome and familial colorectal cancer type X (FCCTX) based on presence of mismatch repair (MMR) gene defects.

Purpose

We addressed the gene expression signatures in colorectal cancer linked to Lynch syndrome and FCCTX with the aim to identify candidate genes and to map signaling pathways relevant in hereditary colorectal carcinogenesis.

Experimental design

The 18 k whole-genome c-DNA-mediated annealing, selection, extension, and ligation (WG-DASL) assay was applied to 123 colorectal cancers, including 39 Lynch syndrome tumors and 37 FCCTX tumors. Target genes were technically validated using real-time quantitative RT-PCR (qRT-PCR) and the expression signature was validated in independent datasets.

Results

Colorectal cancers linked to Lynch syndrome and FCCTX showed distinct gene expression profiles, which by significance analysis of microarrays (SAM) differed by 2188 genes. Functional pathways involved were related to G-protein coupled receptor signaling, oxidative phosphorylation, and cell cycle function and mitosis. qRT-PCR verified altered expression of the selected genes NDUFA9, AXIN2, MYC, DNA2 and H2AFZ. Application of the 2188-gene signature to independent datasets showed strong correlation to MMR status.

Conclusion

Distinct genetic profiles and deregulation of different canonical pathways apply to Lynch syndrome and FCCTX and key targets herein may be relevant to pursue for refined diagnostic and therapeutic strategies in hereditary colorectal cancer.  相似文献   

12.
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus.  相似文献   

13.
目前微阵列数据分析方法都基于具有相似表达模式的基因可能具有相近的生物学功能这一假设, 而实际上参与同一生物学功能的基因, 在表达时间和空间上是有关联的, 而并非表现为相似模式。利用水稻cDNA微阵列, 对水稻在ABA及干旱、寒冷和高盐胁迫条件下的基因表达进行了研究。选取环境胁迫和ABA应答的相关基因, 采用最短路径法(shortest path), 利用自行编制的计算软件, 在表达模式不直接相关的基因之间构建最短路径。研究表明, 通过分析这些基因的表达数据, 可以发现它们在功能上的关联性, 并对未知基因的功能预测进行了探索, 为构建水稻在ABA和环境胁迫条件下的分子应答网络奠定了基础。  相似文献   

14.
In complex diseases, various combinations of genomic perturbations often lead to the same phenotype. On a molecular level, combinations of genomic perturbations are assumed to dys-regulate the same cellular pathways. Such a pathway-centric perspective is fundamental to understanding the mechanisms of complex diseases and the identification of potential drug targets. In order to provide an integrated perspective on complex disease mechanisms, we developed a novel computational method to simultaneously identify causal genes and dys-regulated pathways. First, we identified a representative set of genes that are differentially expressed in cancer compared to non-tumor control cases. Assuming that disease-associated gene expression changes are caused by genomic alterations, we determined potential paths from such genomic causes to target genes through a network of molecular interactions. Applying our method to sets of genomic alterations and gene expression profiles of 158 Glioblastoma multiforme (GBM) patients we uncovered candidate causal genes and causal paths that are potentially responsible for the altered expression of disease genes. We discovered a set of putative causal genes that potentially play a role in the disease. Combining an expression Quantitative Trait Loci (eQTL) analysis with pathway information, our approach allowed us not only to identify potential causal genes but also to find intermediate nodes and pathways mediating the information flow between causal and target genes. Our results indicate that different genomic perturbations indeed dys-regulate the same functional pathways, supporting a pathway-centric perspective of cancer. While copy number alterations and gene expression data of glioblastoma patients provided opportunities to test our approach, our method can be applied to any disease system where genetic variations play a fundamental causal role.  相似文献   

15.
A limitation of many gene expression analytic approaches is that they do not incorporate comprehensive background knowledge about the genes into the analysis. We present a computational method that leverages the peer-reviewed literature in the automatic analysis of gene expression data sets. Including the literature in the analysis of gene expression data offers an opportunity to incorporate functional information about the genes when defining expression clusters. We have created a method that associates gene expression profiles with known biological functions. Our method has two steps. First, we apply hierarchical clustering to the given gene expression data set. Secondly, we use text from abstracts about genes to (i) resolve hierarchical cluster boundaries to optimize the functional coherence of the clusters and (ii) recognize those clusters that are most functionally coherent. In the case where a gene has not been investigated and therefore lacks primary literature, articles about well-studied homologous genes are added as references. We apply our method to two large gene expression data sets with different properties. The first contains measurements for a subset of well-studied Saccharomyces cerevisiae genes with multiple literature references, and the second contains newly discovered genes in Drosophila melanogaster; many have no literature references at all. In both cases, we are able to rapidly define and identify the biologically relevant gene expression profiles without manual intervention. In both cases, we identified novel clusters that were not noted by the original investigators.  相似文献   

16.
Aberrant DNA methylation patterns have been reported in inflamed tissues and may play a role in disease. We studied DNA methylation and gene expression profiles of purified intestinal epithelial cells from ulcerative colitis patients, comparing inflamed and non-inflamed areas of the colon. We identified 577 differentially methylated sites (false discovery rate <0.2) mapping to 210 genes. From gene expression data from the same epithelial cells, we identified 62 differentially expressed genes with increased expression in the presence of inflammation at prostate cancer susceptibility genes PRAC1 and PRAC2. Four genes showed inverse correlation between methylation and gene expression; ROR1, GXYLT2, FOXA2, and, notably, RARB, a gene previously identified as a tumor suppressor in colorectal adenocarcinoma as well as breast, lung and prostate cancer. We highlight targeted and specific patterns of DNA methylation and gene expression in epithelial cells from inflamed colon, while challenging the importance of epithelial cells in the pathogenesis of chronic inflammation.  相似文献   

17.
Therapies targeting the type I insulin-like growth factor receptor (IGF-1R) have not been developed with predictive biomarkers to identify tumors with receptor activation. We have previously shown that the insulin receptor substrate (IRS) adaptor proteins are necessary for linking IGF1R to downstream signaling pathways and the malignant phenotype in breast cancer cells. The purpose of this study was to identify gene expression profiles downstream of IGF1R and its two adaptor proteins. IRS-null breast cancer cells (T47D-YA) were engineered to express IRS-1 or IRS-2 alone and their ability to mediate IGF ligand-induced proliferation, motility, and gene expression determined. Global gene expression signatures reflecting IRS adaptor specific and primary vs. secondary ligand response were derived (Early IRS-1, Late IRS-1, Early IRS-2 and Late IRS-2) and functional pathway analysis examined. IRS isoforms mediated distinct gene expression profiles, functional pathways, and breast cancer subtype association. For example, IRS-1/2-induced TGFb2 expression and blockade of TGFb2 abrogated IGF-induced cell migration. In addition, the prognostic value of IRS proteins was significant in the luminal B breast tumor subtype. Univariate and multivariate analyses confirmed that IRS adaptor signatures correlated with poor outcome as measured by recurrence-free and overall survival. Thus, IRS adaptor protein expression is required for IGF ligand responses in breast cancer cells. IRS-specific gene signatures represent accurate surrogates of IGF activity and could predict response to anti-IGF therapy in breast cancer.  相似文献   

18.
19.
Emerging evidence indicates that gene products implicated in human cancers often cluster together in “hot spots” in protein-protein interaction (PPI) networks. Additionally, small sub-networks within PPI networks that demonstrate synergistic differential expression with respect to tumorigenic phenotypes were recently shown to be more accurate classifiers of disease progression when compared to single targets identified by traditional approaches. However, many of these studies rely exclusively on mRNA expression data, a useful but limited measure of cellular activity. Proteomic profiling experiments provide information at the post-translational level, yet they generally screen only a limited fraction of the proteome. Here, we demonstrate that integration of these complementary data sources with a “proteomics-first” approach can enhance the discovery of candidate sub-networks in cancer that are well-suited for mechanistic validation in disease. We propose that small changes in the mRNA expression of multiple genes in the neighborhood of a protein-hub can be synergistically associated with significant changes in the activity of that protein and its network neighbors. Further, we hypothesize that proteomic targets with significant fold change between phenotype and control may be used to “seed” a search for small PPI sub-networks that are functionally associated with these targets. To test this hypothesis, we select proteomic targets having significant expression changes in human colorectal cancer (CRC) from two independent 2-D gel-based screens. Then, we use random walk based models of network crosstalk and develop novel reference models to identify sub-networks that are statistically significant in terms of their functional association with these proteomic targets. Subsequently, using an information-theoretic measure, we evaluate synergistic changes in the activity of identified sub-networks based on genome-wide screens of mRNA expression in CRC. Cross-classification experiments to predict disease class show excellent performance using only a few sub-networks, underwriting the strength of the proposed approach in discovering relevant and reproducible sub-networks.  相似文献   

20.

Background

Complex diseases are associated with altered interactions between thousands of genes. We developed a novel method to identify and prioritize disease genes, which was generally applicable to complex diseases.

Results

We identified modules of highly interconnected genes in disease-specific networks derived from integrating gene-expression and protein interaction data. We examined if those modules were enriched for disease-associated SNPs, and could be used to find novel genes for functional studies. First, we analyzed publicly available gene expression microarray and genome-wide association study (GWAS) data from 13, highly diverse, complex diseases. In each disease, highly interconnected genes formed modules, which were significantly enriched for genes harboring disease-associated SNPs. To test if such modules could be used to find novel genes for functional studies, we repeated the analyses using our own gene expression microarray and GWAS data from seasonal allergic rhinitis. We identified a novel gene, FGF2, whose relevance was supported by functional studies using combined small interfering RNA-mediated knock-down and gene expression microarrays. The modules in the 13 complex diseases analyzed here tended to overlap and were enriched for pathways related to oncological, metabolic and inflammatory diseases. This suggested that this union of the modules would be associated with a general increase in susceptibility for complex diseases. Indeed, we found that this union was enriched with GWAS genes for 145 other complex diseases.

Conclusions

Modules of highly interconnected complex disease genes were enriched for disease-associated SNPs, and could be used to find novel genes for functional studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号