首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
High-throughput technologies are now used to generate more than one type of data from the same biological samples. To properly integrate such data, we propose using co-modules, which describe coherent patterns across paired data sets, and conceive several modular methods for their identification. We first test these methods using in silico data, demonstrating that the integrative scheme of our Ping-Pong Algorithm uncovers drug-gene associations more accurately when considering noisy or complex data. Second, we provide an extensive comparative study using the gene-expression and drug-response data from the NCI-60 cell lines. Using information from the DrugBank and the Connectivity Map databases we show that the Ping-Pong Algorithm predicts drug-gene associations significantly better than other methods. Co-modules provide insights into possible mechanisms of action for a wide range of drugs and suggest new targets for therapy.  相似文献   

2.
Network medicine has been applied successfully to elicit the structure of large-scale molecular interaction networks. Its main proponents have claimed that this approach to integrative medical investigation should make it possible to identify functional modules of interacting molecular biological units as well as interactions themselves. This paper takes a significant step in this direction. Based on a large-scale analysis of the nervous system molecular medicine literature, this study analyzes and visualizes the complex structure of associations between diseases on the one hand and all types of molecular substances on the other. From this analysis it then identifies functional co-association groups consisting of several types of molecular substances, each consisting of substances that exhibit a pattern of frequent co-association with similar diseases. These groups in turn exhibit interlinking in a complex pattern, suggesting that such complex interactions between functional molecular modules may play a role in disease etiology. We find that the patterns exhibited by the networks of disease – molecular substance associations studied here correspond well to a number of recently published research results, and that the groups of molecular substances identified by statistical analysis of these networks do appear to be interesting groups of molecular substances that are interconnected in identifiable and interpretable ways. Our results not only demonstrate that networks are a convenient framework to analyze and visualize large-scale, complex relationships among molecular networks and diseases, but may also provide a conceptual basis for bridging gaps in experimental and theoretical knowledge.  相似文献   

3.
4.
Gene clustering by latent semantic indexing of MEDLINE abstracts   总被引:1,自引:0,他引:1  
MOTIVATION: A major challenge in the interpretation of high-throughput genomic data is understanding the functional associations between genes. Previously, several approaches have been described to extract gene relationships from various biological databases using term-matching methods. However, more flexible automated methods are needed to identify functional relationships (both explicit and implicit) between genes from the biomedical literature. In this study, we explored the utility of Latent Semantic Indexing (LSI), a vector space model for information retrieval, to automatically identify conceptual gene relationships from titles and abstracts in MEDLINE citations. RESULTS: We found that LSI identified gene-to-gene and keyword-to-gene relationships with high average precision. In addition, LSI identified implicit gene relationships based on word usage patterns in the gene abstract documents. Finally, we demonstrate here that pairwise distances derived from the vector angles of gene abstract documents can be effectively used to functionally group genes by hierarchical clustering. Our results provide proof-of-principle that LSI is a robust automated method to elucidate both known (explicit) and unknown (implicit) gene relationships from the biomedical literature. These features make LSI particularly useful for the analysis of novel associations discovered in genomic experiments. AVAILABILITY: The 50-gene document collection used in this study can be interactively queried at http://shad.cs.utk.edu/sgo/sgo.html.  相似文献   

5.
IntroductionResearchers worldwide with great endeavor searching and repurpose drugs might be potentially useful in fighting newly emerged coronavirus. These drugs show inhibition but also show side effects and complications too. On December 27, 2020, 80,926,235 cases have been reported worldwide. Specifically, in Pakistan, 471,335 has been reported with inconsiderable deaths.Problem statementIdentification of COVID-19 drugs pathway through drug-gene and gene−gene interaction to find out the most important genes involved in the pathway to deal with the actual cause of side effects beyond the beneficent effects of the drugs.MethodologyThe medicines used to treat COVID-19 are retrieved from the Drug Bank. The drug-gene interaction was performed using the Drug Gene Interaction Database to check the relation between the genes and the drugs. The networks of genes are developed by Gene MANIA, while Cytoscape is used to check the active functional association of the targeted gene. The developed systems cross-validated using the EnrichNet tool and identify drug genes'' concerned pathways using Reactome and STRING.ResultsFive drugs Azithromycin, Bevacizumab, CQ, HCQ, and Lopinavir, are retrieved. The drug-gene interaction shows several genes that are targeted by the drug. Gene MANIA interaction network shows the functional association of the genes like co-expression, physical interaction, predicted, genetic interaction, co-localization, and shared protein domains.ConclusionOur study suggests the pathways for each drug in which targeted genes and medicines play a crucial role, which will help experts in-vitro overcome and deal with the side effects of these drugs, as we find out the in-silico gene analysis for the COVID-19 drugs.  相似文献   

6.
7.
Since operons are unstable across Prokaryotes, it has been suggested that perhaps they re-combine in a conservative manner. Thus, genes belonging to a given operon in one genome might re-associate in other genomes revealing functional relationships among gene products. We developed a system to build networks of functional relationships of gene products based on their organization into operons in any available genome. The operon predictions are based on inter-genic distances. Our system can use different kinds of thresholds to accept a functional relationship, either related to the prediction of operons, or to the number of non-redundant genomes that support the associations. We also work by shells, meaning that we decide on the number of linking iterations to allow for the complementation of related gene sets. The method shows high reliability benchmarked against knowledge-bases of functional interactions. We also illustrate the use of Nebulon in finding new members of regulons, and of other functional groups of genes. Operon rearrangements produce thousands of high-quality new interactions per prokaryotic genome, and thousands of confirmations per genome to other predictions, making it another important tool for the inference of functional interactions from genomic context.  相似文献   

8.
9.
Groupwise functional analysis of gene variants is becoming standard in next-generation sequencing studies. As the function of many genes is unknown and their classification to pathways is scant, functional associations between genes are often inferred from large-scale omics data. Such data types—including protein–protein interactions and gene co-expression networks—are used to examine the interrelations of the implicated genes. Statistical significance is assessed by comparing the interconnectedness of the mutated genes with that of random gene sets. However, interconnectedness can be affected by confounding bias, potentially resulting in false positive findings. We show that genes implicated through de novo sequence variants are biased in their coding-sequence length and longer genes tend to cluster together, which leads to exaggerated p-values in functional studies; we present here an integrative method that addresses these bias. To discern molecular pathways relevant to complex disease, we have inferred functional associations between human genes from diverse data types and assessed them with a novel phenotype-based method. Examining the functional association between de novo gene variants, we control for the heretofore unexplored confounding bias in coding-sequence length. We test different data types and networks and find that the disease-associated genes cluster more significantly in an integrated phenotypic-linkage network than in other gene networks. We present a tool of superior power to identify functional associations among genes mutated in the same disease even after accounting for significant sequencing study bias and demonstrate the suitability of this method to functionally cluster variant genes underlying polygenic disorders.  相似文献   

10.
11.
BackgroundThere is a growing body of evidence associating microRNAs (miRNAs) with human diseases. MiRNAs are new key players in the disease paradigm demonstrating roles in several human diseases. The functional association between miRNAs and diseases remains largely unclear and far from complete. With the advent of high-throughput functional genomics techniques that infer genes and biological pathways dysregulted in diseases, it is now possible to infer functional association between diseases and biological molecules by integrating disparate biological information.ResultsHere, we first used Lasso regression model to identify miRNAs associated with disease signature as a proof of concept. Then we proposed an integrated approach that uses disease-gene associations from microarray experiments and text mining, and miRNA-gene association from computational predictions and protein networks to build functional associations network between miRNAs and diseases. The findings of the proposed model were validated against gold standard datasets using ROC analysis and results were promising (AUC=0.81). Our protein network-based approach discovered 19 new functional associations between prostate cancer and miRNAs. The new 19 associations were validated using miRNA expression data and clinical profiles and showed to act as diagnostic and prognostic prostate biomarkers. The proposed integrated approach allowed us to reconstruct functional associations between miRNAs and human diseases and uncovered functional roles of newly discovered miRNAs.ConclusionsLasso regression was used to find associations between diseases and miRNAs using their gene signature. Defining miRNA gene signature by integrating the downstream effect of miRNAs demonstrated better performance than the miRNA signature alone. Integrating biological networks and multiple data to define miRNA and disease gene signature demonstrated high performance to uncover new functional associations between miRNAs and diseases.  相似文献   

12.
基因逻辑网络研究进展   总被引:1,自引:0,他引:1  
海量生物数据的涌现,使得通过数据分析和理论方法探索生物机理成为理论生物学研究的重要途径.特别是对于基因的复杂的功能系统,建立基因网络这种理论方法的意义更为突出.Bowers在蛋白质相互作用的分析中引入了高阶逻辑关系,从而建立了系统发生谱数据的逻辑分析(LAPP)的系统方法.LAPP和通常建立模型的方法不同,它给出了一个从复杂网络的元素(或部件)的表达数据出发,通过逻辑分析,找到元素之间逻辑关联性的建模方法.这种方法能够从蛋白质表达谱数据出发,利用信息熵的算法发现两种蛋白质对一种蛋白质的联合作用,对于发现蛋白质之间新的作用机理有重要意义.由于涉及功能的基因组通常是一个大的群体构成的系统,因此LAPP方法也是一个生成复杂的基因逻辑网络的方法.基因逻辑网络的建立,方便实现通过逻辑调控进行基因调控的目的.这种方法可以应用在很多方面,如物种进化、肿瘤诊疗等等.系统阐述并分析了LAPP方法,并指出其在方法和应用方面的新进展以及评述.  相似文献   

13.
Combined analysis of the microarray and drug-activity datasets has the potential of revealing valuable knowledge about various relations among gene expressions and drug activities in the malignant cell. In this paper, we apply Bayesian networks, a tool for compact representation of the joint probability distribution, to such analysis. For the alleviation of data dimensionality problem, the huge datasets were condensed using a feature abstraction technique. The proposed analysis method was applied to the NCI60 dataset (http://discover.nci.nih.gov) consisting of gene expression profiles and drug activity patterns on human cancer cell lines. The Bayesian networks, learned from the condensed dataset, identified most of the salient pairwise correlations and some known relationships among several features in the original dataset, confirming the effectiveness of the proposed feature abstraction method. Also, a survey of the recent literature confirms the several relationships appearing in the learned Bayesian network to be biologically meaningful.  相似文献   

14.
Copy number variation (CNV) plays a role in pathogenesis of many human diseases, especially cancer. Several whole genome CNV association studies have been performed for the purpose of identifying cancer associated CNVs. Here we undertook a novel approach to whole genome CNV analysis, with the goal being identification of associations between CNV of different genes (CNV-CNV) across 60 human cancer cell lines. We hypothesize that these associations point to the roles of the associated genes in cancer, and can be indicators of their position in gene networks of cancer-driving processes. Recent studies show that gene associations are often non-linear and non-monotone. In order to obtain a more complete picture of all CNV associations, we performed omnibus univariate analysis by utilizing dCov, MIC, and HHG association tests, which are capable of detecting any type of association, including non-monotone relationships. For comparison we used Spearman and Pearson association tests, which detect only linear or monotone relationships. Application of dCov, MIC and HHG tests resulted in identification of twice as many associations compared to those found by Spearman and Pearson alone. Interestingly, most of the new associations were detected by the HHG test. Next, we utilized dCov''s and HHG''s ability to perform multivariate analysis. We tested for association between genes of unknown function and known cancer-related pathways. Our results indicate that multivariate analysis is much more effective than univariate analysis for the purpose of ascribing biological roles to genes of unknown function. We conclude that a combination of multivariate and univariate omnibus association tests can reveal significant information about gene networks of disease-driving processes. These methods can be applied to any large gene or pathway dataset, allowing more comprehensive analysis of biological processes.  相似文献   

15.
16.
White blood cell (WBC) count is a common clinical measure from complete blood count assays, and it varies widely among healthy individuals. Total WBC count and its constituent subtypes have been shown to be moderately heritable, with the heritability estimates varying across cell types. We studied 19,509 subjects from seven cohorts in a discovery analysis, and 11,823 subjects from ten cohorts for replication analyses, to determine genetic factors influencing variability within the normal hematological range for total WBC count and five WBC subtype measures. Cohort specific data was supplied by the CHARGE, HeamGen, and INGI consortia, as well as independent collaborative studies. We identified and replicated ten associations with total WBC count and five WBC subtypes at seven different genomic loci (total WBC count-6p21 in the HLA region, 17q21 near ORMDL3, and CSF3; neutrophil count-17q21; basophil count- 3p21 near RPN1 and C3orf27; lymphocyte count-6p21, 19p13 at EPS15L1; monocyte count-2q31 at ITGA4, 3q21, 8q24 an intergenic region, 9q31 near EDG2), including three previously reported associations and seven novel associations. To investigate functional relationships among variants contributing to variability in the six WBC traits, we utilized gene expression- and pathways-based analyses. We implemented gene-clustering algorithms to evaluate functional connectivity among implicated loci and showed functional relationships across cell types. Gene expression data from whole blood was utilized to show that significant biological consequences can be extracted from our genome-wide analyses, with effect estimates for significant loci from the meta-analyses being highly corellated with the proximal gene expression. In addition, collaborative efforts between the groups contributing to this study and related studies conducted by the COGENT and RIKEN groups allowed for the examination of effect homogeneity for genome-wide significant associations across populations of diverse ancestral backgrounds.  相似文献   

17.
Since genetic alteration only accounts for 20%–30% in the drug effect-related factors, the role of epigenetic regulation mechanisms in drug response is gradually being valued. However, how epigenetic changes and abnormal gene expression affect the chemotherapy response remains unclear. Therefore, we constructed a variety of mathematical models based on the integrated DNA methylation, gene expression, and anticancer drug response data of cancer cell lines from pan-cancer levels to identify genes whose DNA methylation is associated with drug response and then to assess the impact of epigenetic regulation of gene expression on the sensitivity of anticancer drugs. The innovation of the mathematical models lies in: Linear regression model is followed by logistic regression model, which greatly shortens the calculation time and ensures the reliability of results by considering the covariates. Second, reconstruction of prediction models based on multiple dataset partition methods not only evaluates the model stability but also optimizes the drug-gene pairs. For 368,520 drug-gene pairs with P < 0.05 in linear models, 999 candidate pairs with both AUC ≥ 0.8 and P < 0.05 were obtained by logistic regression models between drug response and DNA methylation. Then 931 drug-gene pairs with 45 drugs and 491 genes were optimized by model stability assessment. Integrating both DNA methylation and gene expression markedly increased predictive power for 732 drug-gene pairs where 598 drug-gene pairs including 44 drugs and 359 genes were prioritized. Several drug target genes were enriched in the modules of the drug-gene-weighted interaction network. Besides, for cancer driver genes such as EGFR, MET, and TET2, synergistic effects of DNA methylation and gene expression can predict certain anticancer drugs’ responses. In summary, we identified potential drug sensitivity-related markers from pan-cancer levels and concluded that synergistic regulation of DNA methylation and gene expression affect anticancer drug response.  相似文献   

18.
19.
The functional relationships and properties of different subtypes of dendritic cells (DC) remain largely undefined. To better characterize these cells, we used global gene analysis to determine gene expression patterns among murine CD11c(high) DC subsets. CD4(+), CD8alpha(+), and CD8alpha(-) CD4(-) (double negative (DN)) DC were purified from spleens of normal C57/BL6 mice and analyzed using Affymetrix microarrays. The CD4(+) and CD8alpha(+) DC subsets showed distinct basal expression profiles differing by >200 individual genes. These included known DC subset markers as well as previously unrecognized, differentially expressed CD Ags such as CD1d, CD5, CD22, and CD72. Flow cytometric analysis confirmed differential expression in nine of nine cases, thereby validating the microarray analysis. Interestingly, the microarray expression profiles for DN cells strongly resembled those of CD4(+) DC, differing from them by <25 genes. This suggests that CD4(+) and DN DC are closely related phylogenetically, whereas CD8alpha(+) DC represent a more distant lineage, supporting the historical distinction between CD8alpha(+) and CD8alpha(-) DC. However, staining patterns revealed that in contrast to CD4(+) DC, the DN subset is heterogeneous and comprises at least two subpopulations. Gene Ontology and literature mining analyses of genes expressed differentially among DC subsets indicated strong associations with immune response parameters as well as cell differentiation and signaling. Such associations offer clues to possible unique functions of the CD11c(high) DC subsets that to date have been difficult to define as rigid distinctions.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号