首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
MOTIVATION: Identification of functional modules in protein interaction networks is a first step in understanding the organization and dynamics of cell functions. To ensure that the identified modules are biologically meaningful, network-partitioning algorithms should take into account not only topological features but also functional relationships, and identified modules should be rigorously validated. RESULTS: In this study we first integrate proteomics and microarray datasets and represent the yeast protein-protein interaction network as a weighted graph. We then extend a betweenness-based partition algorithm, and use it to identify 266 functional modules in the yeast proteome network. For validation we show that the functional modules are indeed densely connected subgraphs. In addition, genes in the same functional module confer a similar phenotype. Furthermore, known protein complexes are largely contained in the functional modules in their entirety. We also analyze an example of a functional module and show that functional modules can be useful for gene annotation. CONTACT: yuan.33@osu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

2.
Gene co-expression, in many cases, implies the presence of a functional linkage between genes. Co-expression analysis has uncovered gene regulatory mechanisms in model organisms such as Escherichia coli and yeast. Recently, accumulation of Arabidopsis microarray data has facilitated a genome-wide inspection of gene co-expression profiles in this model plant. An approach using network analysis has provided an intuitive way to represent complex co-expression patterns between many genes. Co-expression network analysis has enabled us to extract modules, or groups of tightly co-expressed genes, associated with biological processes. Furthermore, integrated analysis of gene expression and metabolite accumulation has allowed us to hypothesize the functions of genes associated with specific metabolic processes. Co-expression network analysis is a powerful approach for data-driven hypothesis construction and gene prioritization, and provides novel insights into the system-level understanding of plant cellular processes.  相似文献   

3.
Xu Y  Duanmu H  Chang Z  Zhang S  Li Z  Li Z  Liu Y  Li K  Qiu F  Li X 《Molecular biology reports》2012,39(2):1627-1637
Copy number variations (CNVs) are one type of the human genetic variations and are pervasive in the human genome. It has been confirmed that they can play a causal role in complex diseases. Previous studies of CNVs focused more on identifying the disease-specific CNV regions or candidate genes on these CNV regions, but less on the synergistic actions between genes on CNV regions and other genes. Our research combined the CNVs with related gene co-expression to reconstruct gene co-expression network by using single nucleotide polymorphism microarray datasets and gene microarray datasets of breast cancer, and then extracted the modules which connected densely inside and analyzed the functions of modules. Interestingly, all of these modules’ functions were related to breast cancer according to our enrichment analysis, and most of the genes in these modules have been reported to be involved in breast cancer. Our findings suggested that integrating CNVs and gene co-expressed relations was an available way to analyze the roles of CNV genes and their synergistic genes in breast cancer, and provided a novel insight into the pathological mechanism of breast cancer.  相似文献   

4.
5.
Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.  相似文献   

6.
Gene co-expression networks provide an important tool for systems biology studies. Using microarray data from the Array Express database, we constructed an Arabidopsis gene co-expression network, termed At GGM2014, based on the graphical Gaussian model, which contains 102,644 co-expression gene pairs among 18,068 genes. The network was grouped into 622 gene co-expression modules. These modules function in diverse house-keeping, cell cycle, development, hormone response, metabolism, and stress response pathways. We developed a tool to facilitate easy visualization of the expression patterns of these modules either in a tissue context or their regulation under different treatment conditions. The results indicate that at least six modules with tissue-specific expression pattern failed to record modular regulation under various stress conditions. This discrepancy could be best explained by the fact that experiments to study plant stress responses focused mainly on leaves and less on roots, and thus failed to recover specific regulation pattern in other tissues. Overall, the modular structures revealed by our network provide extensive information to generate testable hypotheses about diverse plant signaling pathways. At GGM2014 offers a constructive tool for plant systems biology studies.  相似文献   

7.
We performed a systematic review of genome‐wide gene expression datasets to identify key genes and functional modules involved in the pathogenesis of systemic lupus erythematosus (SLE) at a systems level. Genome‐wide gene expression datasets involving SLE patients were searched in Gene Expression Omnibus and ArrayExpress databases. Robust rank aggregation (RRA) analysis was used to integrate those public datasets and identify key genes associated with SLE. The weighted gene coexpression network analysis (WGCNA) was adapted to identify functional modules involved in SLE pathogenesis, and the gene ontology enrichment analysis was utilized to explore their functions. The aberrant expressions of several randomly selected key genes were further validated in SLE patients through quantitative real‐time polymerase chain reaction. Fifteen genome‐wide gene expression datasets were finally included, which involved a total of 1,778 SLE patients and 408 healthy controls. A large number of significantly upregulated or downregulated genes were identified through RRA analysis, and some of those genes were novel SLE gene signatures and their molecular roles in etiology of SLE remained vague. WGCNA further successfully identified six main functional modules involved in the pathogenesis of SLE. The most important functional module involved in SLE included 182 genes and mainly enriched in biological processes, including defense response to virus, interferon signaling pathway, and cytokine‐mediated signaling pathway. This study identifies a number of key genes and functional coexpression modules involved in SLE, which provides deepening insights into the molecular mechanism of SLE at a systems level and also provides some promising therapeutic targets.  相似文献   

8.
Minguez P  Dopazo J 《PloS one》2011,6(3):e17474
Microarray experiments have been extensively used to define signatures, which are sets of genes that can be considered markers of experimental conditions (typically diseases). Paradoxically, in spite of the apparent functional role that might be attributed to such gene sets, signatures do not seem to be reproducible across experiments. Given the close relationship between function and protein interaction, network properties can be used to study to what extent signatures are composed of genes whose resulting proteins show a considerable level of interaction (and consequently a putative common functional role).We have analysed 618 signatures and 507 modules of co-expression in cancer looking for significant values of four main protein-protein interaction (PPI) network parameters: connection degree, cluster coefficient, betweenness and number of components. A total of 3904 gene ontology (GO) modules, 146 KEGG pathways, and 263 Biocarta pathways have been used as functional modules of reference.Co-expression modules found in microarray experiments display a high level of connectivity, similar to the one shown by conventional modules based on functional definitions (GO, KEGG and Biocarta). A general observation for all the classes studied is that the networks formed by the modules improve their topological parameters when an external protein is allowed to be introduced within the paths (up to the 70% of GO modules show network parameters beyond the random expectation). This fact suggests that functional definitions are incomplete and some genes might still be missing. Conversely, signatures are clearly not capturing the altered functions in the corresponding studies. This is probably because the way in which the genes have been selected in the signatures is too conservative. These results suggest that gene selection methods which take into account relationships among genes should be superior to methods that assume independence among genes outside their functional contexts.  相似文献   

9.
10.
11.
With the tremendous increase of publicly available single-cell RNA-sequencing (scRNA-seq) datasets, bioinformatics methods based on gene co-expression network are becoming efficient tools for analyzing scRNA-seq data, improving cell type prediction accuracy and in turn facilitating biological discovery. However, the current methods are mainly based on overall co-expression correlation and overlook co-expression that exists in only a subset of cells, thus fail to discover certain rare cell types and sensitive to batch effect. Here, we developed independent component analysis-based gene co-expression network inference (ICAnet) that decomposed scRNA-seq data into a series of independent gene expression components and inferred co-expression modules, which improved cell clustering and rare cell-type discovery. ICAnet showed efficient performance for cell clustering and batch integration using scRNA-seq datasets spanning multiple cells/tissues/donors/library types. It works stably on datasets produced by different library construction strategies and with different sequencing depths and cell numbers. We demonstrated the capability of ICAnet to discover rare cell types in multiple independent scRNA-seq datasets from different sources. Importantly, the identified modules activated in acute myeloid leukemia scRNA-seq datasets have the potential to serve as new diagnostic markers. Thus, ICAnet is a competitive tool for cell clustering and biological interpretations of single-cell RNA-seq data analysis.  相似文献   

12.
13.
14.
15.
The development of poultry muscle fibers after hatching is closely related to meat quality and production efficiency. It is necessary to identify functional modules (groups of functionally related genes) related to muscle development at different developmental stages, and to investigate their relationships based on the weighted gene co-expression network analysis (WGCNA) methods. Accordingly, we investigated the co-expression associations between genes related to chicken breast muscle at four different developmental stages (between 2 and 14 weeks of age), and systematically analyzed the network topology in Jinmao Hua chicken. As a result, 2341 differentially expressed genes were identified and subjected to co-expression analysis. Four modules were identified to be related to a particular growth stage for the development of breast muscle. A series of genes with the highest connectivity were identified in the pink (2 weeks), yellow (6 weeks), green (10 weeks) and black modules (14 weeks), respectively, and visualized by Cytoscape. These hub genes (FGF, MAPKAPK5, NRG1, SCD, ACSL1, PPAR etc.) were mainly enriched in 15 pathways, such as MAPK signaling pathway, NRG/ErbB signaling pathway, and insulin signaling pathway. They shared biological functions related to development of breast muscle and adipogenesis. This is the first study of gene network with different stages of muscle development in Jinmao Hua chicken to observe co-expression patterns. It may contribute to the underlied molecular mechanisms of chicken breast muscle development.  相似文献   

16.
MOTIVATION: The diverse microarray datasets that have become available over the past several years represent a rich opportunity and challenge for biological data mining. Many supervised and unsupervised methods have been developed for the analysis of individual microarray datasets. However, integrated analysis of multiple datasets can provide a broader insight into genetic regulation of specific biological pathways under a variety of conditions. RESULTS: To aid in the analysis of such large compendia of microarray experiments, we present Microarray Experiment Functional Integration Technology (MEFIT), a scalable Bayesian framework for predicting functional relationships from integrated microarray datasets. Furthermore, MEFIT predicts these functional relationships within the context of specific biological processes. All results are provided in the context of one or more specific biological functions, which can be provided by a biologist or drawn automatically from catalogs such as the Gene Ontology (GO). Using MEFIT, we integrated 40 Saccharomyces cerevisiae microarray datasets spanning 712 unique conditions. In tests based on 110 biological functions drawn from the GO biological process ontology, MEFIT provided a 5% or greater performance increase for 54 functions, with a 5% or more decrease in performance in only two functions.  相似文献   

17.
18.
Jung KH  Lee J  Dardick C  Seo YS  Cao P  Canlas P  Phetsom J  Xu X  Ouyang S  An K  Cho YJ  Lee GC  Lee Y  An G  Ronald PC 《PLoS genetics》2008,4(8):e1000164
Functional redundancy limits detailed analysis of genes in many organisms. Here, we report a method to efficiently overcome this obstacle by combining gene expression data with analysis of gene-indexed mutants. Using a rice NSF45K oligo-microarray to compare 2-week-old light- and dark-grown rice leaf tissue, we identified 365 genes that showed significant 8-fold or greater induction in the light relative to dark conditions. We then screened collections of rice T-DNA insertional mutants to identify rice lines with mutations in the strongly light-induced genes. From this analysis, we identified 74 different lines comprising two independent mutant lines for each of 37 light-induced genes. This list was further refined by mining gene expression data to exclude genes that had potential functional redundancy due to co-expressed family members (12 genes) and genes that had inconsistent light responses across other publicly available microarray datasets (five genes). We next characterized the phenotypes of rice lines carrying mutations in ten of the remaining candidate genes and then carried out co-expression analysis associated with these genes. This analysis effectively provided candidate functions for two genes of previously unknown function and for one gene not directly linked to the tested biochemical pathways. These data demonstrate the efficiency of combining gene family-based expression profiles with analyses of insertional mutants to identify novel genes and their functions, even among members of multi-gene families.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号