首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 796 毫秒
1.
Modular organization of protein interaction networks   总被引:6,自引:0,他引:6  
MOTIVATION: Accumulating evidence suggests that biological systems are composed of interacting, separable, functional modules. Identifying these modules is essential to understand the organization of biological systems. RESULT: In this paper, we present a framework to identify modules within biological networks. In this approach, the concept of degree is extended from the single vertex to the sub-graph, and a formal definition of module in a network is used. A new agglomerative algorithm was developed to identify modules from the network by combining the new module definition with the relative edge order generated by the Girvan-Newman (G-N) algorithm. A JAVA program, MoNet, was developed to implement the algorithm. Applying MoNet to the yeast core protein interaction network from the database of interacting proteins (DIP) identified 86 simple modules with sizes larger than three proteins. The modules obtained are significantly enriched in proteins with related biological process Gene Ontology terms. A comparison between the MoNet modules and modules defined by Radicchi et al. (2004) indicates that MoNet modules show stronger co-clustering of related genes and are more robust to ties in betweenness values. Further, the MoNet output retains the adjacent relationships between modules and allows the construction of an interaction web of modules providing insight regarding the relationships between different functional modules. Thus, MoNet provides an objective approach to understand the organization and interactions of biological processes in cellular systems. AVAILABILITY: MoNet is available upon request from the authors.  相似文献   

2.
In biological networks of molecular interactions in a cell, network motifs that are biologically relevant are also functionally coherent, or form functional modules. These functionally coherent modules combine in a hierarchical manner into larger, less cohesive subsystems, thus revealing one of the essential design principles of system-level cellular organization and function-hierarchical modularity. Arguably, hierarchical modularity has not been explicitly taken into consideration by most, if not all, functional annotation systems. As a result, the existing methods would often fail to assign a statistically significant functional coherence score to biologically relevant molecular machines. We developed a methodology for hierarchical functional annotation. Given the hierarchical taxonomy of functional concepts (e.g., Gene Ontology) and the association of individual genes or proteins with these concepts (e.g., GO terms), our method will assign a Hierarchical Modularity Score (HMS) to each node in the hierarchy of functional modules; the HMS score and its p-value measure functional coherence of each module in the hierarchy. While existing methods annotate each module with a set of "enriched" functional terms in a bag of genes, our complementary method provides the hierarchical functional annotation of the modules and their hierarchically organized components. A hierarchical organization of functional modules often comes as a bi-product of cluster analysis of gene expression data or protein interaction data. Otherwise, our method will automatically build such a hierarchy by directly incorporating the functional taxonomy information into the hierarchy search process and by allowing multi-functional genes to be part of more than one component in the hierarchy. In addition, its underlying HMS scoring metric ensures that functional specificity of the terms across different levels of the hierarchical taxonomy is properly treated. We have evaluated our method using Saccharomyces cerevisiae data from KEGG and MIPS databases and several other computationally derived and curated datasets. The code and additional supplemental files can be obtained from http://code.google.com/p/functional-annotation-of-hierarchical-modularity/ (Accessed 2012 March 13).  相似文献   

3.
Xiao Y  Xu C  Xu L  Guan J  Ping Y  Fan H  Li Y  Zhao H  Li X 《Gene》2012,499(2):332-338
The development of heart failure (HF) is a complex process that can be initiated by multiple etiologies. Identifying common functional modules associated with HF is a challenging task. Here, we developed a systems method to identify these common functional modules by integrating multiple expression profiles, protein interactions from four species, gene function annotations, and text information. We identified 1439 consistently differentially expressed genes (CDEGs) across HF with different etiologies by applying three meta-analysis methods to multiple HF-related expression profiles. Using a weighted human interaction network constructed by combining interaction data from multiple species, we extracted 60 candidate CDEG modules. We further evaluated the functional relevance of each module by using expression, interaction network, functional annotations, and text information together. Finally, five functional modules with significant biological relevance were identified. We found that almost half of the genes in these modules are hubs in the weighted network, and that these modules can accurately classify HF patients from healthy subjects. We also identified many significantly enriched biological processes that contribute to the pathophysiology of HF, including two new ones, RNA splicing and vesicle-mediated protein transport. In summary, we proposed a novel framework to analyze common functional modules related to HF with different etiologies. Our findings provide important insights into the complex mechanism of HF. Further biological experimentations should be required to validate these novel biological processes.  相似文献   

4.
5.
6.
7.
Chen L  Li W  Zhang L  Wang H  He W  Tai J  Li X  Li X 《PloS one》2011,6(9):e24495

Background

Disease genes that interact cooperatively play crucial roles in the process of complex diseases, yet how to analyze and represent their associations is still an open problem. Traditional methods have failed to represent direct biological evidences that disease genes associate with each other in the pathogenesis of complex diseases. Molecular networks, assumed as ‘a form of biological systems’, consist of a set of interacting biological modules (functional modules or pathways) and this notion could provide a promising insight into deciphering this topic.

Methodology/Principal Findings

In this paper, we hypothesized that disease genes might associate by virtue of the associations between biological modules in molecular networks. Then we introduced a novel disease gene interaction pathway representation and analysis paradigm, and managed to identify the disease gene interaction pathway for 61 known disease genes of coronary artery disease (CAD), which contained 46 disease-risk modules and 182 interaction relationships. As demonstrated, disease genes associate through prescribed communication protocols of common biological functions and pathways.

Conclusions/Significance

Our analysis was proved to be coincident with our primary hypothesis that disease genes of complex diseases interact with their neighbors in a cooperative manner, associate with each other through shared biological functions and pathways of disease-risk modules, and finally cause dysfunctions of a series of biological processes in molecular networks. We hope our paradigm could be a promising method to identify disease gene interaction pathways for other types of complex diseases, affording additional clues in the pathogenesis of complex diseases.  相似文献   

8.
Many biological and clinical outcomes are based not on single proteins, but on modules of proteins embedded in protein networks. A fundamental question is how the proteins within each module contribute to the overall module activity. Here, we study the modules underlying three representative biological programs related to tissue development, breast cancer metastasis, or progression of brain cancer, respectively. For each case we apply a new method, called Network-Guided Forests, to identify predictive modules together with logic functions which tie the activity of each module to the activity of its component genes. The resulting modules implement a diverse repertoire of decision logic which cannot be captured using the simple approximations suggested in previous work such as gene summation or subtraction. We show that in cancer, certain combinations of oncogenes and tumor suppressors exert competing forces on the system, suggesting that medical genetics should move beyond cataloguing individual cancer genes to cataloguing their combinatorial logic.  相似文献   

9.
Advances in high throughput 'omic technologies are starting to provide unprecedented insights into how components of biological systems are organized and interact. Key to exploiting these datasets is the definition of the components that comprise the system of interest. Although a variety of knowledge bases exist that capture such information, a major challenge is determining how these resources may be best utilized. Here we present a systematic curation strategy to define a systems-level view of the human extracellular matrix (ECM)--a three-dimensional meshwork of proteins and polysaccharides that impart structure and mechanical stability to tissues. Employing our curation strategy we define a set of 357 proteins that represent core components of the ECM, together with an additional 524 genes that mediate related functional roles, and construct a map of their physical interactions. Topological properties help identify modules of functionally related proteins, including those involved in cell adhesion, bone formation and blood clotting. Because of its major role in cell adhesion, proliferation and morphogenesis, defects in the ECM have been implicated in cancer, atherosclerosis, asthma, fibrosis, and arthritis. We use MeSH annotations to identify modules enriched for specific disease terms that aid to strengthen existing as well as predict novel gene-disease associations. Mapping expression and conservation data onto the network reveal modules evolved in parallel to convey tissue-specific functionality on otherwise broadly expressed units. In addition to demonstrating an effective workflow for defining biological systems, this study crystallizes our current knowledge surrounding the organization of the ECM.  相似文献   

10.
由于耐药性的存在,不同患者在使用相同药物时会导致治疗效果的差异.因此识别耐药性相关的关键生物学标记,有助于临床医生快速选择出适合的药物,延长患者的生存时间,对药物研发以及药物的作用机制的详细研究具有重要意义.首先在食管癌细胞系中筛选不同药物的耐药及敏感细胞系,从中找到不同药物耐药相关的基因,将这些计算得到的耐药相关基因...  相似文献   

11.
If perturbing two genes together has a stronger or weaker effect than expected, they are said to genetically interact. Genetic interactions are important because they help map gene function, and functionally related genes have similar genetic interaction patterns. Mapping quantitative (positive and negative) genetic interactions on a global scale has recently become possible. This data clearly shows groups of genes connected by predominantly positive or negative interactions, termed monochromatic groups. These groups often correspond to functional modules, like biological processes or complexes, or connections between modules. However it is not yet known how these patterns globally relate to known functional modules. Here we systematically study the monochromatic nature of known biological processes using the largest quantitative genetic interaction data set available, which includes fitness measurements for ~5.4 million gene pairs in the yeast Saccharomyces cerevisiae. We find that only 10% of biological processes, as defined by Gene Ontology annotations, and less than 1% of inter-process connections are monochromatic. Further, we show that protein complexes are responsible for a surprisingly large fraction of these patterns. This suggests that complexes play a central role in shaping the monochromatic landscape of biological processes. Altogether this work shows that both positive and negative monochromatic patterns are found in known biological processes and in their connections and that protein complexes play an important role in these patterns. The monochromatic processes, complexes and connections we find chart a hierarchical and modular map of sensitive and redundant biological systems in the yeast cell that will be useful for gene function prediction and comparison across phenotypes and organisms. Furthermore the analysis methods we develop are applicable to other species for which genetic interactions will progressively become more available.  相似文献   

12.
Diabetes mellitus (DM) and breast cancer (BC) can simultaneously occur in the same patient populations, but the molecular relationship between them remains unknown. In this study, we constructed genetic networks and used modularized analysis approaches to investigate the multi‐dimensional characteristics of two diseases and one disease subtype. A text search engine (Agilent Literature Search 2.71) and MCODE software were applied to validate potential subnetworks and to divide the modules, respectively. A total of 793 DM‐related genes, 386 type 2 diabetes (T2DM) genes and 873 BC‐related genes were identified from the Online Mendelian Inheritance in Man database. For DM and BC, a total of 99 overlapping genes, 9 modules, 29 biological processes and 7 pathways were identified. Meanwhile, for T2DM and BC, 56 overlapping genes, 5 modules, 20 biological processes and 12 pathways were identified. Based on the Gene Ontology functional enrichment analysis of the top 10 non‐overlapping modules of the two diseases, 10 biological functions and 5 pathways overlapped between them. The glycosphingolipid and lysosome pathways verified molecular mechanisms of cell death related to both DM and BC. We also identified new biological functions of dopamine receptors and four signalling pathways (Parkinson's disease, Alzheimer's disease, Huntington's disease and long‐term depression) related to both diseases; these warrant further investigation. Our results illustrate the landscape of the novel molecular substructures between DM and BC, which may support a new model for complex disease classification and rational therapies for multiple diseases.  相似文献   

13.
MOTIVATION: MicroRNAs (miRNAs) and mRNAs constitute an important part of gene regulatory networks, influencing diverse biological phenomena. Elucidating closely related miRNAs and mRNAs can be an essential first step towards the discovery of their combinatorial effects on different cellular states. Here, we propose a probabilistic learning method to identify synergistic miRNAs involving regulation of their condition-specific target genes (mRNAs) from multiple information sources, i.e. computationally predicted target genes of miRNAs and their respective expression profiles. RESULTS: We used data sets consisting of miRNA-target gene binding information and expression profiles of miRNAs and mRNAs on human cancer samples. Our method allowed us to detect functionally correlated miRNA-mRNA modules involved in specific biological processes from multiple data sources by using a balanced fitness function and efficient searching over multiple populations. The proposed algorithm found two miRNA-mRNA modules, highly correlated with respect to their expression and biological function. Moreover, the mRNAs included in the same module showed much higher correlations when the related miRNAs were highly expressed, demonstrating our method's ability for finding coherent miRNA-mRNA modules. Most members of these modules have been reported to be closely related with cancer. Consequently, our method can provide a primary source of miRNA and target sets presumed to constitute closely related parts of gene regulatory pathways.  相似文献   

14.
15.
16.
Molecular networks in cells are organized into functional modules, where genes in the same module interact densely with each other and participate in the same biological process. Thus, identification of modules from molecular networks is an important step toward a better understanding of how cells function through the molecular networks. Here, we propose a simple, automatic method, called MC(2), to identify functional modules by enumerating and merging cliques in the protein-interaction data from large-scale experiments. Application of MC(2) to the S. cerevisiae protein-interaction data produces 84 modules, whose sizes range from 4 to 69 genes. The majority of the discovered modules are significantly enriched with a highly specific process term (at least 4 levels below root) and a specific cellular component in Gene Ontology (GO) tree. The average fraction of genes with the most enriched GO term for all modules is 82% for specific biological processes and 78% for specific cellular components. In addition, the predicted modules are enriched with coexpressed proteins. These modules are found to be useful for annotating unknown genes and uncovering novel functions of known genes. MC(2) is efficient, and takes only about 5 min to identify modules from the current yeast gene interaction network with a typical PC (Intel Xeon 2.5 GHz CPU and 512 MB memory). The CPU time of MC(2) is affordable (12 h) even when the number of interactions is increased by a factor of 10. MC(2) and its results are publicly available on http://theory.med.buffalo.edu/MC2.  相似文献   

17.
18.
19.

Background  

The accumulation of high-throughput data greatly promotes computational investigation of gene function in the context of complex biological systems. However, a biological function is not simply controlled by an individual gene since genes function in a cooperative manner to achieve biological processes. In the study of human diseases, rather than to discover disease related genes, identifying disease associated pathways and modules becomes an essential problem in the field of systems biology.  相似文献   

20.
Classically, the functional consequences of natural selection over genomes have been analyzed as the compound effects of individual genes. The current paradigm for large-scale analysis of adaptation is based on the observed significant deviations of rates of individual genes from neutral evolutionary expectation. This approach, which assumed independence among genes, has not been able to identify biological functions significantly enriched in positively selected genes in individual species. Alternatively, pooling related species has enhanced the search for signatures of selection. However, grouping signatures does not allow testing for adaptive differences between species. Here we introduce the Gene-Set Selection Analysis (GSSA), a new genome-wide approach to test for evidences of natural selection on functional modules. GSSA is able to detect lineage specific evolutionary rate changes in a notable number of functional modules. For example, in nine mammal and Drosophilae genomes GSSA identifies hundreds of functional modules with significant associations to high and low rates of evolution. Many of the detected functional modules with high evolutionary rates have been previously identified as biological functions under positive selection. Notably, GSSA identifies conserved functional modules with many positively selected genes, which questions whether they are exclusively selected for fitting genomes to environmental changes. Our results agree with previous studies suggesting that adaptation requires positive selection, but not every mutation under positive selection contributes to the adaptive dynamical process of the evolution of species.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号