首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 468 毫秒
1.
2.
The DAVID Gene Functional Classification Tool uses a novel agglomeration algorithm to condense a list of genes or associated biological terms into organized classes of related genes or biology, called biological modules. This organization is accomplished by mining the complex biological co-occurrences found in multiple sources of functional annotation. It is a powerful method to group functionally related genes and terms into a manageable number of biological modules for efficient interpretation of gene lists in a network context.  相似文献   

3.
Selecting differentially expressed genes (DEGs) is one of the most important tasks in microarray applications for studying multi-factor diseases including cancers. However, the small samples typically used in current microarray studies may only partially reflect the widely altered gene expressions in complex diseases, which would introduce low reproducibility of gene lists selected by statistical methods. Here, by analyzing seven cancer datasets, we showed that, in each cancer, a wide range of functional modules have altered gene expressions and thus have high disease classification abilities. The results also showed that seven modules are shared across diverse cancers, suggesting hints about the common mechanisms of cancers. Therefore, instead of relying on a few individual genes whose selection is hardly reproducible in current microarray experiments, we may use functional modules as functional signatures to study core mechanisms of cancers and build robust diagnostic classifiers.  相似文献   

4.
MOTIVATION: We propose a method for studying the stability of biomarker lists obtained from functional genomics studies. It is common to adopt resampling methods to tune and evaluate marker-based diagnostic and prognostic systems in order to prevent selection bias. Such caution promotes honest estimation of class prediction, but leads to alternative sets of solutions. In microarray studies, the difference in lists may be bewildering, also due to the presence of modules of functionally related genes. Methods for assessing stability understand the dependency of the markers on the data or on the predictor's type and help selecting solutions. RESULTS: A computational framework for comparing sets of ranked biomarker lists is presented. Notions and algorithms are based on concepts from permutation group theory. We introduce several algebraic indicators and metric methods for symmetric groups, including the Canberra distance, a weighted version of Spearman's footrule. We also consider distances between partial lists and an aggregation of sets of lists into an optimal list based on voting theory (Borda count). The stability indicators are applied in practical situations to several synthetic, cancer microarray and proteomics datasets. The addressed issues are predictive classification, presence of modules, comparison of alternative biomarker lists, outlier removal, control of selection bias by randomization techniques and enrichment analysis. AVAILABILITY: Supplementary Material and software are available at the address http://biodcv.fbk.eu/listspy.html  相似文献   

5.
Wang Q  Sun J  Zhou M  Yang H  Li Y  Li X  Lv S  Li X  Li Y 《Bioinformatics (Oxford, England)》2011,27(11):1521-1528
MOTIVATION: In the functional genomic era, a large number of gene sets have been identified via high-throughput genomic and proteomic technologies. These gene sets of interest are often related to the same or similar disorders or phenotypes, and are commonly presented as differentially expressed gene lists, co-expressed gene modules, protein complexes or signaling pathways. However, biologists are still faced by the challenge of comparing gene sets and interpreting the functional relationships between gene sets into an understanding of the underlying biological mechanisms. RESULTS: We introduce a novel network-based method, designated corrected cumulative rank score (CCRS), which analyzes the functional communication and physical interaction between genes, and presents an easy-to-use web-based toolkit called GsNetCom to quantify the functional relationship between two gene sets. To evaluate the performance of our method in assessing the functional similarity between two gene sets, we analyzed the functional coherence of complexes in functional catalog and identified protein complexes in the same functional catalog. The results suggested that CCRS can offer a significant advance in addressing the functional relationship between different gene sets compared with several other available tools or algorithms with similar functionality. We also conducted the case study based on our method, and succeeded in prioritizing candidate leukemia-associated protein complexes and expanding the prioritization and analysis of cancer-related complexes to other cancer types. In addition, GsNetCom provides a new insight into the communication between gene modules, such as exploring gene sets from the perspective of well-annotated protein complexes. Availability and Implementation: GsNetCom is a freely available web accessible toolkit at http://bioinfo.hrbmu.edu.cn/GsNetCom.  相似文献   

6.
We present a computational approach based on a local search strategy that discovers sets of proteins that preferentially interact with each other. Such sets are referred to as protein communities and are likely to represent functional modules. Preferential interaction between module members is quantified via an analytical framework based on a network null model known as the random graph with given expected degrees. Based on this framework, the concept of local protein community is generalized to that of community of communities. Protein communities and higher-level structures are extracted from two yeast protein interaction data sets and a network of published interactions between human proteins. The high level structures obtained with the human network correspond to broad biological concepts such as signal transduction, regulation of gene expression, and intercellular communication. Many of the obtained human communities are enriched, in a statistically significant way, for proteins having no clear orthologs in lower organisms. This indicates that the extracted modules are quite coherent in terms of function.  相似文献   

7.
8.
The outcome of a functional genomics pipeline is usually a partial list of genomic features, ranked by their relevance in modelling biological phenotype in terms of a classification or regression model. Due to resampling protocols or to a meta-analysis comparison, it is often the case that sets of alternative feature lists (possibly of different lengths) are obtained, instead of just one list. Here we introduce a method, based on permutations, for studying the variability between lists ("list stability") in the case of lists of unequal length. We provide algorithms evaluating stability for lists embedded in the full feature set or just limited to the features occurring in the partial lists. The method is demonstrated by finding and comparing gene profiles on a large prostate cancer dataset, consisting of two cohorts of patients from different countries, for a total of 455 samples.  相似文献   

9.
To take full advantage of high-throughput genetic and physical interaction mapping projects, the raw interactions must first be assembled into models of cell structure and function. PanGIA (for physical and genetic interaction alignment) is a plug-in for the bioinformatics platform Cytoscape, designed to integrate physical and genetic interactions into hierarchical module maps. PanGIA identifies 'modules' as sets of proteins whose physical and genetic interaction data matches that of known protein complexes. Higher-order functional cooperativity and redundancy is identified by enrichment for genetic interactions across modules. This protocol begins with importing interaction networks into Cytoscape, followed by filtering and basic network visualization. Next, PanGIA is used to infer a set of modules and their functional inter-relationships. This module map is visualized in a number of intuitive ways, and modules are tested for functional enrichment and overlap with known complexes. The full protocol can be completed between 10 and 30 min, depending on the size of the data set being analyzed.  相似文献   

10.
There is an urgent need to develop simple and inexpensive methods for monitoring wildlife populations in resource-poor countries. List-based methods have been advocated as simple yet potentially useful biodiversity monitoring tools, and systems have recently been launched in a number of countries to collect species lists. We attempt to advance the use of systematic list-based monitoring by (1) suggesting improvements to the way in which list reporting rates are calculated; (2) assessing the extent to which degrading effort-corrected measures of abundance into simple species lists results in loss of information on population trends; (3) comparing long-term trends in list reporting rates with population trends from a wholly independent monitoring scheme. Daily species lists of birds were derived from regular trapping at a nature reserve in southern England. Most species showed a strong correlation across years between the proportion of lists on which they occurred, adjusted for list length (adjusted list reporting rate; ALRR), and an effort-corrected measure of abundance (captures per unit effort; CPUE). ALRR revealed almost as much about annual variation in abundance as CPUE for all but the most frequently captured species. Long-term (>20 years) trends in ALRRs at the nature reserve were positively correlated with UK national population trends recorded over the same period by an independent, labour-intensive monitoring scheme that counted birds at a large number of widely spread sites. Our results support previous claims that simple species lists could generate data useful for monitoring long-term population trends, particularly where such lists are collected systematically. However, further research on the efficiency of list reporting rates relative to more sophisticated methods is necessary, before list-based methods can be advocated for dedicated monitoring schemes in resource-poor regions.  相似文献   

11.
Xiao Y  Xu C  Xu L  Guan J  Ping Y  Fan H  Li Y  Zhao H  Li X 《Gene》2012,499(2):332-338
The development of heart failure (HF) is a complex process that can be initiated by multiple etiologies. Identifying common functional modules associated with HF is a challenging task. Here, we developed a systems method to identify these common functional modules by integrating multiple expression profiles, protein interactions from four species, gene function annotations, and text information. We identified 1439 consistently differentially expressed genes (CDEGs) across HF with different etiologies by applying three meta-analysis methods to multiple HF-related expression profiles. Using a weighted human interaction network constructed by combining interaction data from multiple species, we extracted 60 candidate CDEG modules. We further evaluated the functional relevance of each module by using expression, interaction network, functional annotations, and text information together. Finally, five functional modules with significant biological relevance were identified. We found that almost half of the genes in these modules are hubs in the weighted network, and that these modules can accurately classify HF patients from healthy subjects. We also identified many significantly enriched biological processes that contribute to the pathophysiology of HF, including two new ones, RNA splicing and vesicle-mediated protein transport. In summary, we proposed a novel framework to analyze common functional modules related to HF with different etiologies. Our findings provide important insights into the complex mechanism of HF. Further biological experimentations should be required to validate these novel biological processes.  相似文献   

12.
The architecture of the network of protein–protein physical interactions in Saccharomyces cerevisiae is exposed through the combination of two complementary theoretical network measures, betweenness centrality and ‘Q-modularity’. The yeast interactome is characterized by well-defined topological modules connected via a small number of inter-module protein interactions. Should such topological inter-module connections turn out to constitute a form of functional coordination between the modules, we speculate that this coordination is occurring typically in a pairwise fashion, rather than by way of high-degree hub proteins responsible for coordinating multiple modules. The unique non-hub-centric hierarchical organization of the interactome is not reproduced by gene duplication-and-divergence stochastic growth models that disregard global selective pressures.  相似文献   

13.
While protein-protein interactions have been studied largely as a network graph without physicality, here we analyze two protein complex data sets of Saccharomyces cerevisiae to relate physical and functional modularity to the network topology. We study for the first time the number of different protein complexes as a function of the protein complex size and find that it follows an exponential decay with a characteristic number of about 7. This reflects the dynamics of complex formation and dissociation in the cell. The analysis of the protein usage by complexes shows an extensive sharing of subunits that is due to the particular organization of the proteome into physical complexes and functional modules. This promiscuity accounts for the high clustering in the protein net-work graph. Our results underscore the need to include the information contained in observed protein complexes into protein network analyses.  相似文献   

14.
MOTIVATION: Given that association and dissociation of protein molecules is crucial in most biological processes several in silico methods have been recently developed to predict protein-protein interactions. Structural evidence has shown that usually interacting pairs of close homologs (interologs) physically interact in the same way. Moreover, conservation of an interaction depends on the conservation of the interface between interacting partners. In this article we make use of both, structural similarities among domains of known interacting proteins found in the Database of Interacting Proteins (DIP) and conservation of pairs of sequence patches involved in protein-protein interfaces to predict putative protein interaction pairs. RESULTS: We have obtained a large amount of putative protein-protein interaction (approximately 130,000). The list is independent from other techniques both experimental and theoretical. We separated the list of predictions into three sets according to their relationship with known interacting proteins found in DIP. For each set, only a small fraction of the predicted protein pairs could be independently validated by cross checking with the Human Protein Reference Database (HPRD). The fraction of validated protein pairs was always larger than that expected by using random protein pairs. Furthermore, a correlation map of interacting protein pairs was calculated with respect to molecular function, as defined in the Gene Ontology database. It shows good consistency of the predicted interactions with data in the HPRD database. The intersection between the lists of interactions of other methods and ours produces a network of potentially high-confidence interactions.  相似文献   

15.
16.
Indirect interactions among species emerge from the complexity of ecological networks and can strongly affect the response of communities to disturbances. To determine these indirect interactions and understand better community dynamics, ecologists focused on the interactions within small sets of species or modules. Thanks to their analytical tractability, modules bring insights on the mechanisms occurring in complex interaction networks. So far, most studies have considered modules with a single type of interaction although numerous species are involved in mutualistic and antagonistic interactions simultaneously. In this study, we analyse the dynamics of a diamond-shaped module with multiple interaction types: two resource species sharing a mutualist and a consumer. We describe the different types of indirect interaction occurring between the resource species and the conditions for a stable coexistence of all species. We show that the nature of indirect interactions between resource species (i.e. apparent facilitation, competition or antagonism), as well as stable coexistence, depend on the species generalism and asymmetry of interactions, or in other words, on the distribution of interaction strengths among species. We further unveil that a balance between mutualistic and antagonistic interactions at the level of resource species favours stable coexistence, and that species are more likely to coexist stably if there is apparent facilitation between the two resource species rather than apparent competition. Our results echo existing knowledge on the trophic diamond-shaped module, and confirm that our understanding of communities combining different interaction types can gain from module analyses.  相似文献   

17.
Ma J  Zhang X  Ung CY  Chen YZ  Li B 《Molecular bioSystems》2012,8(4):1179-1186
Interest in essential genes has arisen recently given their importance in antimicrobial drug development. Although knockouts of essential genes are commonly known to cause lethal phenotypes, there is insufficient understanding on the intermediate changes followed by genetic perturbation and to what extent essential genes correlate to other genes. Here, we characterized the gene knockout effects by using a list of affected genes, termed as 'damage lists'. These damage lists were identified through a refined cascading failure approach that was based on a previous topological flux balance analysis. Using an Escherichia coli metabolic network, we incorporated essentiality information into damage lists and revealed that the knockout of an essential gene mainly affects a large range of other essential genes whereas knockout of a non-essential gene only interrupts other non-essential genes. Also, genes sharing common damage lists tend to have the same essentiality. We extracted 72 core functional modules from the common damage lists of essential genes and demonstrated their ability to halt essential metabolites production. Overall, our network analysis revealed that essential and non-essential genes propagated their deletion effects via distinct routes, conferring mechanistic explanation to the observed lethality phenotypes of essential genes.  相似文献   

18.
Many complex networks such as computer and social networks exhibit modular structures, where links between nodes are much denser within modules than between modules. It is widely believed that cellular networks are also modular, reflecting the relative independence and coherence of different functional units in a cell. While many authors have claimed that observations from the yeast protein–protein interaction (PPI) network support the above hypothesis, the observed structural modularity may be an artifact because the current PPI data include interactions inferred from protein complexes through approaches that create modules (e.g., assigning pairwise interactions among all proteins in a complex). Here we analyze the yeast PPI network including protein complexes (PIC network) and excluding complexes (PEC network). We find that both PIC and PEC networks show a significantly greater structural modularity than that of randomly rewired networks. Nonetheless, there is little evidence that the structural modules correspond to functional units, particularly in the PEC network. More disturbingly, there is no evolutionary conservation among yeast, fly, and nematode modules at either the whole-module or protein-pair level. Neither is there a correlation between the evolutionary or phylogenetic conservation of a protein and the extent of its participation in various modules. Using computer simulation, we demonstrate that a higher-than-expected modularity can arise during network growth through a simple model of gene duplication, without natural selection for modularity. Taken together, our results suggest the intriguing possibility that the structural modules in the PPI network originated as an evolutionary byproduct without biological significance.  相似文献   

19.
Despite widespread acceptance that competition between scleractinian corals and benthic algae is important to the structure of coral reef communities, there is little direct experimental evidence that corals and algae do compete, and very little data on the processes and causality of their interactions. Most available evidence is observational or correlative, with intrinsic risks of confounded causality. This paper reviews and categorises the available evidence, concluding that competition between corals and algae probably is widespread on coral reefs, but also that the interaction varies considerably. Widespread replacement of corals by algae may often indicate coral mortality due to external disturbances, rather than competitive overgrowth, but may lead to competitive inhibition of coral recruitment, with consequences for reef recovery. We list eight specific processes by which corals and algae may affect each other, and suggest life history properties that will influence which of these interactions are possible. We propose a matrix for algal effects on corals, which lists the subset of processes possible for each combination of coral life form and algal functional group. This table provides a preliminary framework for improved understanding and interpretation of coral-algal interactions.  相似文献   

20.
Understanding the functional implications of changes in gene expression, mutations, etc., is the aim of most genomic experiments. To achieve this, several functional profiling methods have been proposed. Such methods study the behaviour of different gene modules (e.g. gene ontology terms) in response to one particular variable (e.g. differential gene expression). In spite to the wealth of information provided by functional profiling methods, a common limitation to all of them is their inherent unidimensional nature. In order to overcome this restriction we present a multidimensional logistic model that allows studying the relationship of gene modules with different genome-scale measurements (e.g. differential expression, genotyping association, methylation, copy number alterations, heterozygosity, etc.) simultaneously. Moreover, the relationship of such functional modules with the interactions among the variables can also be studied, which produces novel results impossible to be derived from the conventional unidimensional functional profiling methods. We report sound results of gene sets associations that remained undetected by the conventional one-dimensional gene set analysis in several examples. Our findings demonstrate the potential of the proposed approach for the discovery of new cell functionalities with complex dependences on more than one variable.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号