共查询到20条相似文献,搜索用时 15 毫秒
1.
Background
MicroRNAs (miRNAs) are a class of endogenous small regulatory RNAs. Identifications of the dys-regulated or perturbed miRNAs and their key target genes are important for understanding the regulatory networks associated with the studied cellular processes. Several computational methods have been developed to infer the perturbed miRNA regulatory networks by integrating genome-wide gene expression data and sequence-based miRNA-target predictions. However, most of them only use the expression information of the miRNA direct targets, rarely considering the secondary effects of miRNA perturbation on the global gene regulatory networks.Results
We proposed a network propagation based method to infer the perturbed miRNAs and their key target genes by integrating gene expressions and global gene regulatory network information. The method used random walk with restart in gene regulatory networks to model the network effects of the miRNA perturbation. Then, it evaluated the significance of the correlation between the network effects of the miRNA perturbation and the gene differential expression levels with a forward searching strategy. Results show that our method outperformed several compared methods in rediscovering the experimentally perturbed miRNAs in cancer cell lines. Then, we applied it on a gene expression dataset of colorectal cancer clinical patient samples and inferred the perturbed miRNA regulatory networks of colorectal cancer, including several known oncogenic or tumor-suppressive miRNAs, such as miR-17, miR-26 and miR-145.Conclusions
Our network propagation based method takes advantage of the network effect of the miRNA perturbation on its target genes. It is a useful approach to infer the perturbed miRNAs and their key target genes associated with the studied biological processes using gene expression data.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2105-15-255) contains supplementary material, which is available to authorized users. 相似文献2.
Jimson W. D’Souza Smitha Reddy Lisa E. Goldsmith Irina Shchaveleva James D. Marks Samuel Litwin Matthew K. Robinson 《PloS one》2014,9(11)
Background
Inappropriate signaling through the epidermal growth factor receptor family (EGFR1/ERBB1, ERBB2/HER2, ERBB3/HER3, and ERBB4/HER4) of receptor tyrosine kinases leads to unregulated activation of multiple downstream signaling pathways that are linked to cancer formation and progression. In particular, ERBB3 plays a critical role in linking ERBB signaling to the phosphoinositide 3-kinase and Akt signaling pathway and increased levels of ERBB3-dependent signaling is also increasingly recognized as a mechanism for acquired resistance to ERBB-targeted therapies.Methods
We had previously reported the isolation of a panel of anti-ERBB3 single-chain Fv antibodies through use of phage-display technology. In the current study scFv specific for domain I (F4) and domain III (A5) were converted into human IgG1 formats and analyzed for efficacy.Results
Treatment of cells with an oligoclonal mixture of the A5/F4 IgGs appeared more effective at blocking both ligand-induced and ligand-independent signaling through ERBB3 than either single IgG alone. This correlated with improved ability to inhibit the cell growth both as a single agent and in combination with other ERBB-targeted therapies. Treatment of NCI-N87 tumor xenografts with the A5/F4 oligoclonal led to a statistically significant decrease in tumor growth rate that was further enhanced in combination with trastuzumab.Conclusion
These results suggest that an oligoclonal antibody mixture may be a more effective approach to downregulate ERBB3-dependent signaling. 相似文献3.
4.
5.
Andrew Schoenrock Bahram Samanfar Sylvain Pitre Mohsen Hooshyar Ke Jin Charles A Phillips Hui Wang Sadhna Phanse Katayoun Omidi Yuan Gui Md Alamgir Alex Wong Fredrik Barren?s Mohan Babu Mikael Benson Michael A Langston James R Green Frank Dehne Ashkan Golshani 《BMC bioinformatics》2014,15(1)
Background
Our knowledge of global protein-protein interaction (PPI) networks in complex organisms such as humans is hindered by technical limitations of current methods.Results
On the basis of short co-occurring polypeptide regions, we developed a tool called MP-PIPE capable of predicting a global human PPI network within 3 months. With a recall of 23% at a precision of 82.1%, we predicted 172,132 putative PPIs. We demonstrate the usefulness of these predictions through a range of experiments.Conclusions
The speed and accuracy associated with MP-PIPE can make this a potential tool to study individual human PPI networks (from genomic sequences alone) for personalized medicine.Electronic supplementary material
The online version of this article (doi:10.1186/s12859-014-0383-1) contains supplementary material, which is available to authorized users. 相似文献6.
Background
Proteins dynamically interact with each other to perform their biological functions. The dynamic operations of protein interaction networks (PPI) are also reflected in the dynamic formations of protein complexes. Existing protein complex detection algorithms usually overlook the inherent temporal nature of protein interactions within PPI networks. Systematically analyzing the temporal protein complexes can not only improve the accuracy of protein complex detection, but also strengthen our biological knowledge on the dynamic protein assembly processes for cellular organization.Results
In this study, we propose a novel computational method to predict temporal protein complexes. Particularly, we first construct a series of dynamic PPI networks by joint analysis of time-course gene expression data and protein interaction data. Then a Time Smooth Overlapping Complex Detection model (TS-OCD) has been proposed to detect temporal protein complexes from these dynamic PPI networks. TS-OCD can naturally capture the smoothness of networks between consecutive time points and detect overlapping protein complexes at each time point. Finally, a nonnegative matrix factorization based algorithm is introduced to merge those very similar temporal complexes across different time points.Conclusions
Extensive experimental results demonstrate the proposed method is very effective in detecting temporal protein complexes than the state-of-the-art complex detection techniques.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2105-15-335) contains supplementary material, which is available to authorized users. 相似文献7.
Background
Discovering causal genetic variants from large genetic association studies poses many difficult challenges. Assessing which genetic markers are involved in determining trait status is a computationally demanding task, especially in the presence of gene-gene interactions.Results
A non-parametric Bayesian approach in the form of a Bayesian neural network is proposed for use in analyzing genetic association studies. Demonstrations on synthetic and real data reveal they are able to efficiently and accurately determine which variants are involved in determining case-control status. By using graphics processing units (GPUs) the time needed to build these models is decreased by several orders of magnitude. In comparison with commonly used approaches for detecting interactions, Bayesian neural networks perform very well across a broad spectrum of possible genetic relationships.Conclusions
The proposed framework is shown to be a powerful method for detecting causal SNPs while being computationally efficient enough to handle large datasets.Electronic supplementary material
The online version of this article (doi:10.1186/s12859-014-0368-0) contains supplementary material, which is available to authorized users. 相似文献8.
Aravind Venkatesan Sushil Tripathi Alejandro Sanz de Galdeano Ward Blondé Astrid L?greid Vladimir Mironov Martin Kuiper 《BMC bioinformatics》2014,15(1)
Background
Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of ‘omics’ data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis.Results
We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions.Conclusions
Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.Electronic supplementary material
The online version of this article (doi:10.1186/s12859-014-0386-y) contains supplementary material, which is available to authorized users. 相似文献9.
John E Wenskovitch Jr. Leonard A Harris Jose-Juan Tapia James R Faeder G Elisabeta Marai 《BMC bioinformatics》2014,15(1)
Background
Mechanistic models that describe the dynamical behaviors of biochemical systems are common in computational systems biology, especially in the realm of cellular signaling. The development of families of such models, either by a single research group or by different groups working within the same area, presents significant challenges that range from identifying structural similarities and differences between models to understanding how these differences affect system dynamics.Results
We present the development and features of an interactive model exploration system, MOSBIE, which provides utilities for identifying similarities and differences between models within a family. Models are clustered using a custom similarity metric, and a visual interface is provided that allows a researcher to interactively compare the structures of pairs of models as well as view simulation results.Conclusions
We illustrate the usefulness of MOSBIE via two case studies in the cell signaling domain. We also present feedback provided by domain experts and discuss the benefits, as well as the limitations, of the approach.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2105-15-316) contains supplementary material, which is available to authorized users. 相似文献10.
11.
Background
Human T-cell leukemia viruses (HTLV) tend to induce some fatal human diseases like Adult T-cell Leukemia (ATL) by targeting human T lymphocytes. To indentify the protein-protein interactions (PPI) between HTLV viruses and Homo sapiens is one of the significant approaches to reveal the underlying mechanism of HTLV infection and host defence. At present, as biological experiments are labor-intensive and expensive, the identified part of the HTLV-human PPI networks is rather small. Although recent years have witnessed much progress in computational modeling for reconstructing pathogen-host PPI networks, data scarcity and data unavailability are two major challenges to be effectively addressed. To our knowledge, no computational method for proteome-wide HTLV-human PPI networks reconstruction has been reported.Results
In this work we develop Multi-instance Adaboost method to conduct homolog knowledge transfer for computationally reconstructing proteome-wide HTLV-human PPI networks. In this method, the homolog knowledge in the form of gene ontology (GO) is treated as auxiliary homolog instance to address the problems of data scarcity and data unavailability, while the potential negative knowledge transfer is automatically attenuated by AdaBoost instance reweighting. The cross validation experiments show that the homolog knowledge transfer in the form of independent homolog instances can effectively enrich the feature information and substitute for the missing GO information. Moreover, the independent tests show that the method can validate 70.3% of the recently curated interactions, significantly exceeding the 2.1% recognition rate by the HT-Y2H experiment. We have used the method to reconstruct the proteome-wide HTLV-human PPI networks and further conducted gene ontology based clustering of the predicted networks for further biomedical research. The gene ontology based clustering analysis of the predictions provides much biological insight into the pathogenesis of HTLV retroviruses.Conclusions
The Multi-instance AdaBoost method can effectively address the problems of data scarcity and data unavailability for the proteome-wide HTLV-human PPI interaction networks reconstruction. The gene ontology based clustering analysis of the predictions reveals some important signaling pathways and biological modules that HTLV retroviruses are likely to target.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2105-15-245) contains supplementary material, which is available to authorized users. 相似文献12.
Background
Gene prediction is a challenging but crucial part in most genome analysis pipelines. Various methods have evolved that predict genes ab initio on reference sequences or evidence based with the help of additional information, such as RNA-Seq reads or EST libraries. However, none of these strategies is bias-free and one method alone does not necessarily provide a complete set of accurate predictions.Results
We present IPred (Integrative gene Prediction), a method to integrate ab initio and evidence based gene identifications to complement the advantages of different prediction strategies. IPred builds on the output of gene finders and generates a new combined set of gene identifications, representing the integrated evidence of the single method predictions.Conclusion
We evaluate IPred in simulations and real data experiments on Escherichia Coli and human data. We show that IPred improves the prediction accuracy in comparison to single method predictions and to existing methods for prediction combination.Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-1315-9) contains supplementary material, which is available to authorized users. 相似文献13.
Background
Frontline treatment of small cell lung carcinoma (SCLC) relies heavily on chemotherapeutic agents and radiation therapy. Though SCLC patients respond well to initial cycles of chemotherapy, they eventually develop resistance. Identification of novel therapies against SCLC is therefore imperative.Methods and Findings
We have designed a bioluminescence-based cell viability assay for high-throughput screening of anti-SCLC agents. The assay was first validated via standard pharmacological agents and RNA interference using two human SCLC cell lines. We then utilized the assay in a high-throughput screen using the LOPAC1280 compound library. The screening identified several drugs that target classic cancer signaling pathways as well as neuroendocrine markers in SCLC. In particular, perturbation of dopaminergic and serotonergic signaling inhibits SCLC cell viability.Conclusions
The convergence of our pharmacological data with key SCLC pathway components reiterates the importance of neurotransmitter signaling in SCLC etiology and points to possible leads for drug development. 相似文献14.
Kaushalya C Amarasinghe Jason Li Sally M Hunter Georgina L Ryland Prue A Cowin Ian G Campbell Saman K Halgamuge 《BMC genomics》2014,15(1)
Background
Using whole exome sequencing to predict aberrations in tumours is a cost effective alternative to whole genome sequencing, however is predominantly used for variant detection and infrequently utilised for detection of somatic copy number variation.Results
We propose a new method to infer copy number and genotypes using whole exome data from paired tumour/normal samples. Our algorithm uses two Hidden Markov Models to predict copy number and genotypes and computationally resolves polyploidy/aneuploidy, normal cell contamination and signal baseline shift. Our method makes explicit detection on chromosome arm level events, which are commonly found in tumour samples. The methods are combined into a package named ADTEx (Aberration Detection in Tumour Exome). We applied our algorithm to a cohort of 17 in-house generated and 18 TCGA paired ovarian cancer/normal exomes and evaluated the performance by comparing against the copy number variations and genotypes predicted using Affymetrix SNP 6.0 data of the same samples. Further, we carried out a comparison study to show that ADTEx outperformed its competitors in terms of precision and F-measure.Conclusions
Our proposed method, ADTEx, uses both depth of coverage ratios and B allele frequencies calculated from whole exome sequencing data, to predict copy number variations along with their genotypes. ADTEx is implemented as a user friendly software package using Python and R statistical language. Source code and sample data are freely available under GNU license (GPLv3) at http://adtex.sourceforge.net/.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-732) contains supplementary material, which is available to authorized users. 相似文献15.
Background
The transmission networks of Plasmodium vivax characterize how the parasite transmits from one location to another, which are informative and insightful for public health policy makers to accurately predict the patterns of its geographical spread. However, such networks are not apparent from surveillance data because P. vivax transmission can be affected by many factors, such as the biological characteristics of mosquitoes and the mobility of human beings. Here, we pay special attention to the problem of how to infer the underlying transmission networks of P. vivax based on available tempo-spatial patterns of reported cases.Methodology
We first define a spatial transmission model, which involves representing both the heterogeneous transmission potential of P. vivax at individual locations and the mobility of infected populations among different locations. Based on the proposed transmission model, we further introduce a recurrent neural network model to infer the transmission networks from surveillance data. Specifically, in this model, we take into account multiple real-world factors, including the length of P. vivax incubation period, the impact of malaria control at different locations, and the total number of imported cases.Principal Findings
We implement our proposed models by focusing on the P. vivax transmission among 62 towns in Yunnan province, People''s Republic China, which have been experiencing high malaria transmission in the past years. By conducting scenario analysis with respect to different numbers of imported cases, we can (i) infer the underlying P. vivax transmission networks, (ii) estimate the number of imported cases for each individual town, and (iii) quantify the roles of individual towns in the geographical spread of P. vivax.Conclusion
The demonstrated models have presented a general means for inferring the underlying transmission networks from surveillance data. The inferred networks will offer new insights into how to improve the predictability of P. vivax transmission. 相似文献16.
17.
18.
Background
Meta-analysis has become a popular approach for high-throughput genomic data analysis because it often can significantly increase power to detect biological signals or patterns in datasets. However, when using public-available databases for meta-analysis, duplication of samples is an often encountered problem, especially for gene expression data. Not removing duplicates could lead false positive finding, misleading clustering pattern or model over-fitting issue, etc in the subsequent data analysis.Results
We developed a Bioconductor package Dupchecker that efficiently identifies duplicated samples by generating MD5 fingerprints for raw data. A real data example was demonstrated to show the usage and output of the package.Conclusions
Researchers may not pay enough attention to checking and removing duplicated samples, and then data contamination could make the results or conclusions from meta-analysis questionable. We suggest applying DupChecker to examine all gene expression data sets before any data analysis step.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2105-15-323) contains supplementary material, which is available to authorized users. 相似文献19.
Haritz Irizar Maider Mu?oz-Culla Matías Sáenz-Cuesta I?aki Osorio-Querejeta Lucía Sepúlveda Tamara Castillo-Trivi?o Alvaro Prada Adolfo Lopez de Munain Javier Olascoaga David Otaegui 《BMC genomics》2015,16(1)