期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

OryzaExpress: an integrated database of gene expression networks and omics annotations in rice 总被引：1，自引：0，他引：1

Hamada K Hongo K Suwabe K Shimizu A Nagayama T Abe R Kikuchi S Yamamoto N Fujii T Yokoyama K Tsuchida H Sano K Mochizuki T Oki N Horiuchi Y Fujita M Watanabe M Matsuoka M Kurata N Yano K 《Plant & cell physiology》2011,52(2):220-229

相似文献

2.

Probabilistic protein function prediction from heterogeneous genome-wide data

Nariai N Kolaczyk ED Kasif S 《PloS one》2007,2(3):e337

Dramatic improvements in high throughput sequencing technologies have led to a staggering growth in the number of predicted genes. However, a large fraction of these newly discovered genes do not have a functional assignment. Fortunately, a variety of novel high-throughput genome-wide functional screening technologies provide important clues that shed light on gene function. The integration of heterogeneous data to predict protein function has been shown to improve the accuracy of automated gene annotation systems. In this paper, we propose and evaluate a probabilistic approach for protein function prediction that integrates protein-protein interaction (PPI) data, gene expression data, protein motif information, mutant phenotype data, and protein localization data. First, functional linkage graphs are constructed from PPI data and gene expression data, in which an edge between nodes (proteins) represents evidence for functional similarity. The assumption here is that graph neighbors are more likely to share protein function, compared to proteins that are not neighbors. The functional linkage graph model is then used in concert with protein domain, mutant phenotype and protein localization data to produce a functional prediction. Our method is applied to the functional prediction of Saccharomyces cerevisiae genes, using Gene Ontology (GO) terms as the basis of our annotation. In a cross validation study we show that the integrated model increases recall by 18%, compared to using PPI data alone at the 50% precision. We also show that the integrated predictor is significantly better than each individual predictor. However, the observed improvement vs. PPI depends on both the new source of data and the functional category to be predicted. Surprisingly, in some contexts integration hurts overall prediction accuracy. Lastly, we provide a comprehensive assignment of putative GO terms to 463 proteins that currently have no assigned function. 相似文献

3.

Molecular interaction networks for the analysis of human disease: Utility,limitations, and considerations

Sarah‐Jane Schramm Vivek Jayaswal Apurv Goel Simone S. Li Yee Hwa Yang Graham J. Mann Marc R. Wilkins 《Proteomics》2013,13(23-24):3393-3405

High‐throughput ‘‐omics’ data can be combined with large‐scale molecular interaction networks, for example, protein–protein interaction networks, to provide a unique framework for the investigation of human molecular biology. Interest in these integrative ‘‐omics’ methods is growing rapidly because of their potential to understand complexity and association with disease; such approaches have a focus on associations between phenotype and “network‐type.” The potential of this research is enticing, yet there remain a series of important considerations. Here, we discuss interaction data selection, data quality, the relative merits of using data from large high‐throughput studies versus a meta‐database of smaller literature‐curated studies, and possible issues of sociological or inspection bias in interaction data. Other work underway, especially international consortia to establish data formats, quality standards and address data redundancy, and the improvements these efforts are making to the field, is also evaluated. We present options for researchers intending to use large‐scale molecular interaction networks as a functional context for protein or gene expression data, including microRNAs, especially in the context of human disease. 相似文献

4.

Comparison of pattern detection methods in microarray time series of the segmentation clock

Dequéant ML Ahnert S Edelsbrunner H Fink TM Glynn EF Hattem G Kudlicki A Mileyko Y Morton J Mushegian AR Pachter L Rowicka M Shiu A Sturmfels B Pourquié O 《PloS one》2008,3(8):e2856

相似文献

5.

Partial least squares regression, support vector machine regression, and transcriptome-based distances for prediction of maize hybrid performance with gene expression data

Fu J Falke KC Thiemann A Schrag TA Melchinger AE Scholten S Frisch M 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2012,124(5):825-833

相似文献

6.

Filtering genetic variants and placing informative <Emphasis Type="Italic">priors</Emphasis> based on putative biological function

Stefanie?Friedrichs D?rthe?Malzahn Elizabeth?W.?Pugh Marcio?Almeida Xiao?Qing?Liu Julia?N.?Bailey Email author 《BMC genetics》2016,17(Z2):S8

High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure. 相似文献

7.

Cardiac function-related gene expression profiles in human atrial myocytes

Ohki-Kaneda R Ohashi J Yamamoto K Ueno S Ota J Choi YL Koinuma K Yamashita Y Misawa Y Fuse K Ikeda U Shimada K Mano H 《Biochemical and biophysical research communications》2004,320(4):1328-1336

To obtain insights into the molecular pathogenesis of heart failure in humans, we have analyzed the expression profiles of>12,000 genes in a total of 17 human specimens of right atrial myocytes. From this large data set, we here tried to identify gene clusters, expression level of which is correlated precisely with clinical parameter values of cardiac function. We could reveal that cardiac myocytes with normal sinus rhythm were clearly differentiated, in the point of view of gene expression, from those with atrial fibrillation. Further, an expression profile-based prediction of arrhythmia by a newly developed "weighted-distance method" could efficiently diagnose our samples. We could even construct calculation formulae for the values of left ventricular ejection fraction based on the expression level of selected genes. To our best knowledge, this is the first report to indicate that pumping ability of heart can be predicted by any measures of atrium. 相似文献

8.

Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis

JA Klomp KA Furge 《BMC research notes》2012,5(1):370

相似文献

9.

Multi-omics network-based functional annotation of unknown Arabidopsis genes

Thomas Depuydt Klaas Vandepoele 《The Plant journal : for cell and molecular biology》2021,108(4):1193-1212

相似文献

10.

Accurate cancer phenotype prediction with AKLIMATE,a stacked kernel learner integrating multimodal genomic data and pathway knowledge

Vladislav Uzunangelov Christopher K. Wong Joshua M. Stuart 《PLoS computational biology》2021,17(4)

Advancements in sequencing have led to the proliferation of multi-omic profiles of human cells under different conditions and perturbations. In addition, many databases have amassed information about pathways and gene “signatures”—patterns of gene expression associated with specific cellular and phenotypic contexts. An important current challenge in systems biology is to leverage such knowledge about gene coordination to maximize the predictive power and generalization of models applied to high-throughput datasets. However, few such integrative approaches exist that also provide interpretable results quantifying the importance of individual genes and pathways to model accuracy. We introduce AKLIMATE, a first kernel-based stacked learner that seamlessly incorporates multi-omics feature data with prior information in the form of pathways for either regression or classification tasks. AKLIMATE uses a novel multiple-kernel learning framework where individual kernels capture the prediction propensities recorded in random forests, each built from a specific pathway gene set that integrates all omics data for its member genes. AKLIMATE has comparable or improved performance relative to state-of-the-art methods on diverse phenotype learning tasks, including predicting microsatellite instability in endometrial and colorectal cancer, survival in breast cancer, and cell line response to gene knockdowns. We show how AKLIMATE is able to connect feature data across data platforms through their common pathways to identify examples of several known and novel contributors of cancer and synthetic lethality. 相似文献

11.

Unveiling network-based functional features through integration of gene expression into protein networks

Mahdi Jalili Tom Gebhardt Olaf Wolkenhauer Ali Salehzadeh-Yazdi 《生物化学与生物物理学报:疾病的分子基础》2018,1864(6):2349-2359

Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype–phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. 相似文献

12.

GeneMANIA Cytoscape plugin: fast gene function predictions on the desktop 总被引：1，自引：0，他引：1

Montojo J Zuberi K Rodriguez H Kazi F Wright G Donaldson SL Morris Q Bader GD 《Bioinformatics (Oxford, England)》2010,26(22):2927-2928

The GeneMANIA Cytoscape plugin brings fast gene function prediction capabilities to the desktop. GeneMANIA identifies the most related genes to a query gene set using a guilt-by-association approach. The plugin uses over 800 networks from six organisms and each related gene is traceable to the source network used to make the prediction. Users may add their own interaction networks and expression profile data to complement or override the default data. Availability and Implementation: The GeneMANIA Cytoscape plugin is implemented in Java and is freely available at http://www.genemania.org/plugin/. 相似文献

13.

Association of feature gene expression with structural fingerprints of chemical compounds

Li Y Tu K Zheng S Wang J Li Y Hao P Li X 《Journal of bioinformatics and computational biology》2011,9(4):503-519

相似文献

14.

Molecular basis of the differences between normal and tumor tissues of gastric cancer 总被引：1，自引：0，他引：1

Yang S Shin J Park KH Jeung HC Rha SY Noh SH Yang WI Chung HC 《Biochimica et biophysica acta》2007,1772(9):1033-1040

To be able to describe the differences between the normal and tumor tissues of gastric cancer at a molecular level would be essential in the study of the disease. We investigated the gene expression pattern in the two types of tissues from gastric cancer by performing expression profiling of 86 tissues on 17K complementary DNA microarrays. To select for the differentially expressed genes, class prediction algorithm was employed. For predictor selection, samples were first divided into a training (n=58), and a test set (n=28). A group of 894 genes was selected by a t-test in a training set, which was used for cross-validation in the training set and class (normal or tumor) prediction in the test set. Smaller groups of 894 genes were individually tested for their ability to correctly predict the normal or tumor samples based on gene expression pattern. The expression ratios of the 5 genes chosen from microarray data can be validated by real time RT-PCR over 6 tissue samples, resulting in a high level of correlation, individually or combined. When a representative predictor set of 92 genes was examined, pathways of 'focal adhesion' (with gene components of THBS2, PDGFD, MAPK1, COL1A2, COL6A3), 'ECM-receptor interaction' pathway (THBS2, COL1A2, COL6A3, FN1) and 'TGF-beta signaling' (THBS2, MAPK1, INHBA) represent some of the main differences between normal and tumor of gastric cancer at a molecular level. 相似文献

15.

Prediction of Candidate Primary Immunodeficiency Disease Genes Using a Support Vector Machine Learning Approach 总被引：1，自引：0，他引：1

Shivakumar Keerthikumar Sahely Bhadra Kumaran Kandasamy Rajesh Raju Y.L. Ramachandra Chiranjib Bhattacharyya Kohsuke Imai Osamu Ohara Sujatha Mohan Akhilesh Pandey 《DNA research》2009,16(6):345-351

Screening and early identification of primary immunodeficiency disease (PID) genes is a major challenge for physicians. Many resources have catalogued molecular alterations in known PID genes along with their associated clinical and immunological phenotypes. However, these resources do not assist in identifying candidate PID genes. We have recently developed a platform designated Resource of Asian PDIs, which hosts information pertaining to molecular alterations, protein–protein interaction networks, mouse studies and microarray gene expression profiling of all known PID genes. Using this resource as a discovery tool, we describe the development of an algorithm for prediction of candidate PID genes. Using a support vector machine learning approach, we have predicted 1442 candidate PID genes using 69 binary features of 148 known PID genes and 3162 non-PID genes as a training data set. The power of this approach is illustrated by the fact that six of the predicted genes have recently been experimentally confirmed to be PID genes. The remaining genes in this predicted data set represent attractive candidates for testing in patients where the etiology cannot be ascribed to any of the known PID genes. 相似文献

16.

Clustering gene expression data based on predicted differential effects of GV interaction

Pan HY Zhu J Han DF 《基因组蛋白质组与生物信息学报(英文版)》2005,3(1):36-41

Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent “noise” within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV （gene by variety） interaction using the adjusted unbiased prediction （AUP） method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation. 相似文献

17.

A bioinformatics tool for linking gene expression profiling results with public databases of microRNA target predictions 总被引：1，自引：0，他引：1

Creighton CJ Nagaraja AK Hanash SM Matzuk MM Gunaratne PH 《RNA (New York, N.Y.)》2008,14(11):2290-2296

相似文献

18.

Quality Measures for Gene Expression Biclusters

Beatriz Pontes Ral Girldez Jess S. Aguilar-Ruiz 《PloS one》2015,10(3)

An noticeable number of biclustering approaches have been proposed proposed for the study of gene expression data, especially for discovering functionally related gene sets under different subsets of experimental conditions. In this context, recognizing groups of co-expressed or co-regulated genes, that is, genes which follow a similar expression pattern, is one of the main objectives. Due to the problem complexity, heuristic searches are usually used instead of exhaustive algorithms. Furthermore, most of biclustering approaches use a measure or cost function that determines the quality of biclusters. Having a suitable quality metric for bicluster is a critical aspect, not only for guiding the search, but also for establishing a comparison criteria among the results obtained by different biclustering techniques. In this paper, we analyse a large number of existing approaches to quality measures for gene expression biclusters, as well as we present a comparative study of them based on their capability to recognize different expression patterns in biclusters. 相似文献

19.

Understanding communication signals during mycobacterial latency through predicted genome-wide protein interactions and boolean modeling

Hegde SR Rajasingh H Das C Mande SS Mande SC 《PloS one》2012,7(3):e33893

相似文献

20.

CARMO: a comprehensive annotation platform for functional exploration of rice multi‐omics data

下载免费PDF全文

Jiawei Wang Meifang Qi Jian Liu Yijing Zhang 《The Plant journal : for cell and molecular biology》2015,83(2):359-374

相似文献