ENCAPP: elastic-net-based prognosis prediction and biomarker discovery for human cancers |
| |
Authors: | Jishnu Das Kaitlyn M Gayvert Florentina Bunea Marten H Wegkamp Haiyuan Yu |
| |
Affiliation: | .Department of Biological Statistics and Computational Biology, Cornell University, 335 Weill Hall, Ithaca, NY 14853 USA ;.Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853 USA ;.Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY 10065 USA ;.Department of Statistical Science, Cornell University, Ithaca, NY 14853 USA |
| |
Abstract: | BackgroundWith the explosion of genomic data over the last decade, there has been a tremendous amount of effort to understand the molecular basis of cancer using informatics approaches. However, this has proven to be extremely difficult primarily because of the varied etiology and vast genetic heterogeneity of different cancers and even within the same cancer. One particularly challenging problem is to predict prognostic outcome of the disease for different patients.ResultsHere, we present ENCAPP, an elastic-net-based approach that combines the reference human protein interactome network with gene expression data to accurately predict prognosis for different human cancers. Our method identifies functional modules that are differentially expressed between patients with good and bad prognosis and uses these to fit a regression model that can be used to predict prognosis for breast, colon, rectal, and ovarian cancers. Using this model, ENCAPP can also identify prognostic biomarkers with a high degree of confidence, which can be used to generate downstream mechanistic and therapeutic insights.ConclusionENCAPP is a robust method that can accurately predict prognostic outcome and identify biomarkers for different human cancers.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1465-9) contains supplementary material, which is available to authorized users. |
| |
Keywords: | Cancer genomics Gene expression Protein interaction network Prognosis prediction Elastic net |
|
|