首页 | 本学科首页   官方微博 | 高级检索  
     


Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm
Authors:Jiudi Lv  Junjie Wang  Xiujuan Shang  Fangfang Liu  Shixun Guo
Affiliation:1.Department of General Surgery Three, Xinxiang Central Hospital, No. 56 Jinsui Avenue, Xinxiang, Henan 453000, China;2.Department of Oncology Medicine Three, Xinxiang Central Hospital, No. 56 Jinsui Avenue, Xinxiang, Henan 453000, China;3.Severe Medical Section, Xinxiang Central Hospital, No. 56 Jinsui Avenue, Xinxiang, Henan 453000, China
Abstract:The present study proposed a deep learning (DL) algorithm to predict survival in patients with colon adenocarcinoma (COAD) based on multiomics integration. The survival-sensitive model was constructed using an autoencoder for DL implementation based on The Cancer Genome Atlas (TCGA) data of patients with COAD. The autoencoder framework was compared with PCA, NMF, t-SNE, and univariable Cox-PH model for identifying survival-related features. The prognostic robustness of the inferred survival risk groups was validated using three independent confirmation cohorts. Differential expression analysis, Pearson’s correlation analysis, construction of miRNA–target gene network, and function enrichment analysis were performed. Two risk groups with significant survival differences were identified in TCGA set using the autoencoder-based model (log-rank P-value = 5.51e−07). The autoencoder framework showed superior performance compared with PCA, NMF, t-SNE, and the univariable Cox-PH model based on the C-index, log-rank P-value, and Brier score. The robustness of the classification model was successfully verified in three independent validation sets. There were 1271 differentially expressed genes, 10 differentially expressed miRNAs, and 12 hypermethylated genes between the survival risk groups. Among these, miR-133b and its target genes (GNB4, PTPRZ1, RUNX1T1, EPHA7, GPM6A, BICC1, and ADAMTS5) were used to construct a network. These genes were significantly enriched in ECM–receptor interaction, focal adhesion, PI3K–Akt signaling pathway, and glucose metabolism-related pathways. The risk subgroups obtained through a multiomics data integration pipeline using the DL algorithm had good robustness. miR-133b and its target genes could be potential diagnostic markers. The results would assist in elucidating the possible pathogenesis of COAD.
Keywords:deep learning   methylation   miRNA   multi-omics data
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号