Extracting a few functionally reproducible biomarkers to build robust subnetwork-based classifiers for the diagnosis of cancer |
| |
Authors: | Lin Zhang Shan Li Chunxiang Hao Guini Hong Jinfeng Zou Yuannv Zhang Pengfei Li Zheng Guo |
| |
Affiliation: | 1. Bioinformatics Centre, Key Laboratory for NeuroInformation of Ministry of Education and School of Life Science and Technology, School of Life Science, University of Electronic Science and Technology of China, Chengdu, 610054, China;2. College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China |
| |
Abstract: | In microarray-based case–control studies of a disease, people often attempt to identify a few diagnostic or prognostic markers amongst the most significant differentially expressed (DE) genes. However, the reproducibility of DE genes identified in different studies for a disease is typically very low. To tackle the problem, we could evaluate the reproducibility of DE genes across studies and define robust markers for disease diagnosis using disease-associated protein–protein interaction (PPI) subnetwork. Using datasets for four cancer types, we found that the most significant DE genes in cancer exhibit consistent up- or down-regulation in different datasets. For each cancer type, the 5 (or 10) most significant DE genes separately extracted from different datasets tend to be significantly coexpressed and closely connected in the PPI subnetwork, thereby indicating that they are highly reproducible at the PPI level. Consequently, we were able to build robust subnetwork-based classifiers for cancer diagnosis. |
| |
Keywords: | DE, differentially expressed PPI, protein&ndash protein interaction SAM, significance analysis of microarray FDR, false discovery rate POD, percentage of overlapping deregulations PO, percentage of overlap PON, percentage of overlap in the PPI network SVM, support vector machine RFE, recursive feature elimination |
本文献已被 ScienceDirect 等数据库收录! |
|