Integrating Prior Knowledge in Multiple Testing under Dependence with Applications to Detecting Differential DNA Methylation |
| |
Authors: | Kuan Pei Fen Chiang Derek Y |
| |
Affiliation: | Department of Biostatistics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A. Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A. |
| |
Abstract: | Summary DNA methylation has emerged as an important hallmark of epigenetics. Numerous platforms including tiling arrays and next generation sequencing, and experimental protocols are available for profiling DNA methylation. Similar to other tiling array data, DNA methylation data shares the characteristics of inherent correlation structure among nearby probes. However, unlike gene expression or protein DNA binding data, the varying CpG density which gives rise to CpG island, shore and shelf definition provides exogenous information in detecting differential methylation. This article aims to introduce a robust testing and probe ranking procedure based on a nonhomogeneous hidden Markov model that incorporates the above‐mentioned features for detecting differential methylation. We revisit the seminal work of Sun and Cai (2009, Journal of the Royal Statistical Society: Series B (Statistical Methodology) 71 , 393–424) and propose modeling the nonnull using a nonparametric symmetric distribution in two‐sided hypothesis testing. We show that this model improves probe ranking and is robust to model misspecification based on extensive simulation studies. We further illustrate that our proposed framework achieves good operating characteristics as compared to commonly used methods in real DNA methylation data that aims to detect differential methylation sites. |
| |
Keywords: | CpG island False discovery rate Kernel density estimation Microarray Nonhomogeneous hidden Markov model Semiparametric model |
本文献已被 PubMed 等数据库收录! |
|