首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Powerful SNP-Set Analysis for Case-Control Genome-wide Association Studies
Authors:Michael C Wu  Michael P Epstein  Stephen J Chanock  Xihong Lin
Institution:1 Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
2 Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
3 Department of Epidemiology, Harvard School of Public Health, Boston, MA 02115, USA
4 Department of Human Genetics, Emory University, Atlanta, GA 30322, USA
5 Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
Abstract:GWAS have emerged as popular tools for identifying genetic variants that are associated with disease risk. Standard analysis of a case-control GWAS involves assessing the association between each individual genotyped SNP and disease risk. However, this approach suffers from limited reproducibility and difficulties in detecting multi-SNP and epistatic effects. As an alternative analytical strategy, we propose grouping SNPs together into SNP sets on the basis of proximity to genomic features such as genes or haplotype blocks, then testing the joint effect of each SNP set. Testing of each SNP set proceeds via the logistic kernel-machine-based test, which is based on a statistical framework that allows for flexible modeling of epistatic and nonlinear SNP effects. This flexibility and the ability to naturally adjust for covariate effects are important features of our test that make it appealing in comparison to individual SNP tests and existing multimarker tests. Using simulated data based on the International HapMap Project, we show that SNP-set testing can have improved power over standard individual-SNP analysis under a wide range of settings. In particular, we find that our approach has higher power than individual-SNP analysis when the median correlation between the disease-susceptibility variant and the genotyped SNPs is moderate to high. When the correlation is low, both individual-SNP analysis and the SNP-set analysis tend to have low power. We apply SNP-set analysis to analyze the Cancer Genetic Markers of Susceptibility (CGEMS) breast cancer GWAS discovery-phase data.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号