首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Extending rare-variant testing strategies: analysis of noncoding sequence and imputed genotypes
Authors:Zawistowski Matthew  Gopalakrishnan Shyam  Ding Jun  Li Yun  Grimm Sara  Zöllner Sebastian
Institution:1 Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
2 Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
3 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
4 Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA
5 Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, NC 27599, USA
6 Bioinformatics Program, University of Michigan, Ann Arbor, MI 48109, USA
7 Department of Psychiatry, University of Michigan, Ann Arbor, MI 48109, USA
Abstract:Next Generation Sequencing Technology has revolutionized our ability to study the contribution of rare genetic variation to heritable traits. However, existing single-marker association tests are underpowered for detecting rare risk variants. A more powerful approach involves pooling methods that combine multiple rare variants from the same gene into a single test statistic. Proposed pooling methods can be limited because they generally assume high-quality genotypes derived from deep-coverage sequencing, which may not be available. In this paper, we consider an intuitive and computationally efficient pooling statistic, the cumulative minor-allele test (CMAT). We assess the performance of the CMAT and other pooling methods on datasets simulated with population genetic models to contain realistic levels of neutral variation. We consider study designs ranging from exon-only to whole-gene analyses that contain noncoding variants. For all study designs, the CMAT achieves power comparable to that of previously proposed methods. We then extend the CMAT to probabilistic genotypes and describe application to low-coverage sequencing and imputation data. We show that augmenting sequence data with imputed samples is a practical method for increasing the power of rare-variant studies. We also provide a method of controlling for confounding variables such as population stratification. Finally, we demonstrate that our method makes it possible to use external imputation templates to analyze rare variants imputed into existing GWAS datasets. As proof of principle, we performed a CMAT analysis of more than 8 million SNPs that we imputed into the GAIN psoriasis dataset by using haplotypes from the 1000 Genomes Project.
Keywords:
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号