首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Accurate haplotype imputation with individualized ancestry-adjusted reference panels
Authors:Qing Song  Wei Xu  Wenzhi Li  Shaohua He  Jiankang Liu  Guangming Wang  Li Ma
Institution:1. Center of Big Data and Bioinformatics, First Affiliated Hospital of Medical School, Xi''an Jiaotong University, No. 277 Yanta Xi Street, Xi''an, Shaanxi 710061, China;2. Cardiovascular Research Institute, Department of Medicine, Morehouse School of Medicine, 720 Westview Drive SW, Atlanta, GA 30310, USA;3. 4DGENOME, 2360 Elon Way, Decatur, GA 30033, USA;4. Shapiro Cardiovascular Center, Brigham and Women''s Hospital, Harvard Medical School, 75 Francis St., Boston MA02115, USA;5. Genetic Test Center, First Affiliated Hospital of Dali University, Dali City, Yunnan 671000, China
Abstract:Accurate data imputation requires ethnicity-matched reference panels. However, recent admixtures have created mosaic human genomes, different chromosomal segments have different ethnic backgrounds, so it is impossible for a single-ethnicity reference panel to be the matched for data imputation. In this study, we explored a novel strategy for imputation. We created individualized mosaic reference panel for each person according to his/her ethnic backgrounds at each genomic locus. We examined on datasets with 70% missing values on haplotypes and 50% missing values on genotypes. Results showed that the imputation with mosaic references steadily yielded high imputation accuracy and outperforms the other strategies. With the mosaic reference panels, the imputation accuracy was 98.8 ± 0.1% (CEU), 98.7 ± 0.1% (YRI), 98.5 ± 0.1% (CHB), 98.6 ± 0.1% (ASW), 97.3 ± 0.1% (MKK) and 98.2 ± 0.1% (MXL). Mosaic reference panel will be one option for future missing value imputation in big data era.
Keywords:Imputation  African-American  Minority population  Reference  Big data
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号