Accurate haplotype imputation with individualized ancestry-adjusted reference panels |
| |
Authors: | Qing Song Wei Xu Wenzhi Li Shaohua He Jiankang Liu Guangming Wang Li Ma |
| |
Institution: | 1. Center of Big Data and Bioinformatics, First Affiliated Hospital of Medical School, Xi''an Jiaotong University, No. 277 Yanta Xi Street, Xi''an, Shaanxi 710061, China;2. Cardiovascular Research Institute, Department of Medicine, Morehouse School of Medicine, 720 Westview Drive SW, Atlanta, GA 30310, USA;3. 4DGENOME, 2360 Elon Way, Decatur, GA 30033, USA;4. Shapiro Cardiovascular Center, Brigham and Women''s Hospital, Harvard Medical School, 75 Francis St., Boston MA02115, USA;5. Genetic Test Center, First Affiliated Hospital of Dali University, Dali City, Yunnan 671000, China |
| |
Abstract: | Accurate data imputation requires ethnicity-matched reference panels. However, recent admixtures have created mosaic human genomes, different chromosomal segments have different ethnic backgrounds, so it is impossible for a single-ethnicity reference panel to be the matched for data imputation. In this study, we explored a novel strategy for imputation. We created individualized mosaic reference panel for each person according to his/her ethnic backgrounds at each genomic locus. We examined on datasets with 70% missing values on haplotypes and 50% missing values on genotypes. Results showed that the imputation with mosaic references steadily yielded high imputation accuracy and outperforms the other strategies. With the mosaic reference panels, the imputation accuracy was 98.8 ± 0.1% (CEU), 98.7 ± 0.1% (YRI), 98.5 ± 0.1% (CHB), 98.6 ± 0.1% (ASW), 97.3 ± 0.1% (MKK) and 98.2 ± 0.1% (MXL). Mosaic reference panel will be one option for future missing value imputation in big data era. |
| |
Keywords: | Imputation African-American Minority population Reference Big data |
本文献已被 ScienceDirect 等数据库收录! |
|