Assembly errors cause false tandem duplicate regions in the chicken (Gallus gallus) genome sequence |
| |
Authors: | Qu Zhang Niclas Backström |
| |
Institution: | 1. Department of Human Evolutionary Biology, Harvard University, 11 Divinity Avenue, Cambridge, MA, 02138, USA 3. Pioneer Hi-Bred International, A DuPont Business, Johnston, IA, 50131, USA 2. Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Norbyv?gen 18D, 752 36, Uppsala, Sweden
|
| |
Abstract: | The complexity of eukaryote genomes makes assembly errors inevitable in the process of constructing reference genomes. Next-generation sequencing (NGS) could provide an efficient way to validate previously assembled genomes. Here, we exploited NGS data to interrogate the chicken reference genome and identified 35 pairs of nearly identical regions with >99.5 % sequence similarity and a median size of 109 kb. Several lines of evidence, including read depth, the composition of junction sequences, and sequence similarity, suggest that these regions present genome assembly errors and should be excluded from forthcoming genomic studies. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|