Current challenges and solutions of de novo assembly |
| |
Authors: | Xingyu Liao Min Li You Zou Fang-Xiang Wu Yi-Pan Jianxin Wang |
| |
Affiliation: | 1. School of Computer Science and Engineering, Central south University, Changsha 410083, China2. Division of Biomedical Engineering, University of Saskatchewan, Saskatchewan, S7N 5A9, Canada3. Department of Computer Science, Georgia State University, Atlanta, GA 30302, USA |
| |
Abstract: | Background: Next-generation sequencing (NGS) technologies have fostered an unprecedented proliferation of high-throughput sequencing projects and a concomitant development of novel algorithms for the assembly of short reads. However, numerous technical or computational challenges in de novo assembly still remain, although many new ideas and solutions have been suggested to tackle the challenges in both experimental and computational settings.Results: In this review, we first briefly introduce some of the major challenges faced by NGS sequence assembly. Then, we analyze the characteristics of various sequencing platforms and their impact on assembly results. After that, we classify de novo assemblers according to their frameworks (overlap graph-based, de Bruijn graph-based and string graph-based), and introduce the characteristics of each assembly tool and their adaptation scene. Next, we introduce in detail the solutions to the main challenges of de novo assembly of next generation sequencing data, single-cell sequencing data and single molecule sequencing data. At last, we discuss the application of SMS long reads in solving problems encountered in NGS assembly.Conclusions: This review not only gives an overview of the latest methods and developments in assembly algorithms, but also provides guidelines to determine the optimal assembly algorithm for a given input sequencing data type. |
| |
Keywords: | next-generation sequencing single-cell sequencing single-molecule sequencing de novo assembly algorithms |
|
| 点击此处可从《》浏览原始摘要信息 |
|
点击此处可从《》下载全文 |
|