首页 | 本学科首页   官方微博 | 高级检索  
   检索      


An efficient algorithm for DNA fragment assembly in MapReduce
Authors:Baomin Xu  Jin GaoChunyan Li
Institution:a School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China
b Department of Computer Science, Tangshan Normal University, Tangshan 063000, China
Abstract:Fragment assembly is one of the most important problems of sequence assembly. Algorithms for DNA fragment assembly using de Bruijn graph have been widely used. These algorithms require a large amount of memory and running time to build the de Bruijn graph. Another drawback of the conventional de Bruijn approach is the loss of information. To overcome these shortcomings, this paper proposes a parallel strategy to construct de Bruijin graph. Its main characteristic is to avoid the division of de Bruijin graph. A novel fragment assembly algorithm based on our parallel strategy is implemented in the MapReduce framework. The experimental results show that the parallel strategy can effectively improve the computational efficiency and remove the memory limitations of the assembly algorithm based on Euler superpath. This paper provides a useful attempt to the assembly of large-scale genome sequence using Cloud Computing.
Keywords:Eulerian superpath problem  Parallel assembly  Fragment assembly  MapReduce
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号