首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Compressing DNA sequence databases with coil
Authors:W Timothy J White  Michael D Hendy
Institution:(1) Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand
Abstract:

Background  

Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号