首页 | 本学科首页   官方微博 | 高级检索  
     

一种有效的重复序列识别算法
引用本文:李冬冬,王正志,倪青山. 一种有效的重复序列识别算法[J]. 生物信息学, 2005, 3(4): 163-166,174
作者姓名:李冬冬  王正志  倪青山
作者单位:国防科技大学机电工程与自动化学院自动控制系,湖南,长沙,410073
摘    要:重复序列的分析是基因组研究中的一个重要课题,进行这一研究的基础则是从基因组序列中快速有效地找出其中的重复序列。一种投影拼接算法,即利用随机投影获得候选片断集合,利用片断拼接对候选片断进行拼接,以发现基因组中的重复序列。分析了算法的计算复杂度,构造了半仿真测试数据,对算法的测试结果表明了其有效性。

关 键 词:重复序列  随机投影  拼接
文章编号:1672-5565(2005)-04-163-05
收稿时间:2004-06-02
修稿时间:2005-05-25

An effective algorithm for repeat sequence finding
LI Dong-dong,WANG Zheng-zhi,NI Qing-shan. An effective algorithm for repeat sequence finding[J]. Chinese Journal of Bioinformatics, 2005, 3(4): 163-166,174
Authors:LI Dong-dong  WANG Zheng-zhi  NI Qing-shan
Abstract:Analysis of repeat sequences is an important subject for genomic research.To do this analysis,we have to find all unknown repeat sequences from the whole sequence first.In this paper,we propose a novel projection-assemble algorithm to find these repeats.The algorithm employs random projection algorithm to obtain a candidature segment set,and employs exhaust search algorithm searching each pairs of segments to find potential linkage between them and then assemble them together.The complexity of our projection-assemble algorithm is nearly linear to the length of genomic sequence,and its memory usage is an exponential function to a parameter,which is relative to the length of genomic sequence.But that isn't a serious problem,because to sequences with length up to several decades of millions,this parameter can be set as a constant.We construct a test dataset to examine our algorithm,and the results show that it can find the repeat segments effectively.
Keywords:repeat sequences  random projection  assemble  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号