首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Segmented K-mer and its application on similarity analysis of mitochondrial genome sequences
Authors:Hong-Jie Yu
Institution:Department of Mathematics, School of Science, Anhui Science and Technology University, Fengyang, Anhui 233100, China; Intelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui 230031, China; Department of Automation, University of Science and Technology of China, Hefei, China
Abstract:K-mer-based approach has been widely used in similarity analyses so as to discover similarity/dissimilarity among different biological sequences. In this study, we have improved the traditional K-mer method, and introduce a segmented K-mer approach (s-K-mer). After each primary sequence is divided into several segments, we simultaneously transform all these segments into corresponding K-mer-based vectors. In this approach, it is vital how to determine the optimal combination of distance metric with the number of K and the number of segments, i.e., (K?, s?, and d?). Based on the cascaded feature vectors transformed from s? segmented sequences, we analyze 34 mammalian genome sequences using the proposed s-K-mer approach. Meanwhile, we compare the results of s-K-mer with those of traditional K-mer. The contrastive analysis results demonstrate that s-K-mer approach outperforms the traditionally K-mer method on similarity analysis among different species.
Keywords:A  adenosine  bp  base pair(s)  C  cytidine  CV  composition vector  FV  feature vector  G  guanosine  mt  mitochondria  MEGA  molecular evolutionary genetics analysis  MSA  multiple sequence alignments  nt  nucleotide(s)  Pdist  pair-wise distance  s-K-mer  segmented K-mer  T  thymidine
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号