首页 | 本学科首页   官方微博 | 高级检索  
     

遗传密码和DNA序列的高维空间数字编码
引用本文:陈惟昌,陈志华,陈志义,王自强,邱红霞. 遗传密码和DNA序列的高维空间数字编码[J]. 生物物理学报, 2000, 16(4): 760-768
作者姓名:陈惟昌  陈志华  陈志义  王自强  邱红霞
作者单位:1. 中日友好临床医学研究所生物物理研究室,北京100029
2. 中日友好临床医学研究所生物化学与分子生物学研究室,北京,100029
3. 中国科学院自动化研究所国家模式识别实验室,北京,100080
基金项目:国家自然科学基金资助项目
摘    要:二进制数字化编码是信息科学最基本的编码方式。用0(00)、1(01)、2(10)和3(11)4个数码对4种碱基(C、T、A、G)进行二进制数字编码,共有24种可能的编码组合,其中8种满足碱基到补法则,它们是拓扑等价的。按碱基分子量大小排列的编码格式:0123/CTAG是最理想的编码格式。用二进制数对DNA的字符序列进行编码,有以下优点:1)压缩信息冗余度,提高编码效率;2)可以对碱基的结构、功能基

关 键 词:数字编码 DNA序列 遗传密码 高维空间
修稿时间::

DIGITAL CODING OF THE GENETIC CODONS AND DNA SEQUENCESIN HIGH DIMENSION SPACE
CHEN Wei-chang,CHEN Zhi-hua,CHEN Zhi-yi,WANG Zi-qiang,QIU Hong-xia. DIGITAL CODING OF THE GENETIC CODONS AND DNA SEQUENCESIN HIGH DIMENSION SPACE[J]. Acta Biophysica Sinica, 2000, 16(4): 760-768
Authors:CHEN Wei-chang  CHEN Zhi-hua  CHEN Zhi-yi  WANG Zi-qiang  QIU Hong-xia
Abstract:Binary digital coding is the most fundamental coding in information science. There are 24 possible coding patterns to encode the 4 nucleotide bases (C,T,A,G)by means of 4 digits: 0(00), 1(01), 2(10),3(11). Among these 24 patterns, only 8 kinds of patterns which are topologically identical fit the complementary rule of the nucleotide bases. It is suggested that the coding pattern in accordance with the sequence of molecular weight, 0123/CTAG, is the best coding pattern for the nucleotide bases. The binary digital coding of DNA sequences prevails over the character DNA coding with the following advantages:(1). To decrease the redundancy of the information coding and improve the coding efficiency; (2). The properties of nucleotide bases, such as: structure, functional group, complementary relationship and the strong and weak hydrogen bond connections, can also be encoded;(3). Digital codings of DNA sequences possess the sequential property and can be uniquely arranged according to their sizes; (4). The symmetry of the DNA digital coding is in accordance with the symmetry of the degeneracy of the genetic codons. The degenerate rule of topological connectivity of the genetic codons can also be derived;(5). The digital coding of any DNA tandem repeats can be easily obtained;(6). According to the Hamming distance in the high dimension space, the information distance of multiple DNA sequences and also the conjunctive spaces can be determined, this may be of great importance for bioinformatics;(7). The digital coding of DNA sequence is very convenient for mathematical operation and logical operation and may give a great impact for the DNA biocomputer.
Keywords:Digital coding  DNA sequence  Genetic code  High dimension space  Hamming distance  Biocompute  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《生物物理学报》浏览原始摘要信息
点击此处可从《生物物理学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号