首页 | 本学科首页   官方微博 | 高级检索  
     


A relationship between GC content and coding-sequence length
Authors:José L. Oliver  Antonio Marín
Affiliation:(1) Departamento de Genética, Instituto de Biotecnologfa, Facultad de Ciencias, Universidad de Granada, E-18071 Granada, Spain;(2) Departamento de Genética, Facultad de Biologfa, Universidad de Sevilla, Aptdo. 1095, E-41080 Sevilla, Spain
Abstract:Since base composition of translational stop codons (TAG, TAA, and TGA) is biased toward a low G+C content, a differential density for these termination signals is expected in random DNA sequences of different base compositions. The expected length of reading frames (DNA segments of sense codons flanked by in-phase stop codons) in random sequences is thus a function of GC content. The analysis of DNA sequences from several genome databases stratified according to GC content reveals that the longest coding sequences—exons in vertebrates and genes in prokaryotes—are GC-rich, while the shortest ones are GC-poor. Exon lengthening in GC-rich vertebrate regions does not result, however, in longer vertebrate proteins, perhaps because of the lower number of exons in the genes located in these regions. The effects on coding-sequence lengths constitute a new evolutionary meaning for compositional variations in DNA GC content. Correspondence to: J. L. Oliver
Keywords:Base composition  Stop-codon density  Coding-sequence length  Compositional heterogeneity
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号