共查询到20条相似文献,搜索用时 0 毫秒
1.
2.
3.
Krishnamurthy Subramanian Bryan Payne Felix Feyertag David Alvarez-Ponce 《Molecular biology and evolution》2022,39(8)
We present the Codon Statistics Database, an online database that contains codon usage statistics for all the species with reference or representative genomes in RefSeq (over 15,000). The user can search for any species and access two sets of tables. One set lists, for each codon, the frequency, the Relative Synonymous Codon Usage, and whether the codon is preferred. Another set of tables lists, for each gene, its GC content, Effective Number of Codons, Codon Adaptation Index, and frequency of optimal codons. Equivalent tables can be accessed for (1) all nuclear genes, (2) nuclear genes encoding ribosomal proteins, (3) mitochondrial genes, and (4) chloroplast genes (if available in the relevant assembly). The user can also search for any taxonomic group (e.g., “primates”) and obtain a table comparing all the species in the group. The database is free to access without registration at http://codonstatsdb.unr.edu. 相似文献
4.
5.
以植物钾离子外排通道(K’channeloutward.rectifier,KCO)基因为研究对象,运用CodonW软件分析了75个植物KCO基因密码子的使用模式,探讨密码子的使用模式和影响密码子使用的各种可能因素。结果表明:碱基组成差异(r=0.961,P〈0.01)和自然选择(r=0.568,P〈0.01)是影响密码子使用的主要因素,并且高表达的基因强烈偏爱使用以G或C结尾的密码子。确定了UUC、CUC等26个均以G/C结尾的密码子为植物KcD基因的高表达优越密码子。 相似文献
6.
籼稻品种93-11同义密码子的使用偏性 总被引:13,自引:2,他引:13
利用籼稻品种93-11的全基因组序列及相应的EST数据,对影响同义密码子用法的若干因子进行了详细分析。指出93-11基因的表达水平(mRNA丰度)与3个同义密码子偏性指标CAI、CPP和ENC相关极显著(r=0.227^**,0.145^**和-0.147^**),表明高表达的基因其同义密码子非随机使用的程度越大;基因长度与CAI和CPP极显著负相关(r=-0.413^**和-0.480^**),与ENC极显著正相关(r=0.210^**),暗示较短的基因具有更高的转录活性;编码区G+C含量对其同义密码子偏性的贡献率远高于mRNA丰度和基因长度,G+C含量与CAI、CPP和ENC相关系数分别高达0.877^**,0.832^**和-0.740^**;起始编码区内A、T、C、G4种碱基呈明显的3周期振荡,尤以ATG下游第一个密码子所在的3个位点(+4、+5和+6)偏置最强烈,由此认为在这3个特殊位点有较高的自然选择压存在;93-11中25个最优密码子的首次确定将对水稻转基因具有指导意义。 相似文献
7.
伪狂犬病病毒基因编码区碱基组成与密码子使用偏差 总被引:6,自引:0,他引:6
由于伪狂犬病病毒(PRV)中G C含量高达74%,至今尚没有一个毒株完成全基因组测序。对已知的68个PRV基因编码区序列碱基组成及密码子使用现象进行了统计分析,结果发现PRV基因中存在非常强的密码子使用偏差。所有68个PRV基因编码区密码子第三位总的G C含量为96.24%,其中UL48基因高达99.52%。PRV基因偏向于使用富含GC的密码子,特别是以C或G结尾的密码子。此外,还发现PRV中G C含量变化较大的UL48、UL40、UL14和IE180等基因附近正好与已知的PRV基因组复制起始区相对应。根据基因功能将PRV基因分为6类进行分析发现,基因功能相同或相近的基因其密码子使用模式相似,其中调节基因的同义密码子相对使用度(RSCU)与其他基因有显著差异,在调节基因中以C结尾的密码子的RSCU值远大于其他同义密码子。最后,对PRV基因氨基酸组成差异进行多元分析,发现不同功能的PRV基因在对应分析图上分布不同,表明PRV基因密码子使用模式可能与基因功能相关。 相似文献
8.
查尔酮合成酶(Chalcone synthase,CHS)广泛存在于植物体内,是花色素形成过程中一种重要的酶,可以进一步催化生成黄酮类化合物。本研究采用Codon W和EMBOSS在线软件对红松查尔酮合成酶基因CHS的密码子使用偏好性进行分析,并与北美乔松等其他24种植物的CHS基因以及模式植物基因组进行比较,对认识红松CHS基因的密码子使用偏好性,为选择适宜的表达系统奠定了一定的基础。研究结果表明:红松CHS基因编码区的有效密码子数(ENC)和GC含量分别为48.92和0.548,C+G含量高于A+T含量,密码子偏好以A/T结尾;多数植物CHS基因的G+C含量高于A+T含量,且密码子更偏好C/G结尾;聚类分析表明,红松与马尾松和赤松的密码子使用偏好性的相似性较高;密码子使用频率研究发现,红松CHS遗传转化与异源表达较优的受体可能是大肠杆菌和拟南芥。 相似文献
9.
影响链球菌属肺炎球菌基因组密码子使用的因素分析 总被引:5,自引:2,他引:5
链球菌属肺炎球菌(Steptococcus pneumoniae)的完整基因组序列已经测定完毕并于近期发表,对肺炎球菌基因组序列进行了详细分析,研究了基因组密码子的使用模式和影响密码子使用的因素,高水平高达基因的密码子第三位碱基使用胞嘧啶(C)的频率比表达水平低的基因使用C有显著的提高,表达水平较低的基因在密码子的第三位碱基更趋向使用嘌呤),基因的表达水平与对应分析的第一条向量轴呈显著相关(R=0.86),比较表达水平高,低的两组基因的密码子使用模式发现,基因的表达水平对于密码子使用有显著的影响,基因碱基G+C的组成与基因的表达水平(R=0.44),对应分析的第一条向量轴(R=0.5)有显著的相关,对基因的表达水平,密码子的使用有显著的影响,通过GC-skew,蛋白质的疏水性,基因的长度分析,发现不同长度的基因表达水平,GC含量,GC3s有差异,结果表明,在表达水平上的自然选择以及基因的碱基组成是影响肺炎球菌基因密码子使用的主要因素,基因的长度对密码子的使用有一定影响。 相似文献
10.
文中对子囊菌代表类群的延伸因子1 alpha基因密码子的使用模式进行了研究。结果表明:该基因的密码子使用偏好性不仅与核酸碱基组成密切相关,也受到其他选择性压力的影响。统计分析揭示了子囊菌各类群该基因的密码子组成和编码特点,在同义密码子的选择模式上,酵母纲(Saccharomycetes)的成员具有较独特的偏好性。基于密码子用法分歧度的聚类分析方法较合理地反映了大部分类群的分类学地位,但在各个纲的内部,密码子偏好性的变化程度存在差异。 相似文献
11.
Xuhua Xia 《Genetics》2015,199(2):573-579
Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. 相似文献
12.
We present an expression measure of a gene, devised to predictthe level of gene expression from relative codon bias (RCB).There are a number of measures currently in use that quantifycodon usage in genes. Based on the hypothesis that gene expressivityand codon composition is strongly correlated, RCB has been definedto provide an intuitively meaningful measure of an extent ofthe codon preference in a gene. We outline a simple approachto assess the strength of RCB (RCBS) in genes as a guide totheir likely expression levels and illustrate this with an analysisof Escherichia coli (E. coli) genome. Our efforts to quantitativelypredict gene expression levels in E. coli met with a high levelof success. Surprisingly, we observe a strong correlation betweenRCBS and protein length indicating natural selection in favourof the shorter genes to be expressed at higher level. The agreementof our result with high protein abundances, microarray dataand radioactive data demonstrates that the genomic expressionprofile available in our method can be applied in a meaningfulway to the study of cell physiology and also for more detailedstudies of particular genes of interest. 相似文献
13.
在基因组学水平上研究密码子使用偏性模式、成因并分析进化过程中的选择压力在基因组学研究中有重要意义。文章概述了目前提出的密码子使用偏性的量化方法及实现原理。目前研究发现:有些量化密码子偏性的方法受高表达基因参考数据集未完全注释的限制,不同密码子位置对变异和选择的影响不同,以及不同密码子位置处GC含量和嘌呤含量的贡献不同。由此展望密码子偏性量化方法发展方向为:需要设计不需要相关参考基因集合先验知识的密码子使用偏性量化方法;考虑不同位置处背景核苷酸组成的密码子使用偏性的量化方法;同时考虑基因表达水平的密码子使用偏性量化方法。最后,归纳了目前可用的密码子使用偏性的量化工具和数据库。 相似文献
14.
比较分析了嗜热泉生古细菌(Aeropyrum pernix K1)和其他两种系统发育相关的泉古菌[嗜气菌(Pyrobaculum aerophi-lumstr.IM2)和嗜硫菌(Sulfolobus acidocaldarius DSM 639)]的同义密码子使用偏向性。结果表明嗜热泉生古细菌(Aeropyrum pernix K1)的密码子偏向性很小,并且与GC3S成高度的相关性。这3种泉古菌的密码子使用模式在进化上很保守。与基因的功能对密码子使用的影响相比,这些泉古菌密码子的使用偏向性更是由其物种所决定的。嗜热泉生古细菌(A.pernix K1),嗜气菌(P.aerophilum str.IM2)和嗜硫菌(S.acidocaldarius DSM 639)生存在不同的极限环境中。推测正是这些极限环境决定了这些泉古菌的密码子使用偏向性模式。此外在这些泉古菌的基因组中并没有发现其正义链和反义链的密码子使用偏向性差别。嗜热泉生古细菌(A.pernix K1)和嗜硫菌(S.acidocaldarius DSM 639)的密码子偏向性程度与基因表达水平有高度的相关性,而嗜气菌(P.aerophilum str.IM2)的基因组并没有发现这种规律。 相似文献
15.
An Evaluation of Measures of Synonymous Codon Usage Bias 总被引:14,自引:0,他引:14
Synonymous codons are not generally used at equal frequencies, and this trend is observed for most genes and organisms. Several
methods have been proposed and used to estimate the degree of the nonrandom use of the different synonymous codons. The estimates
obtained by these methods, however, show different levels of both precision and dispersion when coding regions of a finite
number of codons are under analysis. Here, we present a study, based on computer simulation, of how the different methods
proposed to evaluate the nonrandom use of synonymous codons are affected by the length of the coding region analyzed. The
results show that some of these methods are heavily influenced by the number of codons and that the comparison of codon usage
bias between coding regions of different lengths shows a methodological bias under different conditions of nonrandom use of
synonymous codons. The study of the dispersion of the estimates obtained by the different methods gives, on the other hand,
an indication of the methods to be applied to compare values of codon usage bias among coding regions of equivalent length.
Received: 10 September 1997 / Accepted: 23 March 1998 相似文献
16.
The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias. 相似文献
17.
为探究滇黄精(Polygonatum kingianum)叶绿体全基因组特征和密码子使用偏性,利用第二代测序技术对滇黄精嫩叶进行测序,再经组装与注释后得到其叶绿体基因组全序列,通过MISA、EMBOSS和CodonW等软件对滇黄精叶绿体全基因组的SSR位点、系统发育及密码子偏好性进行分析。结果表明,滇黄精完整叶绿体基因组长度为155 852 bp,基因组平均GC含量为37.7%,其大、小单拷贝区(LSC)长度分别为84 633和185 25 bp,反向重复区长度为26 347 bp,注释了132个基因,包括86个蛋白编码基因、38个tRNA基因和8个核糖rRNA基因。叶绿体基因组中共有69个SSR位点,绝大多数属于单碱基重复的A/T类型。系统发育分析表明滇黄精与格脉黄精(P. tessellatum)亲缘关系近,可能与分布地域有关。密码子偏好性分析表明,滇黄精叶绿体基因组密码子使用模式受到自然选择影响大于突变因素,最终确定9个最优密码子。因此, 滇黄精叶绿体基因组遗传结构和系统发育位置及其密码子偏倚的分析,为叶绿体基因工程研究提供理论依据。 相似文献
18.
影响鼻疽伯克霍尔德氏菌基因组密码子用法的因素分析 总被引:1,自引:0,他引:1
鼻疽伯克霍尔德氏菌(Burkholderia mallei ATCC 23344)的基因组密码子使用受多种因素的影响,本研究根据该菌的完整基因组序列,运用多元统计分析和对应分析的方法,探讨了鼻疽伯克霍尔德氏菌全基因组序列密码子的使用模式和影响密码子使用的因素。结果表明基因表达水平的高低是影响密码子使用的主要因素;基因组中编码区的碱基组成、蛋白质的疏水性和基因的长度对密码子的使用也有一定的影响,但影响力不及基因的表达水平。同时,通过比较高表达的基因、低表达的基因密码子使用情况,GCG 和 CUC 等 21 个密码子被确定为鼻疽伯克霍尔德氏菌的主要偏爱密码子。以上结果对鼻疽伯克霍尔德氏菌的密码子用法研究、在分子水平上研究物种进化、基因组中未知基因的预测、开放阅读框的判断、功能基因的表达以及鼻疽病疫苗的研发等工作都提供了理论基础,具有较强的指导作用。 相似文献
19.
Brian Charlesworth 《Genetics》2013,194(4):955-971
Genomic traits such as codon usage and the lengths of noncoding sequences may be subject to stabilizing selection rather than purifying selection. Mutations affecting these traits are often biased in one direction. To investigate the potential role of stabilizing selection on genomic traits, the effects of mutational bias on the equilibrium value of a trait under stabilizing selection in a finite population were investigated, using two different mutational models. Numerical results were generated using a matrix method for calculating the probability distribution of variant frequencies at sites affecting the trait, as well as by Monte Carlo simulations. Analytical approximations were also derived, which provided useful insights into the numerical results. A novel conclusion is that the scaled intensity of selection acting on individual variants is nearly independent of the effective population size over a wide range of parameter space and is strongly determined by the logarithm of the mutational bias parameter. This is true even when there is a very small departure of the mean from the optimum, as is usually the case. This implies that studies of the frequency spectra of DNA sequence variants may be unable to distinguish between stabilizing and purifying selection. A similar investigation of purifying selection against deleterious mutations was also carried out. Contrary to previous suggestions, the scaled intensity of purifying selection with synergistic fitness effects is sensitive to population size, which is inconsistent with the general lack of sensitivity of codon usage to effective population size. 相似文献
20.
杨树派间不同种的遗传密码子使用频率分析 总被引:1,自引:0,他引:1
遗传密码子的简并性特征造成了不同物种使用的密码子存在偏爱性。了解不同物种的密码子使用特点,可以为外源基因导入过程中的基因改造提供依据,从而实现外源基因的高效表达。杨树是世界上广泛栽培的重要造林树种之一,已经成为林木基因工程研究的模式植物。本研究采用高频密码子分析法,对美洲山杨P.tremuloides,毛白杨P.tomentosa,美洲黑杨P.deltoids和毛果杨P.trichocarpa 4种杨树的蛋白质编码基因序列(CDS)进行了分析,计算出了杨树同义密码子相对使用频率(RFSC),确定了4种杨树的高频率密码子,发现虽然不同种类的杨树密码子使用上有一些差别,但是偏爱密码子的差别却很小,共性的密码子占绝大多数。仅有Pro,Thr和Cys等少数几个氨基酸的偏爱密码子有差别。这种“共性”提示我们,用不同种的杨树中任何一种杨树的偏爱密码子所设计的外源基因在其他杨树中也可以使用。 相似文献