首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
To investigate the genomic patterns of influenza A virus subtypes, such as H3N2, H9N2, and H5N1, we collected 1842 sequences of the hemagglutinin and neuraminidase genes from the NCBI database and parsed them into 7 categories: accession number, host species, sampling year, country, subtype, gene name, and sequence. The sequences that were isolated from the human, avian, and swine populations were extracted and stored in a MySQL database for intensive analysis. The GC content and relative synonymous codon usage (RSCU) values were calculated using JAVA codes. As a result, correspondence analysis of the RSCU values yielded the unique codon usage pattern (CUP) of each subtype and revealed no extreme differences among the human, avian, and swine isolates. H5N1 subtype viruses exhibited little variation in CUPs compared with other subtypes, suggesting that the H5N1 CUP has not yet undergone significant changes within each host species. Moreover, some observations may be relevant to CUP variation that has occurred over time among the H3N2 subtype viruses isolated from humans. All the sequences were divided into 3 groups over time, and each group seemed to have preferred synonymous codon patterns for each amino acid, especially for arginine, glycine, leucine, and valine. The bioinformatics technique we introduce in this study may be useful in predicting the evolutionary patterns of pandemic viruses.  相似文献   

2.
Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses   总被引:11,自引:0,他引:11  
Zhou T  Gu W  Ma J  Sun X  Lu Z 《Bio Systems》2005,81(1):77-86
In this study, we calculated the codon usage bias in H5N1 virus and performed a comparative analysis of synonymous codon usage patterns in H5N1 virus, five other evolutionary related influenza A viruses and a influenza B virus. Codon usage bias in H5N1 genome is a little slight, which is mainly determined by the base compositions on the third codon position. By comparing synonymous codon usage patterns in different viruses, we observed that the codon usage pattern of H5N1 virus is similar with other influenza A viruses, but not influenza B virus, and the synonymous codon usage in influenza A virus genes is phylogenetically conservative, but not strain-specific. Synonymous codon usage in genes encoded by different influenza A viruses is genus conservative. Compositional constraints could explain most of the variation of synonymous codon usage among these virus genes, while gene function is also correlated to synonymous codon usages to a certain extent. However, translational selection and gene length have no effect on the variations of synonymous codon usage in these virus genes.  相似文献   

3.
A novel subtype of influenza A virus 09H1N1 has rapidly spread across the world. Evolutionary analyses of this virus have revealed that 09H1N1 is a triple reassortant of segments from swine, avian and human influenza viruses. In this study, we investigated factors shaping the codon usage bias of 09H1N1 and carried out cluster analysis of 60 strains of influenza A virus from different subtypes based on their codon usage bias. We discovered that more preferentially used codons of 09H1N1 are A-ended or U-ended...  相似文献   

4.
The pandemic of 1918 was caused by an H1N1 influenza A virus, which is a negative strand RNA virus; however, little is known about the nature of its direct ancestral strains. Here we applied a broad genetic and phylogenetic analysis of a wide range of influenza virus genes, in particular the PB1 gene, to gain information about the phylogenetic relatedness of the 1918 H1N1 virus. We compared the RNA genome of the 1918 strain to many other influenza strains of different origin by several means, including relative synonymous codon usage (RSCU), effective number of codons (ENC), and phylogenetic relationship. We found that the PB1 gene of the 1918 pandemic virus had ENC values similar to the H1N1 classical swine and human viruses, but different ENC values from avian as well as H2N2 and H3N2 human viruses. Also, according to the RSCU of the PB1 gene, the 1918 virus grouped with all human isolates and "classical" swine H1N1 viruses. The phylogenetic studies of all eight RNA gene segments of influenza A viruses may indicate that the 1918 pandemic strain originated from a H1N1 swine virus, which itself might be derived from a H1N1 avian precursor, which was separated from the bulk of other avian viruses in toto a long time ago. The high stability of the RSCU pattern of the PB1 gene indicated that the integrity of RNA structure is more important for influenza virus evolution than previously thought.  相似文献   

5.
A型流感病毒NS1基因密码子去优化改造引起病毒毒力减弱   总被引:1,自引:0,他引:1  
根据A型流感病毒密码子使用偏嗜性,选取稀有密码子对A/Puerto Rico/8/34(H1N1)病毒NS1基因内部110个氨基酸区域进行密码子同义突变改造,并全基因合成NS基因,利用反向遗传操作技术拯救出含有密码子去优化NS1基因的重组病毒(deoNS)。体外细胞噬斑形成实验和病毒生长曲线证明该病毒在MDCK细胞内的感染和复制能力比野生型病毒低约1000倍;BALB/c小鼠体内致病力实验证明deoNS病毒不能引起小鼠发病和死亡,该病毒在小鼠肺内的复制滴度比野生型病毒低100~1000倍。本研究探索了通过基因组密码子去优化改造途径降低A型流感病毒毒力的可行性,首次证明流感病毒NS1基因密码子去优化同义突变可以降低病毒毒力,为流感减毒活疫苗的研究提供了新的思路。  相似文献   

6.
A novel subtype of influenza A virus 09H1N1 has rapidly spread across the world. Evolutionary analyses of this virus have revealed that 09H1N1 is a triple reassortant of segments from swine, avian and human influenza viruses. In this study, we investigated factors shaping the codon usage bias of 09H1N1 and carried out cluster analysis of 60 strains of influenza A virus from different subtypes based on their codon usage bias. We discovered that more preferentially used codons of 09H1N1 are A-ended or U-ended, and the intra-genomic codon usage bias of 09H1N1 is quite low. Base composition constraint, dinucleotide biases and translational selection are the main factors influencing the codon usage bias of 09H1N1. At the genome level, we find that the codon usage bias of 09H1N1 is similar to H1N1 (A/swine/Kansas/77778/2007H1N1), H9N2 from Asia, H1N2 from Asia and North America and H3N2 from North America. Our results provide insight for understanding the processes governing evolution, regulation of gene expression, and revealing the evolution of 09H1N1.  相似文献   

7.
流感病毒基因的密码子偏好性及聚类分析   总被引:1,自引:0,他引:1  
徐利娟  钟金城  陈智华  穆松 《生物信息学》2010,8(2):175-179,186
流行性感冒病毒是一种造成人类及动物患流行性感冒的RNA病毒,它造成急性上呼吸道感染,并由空气迅速传播,在世界各地常有周期性的大流行。根据该病毒的基因组CDS序列,探讨了基因组序列密码子的使用模式和特性,并进行了病毒间的聚类分析。结果表明:流感病毒的G+C含量均低于A+U含量,偏向使用以A、U结尾的密码子的程度比使用以G、C结尾的较高,CUG、UCA、AGU、AGC、AGA、AGG、GUG、CCA、ACA、GGA、GCA、AUU、UGA、CAU、CAA、AAU、AAA、GAA等18个密码子为流感病毒共有的偏好性密码子,且以A结尾的居多,尤其偏爱AGA、GGA。聚类结果表明首先亚洲流感病毒H2N2和香港流感病毒H2N2聚为一类,亚洲流感病毒H1N1和俄罗斯流感病毒H1N1聚为一类,1997年和2003年~2004年发生的人禽流感聚为一类,说明它们的密码子使用的偏好性相似;而2009年爆发的甲型H1N1流感和任何一个流感的距离都比较远,说明甲型H1N1流感病毒是一种新型的病毒,不同于以往任何一种流感病毒。  相似文献   

8.
Two species of the DNA virus Torque teno sus virus (TTSuV), TTSuV1 and TTSuV2, have become widely distributed in pig-farming countries in recent years. In this study, we performed a comprehensive analysis of synonymous codon usage bias in 41 available TTSuV2 coding sequences (CDS), and compared the codon usage patterns of TTSuV2 and TTSuV1. TTSuV codon usage patterns were found to be phylogenetically conserved. Values for the effective number of codons (ENC) indicated that the overall extent of codon usage bias in both TTSuV2 and TTSuV1 was not significant, the most frequently occurring codons had an A or C at the third codon position. Correspondence analysis (COA) was performed and TTSuV2 and TTSuV1 sequences were located in different quadrants of the first two major axes. A plot of the ENC revealed that compositional constraint was the major factor determining the codon usage bias for TTSuV2. In addition, hierarchical cluster analysis of 41 TTSuV2 isolates based on relative synonymous codon usage (RSCU) values suggested that there was no association between geographic distribution and codon bias of TTSuV2 sequences. Finally, the comparison of RSCU for TTSuV2, TTSuV1 and the corresponding host sequence indicated that the codon usage pattern of TTSuV2 was similar to that of TTSuV1. However the similarity was low for each virus and its host. These conclusions provide important insight into the synonymous codon usage pattern of TTSuV2, as well as better understangding of the molecular evolution of TTSuV2 genomes.  相似文献   

9.
Analysis of codon usage pattern is important to understand the genetic and evolutionary characteristics of genomes. We have used bioinformatic approaches to analyze the codon usage bias (CUB) of the genes located in human Y chromosome. Codon bias index (CBI) indicated that the overall extent of codon usage bias was low. The relative synonymous codon usage (RSCU) analysis suggested that approximately half of the codons out of 59 synonymous codons were most frequently used, and possessed a T or G at the third codon position. The codon usage pattern was different in different genes as revealed from correspondence analysis (COA). A significant correlation between effective number of codons (ENC) and various GC contents suggests that both mutation pressure and natural selection affect the codon usage pattern of genes located in human Y chromosome. In addition, Y-linked genes have significant difference in GC contents at the second and third codon positions, expression level, and codon usage pattern of some codons like the SPANX genes in X chromosome.  相似文献   

10.
Dengue is the most common arthropod-borne viral (Arboviral) illness in humans. The genetic features concerning the codon usage of dengue virus (DENV) were analyzed by the relative synonymous codon usage, the effective number of codons and the codon adaptation index. The evolutionary distance between DENV and the natural hosts (Homo sapiens, Pan troglodytes, Aedes albopictus and Aedes aegypti) was estimated by a novel formula. Finally, the synonymous codon usage preference for the translation initiation region of this virus was also analyzed. The result indicates that the general trend of the 59 synonymous codon usage of the four genotypes of DENV are similar to each other, and this pattern has no link with the geographic distribution of the virus. The effect of codon usage pattern of Aedes albopictus and Aedes aegypti on the formation of codon usage of DENV is stronger than that of the two primates. Turning to the codon usage preference of the translation initiation region of this virus, some codons pairing to low tRNA copy numbers in the two primates have a stronger tendency to exist in the translation initiation region than those in the open reading frame of DENV. Although DENV, like other RNA viruses, has a high mutation to adapt its hosts, the regulatory features about the synonymous codon usage have been ‘branded’ on the translation initiation region of this virus in order to hijack the translational mechanisms of the hosts.  相似文献   

11.
Li Y  Wang C  Cheng X  Wu T  Zhang C 《Bio Systems》2011,104(1):42-47
Three very virulent infectious bursal disease virus (vvIBDV) strains were isolated from a single farm and shown to be phylogenetically related to the vvIBDV isolate UK661. In this study, a comparative analysis of the synonymous codon usage in the hypervariable region of theVP2 (vVP2) gene of the vvIBDV strains was done on viruses serially passaged in chicken embryos. Sequencing demonstrated that codons change during the serial passage in the vVP2 gene of the viruses. Nine codon mutations resulted in amino acids changes. The amino acid changes were I256V, I296L 6in isolate XA1989, A222P, I242V, Q253H, I256V in isolate XA1998, and Q253H, I256V, I296L in isolate XA2004. Three of the nine amino acid changes occurred at residue 256. The codons of the amino acids A232, N233, I234, T269, T283 and H338 changed to the synonymous codons in XA1989 after the 16th passage, in XA1998 after the 24th passage and in XA2004 22nd passage viruses. These mutations change the key amino acid residues Q253H and I256V in the domains which are essential for its virulence, and the synonymous codons were observed compared to classical virulent IBDV. The results indicated that the codon changes during the serial passage comprised of synonymous codon usage in the vVP2 gene of IBDV, and this synonymous codon bias was correlated with pathotypes. The extent of synonymous codon usage bias in the IBDV-vVP2 gene maybe influence the gene expression level and secondary structure of protein as well as hydrophobicity, therefore the results provide useful perspectives for evolution and understanding of the pathogenesis of IBDV.  相似文献   

12.
The helicase gene of Autographa californica multiple nucleopolyhedrovirus (AcMNPV) is not only involved in viral DNA replication, but also plays a role in viral host range. To identify the codon usage bias of helicase of AcMNPV, the codon usage bias of helicase was especially studies in AcMNPV and 41 reference strains of baculoviruses by calculating the codon adaptation index (CAI), effective number of codon (ENc), relative synonymous codon usage (RSCU), and other indices. The helicase of baculovirus is less biased (mean ENc?=?50.539?>?40; mean CAI?=?0.246). AcMNPV helicase has a strong bias toward the synonymous codons with G and C at the third codon position (GC3s?=?53.6%). The plot of GC3s against ENc values revealed that GC compositional constraints are the main factor that determines the codon usage bias of major of helicase. Several indicators supported that the codon usage pattern of helicase is mainly subject to mutation pressure. Analysis of variation in codon usage and amino acid composition indicated AcMNPV helicase shows the significant preference for one or more postulated codons for each amino acid. A cluster analysis based on RSCU values suggested that AcMNPV is evolutionarily closer to members of group I alphabaculovirus. Comparison of the codon usage pattern among E. coli, yeast, mouse, human and AcMNPV showed that yeast is a suitable expression system for AcMNPV helicase. AcMNPV helicase shows weak codon usage bias. This study may help in elucidating the functional mechanism of AcMNPV helicase and the evolution of baculovirus helicases.  相似文献   

13.
紫花苜蓿叶绿体基因组密码子偏好性分析   总被引:1,自引:0,他引:1  
喻凤  韩明 《广西植物》2021,41(12):2069-2076
为分析紫花苜蓿叶绿体基因组密码子偏好性的使用模式,该文以紫花苜蓿叶绿体基因组中筛选到的49条蛋白质编码序列为研究对象,利用CodonW、CUSP、CHIPS、SPSS等软件对其密码子的使用模式和偏好性进行研究。结果表明:(1)紫花苜蓿叶绿体基因的第3位密码子的平均GC含量为26.44%,有效密码子数(ENC)在40.6~51.41之间,多数密码子的偏好性较弱。(2)相对同义密码子使用度(RSCU)分析发现,RSCU>1 的密码子数目有30个,以A、U结尾的有29个,说明了紫花苜蓿叶绿体基因组A或U出现的频率较高。(3)中性分析发现,GC3与 GC12的相关性不显著,表明密码子偏性主要受自然选择的影响; ENC-plot 分析发现一部分基因落在曲线的下方及周围,表明突变也影响了部分密码子偏性的形成。此外,有17个密码子被鉴定为紫花苜蓿叶绿体基因组的最优密码子。紫花苜蓿叶绿体基因组的密码子偏好性可能受自然选择和突变的共同作用。该研究将为紫花苜蓿叶绿体基因工程的开展和目标性状的遗传改良奠定基础。  相似文献   

14.
In the present study, we examined GC nucleotide composition, relative synonymous codon usage (RSCU), effective number of codons (ENC), codon adaptation index (CAI) and gene length for 308 prokaryotic mechanosensitive ion channel (MSC) genes from six evolutionary groups: Euryarchaeota, Actinobacteria, Alphaproteobacteria, Betaproteobacteria, Firmicutes, and Gammaproteobacteria. Results showed that: (1) a wide variation of overrepresentation of nucleotides exists in the MSC genes; (2) codon usage bias varies considerably among the MSC genes; (3) both nucleotide constraint and gene length play an important role in shaping codon usage of the bacterial MSC genes; and (4) synonymous codon usage of prokaryotic MSC genes is phylogenetically conserved. Knowledge of codon usage in prokaryotic MSC genes may benefit from the study of the MSC genes in eukaryotes in which few MSC genes have been identified and functionally analysed.  相似文献   

15.
《Genomics》2021,113(4):2177-2188
The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HCoVs) to decipher the nucleotide sequence variability and codon usage patterns. It is essential to understand the survival ability of the viruses, their adaptation to hosts, and their evolution.The current analysis revealed a high abundance of the relative dinucleotide (odds ratio), GC and CT pairs in the first and last two codon positions, respectively, as well as a low abundance of the CG pair in the last two positions of the codon, which might be related to the evolution of the viruses. A remarkable level of variability of GC content in the third position of the codon among the seven coronaviruses was observed. Codons with high RSCU values are primarily from the aliphatic and hydroxyl amino acid groups, and codons with low RSCU values belong to the aliphatic, cyclic, positively charged, and sulfur-containing amino acid groups. In order to elucidate the evolutionary processes of the seven coronaviruses, a phylogenetic tree (dendrogram) was constructed based on the RSCU scores of the codons. The severe and mild categories CoVs were positioned in different clades. A comparative phylogenetic study with other coronaviruses depicted that SARS-CoV-2 is close to the CoV isolated from pangolins (Manis javanica, Pangolin-CoV) and cats (Felis catus, SARS(r)-CoV). Further analysis of the effective number of codon (ENC) usage bias showed a relatively higher bias for SARS-CoV and MERS-CoV compared to SARS-CoV-2. The ENC plot against GC3 suggested that the mutational bias might have a role in determining the codon usage variation among candidate viruses.A codon adaptability study on a few human host parasites (from different kingdoms), including CoVs, showed a diverse adaptability pattern. SARS-CoV-2 and SARS-CoV exhibit relatively lower but similar codon adaptability compared to MERS-CoV.  相似文献   

16.
M Bulmer 《Nucleic acids research》1990,18(10):2869-2873
The effect of neighbouring bases on the usage of synonymous codons in genes with low codon usage bias in yeast and E. coli is examined. The codon adaptation index is employed to identify a group of genes in each organism with low codon usage bias, which are likely to be weakly expressed. A similar pattern is found in complementary sequences with respect to synonymous usage of A vs G or of U vs C. It is suggested that this may reflect an effect of context on mutation rates in weakly expressed genes.  相似文献   

17.
目前,有关同义密码子使用偏性对蛋白质折叠的影响研究中,样本蛋白均来源于不同的物种。考虑到同义密码子使用偏性的物种差异性,选取枯草杆菌的核蛋白为研究对象。首先,将每条核蛋白按二级结构截取为α螺旋片段、β折叠片段和无规卷曲(α-β混合)片段,并计算其蛋白质折叠速率。然后,整理每个片段相应的核酸序列信息,计算其同义密码子使用度。在此基础上,分析枯草芽孢杆菌核蛋白的同义密码子使用偏性与蛋白质折叠速率的相关性。发现对于不同二级结构的肽链片段,都有部分密码子的使用偏性与其对应的肽链折叠速率显著相关。进一步分析发现,与肽链片段折叠速率显著相关的密码子绝大部分为枯草杆菌全序列或核蛋白序列的每一组同义密码子中使用度最高的密码子。结果表明,在蛋白质的折叠过程中,枯草芽孢杆菌的同义密码子使用偏性起着重要作用。  相似文献   

18.
Mycoplasma bovis is a major pathogen causing arthritis, respiratory disease and mastitis in cattle. A better understanding of its genetic features and evolution might represent evidences of surviving host environments. In this study, multiple factors influencing synonymous codon usage patterns in M. bovis (three strains’ genomes) were analyzed. The overall nucleotide content of genes in the M. bovis genome is AT-rich. Although the G and C contents at the third codon position of genes in the leading strand differ from those in the lagging strand (p<0.05), the 59 synonymous codon usage patterns of genes in the leading strand are highly similar to those in the lagging strand. The over-represented codons and the under-represented codons were identified. A comparison of the synonymous codon usage pattern of M. bovis and cattle (susceptible host) indicated the independent formation of synonymous codon usage of M. bovis. Principal component analysis revealed that (i) strand-specific mutational bias fails to affect the synonymous codon usage pattern in the leading and lagging strands, (ii) mutation pressure from nucleotide content plays a role in shaping the overall codon usage, and (iii) the major trend of synonymous codon usage has a significant correlation with the gene expression level that is estimated by the codon adaptation index. The plot of the effective number of codons against the G+C content at the third codon position also reveals that mutation pressure undoubtedly contributes to the synonymous codon usage pattern of M. bovis. Additionally, the formation of the overall codon usage is determined by certain evolutionary selections for gene function classification (30S protein, 50S protein, transposase, membrane protein, and lipoprotein) and translation elongation region of genes in M. bovis. The information could be helpful in further investigations of evolutionary mechanisms of the Mycoplasma family and heterologous expression of its functionally important proteins.  相似文献   

19.
In this study codon usage bias of all experimentally known genes of Lactococcus lactis has been analyzed. Since Lactococcus lactis is an AT rich organism, it is expected to occur A and/or T at the third position of codons and detailed analysis of overall codon usage data indicates that A and/or T ending codons are predominant in this organism. However, multivariate statistical analyses based both on codon count and on relative synonymous codon usage (RSCU) detect a large number of genes, which are supposed to be highly expressed are clustered at one end of the first major axis, while majority of the putatively lowly expressed genes are clustered at the other end of the first major axis. It was observed that in the highly expressed genes C and T ending codons are significantly higher than the lowly expressed genes and also it was observed that C ending codons are predominant in the duets of highly expressed genes, whereas the T endings codons are abundant in the quartets. Abundance of C and T ending codons in the highly expressed genes suggest that, besides, compositional biases, translational selection are also operating in shaping the codon usage variation among the genes in this organism as observed in other compositionally skewed organisms. The second major axis generated by correspondence analysis on simple codon counts differentiates the genes into two distinct groups according to their hydrophobicity values, but the same analysis computed with relative synonymous codon usage values could not discriminate the genes according to the hydropathy values. This suggests that amino acid composition exerts constraints on codon usage in this organism. On the other hand the second major axis produced by correspondence analysis on RSCU values differentiates the genes into two groups according to the synonymous codon usage for cysteine residues (rarest amino acids in this organism), which is nothing but a artifactual effect induced by the RSCU values. Other factors such as length of the genes and the positions of the genes in the leading and lagging strand of replication have practically no influence in the codon usage variation among the genes in this organism.  相似文献   

20.
糜子叶绿体基因组密码子使用偏性的分析   总被引:2,自引:0,他引:2       下载免费PDF全文
密码子使用偏性(CUB)是生物体重要的进化特征,对研究物种进化、基因功能以及外源基因表达等具有重要科学意义。本研究利用糜子(Panicum miliaceum L.)叶绿体基因组中筛选出的53条蛋白编码序列,对其密码子使用模式及偏性进行了分析。结果表明,糜子叶绿体基因的有效密码子数(ENC)在37.14~61之间,多数密码子的偏性较弱。相对同义密码子使用度(RSCU)分析发现,RSCU > 1的密码子有32个,其中28个以A、U结尾,表明第3位密码子偏好使用A和U碱基。中性分析发现,GC3与GC12的相关性不显著,回归曲线斜率为0.2129,表明密码子偏性主要受到自然选择的影响;而ENC-plot分析发现大部分基因落在曲线的上方及周围,表明突变也影响了密码子偏性的形成。进一步的对应性分析发现,第1轴为主要影响因素,解释了17.92%的差异,其与ENC、GC3S值的相关性均达到显著水平,但与CBI、GCall不相关。最后,9个密码子被鉴定为糜子叶绿体基因组的最优密码子,糜子叶绿体基因组的密码子使用偏性可能受选择和突变共同作用。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号