首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Abstract

Norovirus GII.4 variants, a genotype in genogroup II belonging to the genus Norovirus, is a single-strand positive sense RNA containing three open reading frames (ORF1, ORF2 and ORF3) and is the most important pathogen causing nonbacterial gastroenteritis outbreaks. By using bioinformatic softwares such as Codon W, SPSS and so on, a total of 292 strains of the viruses isolated from 1974 to 2016 were analyzed for nucleotide composition and synonymous codon usage in each ORF. The result shows that it is enriched for A over the other bases in nucleotide composition, G behind the other bases in the 3rd site of all synonymous codons in the three ORFs. The patterns of nucleotide composition and codon bias of ORF2 are similar to those of ORF3 and different from those of ORF1. There are generally UpA motif and CpG motif in the codons with the lowest proportion. Correspondence analysis indicates that the codon usage may be changing over a certain time period for ORF1 in 2006 and 2012, ORF2 in 2012, and ORF3 in 2013. ENC (effective number of codons) plot and other analyses indicate that both natural selection and mutational pressure play partly roles in the ORFs, but natural selection is more important for ORF2 and ORF3. Besides, we also found all optimal codons in the ORFs. The study provides a basic understanding of the mechanism for norovirus GII.4 codon usage bias. Abbreviations ORF Open Reading Frame

ENC Effective Number of Codons

COA correspondence analysis

RSCU Relative Synonymous Codon Usage

CAI Codon Adaptation Index

CBI Codon Bias Index

Fop frequency of optimal codons

L_sym number of synonymous codons

L_aa length amino acids

GRAVY grand average of hydropathicity

Aroma aromaticity

Communicated by Ramaswamy H. Sarma  相似文献   

2.
The complete nucleotide sequence of plasmid pAP4 isolated from Acetobacter pasteurianus 2374T has been determined. Plasmid pAP4 was analysed and found to be 3,870 bp in size with a G+C content of 50.1%. Computer assisted analysis of sequence data revealed 2 possible ORFs with typical promoter regions. ORF1 codes for a protein responsible for kanamycin resistance similar with Tn5 transposone, ORF2 encodes a resistance to ampicillin identical with Tn3 transposone. Plasmid has in A. pasteurianus five copies and in E. coli DH1 about 30 copies per chromosome and it segregation stability in both strains is very high. Based on the data on replication region, plasmid does not code for a replication protein and origin region is similar with ColE1-like plasmid.  相似文献   

3.
We have sequenced and genetically characterized comF, a Bacillus subtilis competence locus, previously identified by Tn917 transposon insertion mutagenesis. Expression of the locus, in which three open reading frames (ORFs) were found, is driven by a single σ;A-like promoter in front of comFORF1 and is dependent on early regulatory competence genes and only expressed in competence medium. The predicted amino acid sequences of two of the ORFs showed similarities to known proteins in the Gen-Bank and SwissProt databases: ComFORF1 is similar to an extensive family of ATP-dependent RNA/DNA helicases with closer similarity to the DEAD protein subfamily and to the PriA protein in Escherichia coli. The latter is a DNA translocase/helicase required for primosome assembly at the replication fork of phage φX174. ComFORF3 is 22% identical to Com101, a protein required for genetic competence in Haemophilus influenzae, a naturally competent Gram-negative bacterium. In-frame comFORF1 deletions were 1000-fold deficient in transformability compared to the wild type, whereas disruptions of the other two ORFs were only five- to 10-fold lower. These observations allow us to hypothesize that the ComFORF1 late gene product plays an essential role during the binding and uptake events involved in Bacillus subtilis transformation.  相似文献   

4.
Summary A broad-spectrum mercury resistance locus (mer) from a spontaneous chloramphenicol-sensitive (Cms), arginine auxotrophic (Arg) mutant of Streptomyces lividan 1326 was isolated on a 6 kb DNA fragment by shotgun cloning into the mercury-sensitive derivative S. lividans TK64 using the vector pIJ702. The mer genes form part of a very large amplifiable DNA sequence present in S. lividans 1326. This element was amplified to about 20 copies per chromosome in the Cms Arg mutant and was missing from strains like S. lividans TK64, cured for the plasmid SLP3. DNA sequence analysis of a 5 kb region encompassing the whole region required for broad-spectrum mercury resistance revealed six open reading frames (ORFs) transcribed in opposite directions from a common intercistronic region. The protein sequences predicted from the two ORFs transcribed in one direction showed a high degree of similarity to mercuric reductase and organomercurial lyase from other gram-negative and gram-positive sources. Few, if any, similarities were found between the predicted polypeptide sequences of the other four ORFs and other known proteins.  相似文献   

5.
Studies of gonococcal pilus biogenesis are fundamental to understanding organelle structure/function relationships and identifying new approaches to controlling disease. This area of research is also relevant to elucidating the basic mechanisms of outer membrane translocation of macromolecules, which requires components highly related to those involved in type IV pilus expression. Previous studies have shown that products of several ancillary pil genes are required for organelle biogenesis but of these only PilQ, a member of the GspD protein family, is a component of the outer membrane. DNA sequencing of the region upstream of pilQ revealed the presence of two open reading frames (ORFs) whose deduced polypeptides shared significant identities with proteins required for pilus expression in Pseudomonas aeruginosa and Pseudomonas syringae, the genes for which are arrayed upstream of a gene encoding a PilQ homologue. Gonococcal mutants bearing transposon insertions in these ORFs were non-piliated and failed to express pilus-associated phenotypes, and the corresponding genes were designated pilO and pilP. The piliation defects in the mutants could not be ascribed to polarity on distal pilQ expression as shown by direct measurement of PilQ antigen in those backgrounds and the use of a novel technique to create tandem duplications in the gonococcus (Gc) genome. As predicted by the presence of a consensus lipoprotein signal sequence, PilP expressed in both Escherichia coli and Gc could be labelled with [3H]-palmitic acid. PilP? as well as PilQ? mutants shed PilC, a protein which facilitates pilus assembly and is implicated in epithelial cell adherence, in a soluble form. Combined with the finding that levels of multimerized PilQ were greatly reduced in PilP? mutants, the results suggest that PilP is required for PilQ function and that PilQ and PilC may interact during the terminal stages of pilus biogenesis. The findings also support the hypothesis that the Gc PilQ multimer corresponds to a physiologically relevant form of the protein required for pilus biogenesis.  相似文献   

6.
The chemical mutagen ethylmethanesulphonate (EMS) has been used to generate mutants of Erwinia carotovora subspecies carotovora which are defective in the secretion of pectinases (Pel) and cellulases (Cel) but unaltered for protease (Prt) secretion. Such mutants, called Out? still synthesize Pel and Cel but these enzymes accumulate within the periplasm. Cosmid clones carrying wild-type E. carotovra ssp. carotovora DNA, identified by their ability to restore the Out+ phenotype when transferred to some Out? mutants, were classified into six complementation groups using cosmids and cosmid derivatives. Analysis of the nucleotide sequence of a 12.7 kb DNA fragment, encompassing complementing cosmid inserts, revealed a coding capacity for 13 potential open reading frames (ORFs), and these were designated outC-outO. Some of the out gene products were visualized using a T7 gene 10 expression system. The predicted Out proteins are highly similar to components of extracellular enzyme secretion systems from a diverse range of eubacteria including Erwinia chrysanthemi, Klebsiella oxytoca, Aeromonas hydrophila, Pseudomonas aeruginosa and Xanthomonas campestris. Lower levels of similarity exist between Ecc Out proteins and components of macromolecular trafficking systems from Bacillus subtilis, Haemophilus influenzae, Agrobacterium tumefaciens, Yersinia pestis and a protein involved in the morphogenesis of filamentous bacteriophages such as M13.  相似文献   

7.
8.
The nucleotide sequence of a 4,539 bp fragment of Bacillus stearothermophilus V mediating tellurite resistance in Escherichia coli was determined. Four ORFs of more than 150 amino acids encoding polypeptides of 244, 258, 308, and 421 residues were found in the restriction fragment. E. coli cells harboring a recombinant plasmid containing the Ter determinant express, when challenged with tellurite, a 32 kDa protein with an amino terminal sequence identical to the ten first residues of the 308 ORF. This ORF shows great similarity with the cysteine synthase gene (cysK) of a number of organisms. Recombinant clones carrying the active cysK gene have minimal inhibitory concentrations to K2TeO3 that were tenfold higher than those determined for the host strain or that of clones carrying ORFs 244, 258, and 421. Introduction of the B. stearothermophilus V cysK gene into a cysK strain of Salmonella typhimurium LT2 resulted in complementation of the mutation as well as transfer of tellurite resistance. Received: 28 March 2001 / Accepted: 17 April 2001  相似文献   

9.
A metagenomic fosmid library was constructed using a genomic DNA mixture extracted from the gut microflora of abalone. The library gave an alginate lyase positive clone (AlyDW) harboring a 31.7-kbp insert. The AlyDW insert consisted of 22 open reading frames (ORFs). The deduced amino acid sequences of ORFs 11–13 were similar to those of known alginate lyase genes, which are found adjacent in the genome of Klebsiella pneumoniae subsp. aerogenes, Vibrio splendidus, and Vibrio sp. belonging to the phylum Gammaproteobacteria. Among the three recombinant proteins expressed from the three ORFs, alginate lyase activity was only observed in the recombinant protein (AlyDW11) coded by ORF 11. The expressed protein (AlyDW11) had the highest alginate lyase activity at pH 7.0 and 45°C in the presence of 1 mM AgNO3. The alginate lyase activity of ORF 11 was confirmed to be endolytic by thin-layer chromatography. AlyDW11 preferred poly(β-d-mannuronate) as a substrate over poly(α-l-guluronate). AlyDW11 contained three highly conserved regions, RSEL, QIH, and YFKAGVYNQ, which may act to stabilize the three-dimensional conformation and function of the alginate lyase.  相似文献   

10.
The distributions and population densities of large benthic foraminifers (LBFs) were investigated on reef flats of the Majuro Atoll, Marshall Islands. Annual sediment production by foraminifers was estimated based on population density data. Predominant LBFs were Calcarina and Amphistegina, and the population densities of these foraminifers varied with location and substratum type on reef flats. Both foraminifers primarily attached to macrophytes, particularly turf-forming algae, and were most abundant on an ocean reef flat (ORF) and in an inter-island channel near windward, sparsely populated islands. Calcarina density was higher on windward compared to leeward sides of ORFs, whereas Amphistegina density was similar on both sides of ORFs. These foraminifers were more common on the ocean side relative to the lagoon side of reef flats around a windward reef island, and both were rare or absent in nearshore zones around reef islands and on an ORF near windward, densely populated islands. Foraminiferal production rates varied with the degree to which habitats were subject to water motion and human influences. Highly productive sites (>103 g CaCO3 m−2 year−1) included an ORF and an inter-island channel near windward, sparsely populated islands, and a seaward area of a reef flat with no reef islands. Low-productivity sites (<10 g CaCO3 m−2 year−1) included generally nearshore zones of lagoonal reef flats, leeward ORFs, and a windward ORF near densely populated islands. These results suggest that the distribution and production of LBFs were largely influenced by a combination of natural environmental factors, including water motion, water depth, elevation relative to the lowest tidal level at spring tide, and the distribution of suitable substratum. The presence of reef islands may limit the distribution and production of foraminifers by altering water circulation in nearshore environments. Furthermore, increased anthropogenic factors (population and activities) may adversely affect foraminiferal distribution and production.  相似文献   

11.
A 24-kb plasmid with 21 open reading frames (ORFs) was newly isolated from Corynebacterium glutamicum ATCC 14997 and named pCGR2. Three of its ORFs were indispensable for stable autonomous replication of pCGR2 in C. glutamicum: in the absence of selective pressure, deletion derivatives of pCGR2 containing the three ORFs showed stability in C. glutamicum for over 50 generations. The first of these ORFs encoded replicase repA whose gene product revealed high amino acid sequence similarity to corresponding gene products of C. glutamicum pCG1-family plasmids in general, and to that of pTET3 plasmid repA in particular. The other two ORFs were located upstream of repA and exhibited high sequence similarity to pTET3 parA and parB, respectively. Interestingly, plasmids based on the pCGR2 were compatible not only with those based on different family plasmids (pBL1, pCASE1) but also with those based on pCG1-family plasmid. Plasmids comprising pCGR2 repA showed a copy number of four in C. glutamicum. The number increased to 240 upon introduction of a mutation within the repA origin of the putative promoter for counter-transcribed RNA. This 60-fold increase in copy number should immensely contribute towards enhanced expression of desired genes in C. glutamicum.  相似文献   

12.
He-T sequences are a complex repetitive family of DNA sequences in Drosophila that are associated with telomeric regions, pericentromeric heterochromatin, and the Y chromosome. A component of the He-T family containing open reading frames (ORFs) is described. These ORF-containing elements within the He-T family are designated T-elements, since hybridization in situ with the polytene salivary gland chromosomes results in detectable signal exclusively at the chromosome tips. One T-element that has been sequenced includes ORFs of 1,428 and 1,614 bp. The ORFs are overlapping but one nucleotide out of frame with respect to each other. The longer ORF contains cysteine-histidine motifs strongly resembling nucleic acid binding domains of gag-like proteins, and the overall organization of the T-element ORFs is reminiscent of LINE elements. The T-elements are transcribed and appear to be conserved in Drosophila species related to D. melanogaster. The results suggest that T-elements may play a role in the structure and/or function of telomeres.by W. Hennig  相似文献   

13.
Forty-one open reading frames (ORFs) were identified in a 32-kb DNA fragment of alkaliphilic Bacillus sp. C-125. A similarity search using the BSORF database found 37 ORFs with significant sequence similarity to B. subtilis RNA polymerase subunits, elongation factor G, elongation factor Tu, and ribosomal proteins. Each ORF product showed more than 70% identity to those of B. subtilis. Gene organization in the region of str, S10, spc, and the α cluster was highly conserved among three strains, C-125, B. subtilis, and B. stearothermophilus.  相似文献   

14.
A clone isolated from a Drosophila auraria heat-shock cDNA library presents two long, antiparallel, coupled (LAC) open reading frames (ORFs). One strand ORF is 1,929 nucleotides long and exhibits great identity (87.5% at the nucleotide level and 94% at the amino acid level) with the hsp70 gene copies of D. melanogaster, while the second strand ORF, in antiparallel in-frame register arrangement, is 1,839 nucleotides long and exhibits 32% identity with a putative, recently identified, NAD+-dependent glutamate dehydrogenase (NAD+-GDH). The overlap of the two ORFs is 1,824 nucleotides long. Computational analysis shows that this LAC ORF arrangement is conserved in other hsp70 loci in a wide range of organisms, raising questions about possible evolutionary benefits of such a peculiar genomic organization.Correspondence to: Z.G. Scouras  相似文献   

15.
Cauliflower mosaic virus (CaMV) is a plant pararetrovirus with a double-stranded DNA genome. It is the type member of the genus Caulimovirus in the family Caulimoviridae. CaMV is transmitted by sap inoculation and in nature by aphids in a semi-persistent manner. To investigate the patterns and timescale of CaMV migration and evolution, we sequenced and analyzed the genomes of 67 isolates of CaMV collected mostly in Greece, Iran, Turkey, and Japan together with nine published sequences. We identified the open-reading frames (ORFs) in the genomes and inferred their phylogeny. After removing recombinant sequences, we estimated the substitution rates, divergence times, and phylogeographic patterns of the virus populations. We found that recombination has been a common feature of CaMV evolution, and that ORFs I–V have a different evolutionary history from ORF VI. The ORFs have evolved at rates between 1.71 and 5.81×10−4 substitutions/site/year, similar to those of viruses with RNA or ssDNA genomes. We found four geographically confined lineages. CaMV probably spread from a single population to other parts of the world around 400–500 years ago, and is now widely distributed among Eurasian countries. Our results revealed evidence of frequent gene flow between populations in Turkey and those of its neighboring countries, with similar patterns observed for Japan and the USA. Our study represents the first report on the spatial and temporal spread of a plant pararetrovirus.  相似文献   

16.
Thysanoplusia orichalcea multiple nucleopolyhedrovirus (ThorMNPV) has high virulence to Trichoplusia ni and Pseudoplusia includens larvae, with a potential for biological control of insect pests. The genome of ThorMNPV was sequenced and found to be 132,978 bp, with a G+C content of 37.9%. There are 145 predicted open reading frames (ORFs), encoding proteins of 50 or more amino acid residues with minimal overlap. Of the 145 ORFs, 141 appeared to be homologous to those of Autographa californica MNPV (AcMNPV). In comparison to AcMNPV, 9 ORFs of AcMNPV were absent in ThorMNPV, including the superoxide dismutase (sod) gene.  相似文献   

17.
A new plasmid designated pAsa6 from an Aeromonas salmonicida subsp. salmonicida strain isolated from diseased turbot has been characterized. pAsa6 consists of 18536 bp, has a G+C content of 53.8% and encodes 20 predicted open-reading frames (ORFs). Eight ORFs showed homology to transposases, of which six are complete and two are partial IS sequences. Two ORFs showed homology to replication proteins, and six ORFs showed homology to hypothetical proteins. Two ORFs are truncated homologs of putative A. salmonicida sulfatases. Two genes, aopH and sycH encode homologs of an effector protein for which a role in fish colonization by A. salmonicida has been previously reported, and its chaperone, respectively. The results of filter conjugation experiments suggested that pAsa6 is not mobilizable, as it failed to be conjugally-transferred to several species of marine bacteria tested. All the ORFs of pAsa6 with the exception of four copies of a IS1 transposase gene, have a counterpart in the recently sequenced 155-kb A. salmonicida plasmid pAsa5, suggesting either that pAsa6 is a derivative of pAsa5, or that pAsa5 is the result of the fusion of a pAsa6-like plasmid and a larger plasmid of ca. 135-kb. The pAsa6-encoded repA and aopH genes could be PCR-amplified from strains lacking pAsa6, suggesting presence of a large, possibly pAsa5-like plasmid that was not detected on agarose gels, or the existence of chromosome-integrated plasmid sequences. This study demonstrates that genomic locations for the aopH gene different to pAsa5 or pAsa5-like plasmids exist in A. salmonicida.  相似文献   

18.
19.
We determined and analyzed the Shigella flexneri serotype 5 (pSF5) and S. dysenteriae serotype 1 (pSD1) virulence plasmid genomes. The total length of pSF5 is 136513 bp, including 165 open reading frames (ORFs). Of these ORFs, 133 were identified and 32 of those had no significant homology to proteins with known functions. The length of pSD1 is 182545 bp, including 224 ORFs, of which we identified 181. The remaining 43 ORFs were not significantly homologous to proteins with known functions. The insertion sequence (IS) elements are 53787 bp in pSF5, and 49616 bp in pSD1, which represents 39.4% and 27.1% of the genome, respectively. There are 22 IS element types in pSF5 and pSD1, among which we report ISEc8 and ISSbo6 for the first time in the Shigella virulence plasmid. Compared to pCP301, there are a large number of deleted genes and gene inversions in both pSF5 and pSD1. The ipa-mxi-spa locus in pSF5 is completely absent, and the genes related to the O-antigen biosynthesis are partially missing. In contrast, the above genes in pSD1 are integral, with the exception of virF. The whole genome analysis of the two plasmids shows that the loss of genes related to gene invasion or regulation also obliterates the ability of pPF5 and pSD1 to bind Congo red (Crb). Whether these genes determine the Crb function requires continued investigation. These authors contributed equally to this work.  相似文献   

20.
The rulAB operon of Pseudomonas spp. confers fitness traits on the host and has been suggested to be a hotspot for insertion of mobile elements that carry avirulence genes. Here, for the first time, we show that rulB on plasmid pWW0 is a hotspot for the active site‐specific integration of related integron‐like elements (ILEs) found in six environmental pseudomonads (strains FH1–FH6). Integration into rulB on pWW0 occurred at position 6488 generating a 3 bp direct repeat. ILEs from FH1 and FH5 were 9403 bp in length and contained eight open reading frames (ORFs), while the ILE from FH4 was 16 233 bp in length and contained 16 ORFs. In all three ILEs, the first 5.1 kb (containing ORFs 1–4) were structurally conserved and contained three predicted site‐specific recombinases/integrases and a tetR homologue. Downstream of these resided ORFs of the ‘variable side’ with structural and sequence similarity to those encoding survival traits on the fitness enhancing plasmid pGRT1 (ILEFH1 and ILEFH5) and the NR‐II virulence region of genomic island PAGI‐5 (ILEFH4). Collectively, these ILEs share features with the previously described type III protein secretion system effector ILEs and are considered important to host survival and transfer of fitness enhancing and (a)virulence genes between bacteria.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号