首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
T K Frey  L D Marr 《Gene》1988,62(1):85-99
The sequence of the 3' 4508 nucleotides (nt) of the genomic RNA of the Therien strain of rubella virus (RV) was determined for cDNA clones. The sequence contains a 3189-nt open reading frame (ORF) which codes for the structural proteins C, E2 and E1. C is predicted to have a length of 300 amino acids (aa). The N-terminal half of the C protein is highly basic and hydrophilic in nature, and is putatively the region of the protein which interacts with the virion RNA. At the C terminus of the C protein is a stretch of 20 hydrophobic aa which also serves as the signal sequence for E2, indicating that the cleavage of C from the polyprotein precursor may be catalyzed by signalase in the lumen of the endoplasmic reticulum. E2 is 282 aa in length and contains four potential N-linked glycosylation sites and a putative transmembrane domain near its C terminus. The sequence of E1 has been previously described [Frey et al., Virology 154 (1986) 228-232]. No homology could be detected between the amino acid sequence of the RV structural proteins and the amino acid sequence of the alphavirus structural proteins. From the position of a region of 30 nt in the RV genomic sequence which exhibited significant homology with the sequence in the alphavirus genome at which subgenomic RNA synthesis is initiated, the RV subgenomic RNA is predicted to be 3346 nt in length and the nontranslated region from the 5' end of the subgenomic RNA to the structural protein ORF is predicted to be 98 nt. In a different translation frame beginning at the 5' end of the RV nt sequence reported here is a 1407 nt ORF which is the C terminal region of the nonstructural protein ORF. This ORF overlaps the structural protein ORF by 149 nt. A low level of homology could be detected between the predicted amino acid sequence of the C-terminus of the RV nonstructural protein ORF and the replicase proteins of several positive RNA viruses of animals and plants, including nsp4 of the alphaviruses, the protein encoded by the C-terminal region of the alphavirus nonstructural ORF. However, the overall homology between RV and the alphaviruses in this region of the genome was only 18%, indicating that these two genera of the Togavirus family are only distantly related. Intriguingly, there is a 2844-nt ORF present in the negative polarity orientation of the RV sequence which could encode a 928-aa polyprotein.  相似文献   

2.
Hepatitis E virus (HEV) is a major human pathogen in much of the developing world. It is a plus-strand RNA virus with a 7.2-kb polyadenylated genome consisting of three open reading frames, ORF1, ORF2, and ORF3. Of these, ORF2 encodes the major capsid protein of the virus and ORF3 encodes a small protein of unknown function. Using the yeast three-hybrid system and traditional biochemical techniques, we have studied the RNA binding activities of ORF2 and ORF3, two proteins encoded in the 3' structural part of the genome. Since the genomic RNA from HEV has been postulated to contain secondary structures at the 5' and 3' ends, we used these two terminal regions, besides other regions within the genome, in this study. Experiments were designed to test for interactions between the genomic RNA fusion constructs with ORF2 and ORF3 hybrid proteins in a yeast cellular environment. We show here that the ORF2 protein contains RNA binding activity. The ORF2 protein specifically bound the 5' end of the HEV genome. Deletion analysis of this protein showed that its RNA binding activity was lost when deletions were made beyond the N-terminal 111 amino acids. Finer mapping of the interacting RNA revealed that a 76-nucleotide (nt) region at the 5' end of the HEV genome was responsible for binding the ORF2 protein. This 76-nt region included the 51-nt HEV sequence, conserved across alphaviruses. Our results support the requirement of this conserved sequence for interaction with ORF2 and also indicate an increase in the strength of the RNA-protein interaction when an additional 44 bases downstream of this 76-nt region were included. Secondary-structure predictions and the location of the ORF2 binding region within the HEV genome indicate that this interaction may play a role in viral encapsidation.  相似文献   

3.
4.
We report the results from sequence analysis and expression studies of the gastroenteritis agent astrovirus serotype 1. We have cloned and sequenced 5,944 nucleotides (nt) of the estimated 7.2-kb RNA genome and have identified three open reading frames (ORFs). ORF-3, at the 3' end, is 2,361 nt in length and is fully encoded in both the genomic and subgenomic viral RNAs. Expression of ORF-3 in vitro yields an 87-kDa protein that is immunoprecipitated with a monoclonal antibody specific for viral capsids. This protein comigrates with an authentic 87-kDa astrovirus protein immunoprecipitated from infected cells, indicating that this region encodes a viral structural protein. The adjacent upstream ORF (ORF-2) is 1,557 nt in length and contains a viral RNA-dependent RNA polymerase motif. The viral RNA-dependent RNA polymerase motifs from four astrovirus serotypes are compared. Partial sequence (2,018 nt) of the most 5' ORF (ORF-1) reveals a 3C-like serine protease motif. The ORF-1 sequence is incomplete. These results indicate that the astrovirus genome is organized with nonstructural proteins encoded at the 5' end and structural proteins at the 3' end. ORF-2 has no start methionine and is in the -1 frame compared with ORF-1. We present sequence evidence for a ribosomal frameshift mechanism for expression of the viral polymerase.  相似文献   

5.
Higher-order cis-acting RNA replication structures have been identified in the 3'- and 5'-terminal untranslated regions (UTRs) of a bovine coronavirus (BCoV) defective interfering (DI) RNA. The UTRs are identical to those in the viral genome, since the 2.2-kb DI RNA is composed of only the two ends of the genome fused between an internal site within the 738-nucleotide (nt) 5'-most coding region (the nsp1, or p28, coding region) and a site just 4 nt upstream of the 3'-most open reading frame (ORF) (the N gene). The joined ends of the viral genome in the DI RNA create a single continuous 1,635-nt ORF, 288 nt of which come from the 738-nt nsp1 coding region. Here, we have analyzed features of the 5'-terminal 288-nt portion of the nsp1 coding region within the continuous ORF that are required for DI RNA replication. We observed that (i) the 5'-terminal 186 nt of the nsp1 coding region are necessary and sufficient for DI RNA replication, (ii) two Mfold-predicted stem-loops within the 186-nt sequence, named SLV (nt 239 to 310) and SLVI (nt 311 to 340), are supported by RNase structure probing and by nucleotide covariation among closely related group 2 coronaviruses, and (iii) SLVI is a required higher-order structure for DI RNA replication based on mutation analyses. The function of SLV has not been evaluated. We conclude that SLVI within the BCoV nsp1 coding region is a higher-order cis-replication element for DI RNA and postulate that it functions similarly in the viral genome.  相似文献   

6.
The periplasmic flagellum of Treponema phagedenis consists of the flagellar filament and hook-basal body. We report here a characterization of the hook gene and flagellar hook of T. phagedenis, and in the process of this analysis we found evidence that the hook polypeptide is likely cross-linked in situ. A T. phagedenis genomic library was screened with a Treponema pallidum antiserum, and the DNA segments from several positive plaques were subcloned and sequenced. DNA sequencing of two overlapping segments revealed a 1,389-nucleotide (nt) open reading frame (ORF) with a deduced amino acid sequence that was 36% identical to that of FlgE, the hook polypeptide of Salmonella typhimurium. This gene was designated T. phagedenis flgE. Beginning at 312 nt downstream from flgE was a partial ORF of 486 nt with a deduced amino acid sequence that was 33% identical to that of MotA of Bacillus subtilis, a polypeptide that enables flagellar rotation. Upstream of flgE, separated by 39 nt, was a partial (291-nt) ORF with a deduced amino acid sequence that was homologous to that of ORF8, a polypeptide of unknown function located in an operon encoding polypeptides involved in motility of B. subtilis. The T. phagedenis flgE gene was cloned into an Escherichia coli protein expression plasmid, and the purified recombinant protein was used to prepare a FlgE antiserum. Western blots (immunoblots) of whole-cell lysates probed with this antiserum revealed a 55-kDa polypeptide and a ladder of polypeptide bands with increasing molecular masses. T. phagedenis hooks were then isolated and purified, and electron microscopic analysis revealed that the morphology of the hooks resembled that in other bacteria. The hooks were slightly curved and had an average length of 69 +/- 8 nm and a diameter of 23 +/- 1 nm. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis and Western blots of purified hook preparations using the FlgE antiserum also revealed a polypeptide ladder, suggesting that the hooks are composed of a covalently cross-linked polypeptide.  相似文献   

7.
T M Hohn  P D Beremand 《Gene》1989,79(1):131-138
The trichodiene synthase gene (Tox5) has been isolated from the fungus Fusarium sporotrichioides, and its nucleotide (nt) sequence determined. A lambda gt11 library of F. sporotrichioides DNA was screened with antiserum against trichodiene synthase (TS). DNA fragments were isolated which encode a portion of the Tox5 gene. In subsequent screening of the library we employed one of these DNAs as a probe and identified several recombinant phage containing the entire Tox5 gene. The gene consists of a 1182-nt open reading frame (ORF) which contains a 60-nt intron and specifies a Mr 43,999 protein. The deduced amino acid sequence of the ORF was identical to sequences determined for several CNBr peptides from purified TS. Southern and Northern analyses indicated that the Tox5 gene is present in a single copy and is transcribed into an mRNA of about 1450 nt. Upstream from the start codon, 'TATA'-like sequences and a short repeated sequence resembling the 'CCAAT' box were observed. The primary structure described for TS is the first such report for a member of the terpene cyclase group of enzymes.  相似文献   

8.
Nucleotide sequence of the traD region in the Escherichia coli F sex factor   总被引:11,自引:0,他引:11  
M B Jalajakumari  P A Manning 《Gene》1989,81(2):195-202
The complete nucleotide sequence has been determined of a 3635-bp region, extending from the HpaI site in traT, at F coordinate 90.3 kb, to beyond the end of traD, of the F sex factor plasmid of Escherichia coli K-12. This region contains the C-terminal coding part of traT and the entire traD gene. An open reading frame (ORF) of 2148 bp within the sequence confirms that traD encodes an 81.4-kDa cytoplasmic membrane protein. The TraD protein has several regions with an unusually high pI (greater than 10), suggesting that they may correspond to the DNA-binding domains. Several other ORFs were detected within the region including the gene (ORF1) for a 26.3-kDa protein and ORF2, probably corresponding to traI, which continues to the end of the sequence. An ORF for an 8.5-kDa protein preceded by an excellent promoter and ribosome-binding site is present in the region following traD but on the opposite strand. This promoter is thought to correspond to the major RNA polymerase binding site in this region, implying that traI does not have its own promoter. The lack of a typical terminator following traD and ORF1 and the translational coupling provided by overlapping stop and start codons is consistent with this conclusion.  相似文献   

9.
S G Kim  C A Batt 《Gene》1991,98(1):95-100
A genetic element which is conserved in the genomes of numerous Lactococcus lactis bacteriophage isolates has been identified and its nucleotide sequence determined. Approximately 95-99% of all L. lactis bacteriophages collected over a period of six years from two geographically distinct sources carry this conserved DNA fragment. Genetic variation in other regions of the genomes of these bacteriophages is exhibited by changes in the overall restriction patterns. The complete nt sequence for a 1.6-kb region from nine independent L. lactis bacteriophage isolates was determined and only five changes in the nt sequence were observed within a span of 1536 bp. This region has a single large 1356-bp open reading frame (ORF) coding for a 51-kDa protein. Three out of the five changes occur in a 187-bp region, 5' to this large ORF. The two additional changes are found within the 1356-bp ORF, which results in two amino acid substitutions that do not, however, change the net charge of the protein. The encoded protein is extremely charged and shares some homology with yeast translation initiation factor. In addition, there is a potential zinc-binding domain within this protein, similar to those observed in genes from bacteriophages T4 and T7.  相似文献   

10.
11.
H K Lamb  C F Roberts  A R Hawkins 《Gene》1992,112(2):219-224
A sequence of 3299 nt, contiguous with the previously sequenced quinate permease-encoding (qutD) gene and encompassing the dehydroshikimate dehydratase-encoding (qutC) gene, has been determined. Northern-blot analysis detected (i) a quinate-inducible mRNA of the expected size for the qutC gene, and (ii) a quinate-inducible mRNA of 1.45 kb divergently transcribed away from qutC towards qutD. Computer-aided sequence analysis identified an ORF of 1047 nt corresponding to the qutC gene encoding dehydroshikimate dehydratase. In addition, a genetically uncharacterized 1188-nt gene, designated qutH and containing a putative intron of 61 nt, was identified between qutC and qutD. The inferred protein sequence encoded by qutH contains a putative 'zinc cluster' motif and has a low (16%) but significant similarity with the DNA-directed DNA polymerase of hepatitis B virus. The results are interpreted as being consistent with the view that the qutH gene encodes a DNA-binding protein, possibly involved in the regulation of genes essential for the utilisation of protocatechuic acid.  相似文献   

12.
13.
14.
The L-A double-stranded RNA virus of Saccharomyces cerevisiae encodes its major coat protein (80 kDa) and a minor single-stranded RNA binding protein (180 kDa) that has immunological cross-reactivity with the major coat protein. The sequence of L-A cDNA clones revealed two open reading frames (ORF), ORF1 and ORF2. These two reading frames overlap by 130 base pairs and ORF2 is in the -1 reading frame with respect to ORF1. Although the major coat protein of the viral particles is encoded by ORF1, the 180-kDa protein is derived from the entire double-stranded RNA genome by fusing ORF1 and ORF2, probably by a -1 translational frameshift. Within the overlapping region is a sequence similar to that producing a -1 frameshift by "simultaneous slippage" in retroviruses. The coding sequence of ORF2 shows a pattern characteristic of viral RNA-dependent RNA polymerases of icosahedral (+)-strand RNA viruses. Thus, the 180-kDa protein is analogous to gag-pol fusion proteins.  相似文献   

15.
Using the vector pGEM-4-blue, a 4,251-base-pair DNA fragment containing the gene for the surface (S)-layer protein of Bacillus sphaericus 2362 was cloned into Escherichia coli. Determination of the nucleotide sequence indicated an open reading frame (ORF) coding for a protein of 1,176 amino acids with a molecular size of 125 kilodaltons (kDa). A protein of this size which reacted with antibody to the 122-kDa S-layer protein of B. sphaericus was detected in cells of E. coli containing the recombinant plasmid. Analysis of the deduced amino acid sequence indicated a highly hydrophobic N-terminal region which had the characteristics of a leader peptide. The first amino acid of the N-terminal sequence of the 122-kDa S-layer protein followed the predicted cleavage site of the leader peptide in the 125-kDa protein. A sequence characteristic of promoters expressed during vegetative growth was found within a 177-base-pair region upstream from the ORF coding for the 125-kDa protein. This putative promoter may account for the expression of this gene during the vegetative growth of B. sphaericus and E. coli. The gene for the 125-kDa protein was followed by an inverted repeat characteristic of terminators. Downstream from this gene (11.2 kilobases) was an ORF coding for a putative 80-kDa protein having a high sequence similarity to the 125-kDa protein. Evidence was presented indicating that this gene is cryptic.  相似文献   

16.
17.
18.
Burkholderia cepacia AC1100 utilizes 2,4,5-trichlorophenoxyacetic acid (2,4,5-T) as a sole source of carbon and energy. PT88 is a chromosomal deletion mutant of B. cepacia AC1100 and is unable to grow on 2,4,5-T. The nucleotide sequence of a 5.5-kb chromosomal fragment from B. cepacia AC1100 which complemented PT88 for growth on 2,4,5-T was determined. The sequence revealed the presence of six open reading frames, designated ORF1 to ORF6. Five polypeptides were produced when this DNA region was under control of the T7 promoter in Escherichia coli; however, no polypeptide was produced from the fourth open reading frame, ORF4. Homology searches of protein sequence databases were performed to determine if the proteins involved in 2,4,5-T metabolism were similar to other biodegradative enzymes. In addition, complementation studies were used to determine which genes were essential for the metabolism of 2,4,5-T. The first gene of the cluster, ORF1, encoded a 37-kDa polypeptide which was essential for complementation of PT88 and showed significant homology to putative trans-chlorodienelactone isomerases. The next gene, ORF2, was necessary for complementation and encoded a 47-kDa protein which showed homology to glutathione reductases. ORF3 was not essential for complementation; however, both the 23-kDa protein encoded by ORF3 and the predicted amino acid sequence of ORF4 showed homology to glutathione S-transferases. ORF5, which encoded an 11-kDa polypeptide, was essential for growth on 2,4,5-T, but the amino acid sequence did not show homology to those of any known proteins. The last gene of the cluster, ORF6, was necessary for complementation of PT88, and the 32-kDa protein encoded by this gene showed homology to catechol and chlorocatechol-1,2-dioxygenases.  相似文献   

19.
The nucleotide (nt) sequence of a DNA segment containing the majority of a gene cloned from Bacillus thuringiensis DSIR517 encoding a 130 kDa insecticidal crystal protein has been determined. Sequence analysis reveals an open reading frame (ORF) of 3453 nt. The ATG initiation codon, which is preceded by a potential ribosome-binding site sequence, was confirmed by N-terminal amino acid sequencing. The ORF extends beyond the 3' terminus of the cloned fragment; however, the high degree of homology between the deduced amino acid sequence of this ORF and other Cry proteins suggests the clone lacks only five C-terminal amino acids. Making this assumption, the ORF of 3468 nt encodes a protein of 1156 amino acids with an estimated molecular mass of 129700 Da. Analysis of the deduced amino acid sequence reveals a number of features characteristic of Cry proteins. Alignment of the Cry 517 protein sequence with other Cry proteins suggests it is most closely related to the cryIA-E genes but sufficiently different to form a new cryI gene subclass.  相似文献   

20.
《Gene》1996,173(2):189-194
A protein present on the cell surface of Bacillus licheniformis (Bl) NM105 was identified as an S-layer (OlpA in this paper), a protein present on many bacterial cell surfaces. Purification, SDS-PAGE and isoelectrofocusing showed one 94-kDa, slightly acidic (pI 6.5) protein band (defined as OlpA). The pure protein OlpA, has a tetragonal symmetry of its morphological subunits. Following Edman degradation, three 17-mer oligodeoxyribonucleotide (oligo) probes corresponding to the N-terminal sequence of OlpA were synthesized and used for gene cloning. The nucleotide (nt) sequence of the cloned gene (olpA) showed an ORF and encoded an 874 amino acid (aa) protein. In the promoter region of olpA, there appear to be -10 and -35 ∂A-binding sites, as well as -10 and -35 regions specific for∂H. The existence of these two potential promoters suggests that OlpA would be produced during both the vegetative and sporulating stages of growth. The ribosome-binding site (RBS) sequence perfectly matched its consensus sequence, suggesting a high efficiency of translation of olpA. A typical 29-aa leader peptide, characteristic of secretory proteins in Bacilli, is present in the OlpA pre-protein sequence. In olpA, there are two stem-loop structures in tandem, downstream from the stop codon. These stem-loops are probably involved in prolonged olpA expression, by extending the half life of the mRNA.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号