首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 250 毫秒
1.
T K Frey  L D Marr 《Gene》1988,62(1):85-99
The sequence of the 3' 4508 nucleotides (nt) of the genomic RNA of the Therien strain of rubella virus (RV) was determined for cDNA clones. The sequence contains a 3189-nt open reading frame (ORF) which codes for the structural proteins C, E2 and E1. C is predicted to have a length of 300 amino acids (aa). The N-terminal half of the C protein is highly basic and hydrophilic in nature, and is putatively the region of the protein which interacts with the virion RNA. At the C terminus of the C protein is a stretch of 20 hydrophobic aa which also serves as the signal sequence for E2, indicating that the cleavage of C from the polyprotein precursor may be catalyzed by signalase in the lumen of the endoplasmic reticulum. E2 is 282 aa in length and contains four potential N-linked glycosylation sites and a putative transmembrane domain near its C terminus. The sequence of E1 has been previously described [Frey et al., Virology 154 (1986) 228-232]. No homology could be detected between the amino acid sequence of the RV structural proteins and the amino acid sequence of the alphavirus structural proteins. From the position of a region of 30 nt in the RV genomic sequence which exhibited significant homology with the sequence in the alphavirus genome at which subgenomic RNA synthesis is initiated, the RV subgenomic RNA is predicted to be 3346 nt in length and the nontranslated region from the 5' end of the subgenomic RNA to the structural protein ORF is predicted to be 98 nt. In a different translation frame beginning at the 5' end of the RV nt sequence reported here is a 1407 nt ORF which is the C terminal region of the nonstructural protein ORF. This ORF overlaps the structural protein ORF by 149 nt. A low level of homology could be detected between the predicted amino acid sequence of the C-terminus of the RV nonstructural protein ORF and the replicase proteins of several positive RNA viruses of animals and plants, including nsp4 of the alphaviruses, the protein encoded by the C-terminal region of the alphavirus nonstructural ORF. However, the overall homology between RV and the alphaviruses in this region of the genome was only 18%, indicating that these two genera of the Togavirus family are only distantly related. Intriguingly, there is a 2844-nt ORF present in the negative polarity orientation of the RV sequence which could encode a 928-aa polyprotein.  相似文献   

2.
Model-building studies of Inovirus: genetic variations on a geometric theme   总被引:1,自引:0,他引:1  
Inovirus (filamentous bacteriophage) is a simple system for studying the rules by which protein primary structure (amino acid sequence) controls secondary and higher order structure, and thereby function. The virus occurs naturally as a number of different strains with similar secondary and higher order structure, but the protein subunit that assembles to form the virion coat has quite different primary structures in different virus strains. Despite these differences in primary structure, the subunits of all strains have much the same size, about 50 residues, which are distributed by type in much the same way into three domains of primary structure: a collection of acidic residues in the N-terminal region, a hydrophobic domain of about 19 residues near the middle, and a collection of basic residues near the C-terminus. Each subunit can be closely approximated by an alpha-helix with its long axis roughly parallel to the fibre axis, sloping from large to small radius in the virion and interleaving between subunits in the next turn or level. The acidic residues near the N-terminus of the subunit face outwards on the virion surface, and explain the low isoelectric point of the virion; the basic residues near the C-terminus face inwards, where they neutralize the charge on the DNA at the core of the virion; and the hydrophobic central domain is involved in interactions which bind neighbouring subunits. Detailed X-ray fibre diffraction analysis of one strain gives the subunit structure. Comparative model-building studies of different strains illustrate the common structural principles.  相似文献   

3.
The sequence of 5400 bases corresponding to the 5'-terminal half of the Murray Valley encephalitis virus genome has been determined. The genome contains a 5' non-coding region of about 97 nucleotides, followed by a single continuous open reading frame that encodes the structural proteins followed by the non-structural proteins. Amino acid sequence homology between the Murray Valley encephalitis and yellow fever (Rice et al., 1985) polyproteins is 42% over the region sequenced. The start points of the various Murray Valley encephalitis virus-coded proteins have been assigned on the basis of this homology and a consistent set of potential proteolytic cleavage sites identified, the sequences of which are similar in Murray Valley encephalitis and yellow fever. The deduced Murray Valley encephalitis gene order is 5'-C-prM (M)-E-NS1-ns2a-ns2b-NS3-3'. The genome organization of Murray Valley encephalitis and yellow fever appears to be identical and the sizes of the predicted virus-coded proteins similar between the two viruses. Both viruses encode a basic capsid protein followed by three glycoproteins; the glycoproteins appear to have the conventional topology of N terminus outside with a C-terminal membrane-spanning domain. There are conserved glycosylation sites in prM, the precursor to the M protein of the virion, and in NS1, a non-structural protein of uncertain function. The glycosylation sites in E, the major envelope protein of the virion, are not conserved as to position. We predict the existence, in flavivirus-infected cells, of two small, hydrophobic peptides, ns2a and ns2b, which show only limited amino acid sequence homology. Finally, about half of the amino acid sequence of NS3 has been obtained; NS3 is a hydrophilic non-structural protein that shows 55% amino acid sequence similarity between Murray Valley encephalitis and yellow fever over the region sequenced and is probably involved in RNA replication.  相似文献   

4.
The assembly intermediates of the Salmonella bacteriophage P22 are well defined but the molecular interactions between the subunits that participate in its assembly are not. The first stable intermediate in the assembly of the P22 virion is the procapsid, a preformed protein shell into which the viral genome is packaged. The procapsid consists of an icosahedrally symmetric shell of 415 molecules of coat protein, a dodecameric ring of portal protein at one of the icosahedral vertices through which the DNA enters, and approximately 250 molecules of scaffolding protein in the interior. Scaffolding protein is required for assembly of the procapsid but is not present in the mature virion. In order to define regions of scaffolding protein that contribute to the different aspects of its function, truncation mutants of the scaffolding protein were expressed during infection with scaffolding deficient phage P22, and the products of assembly were analyzed. Scaffolding protein amino acids 1-20 are not essential, since a mutant missing them is able to fully complement scaffolding deficient phage. Mutants lacking 57 N-terminal amino acids support the assembly of DNA containing virion-like particles; however, these particles have at least three differences from wild-type virions: (i) a less than normal complement of the gene 16 protein, which is required for DNA injection from the virion, (ii) a fraction of the truncated scaffolding protein was retained within the virions, and (iii) the encapsidated DNA molecule is shorter than the wild-type genome. Procapsids assembled in the presence of a scaffolding protein mutant consisting of only the C-terminal 75 amino acids contained the portal protein, but procapsids assembled with the C-terminal 66 did not, suggesting portal recruitment function for the region about 75 amino acids from the C terminus. Finally, scaffolding protein amino acids 280 through 294 constitute its minimal coat protein binding site.  相似文献   

5.
6.
The E protein is a multifunctional membrane protein of SARS-CoV   总被引:1,自引:0,他引:1  
The E (envelope) protein is the smallest structural protein in all coronaviruses and is the only viral structural protein in which no variation has been detected. We conducted genome sequencing and phylogenetic analyses of SARS-CoV. Based on genome sequencing, we predicted the E protein is a transmembrane (TM) protein characterized by a TM region with strong hydrophobicity and α-helix conformation. We identified a segment (NH2-_L-Cys-A-Y-Cys-Cys-N_-COOH) in the carboxyl-terminal region of the E protein that appears to form three disulfide bonds with another segment of corresponding cysteines in the carboxyl-terminus of the S (spike) protein. These bonds point to a possible structural association between the E and S proteins. Our phylogenetic analyses of the E protein sequences in all published coronaviruses place SARS-CoV in an independent group in Coronaviridae and suggest a non-human animal origin.  相似文献   

7.
The complete nucleotide sequences of the genomes of the type 2 ( P712 , Ch, 2ab ) and type 3 (Leon 12a1b ) poliovirus vaccine strains were determined. Comparison of the sequences with the previously established genome sequence of type 1 (LS-c, 2ab ) poliovirus vaccine strain revealed that 71% of the nucleotides in the genome RNAs were common, that the 5' and 3' termini of the genomes were highly homologous, and that more than 80% of the nucleotide differences in the coding region occurred in the third letter position of in-phase codons, resulting in a low frequency of amino acid difference. These results strongly suggested that the serotypes of poliovirus derived from a common prototype. A comparison of the amino acid sequences predicted from the genome sequences showed highest variation in the capsid protein region, whereas non-structural proteins are highly conserved. Initiation of polyprotein synthesis occurs in all three strains more than 740 nucleotides downstream from the 5' end. An analysis of the non-coding region suggests that small peptides that could potentially originate from this region are conserved. The amino acid sequences immediately surrounding the cleavage signals, however, show a higher than average degree of variation. The analysis of the amino acid sequences of the capsid protein VP1 of all serotypes has led to the prediction of potential antigenic sites on the virion involved in neutralization.  相似文献   

8.
The Vpr gene product of human immunodeficiency virus type 1 is a virion-associated protein that is important for efficient viral replication in nondividing cells such as macrophages. At the cellular level, Vpr is primarily localized in the nucleus when expressed in the absence of other viral proteins. Incorporation of Vpr into viral particles requires a determinant within the p6 domain of the Gag precursor polyprotein Pr55gag. In the present study, we have used site-directed mutagenesis to identify a domain(s) of Vpr involved in virion incorporation and nuclear localization. Truncations of the carboxyl (C)-terminal domain, rich in basic residues, resulted in a less stable Vpr protein and in the impairment of both virion incorporation and nuclear localization. However, introduction of individual substitution mutations in this region did not impair Vpr nuclear localization and virion incorporation, suggesting that this region is necessary for the stability and/or optimal protein conformation relevant to these Vpr functions. In contrast, the substitution mutations within the amino (N)-terminal region of Vpr that is predicted to adopt an alpha-helical structure (extending from amino acids 16 to 34) impaired both virion incorporation and nuclear localization, suggesting that this structure may play a pivotal role in modulating both of these biological properties. These results are in agreement with a recent study showing that the introduction of proline residues in this predicted alpha-helical region abolished Vpr virion incorporation, presumably by disrupting this secondary structure (S. Mahalingam, S. A. Khan, R. Murali, M. A. Jabbar, C. E. Monken, R. G. Collman, and A. Srinivasan, Proc. Natl. Acad. Sci. USA 92:3794-3798, 1995). Interestingly, our results show that two Vpr mutants harboring single amino acid substitutions (L to F at position 23 [L23F] and A30F) on the hydrophobic face of the predicted helix coded for relatively stable proteins that retained their ability to translocate to the nucleus but exhibited dramatic reduction in Vpr incorporation, suggesting that this hydrophobic face might mediate protein-protein interactions required for Vpr virion incorporation but not nuclear localization. Furthermore, a single mutation (E25K) located on the hydrophilic face of this predicted alpha-helical structure affected not only virion incorporation but also nuclear localization of Vpr. The differential impairment of Vpr nuclear localization and virion incorporation by mutations in the predicted N-terminal alpha-helical region suggests that this region of Vpr plays a role in both of these biological functions of Vpr.  相似文献   

9.
The S2 gene nucleotide sequences of prototype strains of the three reovirus serotypes were determined to gain insight into the structure and function of the S2 translation product, virion core protein sigma 2. The S2 sequences of the type 1 Lang, type 2 Jones, and type 3 Dearing strains are 1,331 nucleotides in length and contain a single large open reading frame that could encode a protein of 418 amino acids, corresponding to sigma 2. The deduced sigma 2 amino acid sequences of these strains are very conserved, being identical at 94% of the sequence positions. Predictions of sigma 2 secondary structure and hydrophobicity suggest that the protein has a two-domain structure. A larger domain is suggested to be formed from the amino-terminal three-fourths of sigma 2 sequence, which is separated from a smaller carboxy-terminal domain by a turn-rich hinge region. The carboxy-terminal domain includes sequences that are more hydrophilic than those in the rest of the protein and contains sequences which are predicted to form an alpha-helix. A region of striking similarity was found between amino acids 354 and 374 of sigma 2 and amino acids 1008 and 1031 of the beta subunit of the Escherichia coli DNA-dependent RNA polymerase. We suggest that the regions with similar sequence in sigma 2 and the beta subunit form amphipathic alpha-helices which may play a related role in the function of each protein. We have also performed experiments to further characterize the double-stranded RNA-binding activity of sigma 2 and found that the capacity to bind double-stranded RNA is a property of the sigma 2 protein of prototype strains and of the S2 mutant tsC447.  相似文献   

10.
We report the DNA sequence of the valS gene from Bacillus stearothermophilus and the predicted amino acid sequence of the valyl-tRNA synthetase encoded by the gene. The predicted primary structure is for a protein of 880 amino acids with a molecular mass of 102,036. The molecular mass and amino acid composition of the expressed enzyme are in close agreement with those values deduced from the DNA sequence. Comparison of the predicted protein sequence with known protein sequences revealed a considerable homology with the isoleucyl-tRNA synthetase of Escherichia coli. The two enzymes are identical in some 20-25% of their amino acid residues, and the homology is distributed approximately evenly from N-terminus to C-terminus. There are several regions which are highly conservative between the valyl- and isoleucyl-tRNA synthetases. In one of these regions, 15 of 20 amino acids are identical, and in another, 10 of 14 are identical. The valyl-tRNA synthetase also contains a region HLGH (His-Leu-Gly-His) near its N-terminus equivalent to the consensus HIGH (His-Ile-Gly-His) sequence known to participate in the binding of ATP in the tyrosyl-tRNA synthetase. This is the first example of extensive homology found between two different aminoacyl-tRNA synthetases.  相似文献   

11.
D-dopachrome tautomerase (D-DT) shares amino acid sequence similarity, structural architecture and biological activity with the cytokine MIF. Recent studies show that the two protein homologs also bind to the same cell surface receptor, CD74, to activate the ERK1/2 pathway that ultimately leads to pro-inflammatory and pro-survival gene expression. We recently showed that RTL1000 and DRa1-MOG-35-55, two biological drugs with potent anti-inflammatory properties that treat experimental autoimmune encephalomyelitis (EAE) in mice, bind to the cell surface receptor CD74 with high affinity and compete with MIF for binding to the same regions of CD74. Computational modeling of MIF and RTL1000 binding interactions with CD74 predicted the presence of three CD74 binding regions for each MIF homotrimer. Through a similar approach we have now expanded our work to study the D-DT (MIF-2) interaction with CD74 that is mainly defined by three elements scattered throughout the disordered regions of the interacting molecules. The model predicted: (a) a hydrophobic cradle between CD74 and D-DT consisting of N-terminal tyrosine residues of three CD74 monomers arranged in a planar alignment interacts with aromatic amino acid residues located in the disordered D-DT C-terminus; (b) a triad consisting of the E103 residue on one D-DT monomer in close contact with R179 and S181 on one chain of the CD74 trimer forms an intermolecular salt bridge; and (c) amino acid residues on the C-terminus random coil of CD74 chain C form a long interacting area of ∼500 Å2 with a disordered region of D-DT chain B. These three binding elements were also present in MIF/CD74 binding interactions, with involvement of identical or highly similar amino acid residues in each MIF homotrimer that partner with the exact same residues in CD74. Topologically, however, the location of the three CD74 binding regions of the D-DT homotrimer differs substantially from that of the three MIF binding regions. This key difference in orientation appears to derive from a sequence insertion in D-DT that topologically limits binding to only one CD74 molecule per D-DT homotrimer, in contrast to predicted binding of up to three CD74 molecules per MIF homotrimer. These results have implications for the manner in which D-DT and MIF compete with each other for binding to the CD74 receptor and for the relative potency of DRa1-MOG-35-55 and RTL1000 for competitive inhibition of D-DT and MIF binding and activation through CD74.  相似文献   

12.
Wzz is a membrane protein that determines the chain length distribution of the O-antigen lipopolysaccharide by an unknown mechanism. Wzz proteins consist of two transmembrane helices separated by a large periplasmic loop. The periplasmic loop of Escherichia coli K-12 Wzz (244 amino acids from K65 to A308) was purified and found to be a monomer with an extended conformation, as determined by gel filtration chromatography and analytical ultracentrifugation. Circular dichroism showed that the loop has a 60% helical content. The Wzz periplasmic loop also contains three regions with predicted coiled coils. To probe the function of the predicted coiled coils, we constructed amino acid replacement mutants of the E. coli K-12 Wzz protein, which were designed so that the coiled coils could be separate without compromising the helicity of the individual molecules. Mutations in one of the regions, spanning amino acids 108 to 130 (region I), were associated with a partial defect in O-antigen chain length distribution, while mutants with mutations in the region spanning amino acids 209 to 223 (region III) did not have an apparent functional defect. In contrast, mutations in the region spanning amino acids 153 to 173 (region II) eliminated the Wzz function. This phenotype was associated with protein instability, most likely due to conformational changes caused by the amino acid replacements, which was confirmed by limited trypsin proteolysis. Additional mutagenesis based on a three-dimensional model of region I demonstrated that the amino acids implicated in function are all located at the same face of a predicted α-helix, suggesting that a coiled coil actually does not exist in this region. Together, our results suggest that the regions predicted to be coiled coils are important for Wzz function because they maintain the native conformation of the protein, although the existence of coiled coils could not be demonstrated experimentally.  相似文献   

13.
Gloeobacter violaceus PCC 7421 is a unique cyanobacterium that has no thylakoids and whose genome has been sequenced [Y. Nakamura, T. Kaneko, S. Sato, M. Mimuro, H. Miyashita, T. Tsuchiya, S. Sasamoto, A. Watanabe, K. Kawashima, Y. Kishida, C. Kiyokawa, M. Kohara, M. Matsumoto, A. Matsuno, N. Nakazaki, S. Shimpo, C. Takeuchi, M. Yamada, S. Tabata, Complete Genome Structure of Gloeobacter violaceus PCC 7421, a cyanobacterium that lacks thylakoids. DNA Research 10 (2003) 137-145]. Phycobilisomes of G. violaceus were isolated and analyzed by SDS-PAGE followed by N-terminal sequencing. Three rod-linker subunits (CpeC, CpeD and CpeE) were identified as predicted from the genome sequence. The cpcC1 and cpcC2 genes at order locus named (OLN) glr0950 and gll 3219 encoding phycocyanin-associated linker proteins from G. violaceus are 56 and 55 amino acids longer at the N-terminus than the open reading frame proposed in the genome. The two amino acid extensions showed a 66% identity to one another. Also, the N-terminal extensions of these sequences were similar to domains in both the rod-capping-linker protein CpcD2 and to the C-terminus domain of the phycoerythrin-associated linker protein CpeC. These domains are not only unusual in their N-terminal location, but are unusual in that they are more closely related in sequence similarity to the C-terminus domain of the phycoerythrin-associated linker, CpeC of G. violaceus, than to the C-terminus domain of phycocyanin-associated linker CpcC in other cyanobacteria. These linker proteins with unique special domains are indicators of the unusual structure of the phycobilisomes of G. violaceus.  相似文献   

14.
Various adenovirus E1a proteins, including 13S protein, 12S protein and three other derivatives of 13S protein with deletions were expressed in Saccharomyces cerevisiae. Both the C-terminal 67 residues and the 13S unique domain are required for the nuclear targeting in yeast. The N-terminus containing multiple functional domains appears to be involved in the G1 arrest of diploid yeast and two other regions, the region containing amino acid residues between 122 and 139, and the 67 residues of the C-terminus are required for the lethal effect on haploid yeast. The latter effect, however, is dependent on strains. Thus, the yeast system may be utilized for functional dissection of E1a protein by further analyzing metabolic consequences.  相似文献   

15.
Two acidic domains of the Potato leafroll virus (PLRV) coat protein, separated by 55 amino acids and predicted to be adjacent surface features on the virion, were the focus of a mutational analysis. Eleven site-directed mutants were generated from a cloned infectious cDNA of PLRV and delivered to plants by Agrobacterium-mediated mechanical inoculation. Alanine substitutions of any of the three amino acids of the sequence EWH (amino acids 170 to 172) or of D177 disrupted the ability of the coat protein to assemble stable particles and the ability of the viral RNA to move systemically in four host plant species. Alanine substitution of E109, D173, or E176 reduced the accumulation of virus in agrobacterium-infiltrated tissues, the efficiency of systemic infection, and the efficiency of aphid transmission relative to wild-type virus, but the mutations did not affect virion stability. A structural model of the PLRV capsid predicted that the amino acids critical for virion assembly were located within a depression at the center of a coat protein trimer. The other amino acids that affected plant infection and/or aphid transmission were predicted to be located around the perimeter of the depression. PLRV virions play key roles in phloem-limited virus movement in plant hosts as well as in transport and persistence in the aphid vectors. These results identified amino acid residues in a surface-oriented loop of the coat protein that are critical for virus assembly and stability, systemic infection of plants, and movement of virus through aphid vectors.  相似文献   

16.
The complete amino acid sequence of a structural protein isolated from pharate cuticle of the locust Locusta migratoria was determined. The protein has an unusual amino acid composition: 42% of the residues are alanine and only 14 of the 20 common amino acid residues are present. The primary structure consists of regions enriched in particular amino acid residues. The N-terminal region and a region close to the C-terminus are enriched in glycine. The rest of the protein is dominated by alanine, except for two short regions enriched in hydrophilic residues. Almost all the proline residues are situated in the alanine-rich regions in a conserved sequence 'A-A-P-A/V'. An internal duplication has taken place covering most of the protein except for the glycine-rich regions. Owing to the unusual features of the protein a combination of automated Edman degradations and plasma-desorption m.s. was used to determine the complete sequence. The protein does not show sequence homology to other proteins, but proteins divided into regions enriched in the same kind of amino acid residues have been isolated from other insect structures.  相似文献   

17.
18.
Tobamoviruses, mostly isolated from solanaceous plants, may represent ancient virus lineages that have codiverged with their hosts. Recently completed nucleotide sequences of six nonsolanaceous tobamoviruses allowed assessment of the codivergence hypothesis and support a third subgroup within tobamoviruses. The genomic sequences of 12 tobamoviruses and the partial sequences of 11 others have been analyzed. Comparisons of the predicted protein sequences revealed three clusters of tobamoviruses, corresponding to those infecting solanaceous species (subgroup 1), those infecting cucurbits and legumes (subgroup 2), and those infecting crucifers. The orchid-infecting odontoglossum ringspot tobamovirus was associated with subgroup 1 genomes by its coat and movement protein sequences, but with the crucifer-pathogenic tobamoviruses by the remainder of its genome, suggesting that it is the progeny of a recombinant. For four of five genomic regions, subgroup 1 and 3 genomes were equidistant from a subgroup 2 genome chosen for comparison, suggesting uniform rates of evolution. A phylogenetic tree of plant families based on the tobamoviruses they harbor was congruent with that based on rubisco sequences but had a different root, suggesting that codivergence was tempered by rare events of viruses of one family colonizing another family. The proposed subgroup 3 viruses probably have an origin of virion assembly in the movement protein gene, a large (25-codon) overlap of movement and coat protein open reading frames, and a comparably shorter genome. Codon-position- dependent base compositions and codon prevalences suggested that the coat protein frame of the overlap region was ancestral. Bootstrapped parsimony analysis of the nucleotides in the overlap region and of the sequences translated from the -1 frame (the subgroup 3 movement protein frame) of this region produced trees inconsistent with those deduced from other regions. The results are consistent with a model in which a no or short overlap organization was ancestral. Despite encoding of subgroup 2 and 3 movement protein C-termini by nonhomologous nucleotides, weak similarities between their amino acid sequences suggested convergent sequence evolution.   相似文献   

19.
20.
The structure analysis and antigenicity study of the N protein of SARS-CoV   总被引:2,自引:0,他引:2  
The Coronaviridae family is characterized by a nucleocapsid that is composed of the genome RNA molecule in combination with the nucleoprotein (N protein) within a virion. The most striking physiochemical feature of the N protein of SARS-CoV is that it is a typical basic protein with a high predicted pI and high hydrophilicity, which is consistent with its function of binding to the ribophosphate backbone of the RNA molecule. The predicted high extent of phosphorylation of the N protein on multiple candidate phosphorylation sites demonstrates that it would be related to important functions, such as RNA-binding and localization to the nucleolus of host cells. Subsequent study shows that there is an SR-rich region in the N protein and this region might be involved in the protein-protein interaction. The abundant antigenic sites predicted in the N protein, as well as experimental evidence with synthesized polypeptides, indicate that the N protein is one of the major antigens of the SARS-CoV. Compared with o  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号