首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Complete sequence determination of gene 18 encoding the tail sheath protein was carried out mainly by the Maxam-Gilbert method. Approximately 40 peptides contained in a tryptic digest and a lysyl endopeptidase digest of gp 18 were isolated by reversed-phase high-performance liquid chromatography. All the peptides were identified along the nucleotide sequence of gene 18 based on the amino acid compositions. These peptides cover 88% of the total primary structure. Furthermore, the amino acid sequences of 9 of the 40 peptides were determined by a gas-phase protein sequencer; one of them turned to be the N-terminal one. The C-terminal peptide in the tryptic digest was isolated from the unadsorbed fraction of affinity chromatography on immobilized anhydrotrypsin and the amino acid sequence was also determined. Thus, the complete primary structure of gp 18 was determined; it has 658 amino acid residues and a molecular weight of 71,160.This article was presented during the proceedings of the International Conference on Macromolecular Structure and Function, held at the National Defence Medical College, Tokorozawa, Japan, December 1985.  相似文献   

2.
Nucleotide sequence of the gp70 gene of murine retrovirus MCF 247.   总被引:23,自引:19,他引:4       下载免费PDF全文
We determined the nucleotide sequence and predicted the amino acid sequence of the gp70 gene of MCF 247, a recombinant murine retrovirus isolated from an AKR mouse. Information specifying the first 286 amino acids of the protein was probably derived from the presumptive nonecotropic parent of MCF 247, whereas the C-terminal 154 amino acids were probably derived from the ecotropic parent Akv. The nonecotropic sequences at the amino terminus of MCF 247 show only 38% homology, at the amino acid level, to those of Akv. In contrast, these sequences are strikingly similar (99% homologous) to those reported for another MCF virus. Moloney MCF, which was isolated from a BALB/c mouse. Moloney MCF also has ecotropic-derived sequences encoding the C-terminal portion of its gp70 protein; however, the recombination event that introduced these sequences occurs 213 nucleotides further towards the C terminus of gp70 than it does in MCF 247.  相似文献   

3.
Amino acid sequence of chitinase from Streptomyces erythraeus   总被引:2,自引:0,他引:2  
The amino acid sequence of chitinase from Streptomyces erythraeus was determined by the conventional method. The amino acid sequences of tryptic peptides of the reduced and S-carboxymethylated protein were determined. The tryptic peptides were aligned by overlapping the amino acid sequences of chymotryptic peptides, lysyl endopeptidase peptides and cyanogen bromide fragments. S. erythraeus chitinase consists of 290 amino acid residues with the molecular weight of 30,400 and has two disulfide bridges at Cys(45)-Cys(89) and Cys(265)-Cys(272). The enzyme has no significant homology with other chitinases, lysozymes, and other proteins.  相似文献   

4.
We describe two approaches to cloning and over-expressing gene 42 of bacteriophage T4, which encodes the early enzyme deoxycytidylate hydroxymethylase. In Bochum a library of sonicated fragments of wild-type phage DNA cloned into M13mp18 was screened with clones known to contain parts of gene 42. Two overlapping fragments, each of which contained one end of the gene, were cleaved at a HincII site and joined, to give a fragment containing the entire gene. In Corvallis a 1.8-kb fragment of cytosine-substituted DNA, believed to contain the entire gene, was cloned into pUC18 and shown to express the enzyme at low level. The cloned fragment bore an amber mutation in gene 42. From the DNA sequence of gene 42, the cloned gene was converted to the wild-type allele by site-directed mutagenesis. Both gene-42-containing fragments were cloned into the pT7 expression system and found to be substantially overexpressed. dCMP hydroxymethylase purified from one of the over-expressing strains had a turnover number similar to that of the enzyme isolated earlier from infected cells. In addition, the N-terminal 20 amino acid residues matched precisely the sequence predicted from the gene sequence. The amino acid sequence of gp42 bears considerable homology with that of thymidylate synthase of either host or T4 origin. The gene 42 nucleotide sequences of bacteriophages T2 and T6 were determined and found to code for amino acid sequences nearly identical to that of T4 gp42.  相似文献   

5.
The nucleotide sequence of the yeast MEL1 gene.   总被引:13,自引:1,他引:12       下载免费PDF全文
The complete nucleotide sequence of the MEL1 gene of the yeast, Saccharomyces cerevisiae, encoding alpha-galactosidase was determined. The nucleotide sequence contains an open reading frame of 1413 bp encoding a protein of 471 amino acids. Comparison with the known N-terminal amino acid sequence of the mature secreted protein indicated that alpha-galactosidase is synthesized as a precursor with an N-terminal signal sequence of 18 amino acids. The general features of this signal peptide resemble those of other yeast signal peptides. Molecular weight of the mature alpha-galactosidase polypeptide deduced from the nucleotide sequence is 50.049 kd. The 5' regulatory region has sequences in common with other yeast genes regulated by the GAL4-protein.  相似文献   

6.
The amino acid sequence of ribosomal protein S18 from Bacillus stearothermophilus has been completely determined by automated sequence analysis of the intact protein as well as of peptides derived from digestion with Staphylococcus aureus protease at pH 4.0 and cleavage with cyanogen bromide. The carboxy-terminal region was verified by both amino acid analyses of chymotryptic peptides and by mass spectrometry from the terminal region. The protein contains 77 amino acid residues and has an Mr of 8838. Comparison of this sequence with the sequences of the S18 proteins from tobacco and liverwort chloroplasts and E. coli shows a relatively high similarity, ranging from 42 to 55% identical residues with the B. stearothermophilus S18 protein. The regions of homology common to all four proteins consist of several positively charged sections spanning the entire length of the protein.  相似文献   

7.
The envelope proteins of mouse mammary tumor virus (MMTV) are synthesized from a subgenomic 24S mRNA as a 75,000-dalton glycosylated precursor polyprotein which is eventually processed to the mature glycoproteins gp52 and gp36. In vivo synthesis of this env precursor in the presence of the core glycosylation inhibitor tunicamycin yielded a precursor of approximately 61,000 daltons (P61env). However, a 67,000-dalton protein (P67env) was obtained from cell-free translation with the MMTV 24S mRNA as the template. To determine whether the portion of the protein cleaved from P67env to give P61env was removed from the NH2-terminal end of P67env and as such would represent a leader sequence, the NH2-terminal amino acid sequence of the terminal peptide gp52 was determined. Glutamic acid, and not methionine, was found to be the amino-terminal residue of gp52, indicating that the cleaved portion was derived from the NH2-terminal end of P67env. The NH2-terminal amino acid sequences of gp52's from endogenous and exogenous C3H MMTVs were determined though 46 residues and found to be identical. However, amino acid composition and type-specific gp52 radioimmunoassays from MMTVs grown in heterologous cells indicated primary structure differences between gp52's of the two viruses. The nucleic acid sequence of cloned MMTV DNA fragments (J. Majors and H. E. Varmus, personal communication) in conjunction with the NH2-terminal sequence of gp52 allowed localization of the env gene in the MMTV genome. Nucleotides coding for the NH2 terminus of gp52 begin approximately 0.8 kilobase to the 3' side of the single EcoRI cleavage site. Localization of the env gene at that point agrees with the proposed gene order -gag-pol-env- and also allows sufficient coding potential for the glycoprotein precursor without extending into the long terminal repeat.  相似文献   

8.
The phagocyte NADPH oxidase is a multicomponent membrane-bound electron transport chain that catalyzes the reduction of O2 to superoxide. Cytochrome b558, the terminal electron donor to O2, is an integral membrane heterodimer containing 91- and 22-kDa subunits (gp91-phox and p22-phox, respectively). Synthetic peptides, whose amino acid sequences correspond to a gp91-phox carboxyl-terminal domain, inhibit superoxide production by blocking assembly of the oxidase from membrane and cytosol components. In this study, we examined the amino acid sequence requirements of a series of synthetic truncated gp91-phox peptides for inhibition of human neutrophil NADPH oxidase activation. RGVHFIF, corresponding to gp91-phox residues 559-565, was the minimum sequence capable of inhibiting superoxide generation. Contributions of individual amino acids to overall RGVHFIF inhibitory activity were determined by comparing the abilities of alanine-substituted RGVHFIF peptides to inhibit superoxide production. Substitution of alanine for arginine, valine, isoleucine, or either of the phenylalanines (but not glycine or histidine) within RGVHFIF resulted in loss of inhibitory activity. Synthetic gp91-phox carboxyl-terminal peptides are likely to be competitive inhibitors of the corresponding carboxyl-terminal domain of native gp91-phox by virtue of amino acid identity. We conclude that properties of arginine valine, isoleucine, and phenylalanine side chains within an RGVHFIF-containing domain of gp91-phox contribute significantly to cytochrome b558-mediated activation of the oxidase.  相似文献   

9.
The aspA gene of Escherichia coli W which encodes aspartase was cloned into the plasmid vector pBR322. The nucleotide sequences of aspA and its flanking regions were determined. The aspA gene encodes a protein with a molecular weight of 52,224 consisted of 477 amino acid residues. The amino acid sequence of the protein predicted from the nucleotide sequence was consistent with those of the NH2- and COOH-terminal regions and also with the amino acid composition of the purified aspartase determined previously. Potential promoter and terminator sequences for aspA were also found in the determined sequence.  相似文献   

10.
C Grabau  J E Cronan  Jr 《Nucleic acids research》1986,14(13):5449-5460
The entire nucleotide sequence of the poxB (pyruvate oxidase) gene of Escherichia coli K-12 has been determined by the dideoxynucleotide (Sanger) sequencing of fragments of the gene cloned into a phage M13 vector. The gene is 1716 nucleotides in length and has an open reading frame which encodes a protein of Mr 62,018. This open reading frame was shown to encode pyruvate oxidase by alignment of the amino acid sequences deduced for the amino and carboxy termini and several internal segments of the mature protein with sequences obtained by amino acid sequence analysis. The deduced amino acid sequence of the oxidase was not unusually rich in hydrophobic sequences despite the peripheral membrane location and lipid binding properties of the protein. The codon usage of the oxidase gene was typical of a moderately expressed protein. The deduced amino acid sequence shares homology with the large subunits of the acetohydroxy acid synthase isozymes I, II, and III, encoded by the ilvB, ilvG, and ilvI genes of E. coli.  相似文献   

11.
The mature envelope glycoproteins of mouse mammary tumor virus (gp52 and gp36) were isolated by reversed-phase high-pressure liquid chromatography. The N-terminal amino acid sequence of gp36 was determined for 28 residues. The C-terminal amino acid sequences of gp52 and gp36 were determined by carboxypeptidase digestion. The N-terminal amino acid sequence of gp52 has been reported previously (L. O. Arthur et al., J. Virol. 41:414-422, 1982). These data were aligned with the predicted amino acid sequence of the env gene product obtained by translation of the DNA sequence (S. M. S. Redmond and C. Dickson, Eur. Mol. Biol. Org. J. 2:125-131, 1983). The amino acid sequences of the mature viral proteins were in agreement with the predicted amino acid sequence of the env gene product over the regions of alignment. This alignment showed the sites of proteolytic cleavages of the env gene product leading to the mature viral envelope glycoproteins. The N-terminal amino acid sequence of gp52 starts at residue 99 of the predicted structure indicating proteolytic cleavage of a signal peptide. A dipeptide (Lys-Arg) is excised between the C-terminus of gp52 and the N-terminus of gp36. The C-terminal amino acid sequence of gp36 is identical to the sequence predicted by the codons immediately preceding the termination codon for the env gene product. The data show that there is no proteolytic processing at the C-terminal of the murine mammary tumor virus env gene product and that the env gene coding region extends into the long terminal repeat.  相似文献   

12.
The amino-terminal amino acid sequences of gp85 and gp37, the envelope glycoproteins of Rous sarcoma virus (RSV), were determined. Alignment of these sequences with the amino acid sequence predicted from the complete nucleotide sequence of the Prague strain of RSV, subgroup C (PR-C), has allowed us to delineate the env gene-coding region of this virus. The coding sequences for gp85 and gp37 have been placed in an open reading frame that extends from nucleotide 5045 to nucleotide 6862 and predict sizes of 341 amino acids (36,962 molecular weight) for gp85 and 198 amino acids (21,566 molecular weight) for gp37. Carbohydrate makes a significant contribution to the observed molecular weights of these polypeptides--the amino acid sequence contains 14 potential glycosylation sites (Asn-X-Ser/Thr) in gp85 and two in gp37. Experiments aimed at estimating the number of carbohydrate side chains yielded results consistent with most or all of these sites being occupied. Although an initiation codon is located early (codon 4) in the open reading frame, it is likely that splicing yields an mRNA on which translation initiates at the same AUG as that of the gag gene to produce a nascent polypeptide in which gp85 is preceded by a 62-amino-acid-long leader peptide. This leader contains the hydrophobic sequence (signal sequence) necessary for translocation across the endoplasmic reticulum and is completely removed from the env gene product during translation. The polyprotein precursor, Pr95env, is cleaved to gp85 and gp37 at the carboxyl side of the basic sequence:-Arg-Arg-Lys-Arg-. gp85 is attached through a disulphide linkage to gp37, and although the positions of the cysteines involved in this linkage are not known, the presence of a 27-amino-acid-long hydrophobic region at the carboxy-terminus of gp37 is consistent with its role as a membrane anchor for the viral glycoprotein complex. The location of host range variable regions with respect to the possible tertiary structure of the complex is discussed.  相似文献   

13.
Recombinant clones expressing antigenic determinants of the 18-kDa protein antigen from Mycobacterium leprae recognized by the L5 monoclonal antibody were isolated from a lambda gt11 expression library and their nucleotide sequences determined. All clones expressed the M. leprae-specific determinant as part of a large fusion protein with Escherichia coli beta-galactosidase. The deduced amino acid sequence of the coding region indicated that all the lambda gt11 recombinant clones contained an incomplete M. leprae gene sequence representing the carboxy-terminal two-thirds (111 amino acids) of the 18-kDa gene and coding for a peptide of m.w. 12,432. Subsequent isolation and sequencing of a 3.2kb BamHI-PstI DNA fragment from a genomic M. leprae cosmid library permitted the deduction of the complete 148 amino acid sequence with a predicted m.w. of 16,607. A second open reading frame 560 bases downstream from the 18-kDa coding sequence was found to code for a putative protein of 137 amino acids (m.w. = 15,196). Neither this nor the 18-kDa amino acid sequence displayed any significant homologies with any proteins in the GENBANK, EMBL, or NBRF data bases. Crude lysates from recombinant lambda gt11 clones expressing part of the 18-kDa protein have been reported to stimulate the proliferation of some M. leprae-specific helper T cell clones. Thus, it is significant that the complete 18-kDa sequence contains five short peptides predicted to be possible helper T cell antigenic epitopes based on their propensity to form amphipathic helices. Although three of these occur within the 111 amino acid carboxy-terminal peptide expressed by lambda gt11 clones, the most highly amphipathic peptide is found in the amino-terminal region not present in the lambda gt11 recombinants.  相似文献   

14.
The nucleotide sequence of the cellulase gene celC, encoding endoglucanase C of Clostridium thermocellum, has been determined. The coding region of 1032 bp was identified by comparison with the N-terminal amino acid (aa) sequence of endoglucanase C purified from Escherichia coli. The ATG start codon is preceded by an AGGAGG sequence typical of ribosome-binding sites in Gram-positive bacteria. The derived amino acid sequence corresponds to a protein of Mr 40,439. Amino acid analysis and apparent Mr of endoglucanase C are consistent with the amino acid sequence as derived from the DNA sequencing data. A proposed N-terminal 21-aa residue leader (signal) sequence differs from other prokaryotic signal peptides and is non-functional in E. coli. Most of the protein bears no resemblance to the endoglucanases A, B, and D of the same organism. However, a short region of homology between endoglucanases A and C was identified, which is similar to the established active sites of lysozymes and to related sequences of fungal cellulases.  相似文献   

15.
The gene for L-lactate dehydrogenase (LDH) (EC 1.1.1.27) of Thermus caldophilus GK24 was cloned in Escherichia coli using synthetic oligonucleotides as hybridization probes. The nucleotide sequence of the cloned DNA was determined. The primary structure of the LDH was deduced from the nucleotide sequence. The deduced amino acid sequence agreed with the NH2-terminal and COOH-terminal sequences previously reported and the determined amino acid sequences of the peptides obtained from trypsin-digested T. caldophilus LDH. The LDH comprised 310 amino acid residues and its molecular mass was determined to be 32,808. On alignment of the whole amino acid sequences, the T. caldophilus LDH showed about 40% identity with the Bacillus stearothermophilus, Lactobacillus casei and dogfish muscle LDHs. The T. caldophilus LDH gene was expressed with the E. coli lac promoter in E. coli, which resulted in the production of the thermophilic LDH. The gene for the T. caldophilus LDH showed more than 40% identity with those for the human and mouse muscle LDHs on alignment of the whole nucleotide sequences. The G + C content of the coding region for the T. caldophilus LDH was 74.1%, which was higher than that of the chromosomal DNA (67.2%). The G + C contents in the first, second and third positions of the codons used were 77.7%, 48.1% and 95.5% respectively. The high G + C content in the third base caused extremely non-random codon usage in the LDH gene. About half (48.7%) the codons in the LDH gene started with G, and hence there were relatively high contents of Val, Ala, Glu and Gly in the LDH. The contents of Pro, Arg, Ala and Gly, which have high G + C contents in their codons, were also high. Rare codons with U or A as the third base were sometimes used to avoid the TCGA sequence, the recognition site for the restriction endonuclease, TaqI. Two TCGA sequences were found only in the sequence of CTCGAG (XhoI site) in the sequenced region of the T. caldophilus DNA. There were three segments with similar sequences in the two 5' non-coding regions, probably the promoter and ribosome-binding regions, of the genes for the T. caldophilus LDH and the Thermus thermophilus 3-isopropylmalate dehydrogenase.  相似文献   

16.
Antisense amino acids are amino acids which can be translated from the corresponding anti-codons of a sense amino acid. Antisense peptides encoded by the noncoding DNA strand have a tendency to interact with each other. We have demonstrated that antisense peptide sequences are present intramolecularly, and these may contribute to the folding and maintenance of the tertiary structure of a protein. T20 is a synthetic peptide with an amino acid sequence in the gp41 of HIV-1 and has been demonstrated to be a potent inhibitor of HIV-1 infection. We searched for intramolecular peptide sequences which are antisense to portions of T20. A synthetic peptide (TA-1L) consisting of amino acids 84 to 97 of gp160, which contains an antisense peptide sequence (TA-1) to T20, was shown to inhibit HIV-1(IIIB) infection of MT-4 cells. Interaction of these antisense peptides could be involved in sustaining HIV-1 infectivity. The TA-1L site, which exists in the C1 domain of gp160, is highly homologous among strains of HIV-1, especially at TA-1 and in the amino acids flanking the C terminus. Although the TA-1 sites of 18 out of 30 HIV-1 strains were antisense to the T20 region, those of the remaining 12 strains, including HIV-1(MN), were not. However, TA-1L inhibited infection by HIV-1(MN), which has no antisense peptide in T20 corresponding to TA-1, although the inhibitory effect was weaker. TA-1L may thus also interfere with the gp160 interaction with CD4, which has an antisense sequence to TA-1.  相似文献   

17.
The DNA encoding the elastase of Pseudomonas aeruginosa IFO 3455 was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited high levels of both elastase activity and elastase antigens. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature elastase consisted of 301 amino acids with a relative molecular mass of 32,926 daltons. The amino acid composition predicted from the DNA sequence was quite similar to the chemically determined composition of purified elastase reported previously. We also observed nucleotide sequence encoding a signal peptide and "pro" sequence consisting of 197 amino acids upstream from the mature elastase protein gene. The amino acid sequence analysis revealed that both the N-terminal sequence of the purified elastase and the N-terminal side sequences of the C-terminal tryptic peptide as well as the internal lysyl peptide fragment were completely identical to the deduced amino acid sequences. The pattern of identity of amino acid sequences was quite evident in the regions that include structurally and functionally important residues of Bacillus subtilis thermolysin.  相似文献   

18.
Complete amino acid sequence of protein B   总被引:4,自引:0,他引:4  
The complete amino acid sequence of protein B (= CAMP factor) of Streptococcus agalactiae has been determined. The sequence data were obtained mainly by manual sequencing of peptides derived from digestion with lysyl-peptidase, clostripain and Staphylococcus aureus protease and by solid phase sequencing of cyanogen bromide fragments. The protein contains 226 amino acids and has an Mr of 25,263. The sequence was compared with sequences of other Fc-binding proteins and partial sequence homology was found between protein B and the Fc-binding region of protein A.  相似文献   

19.
The complete amino acid sequence of ribonuclease U1 (RNase U1), a guanine-specific ribonuclease from a fungus, Ustilago sphaerogena, was determined by conventional protein sequencing, using peptide fragments obtained by several enzymatic cleavages of the performic acid-oxidized protein. The oxidized protein was first cleaved by trypsin and the resulting peptides were purified and their amino acid sequences were determined. These tryptic peptides were aligned with the aid of overlapping peptides isolated from a chymotryptic digest of the oxidized protein. The amino acid sequence thus deduced was further confirmed by isolation and analysis of peptides obtained by digestion of the oxidized protein with lysyl endopeptidase. The location of the disulfide bonds was deduced by isolation and analysis of cystine-containing peptides from a chymotryptic digest of heat-denatured RNase U1. These results showed that the protein is composed of a single polypeptide chain of 105 amino acid residues cross-linked by two disulfide bonds, having a molecular weight of 11,235, and that the NH2-terminus is blocked by a pyroglutamate residue. It has an overall homology with other guanine-specific or related ribonucleases, and shows 48% identity with RNase T1 and 38% identity with RNase U2.  相似文献   

20.
Amino acid sequence of rabbit apolipoprotein E   总被引:1,自引:0,他引:1  
The complete amino acid sequence of rabbit apolipoprotein E (apoE) was determined by generating three sets of peptides using cyanogen bromide, endoproteinase AspN, and Staphylococcus aureus V8 protease to cleave the protein. Through twenty cycles of sequence analysis on the whole protein, glutamic acid was identified as the N-terminal residue of rabbit apoE; the C-terminus of the protein was identified as glutamine. Based on the sequence of 294 amino acid residues determined by protein structure analysis, the molecular weight of rabbit apoE was determined to be 33,684. The protein sequence differed from the cDNA inferred sequence in 19 positions, only one of which could be attributed to microheterogeneity. The corrected amino acid sequence of rabbit apoE shares 80% homology with the human apoE sequence, 4% greater homology than that inferred from the cDNA sequence. The great similarity in the amino acid sequences of human and rabbit apoE suggests that their physical and physiological properties may also be similar. This homology and the relative ease with which apoE is isolated from rabbit plasma make it possible to conduct some in vitro experiments with the rabbit apoprotein that would have direct relevance to human apoE, but would be difficult or impossible with the human counterpart because of the quantity of protein required.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号