首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The coding region of the alpha-amylase inhibitor (HaimII) gene from the producing strain Streptomyces griseosporeus YM-25 was localized on an 800-base-pair DNA segment. The nucleotide sequence of a 1,191-base-pair region including the HaimII gene was determined by the dideoxy-chain termination method. The nucleotide sequence data predicted an open reading frame of 363 base pairs starting with an ATG initiation codon and ending with a TGA translational stop codon. The amino acid sequence deduced from the nucleotide sequence indicated that the presumptive pre-HaimII protein extends 37 amino acids to the amino terminus and 6 amino acids to the carboxyl terminus of the mature HaimII protein. The pre-HaimII protein is believed to be processed both during and after secretion. Two forms of the inhibitor, which have a higher molecular weight than that of the HaimII protein isolated from S. griseosporeus, were partially purified from the culture filtrate of Streptomyces lividans containing the cloned HaimII gene.  相似文献   

2.
The nucleotide (nt) sequence of a DNA segment containing the majority of a gene cloned from Bacillus thuringiensis DSIR517 encoding a 130 kDa insecticidal crystal protein has been determined. Sequence analysis reveals an open reading frame (ORF) of 3453 nt. The ATG initiation codon, which is preceded by a potential ribosome-binding site sequence, was confirmed by N-terminal amino acid sequencing. The ORF extends beyond the 3' terminus of the cloned fragment; however, the high degree of homology between the deduced amino acid sequence of this ORF and other Cry proteins suggests the clone lacks only five C-terminal amino acids. Making this assumption, the ORF of 3468 nt encodes a protein of 1156 amino acids with an estimated molecular mass of 129700 Da. Analysis of the deduced amino acid sequence reveals a number of features characteristic of Cry proteins. Alignment of the Cry 517 protein sequence with other Cry proteins suggests it is most closely related to the cryIA-E genes but sufficiently different to form a new cryI gene subclass.  相似文献   

3.
Double-stranded DNA derived from influenza B virus genome RNA segment 8, which codes for the NS1 and NS2 proteins, was constructed by hybridization of full-length cDNA copies of RNA segment 8 and of the NS1 mRNA. This DNA was cloned in plasmid pBR322 and sequenced. The NS1 mRNA (approximately 1,080 viral nucleotides) contains nonviral nucleotides at its 5' end and is capable of coding for a protein of 281 amino acids. Sequencing of the NS2 mRNA has shown that it contains an interrupted sequence of 655 nucleotides and is most likely synthesized by a splicing mechanism. The first approximately 75 virus-specific nucleotides at the 5' end of the NS2 mRNA are the same as are found at the 5' -end of the NS1 mRNA. This region contains the initiation codon for protein synthesis and coding information for 10 amino acids common to the two proteins. The approximately 350-nucleotide body region of the NS2 mRNA can be translated in the +1 reading frame, and the sequence indicates that the NS1 and NS2 protein-coding regions overlap by 52 amino acids translated from different reading frames. Thus, between the influenza A and B viruses, the organization of the NS1 and NS2 mRNAs and the sizes of the NS2 mRNA and protein are conserved despite the larger size of the influenza B virus RNA segment, NS1 mRNA, and NS1 protein.  相似文献   

4.
A cDNA containing the coding region for the complete amino acid sequence of wound-induced proteinase Inhibitor I from tomato leaves was constructed in the plasmid pUC9 and characterized. The open reading frame codes for a protein of 111 amino acids. This deduced amino acid sequence revealed the presence of a 42-amino acid N-terminal sequence that is not found in the native protein. This sequence appears to contain a 23-amino acid segment typical of a signal sequence followed by a 19-amino acid sequence containing 9 charged amino acids. The 42-amino acid sequence is apparently lost during maturation to the native Inhibitor I and represents 38% of the translated protein. The Inhibitor I amino acid sequence contains 71% identity with potato tuber Inhibitor I sequence and 35% identity with an inhibitor from the leech.  相似文献   

5.
pOMD29 is a hybrid protein containing the NH2-terminal topogenic sequence of a bitopic, integral protein of the outer mitochondrial membrane in yeast, OMM70, fused to dihydrofolate reductase. The topogenic sequence consists of two structural domains: an NH2-terminal basic region (amino acids 1-10) and an apolar region which is the predicted transmembrane segment (amino acids 11-29). The transmembrane segment alone was capable of targeting and inserting the hybrid protein into the outer membrane of intact mitochondria from rat heart in vitro. The presence of amino acids 1-10 enhanced the rate of import, and this increased rate depended, in part, on the basic amino acids located at positions 2, 7, and 9. Deletion of a large portion of the transmembrane segment (amino acids 16-29) resulted in a protein that exhibited negligible import in vitro. Insertion of pOMD29 into the outer membrane was not competed by import of excess precursor protein destined for the mitochondrial matrix, indicating that the two proteins may have different rate-limiting steps during import. We propose that the structural domains within amino acids 1-29 of pOMD29 cooperate to form a signal-anchor sequence, the characteristics of which suggest a model for proper sorting to the mitochondrial outer membrane.  相似文献   

6.
Rat apolipoprotein E mRNA. Cloning and sequencing of double-stranded cDNA   总被引:21,自引:0,他引:21  
A 900-base pair clone corresponding to rat liver apolipoprotein E (apo-E) mRNA, and containing a 3'-terminal poly(A) segment, was identified from a library of rat liver cDNA clones in the plasmid pBR322 by specific hybrid selection and translation of mRNA. A restriction endonuclease DNA fragment from this recombinant plasmid was used to clone the 5'-terminal region of the apo-E mRNA by primed synthesis of cDNA. A portion of the double-stranded cDNA corresponding to the 3'-terminal region of apo-E mRNA was subcloned into the bacteriophage M13mp7 and employed as a template for the synthesis of a radioactively labeled, cDNA hybridization probe. This cDNA probe was used in a RNA-blot hybridization assay that showed the length of the apo-E mRNA to be about 1200 nucleotides. The hybridization assay also demonstrated that apo-E mRNA is present in rat intestine, but at about a 100-fold lower level than that of the rat liver. The nucleotide sequence of rat liver apo-E mRNA was determined from the cloned, double-stranded cDNAs. The amino acid sequence of rat liver apo-E was inferred from the nucleotide sequence, which showed that the mRNA codes for a precursor protein of 311 amino acids. A comparison to the NH2-terminal amino acid sequence of rat plasma apo-E indicated that the first 18 amino acids of the primary translation product are not present in the mature protein and are probably removed during co-translational processing. The coding region was flanked by a 3'-untranslated region of 109 nucleotides, which contained a characteristic AAUAAA sequence that ended 13 nucleotides from a 3'-terminal poly(A) segment. At the 5'-terminal region of the mRNA, 23 nucleotides of an untranslated region were also determined. The inferred amino acid sequence of mature rat apo-E, which contains 293 amino acids, was compared to the amino acid sequence of human apo-E, which contains 299 amino acids. Using an alignment that permitted a maximum homology of amino acids, it was found that overall, 69% of the amino acid positions are identical in both proteins. The amino acid identities are clustered in two broad domains separated by a short region of nonhomology, an NH2-terminal domain of 173 residues where 80% are identical, and a COOH-terminal domain of 84 residues where 70% are identical. These two domains may be associated with specific functional roles in the protein.  相似文献   

7.
The nucleotide sequence of part of the tra region of R100 including traJ and traY was determined, and the products of several tra genes were identified. The nucleotide sequence of traJ, encoding a protein of 223 amino acids, showed poor homology with the corresponding segments of other plasmids related to R100, but the deduced amino acid sequences showed low but significant homology. The first four amino acids at the N-terminal region of the TraJ protein were not essential for positive regulation of expression of traY, the first gene of the traYZ operon. The nucleotide sequence of traY shows that this gene may use TTG as the initiation codon and that it encodes a protein of 75 amino acids. Analysis of the traY gene product, which was obtained as the fusion protein with beta-galactosidase, showed that the N-terminal region of the product has an amino acid sequence identical to that deduced from the assigned frame but lacks formylmethionine. traY of plasmid F, which encodes a larger protein than the TraY protein of R100, is thought to use ATG as an initiation codon. However, a TTG initiation codon was found in the preceding region of the previously assigned traY coding frame of F. Interestingly, when translation of traY of F was initiated from TTG, the amino acid sequence homologous to the TraY protein of R100 appeared in tandem in the TraY protein of F. This may suggest that traY of F has undergone duplication of a gene like the traY gene of R100.  相似文献   

8.
The nucleotide sequence of the xynZ gene, encoding the extracellular xylanase Z of Clostridium thermocellum, was determined. The putative xynZ gene was 2,511 base pairs long and encoded a polypeptide of 837 amino acids. A region of 60 amino acids containing a duplicated segment of 24 amino acids was found between residues 429 and 488 of xylanase Z. This region was strongly similar to the conserved domain found at the carboxy-terminal ends of C. thermocellum endoglucanases A, B, and D. Deletions removing up to 508 codons from the 5' end of the gene did not affect the activity of the encoded polypeptide, showing that the active site was located in the C-terminal half of the protein and that the conserved region was not involved in catalysis. Expression of xylanase activity in Escherichia coli was increased up to 220-fold by fusing fragments containing the 3' end of the gene with the start of lacZ present in pUC19. An internal translational initiation site which was efficiently recognized in E. coli was tentatively identified 470 codons downstream from the actual start codon.  相似文献   

9.
10.
Mutations in the ATP-binding cassette transporter A1 (ABCA1) transporter are associated with Tangier disease and a defect in cellular cholesterol efflux. The amino terminus of the ABCA1 transporter has two putative in-frame translation initiation sites, 60 amino acids apart. A cluster of hydrophobic amino acids form a potentially cleavable signal sequence in this 60-residue extension. We investigated the functional role of this extension and found that it was required for stable protein expression of transporter constructs containing any downstream transmembrane domains. The extension directed transporter translocation across the ER membrane with an orientation that resulted in glycosylation of amino acids immediately distal to the signal sequence. Neither the native signal sequence nor a green fluorescent protein tag, fused at the amino terminus, was cleaved from ABCA1. The green fluorescent protein fusion protein had efflux activity comparable with wild type ABCA1 and demonstrated a predominantly plasma membrane distribution in transfected cells. These data establish a requirement for the upstream 60 amino acids of ABCA1. This region contains an uncleaved signal anchor sequence that positions the amino terminus in a type II orientation leading to the extracellular presentation of an approximately 600-amino acid loop in which loss-of-function mutations cluster in Tangier disease patients.  相似文献   

11.
Nucleotide sequence of a cellulase gene of Bacillus subtilis   总被引:8,自引:0,他引:8  
The nucleotide sequence of an endolytic cellulase gene of Bacillus subtilis was determined and compared with the NH2-terminal amino acid sequence of the purified enzyme. The mature protein appeared to be extended by a signal sequence of 36 amino acids. The putative AUG initiation codon was preceded by a sigma 43-type promoter of B. subtilis and an AAGGAGG sequence, typical of procaryotic ribosomal binding sites. Partial homology of amino acid sequences was found between B. subtilis cellulase and an alkalophilic Bacillus cellulase.  相似文献   

12.
13.
A human umbilical vein endothelial cell cDNA library in lambda gt11 was screened for expression of thrombomodulin antigens with affinity-purified rabbit polyclonal anti-thrombomodulin immunoglobulin G (IgG) and mouse monoclonal anti-human thrombomodulin IgG. Among 7 million recombinant clones screened, 12 were recognized by both antibodies. Two of these, lambda HTm10 and lambda HTm12, were shown to encode thrombomodulin by comparison of the amino acid sequence deduced from the nucleotide sequence to the amino acid sequence determined directly from tryptic peptides of thrombomodulin. Thrombomodulin mRNA was estimated to be 3.7 kilobases in length by Northern blot analysis of endothelial cell and placental poly(A)+ RNA. Thrombomodulin mRNA was not detected in human brain, HepG2 hepatoma cells, or the monocytic U937 cell line. Additional cDNA clones were selected by hybridization with the 1.2-kilobase insert of lambda HTm10. One isolate, lambda HTm15, contained a 3693 base pair cDNA insert with an apparent 5'-noncoding region of 146 base pairs, an open reading frame of 1725 base pairs, a stop codon, a 3'-noncoding region of 1779 base pairs, and a poly(A) tail of 40 base pairs. The cDNA sequence encodes a 60.3-kDa protein of 575 amino acids. The predicted protein sequence includes a signal peptide of approximately 21 amino acids, an amino-terminal ligand-binding domain of approximately 223 amino acids, an epidermal growth factor (EGF) homology region of 236 amino acids, a serine/threonine-rich segment of 34 amino acids, a membrane-spanning domain of 23 amino acids, and a cytoplasmic tail of 38 amino acids. The EGF-homology region consists of six tandemly repeated EGF-like domains.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

14.
Abnormal beta-hexosaminidase beta chain cDNA clones were isolated from a library constructed from cultured fibroblasts of a patient with a juvenile form of Sandhoff disease (genetic beta-hexosaminidase A and B deficiency). Sequence analysis of a cDNA clone isolated from these fibroblasts contained an extra 24-base segment between exons 12 and 13. This segment was identified as the 3' terminus of intron 12. The remainder of the coding sequence was completely normal. The same 24-base insertion was found in four additional clones by sequencing. Restriction mapping analysis of seven other clones was consistent with the presence of the same 24-base intron 12 segment. This insertion is inframe and adds 8 amino acids between amino acids 491 and 492 of the primary sequence of the normal enzyme protein. It is located only 5 amino acids away from a possible glycosylation site. The finding is consistent with the slightly larger than normal size of the beta subunit precursor protein observed by immunoprecipitation. No normally spliced mRNA was detected. Gene amplification by the polymerase chain reaction and subsequent sequencing of genomic DNA indicated that the patient was a compound heterozygote. In one allele, there was a single nucleotide transition from normal G to A at 26 bases from the 3' terminus of intron 12. This mutation generates a consensus sequence for the 3' splice site for an intron, CAG/G, and thus explains the abnormal mRNAs that retain 24 bases of the 3' terminus of intron 12. The intron 12 and flanking exons 12 and 13 sequences were normal in the other allele, which is a priori also genetically abnormal. The other mutant allele therefore is likely to be of an mRNA-negative type.  相似文献   

15.
Inosine-5'-monophosphate dehydrogenase, a key enzyme in the regulation of guanine nucleotide biosynthesis, was purified to homogeneity; and a polyclonal antibody directed against the purified protein was used to isolate human and Chinese hamster IMP dehydrogenase cDNA clones. These clones were sequenced and found to contain an open reading frame of a protein containing 514 amino acids. A sequence of 35 amino acids obtained by analysis of the purified protein is identical to a segment of the protein sequence deduced from the IMP dehydrogenase cDNA. The molecular mass of the deduced protein is 56 kDa, which is the observed molecular mass of the purified protein and of the immunoprecipitated in vitro translation product. Comparison of the protein sequences deduced from the human and Chinese hamster cDNA clones indicates only eight amino acid differences, suggesting that IMP dehydrogenase is a highly conserved protein.  相似文献   

16.
17.
18.
A cotton genomic clone containing a 17.4-kb DNA segment was found to encompass a palmitoyl-acyl carrier protein (ACP) thioesterase (Fat B1) gene. The gene spans 3.6 kb with six exons and five introns, and is apparently the first plant FatB acyl-ACP thioesterase gene to be completely sequenced. The six exons are identical in nucleotide sequence to the open reading frame of the corresponding cDNA, and would encode a preprotein of 413 amino acids. The preprotein can clearly be identified as a FatB acyl-ACP thioesterase from its similarity to the deduced amino acid sequences of other FatB thioesterase preproteins. A 5'-flanking region of 914 bp was sequenced, with the potential TATA basal promoter 324 bp upstream from the ATG initiation codon. The 5'-flanking sequence also has a putative CAAT box and two presumptive basic region helixloop-helix (bHLH) elements with the consensus motif CANNTG (termed an E box), implicated as being a positive regulatory element in seed-specific gene expression.  相似文献   

19.
The cDNA clones encoding the precursor form of glycinin A3B4 subunit have been identified from a library of soybean cotyledonary cDNA clones in the plasmid pBR322 by a combination of differential colony hybridizations, and then by immunoprecipitation of hybrid-selected translation product with A3-mono-specific antiserum. A recombinant plasmid, designated pGA3B41425, from one of six clones covering codons for the NH2-terminal region of the subunit was sequenced, and the amino acid sequence was inferred from the nucleotide sequence, which showed that the mRNA codes for a precursor protein of 516 amino acids. Analysis of this cDNA also showed that it contained 1786 nucleotides of mRNA sequence with a 5'-terminal nontranslated region of 46 nucleotides, a signal peptide region corresponding to 24 amino acids, an A3 acidic subunit region corresponding to 320 amino acids followed by a B4 basic subunit region corresponding to 172 amino acids, and a 3'-terminal nontranslated region of 192 nucleotides, which contained two characteristic AAUAAA sequences that ended 110 nucleotides and 26 nucleotides from a 3'-terminal poly(A) segment, respectively. Our results confirm that glycinin is synthesized as precursor polypeptides which undergo post-translational processing to form the nonrandom polypeptide pairs via disulfide bonds. The inferred amino acid sequence of the mature basic subunit, B4, was compared to that of the basic subunit of pea legumin, Leg Beta, which contained 185 amino acids. Using an alignment that permitted a maximum homology of amino acids, it was found that overall 42% of the amino acid positions are identical in both proteins. These results led us to conclude that both storage proteins have a common ancestor.  相似文献   

20.
A cDNA clone encoding the high affinity Ca2+-binding protein (HACBP) of rabbit skeletal muscle sarcoplasmic reticulum was isolated and sequenced. The cDNA encoded a protein of 418 amino acids, but a comparison of the deduced amino acid sequence with the NH2-terminal amino acid sequence of the purified protein indicates that a 17-residue NH2-terminal signal sequence was removed during synthesis. This was confirmed by studies of in vitro translation of mRNA encoding the protein. Structural predictions did not reveal any potential transmembrane segments in the protein. The COOH-terminal sequence of the high affinity Ca2+-binding protein, Lys-Asp-Glu-Leu, is the same as that proposed to be an endoplasmic reticulum retention signal (Munro, S., and Pelham, H. R. B. (1987) Cell 48, 899-907). All of these characteristics suggest that the protein is localized in the lumen of the sarcoplasmic reticulum. The mature protein of Mr 46,567 contains 109 acidic and 52 basic amino acids. Structural predictions suggest that the first half of the molecule forms a globular domain of 8 anti-parallel beta-strands with a helix-turn-helix motif at the extreme NH2 terminus. The next one-third of the sequence is proline-rich. This segment can be subdivided into a charged region which contains a 17-amino acid repeat, followed by a proline, serine, and threonine-rich segment extending from Pro-246 to Thr-316. Thirty-seven acidic residues are clustered within 56 amino acids at the COOH terminus of the protein. Although the protein binds 1 mol of Ca2+/mol with high affinity, no "EF-hand" consensus sequence was observed in the protein. The acidic COOH terminus, however, could account for the low affinity, high capacity Ca2+ binding observed in the protein. In agreement with other involved laboratories, we have chosen the name calreticulin for the protein.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号