首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A novel member of the proteinase Inhibitor I family having a trypsin inhibitor specificity was isolated from the fruit of the wild tomato species Lycopersicon peruvianum (L.) Mill. (LA 107) and characterized. The protein is among the isoinhibitors of Inhibitor I that comprise 50% of the soluble proteins in the fruit of this wild species of tomato. A cDNA corresponding to the inhibitor protein and mRNA was isolated and characterized. The Inhibitor I mRNA represented 0.06% of the poly(A) RNA and gene copy number reconstruction experiments gave an estimate of two to four genes/haploid genome. The open reading frame of the cDNA codes for a protein of 111 amino acids having a 42-amino acid prepropolypeptide. The NH2-terminal sequence of the first 21 amino acids of the purified Inhibitor I protein confirmed that the cDNA was identical to the protein. The amino acid sequence of the L. peruvianum fruit Inhibitor I exhibits 74% identity with the wound-inducible Inhibitor I from tomato leaves. Whereas all previously identified members of the Inhibitor I family have either Met, Leu, or Asp at the P1 site and can inhibit enzymes such as chymotrypsin, subtilisin, and elastase, the fruit Inhibitor I possesses Lys at the P1 position. Thus, this is the first member of the extensive Inhibitor I family from plants and animals that exhibits trypsin inhibitory specificity. The presence of this inhibitor in wild tomato fruit may reflect a functional role to protect the tissues against herbivory.  相似文献   

2.
A cDNA containing the complete amino acid-coding region of wound-induced tomato Inhibitor II was constructed in the plasmid pUC9. The open reading frame codes for 148 amino acids including a 25-amino acid signal sequence preceding the N-terminal lysine of the mature Inhibitor II. The Inhibitor II sequence exhibits two domains, one domain having a trypsin inhibitory site and the other a chymotrypsin inhibitory site, apparently evolved from a smaller gene by a process of gene duplication and elongation. The amino acid sequence of tomato leaf Inhibitor II exhibits homology with two small proteinase inhibitors isolated from potato tuber and an inhibitor from eggplant. The small potato tuber inhibitors are homologous with 33 amino acids of the N-terminal domain and 19 amino acids from the C-terminal domain. Two identical nucleotide sequences of Inhibitor II cDNA in the 3' noncoding region were present that were also found in an Inhibitor I cDNA. These include an atypical polyadenylation signal, AATAAG, and a 10-base palindromic sequence, CATTATAATG, for which no function is yet known.  相似文献   

3.
A human pancreas-specific protein (PASP), previously characterized as a serum marker for acute pancreatitis and pancreatic graft rejection, has been identified as pancreatic procarboxypeptidase B (PCPB). cDNAs encoding PASP/PCPB were isolated from a human pancreas cDNA library using a combination of nucleic acid hybridization screening and immunoscreening with antisera raised against native PASP. The deduced amino acid sequence of PASP/PCPB cDNA predicts the translation of a 416-amino acid preproenzyme with a 15-amino acid signal/leader peptide and a 95-amino acid activation peptide. The proenzyme portion of this protein has 76% identity with rat PCPB and 84% identity with bovine carboxypeptidase B. DNA and RNA blot analyses indicate that human PCPB mRNA (1,400 nucleotides) is transcribed from a single locus in the human genome in a tissue-specific fashion. N-terminal sequencing of native PASP and the specific immunoreactivity of bacterially expressed PASP/PCPB with native PASP antibodies confirm the identification of PASP as human pancreatic PCPB.  相似文献   

4.
5.
The grasshopper neuropeptides adipokinetic hormone (AKH) I and II were among the first of an extensive family of structurally similar arthropod hormones and neuroregulators to be isolated and sequenced. This paper reports the cloning of cDNAs derived from the unusually small mRNAs (550 bases) which code for the precursors of AKH I and II from Schistocerca nitans. Sequence analysis of the cDNAs indicates that AKH I and II are derived from small precursor proteins (63 and 61 amino acids) which are 55% identical in amino acid sequence. Each contains a 22-amino acid hydrophobic leader sequence followed by the AKH I or II sequence and an additional 28-amino acid carboxyl-terminal peptide of unknown function. Significant homology at the nucleic acid level (64% identity) is confined to the coding region of the mRNA sequences. Preliminary DNA blot analyses suggest that a single gene codes for each, and that the genes for AKH I and II may be linked. Genomic blots from various tissues fail to suggest that the high level of expression of AKH in the corpora cardiaca is due to tissue specific gene amplification.  相似文献   

6.
Genomic blotting of restriction fragments of Russet Burbank DNA indicated that at least 6 copies of Inhibitor I are present in the tetraploid potato genome. A library of potato genes in bacteriophage was screened for the presence of Inhibitor I genes using a wound-inducible tomato Inhibitor I cDNA as a hybridization probe. One phage with an insert of 13.1 kb was isolated that hybridized most strongly with the probe. A 4.2 kb Eco RI fragment containing the gene was isolated from the clone and 2.2 kb region was sequenced that included about 800 bp of both the 5 and 3 regions. The gene contained two introns of 479 and 417 bp respectively, and the splice junctions were typical of other eukaryotic genes. Putative TATAA and CAAT boxes were identified. The nucleotide sequence, when compared with a wound-inducible tomato Inhibitor I cDNA, exhibited over 90% identity. The gene codes for a prepro-Inhibitor I protein of 96 amino acids. The putative pre-sequence of 19 amino acids, differs in only one residue from that of tomato Inhibitor I. The potato pro-sequence, however, is lacking a tetrapeptide that is found in the tomato pro-sequence in the region of pro-peptide processing. This deletion, together with a substitution of a Gln for a Leu (4 residues toward the N terminus) provides an explanation for the differences at the N-termini between tomato and potato Inhibitor I natural proteins by providing different processing sites in the two pro-inhibitors. Thus, amino acid sequence differences between the N termini of tomato and potato Inhibitor I are easily explained by the mutational events. The different proposed pro-processing sites of the tomato and potato inhibitors suggest that a processing protease may be present in the vacuole with a specificity for Asn-X and Gln-X bonds.This is Scientific Paper No. 7493, Project 1791, College of Agriculture and Home Economics Research Center, Washington State UniversityThis is Scientific Paper No. 7493, Project 1791, College of Agriculture and Home Economics Research Center, Washington State University  相似文献   

7.
Sequence analysis of short fragments resulting from trypsin digestion of the thermolabile shrimp alkaline phosphatase (SAP) from Northern shrimp Pandalus borealis formed the basis for amplification of its encoding cDNA. The predicted protein sequence was recognized as containing the consensus alkaline phosphatase motif comprising the active site of this protein family. Protein sequence homology searches identified several eukaryote alkaline phosphatases with which the 475-amino acid SAP polypeptide revealed shares 45% amino acid sequence identity. Residues for potential metal binding seem to be conserved in these proteins. The predicted 54-kDa molecular mass of SAP is smaller than previously reported, but is consistent with our recent SDS-PAGE analysis of the native protein. Compared to its homologs, the shrimp enzyme has a surplus of negatively charged amino acids, while the relative number of prolines is lower and the frequency of aromatic residues is higher than in mesophilic counterparts.  相似文献   

8.
Several cDNA clones encoding a 46-kDa collagen-binding glycoprotein (gp46) from rat skeletal myoblasts were isolated and sequenced. The cDNA encoded a 17-amino acid signal peptide and a 400-amino acid mature protein, containing three potential N-linked oligosaccharide attachment sites. The cDNA sequence of gp46 shows 93% identity in the coding region with J6, a retinoic acid-inducible gene coding for a protein of unknown function described from embryonal carcinoma F9 cells. The first 41 NH2-terminal amino acids of the predicted J6 sequence are, however, different from the gp46 sequence as a result of a 7-base pair insertion in the gp46 cDNA. In addition, the NH2-terminal amino acid sequence of hsp47, a collagen-binding protein found in chick embryo fibroblasts, shows 64% identity to gp46 over 36 residues. Interestingly, this alignment begins 10 residues inward from the first amino acid in the mature form of gp46. A significant sequence similarity was observed between gp46 and members of the serine protease inhibitor (serpin) family. Unlike other serpins, however, gp46 is both a heat shock and a collagen-binding protein and is localized to the lumen of the endoplasmic reticulum, as suggested by the presence of the RDEL sequence at the COOH terminus. This sequence is similar to other proposed endoplasmic reticulum retention signals.  相似文献   

9.
10.
A TBLASTN search of the Drosophila melanogaster expressed sequence tag (EST) database with the amino acid sequence of human UDP-N-acetylglucosamine:alpha-3-D-mannoside beta-1,2-N-acetylglucosaminyltransferase I (GnT I, EC 2.4.1.101) as probe yielded a clone (GM01211) with 56% identity over 36 carboxy-terminal amino acids. A 550 base pair (bp) probe derived from the EST clone was used to screen a Drosophila cDNA library in lambda-ZAP II and two cDNAs lacking a start ATG codon were obtained. 5'-Rapid amplification of cDNA ends (5'-RACE) yielded a 2828 bp cDNA containing a full-length 1368 bp open reading frame encoding a 456 amino acid protein with putative N-terminal cytoplasmic (5 residues) and hydrophobic transmembrane (20 residues) domains. The protein showed 52% amino acid sequence identity to human GnT I. This cDNA, truncated to remove the N-terminal hydrophobic domain, was expressed in the baculovirus/Sf9 system as a secreted protein containing an N-terminal (His)6 tag. Protein purified by adsorption to and elution from nickel beads converted Man alpha1-6(Man alpha1-3)Man beta-octyl (M3-octyl) to Man alpha1-6(GlcNAc beta1-2Man alpha1-3)Man beta-octyl. The Km values (0.7 and 0.03 mM for M3-octyl and UDP-GlcNAc respectively), temperature optimum (37 degrees C), pH optimum (pH 5 to 6) and divalent cation requirements (Mn > Fe, Mg, Ni > Ba, Ca, Cd, Cu) were similar to mammalian GnT I. TBLASTN searches of the Berkeley Drosophila Genome Project database with the Drosophila GnT I cDNA sequence as probe allowed localization of the gene to chromosomal region 2R; 57A9. Comparison of the cDNA and genomic DNA sequences allowed the assignment of seven exons and six introns; all introns showed GT-AG splice site consensus sequences. This is the first insect GnT I gene to be cloned and expressed.  相似文献   

11.
Two forms of small, interstitial proteoglycans have been isolated from bovine articular cartilage and have different core proteins, based on NH2-terminal analysis and peptide mapping (Choi, H. U., Johnson, T. L., Pal, S., Tang, L-H., Rosenberg, L. C., and Neame, P. J. (1989) J. Biol. Chem. 264, 2876-2884). These proteoglycans have been called PG I and PG II. Since they were first described, they have also been called "biglycan" (PG I), "decorin," and "DS-PG" (PG II). This report describes the primary structure of PG I from bovine articular cartilage. The protein core consists of 331 amino acids with a molecular mass of 37,280 Da. The amino acid sequence shows 55% identity to the cDNA-derived sequence of PG II from bovine bone. There are four discrete domains in the amino acid sequence. Domain 1, at the NH2 terminus (approximately 23 amino acids), contains two sites of attachment of dermatan sulfate, both of which match the consensus sequence of Asp/Glu-X-X-Ser-Gly-hydrophobic. Neither of these sites is substituted to 100% with glycosaminoglycan in native PG I. Domain 2, near the NH2 terminus and containing approximately 28 amino acids, has a cysteine pattern similar to a domain near the COOH terminus of mouse metallothionein and contains at least one disulfide bond (between the first and fourth cysteine residues). The majority of the core protein of PG I (domain 3) is a leucine-rich domain containing ten repeating units (approximately 231 amino acids). Patthy [1987) J. Mol. Biol. 198, 567-577) has shown that for PG II, the majority of domain 3 shows considerable similarity to leucine-rich alpha 2-glycoprotein (LRG) from serum. Domain 2 of PG I or PG II also has an analog in LRG, in that it has two cysteines in a similar place. The major motif in the PG I described here, in PG II and in LRG, is a series of leucine-rich repeats. PG I and PG II both contain 10 leucine-rich repeats which are 14 amino acids long and which are somewhat irregularly spaced, while LRG contains 9 leucine-rich repeats spaced 10 amino acids apart. Other proteins which contain leucine repeats are the platelet glycoprotein Ib, which is involved in platelet adherence to subendothelium (eight repeats in the alpha chain and two in the beta chain), the protein encoded by the Toll gene (involved in lateral and ventral spatial organization in Drosophila) and chaoptin (a protein involved in Drosophila photoreceptor morphogenesis).(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

12.
Chitin-binding proteins are present in a wide range of plant species, including both monocots and dicots, even though these plants contain no chitin. To investigate the relationship between in vitro antifungal and insecticidal activities of chitin-binding proteins and their unknown endogenous functions, the stinging nettle lectin (Urtica dioica agglutinin, UDA) cDNA was cloned using a synthetic gene as the probe. The nettle lectin cDNA clone contained an open reading frame encoding 374 amino acids. Analysis of the deduced amino acid sequence revealed a 21-amino acid putative signal sequence and the 86 amino acids encoding the two chitin-binding domains of nettle lectin. These domains were fused to a 19-amino acid "spacer" domain and a 244-amino acid carboxyl extension with partial identity to a chitinase catalytic domain. The authenticity of the cDNA clone was confirmed by deduced amino acid sequence identity with sequence data obtained from tryptic digests, RNA gel blot, and polymerase chain reaction analyses. RNA gel blot analysis also showed the nettle lectin message was present primarily in rhizomes and inflorescence (with immature seeds) but not in leaves or stems. Chitinase enzymatic activity was found when the chitinase-like domain alone or the chitinase-like domain with the chitin-binding domains were expressed in Escherichia coli. This is the first example of a chitin-binding protein with both a duplication of the 43-amino acid chitin-binding domain and a fusion of the chitin-binding domains to a structurally unrelated domain, the chitinase domain.  相似文献   

13.
The sequence of the gene from Nocardia sp. NRRL 5646 encoding GTP cyclohydrolase I (GCH), gch, and its adjacent regions was determined. The open reading frame of Nocardia gch contains 684 nucleotides, and the deduced amino acid sequence represents a protein of 227 amino acid residues with a calculated molecular mass of 24,563Da. The uncommon start codon TTG was identified by matching the N-terminal amino acid sequence of purified Nocardia GCH with the deduced amino acid sequence. A likely ribosomal binding site was identified 9bp upstream of the translational start site. The 3' end flank region encodes a peptide that shares high homology with dihydropteroate synthases. Nocardia GCH has 73 and 60% identity to the proteins encoded by the putative gch of Mycobacterium tuberculosis and Streptomyces coelicolor, respectively. Nocardia GCH was highly expressed in Escherichia coli cells carrying a pHAT10 based expression vector, and moderately expressed in Mycobacterium smegmatis cells carrying a pSMT3 based expression vector. Enterokinase digestion of recombinant Nocardia GCH, and in-gel digestion of Nocardia GCH and recombinant GCH followed by MALDI-TOF-MS analysis, confirmed that the actual subunit size of the enzyme was 24.5kDa. Thus, we conclude that the active form of native Nocardia GCH is a decamer. Our earlier incorrect conclusion was that the native enzyme was an octamer derived from the anomalous SDS-PAGE migration of the subunit.  相似文献   

14.
The leukocyte function-associated molecule 1 (LFA-1, CD11a/CD18) is a membrane glycoprotein which functions in cell-cell adhesion by heterophilic interaction with intercellular adhesion molecule 1 (ICAM-1). LFA-1 consists of an alpha subunit (Mr = 180,000) and a beta subunit (Mr = 95,000). We report the molecular biology and protein sequence of the alpha subunit. Overlapping cDNAs containing 5,139 nucleotides were isolated using an oligonucleotide specified by tryptic peptide sequence. The mRNA of 5.5 kb is expressed in lymphoid and myeloid cells but not in a bladder carcinoma cell line. The protein has a 1,063-amino acid extracellular domain, a 29-amino acid transmembrane region, and a 53-amino acid cytoplasmic tail. The extracellular domain contains seven repeats. Repeats V-VII are in tandem and contain putative divalent cation binding sites. LFA-1 has significant homology to the members of the integrin superfamily, having 36% identity with the Mac-1 and p150,95 alpha subunits and 28% identity with other integrin alpha subunits. An insertion of approximately 200 amino acids is present in the NH2-terminal region of LFA-1. This "inserted/interactive" or I domain is also present in the p150,95 and Mac-1 alpha subunits but is absent from other integrin alpha subunits sequenced to date. The I domain has striking homology to three repeats in human von Willebrand factor, two repeats in chicken cartilage matrix protein, and a region of complement factor B. These structural features indicate a bipartite evolution from the integrin family and from an I domain family. These features may also correspond to relevant functional domains.  相似文献   

15.
The complete cDNA sequence of bovine coagulation factor V.   总被引:5,自引:0,他引:5  
Lack of availability of a primary structure for bovine factor V has hindered detailed analysis of a vast majority of structure-function correlations on this molecule. To determine the primary structure of bovine factor V, we used liver mRNA as a template for the synthesis of three cDNA libraries. The sequences of seven overlapping cDNA clones infer two bovine factor V variants. Variant 1 results in a 6910-basepair (bp) cDNA including 103 bp of 5'-untranslated sequence, 6633 bp of coding sequence and 171 bp of 3'-untranslated sequence with a putative polyadenylation site. Variant 2 differs only in the size of the coding sequence (6618 bp). The open reading frame translates to factor V consisting of 2211 (or 2206) amino acids including a 28-amino acid signal peptide. Comparison of the amino acid sequences with human factor Va reveals 84% identity for the heavy and 86% for the light chains. In contrast, the B domain (connecting region) exhibits only 59% identity relative to the human molecule. The bovine B domain contains two repeats of a 14-amino acid structure that is contained only once in the human sequence. Bovine factor V lacks one of the nine amino acid repeats and one of the 17 amino acid repeats present in the human B domain. Factor V has little homology to the factor VIII molecule in the B domain. The 17-amino acid repeat missing in bovine factor V allows identification of an 18-amino acid sequence that is homologous to the B domain of human factor VIII. These 18 amino acids may either constitute the unique vestige of a divergent evolution between the B domains of factors V and VIII or reveal the convergent evolution toward a critical epitope involved in the activation of both procofactors.  相似文献   

16.
17.
Pure human inactive renin. Evidence that native inactive renin is prorenin   总被引:1,自引:0,他引:1  
To clarify contradicting observations on the identity of inactive renin and prorenin, inactive renin was completely purified from native human chorion laeve and the culture medium of human chorion cells. A 720,000-fold purification with 14% recovery was achieved from chorion laeve in 6 steps, including immunoaffinity chromatography on a monoclonal antibody to human renin coupled to Protein A-Sepharose CL-4B. A 3,100-fold purification with 40% recovery was achieved from chorion culture medium in 4 steps, including immunoaffinity chromatography. Inactive renin purified from the two different sources migrated as a single protein band with the same molecular weight of 47,000 on sodium dodecyl sulfate-polyacrylamide gel electrophoresis and consisted of multiple components that could be resolved by isoelectric focusing. Both had the same pI values which shifted downward upon activation by trypsin; however, relative peak heights were different between the two preparations. The purified inactive renin from chorion laeve was completely inactive and did not bind to pepstatin-aminohexyl-Sepharose; however, that from chorion culture medium was partially active and completely bound to the pepstatin gel, indicating that each molecule is partially activated. Trypsin-activated inactive renins from both sources were identical with human renal renin in terms of pH optimum and Km. Specific activities of trypsin-activated inactive renin from chorion laeve and chorion culture medium were 529 Goldblatt units/mg of protein and 449 Goldblatt units/mg of protein, respectively. Amino acid sequence analysis of both of the purified inactive renin preparations demonstrated a leucine residue at the amino terminus. The sequence of 11 additional amino acids was identical in both and agreed with that predicted from the base sequence of the renin gene. These findings indicate that preprorenin is converted to prorenin following removal of a 23-amino acid signal peptide and that the native inactive renin, whose amino acid sequence commences with Leu-Pro-Thr..., is prorenin.  相似文献   

18.
A full-length cDNA clone encoding osteocalcin from the bullfrog, Rana catesbeiana (bone Gla-protein, BGP) has been isolated, and the complete coding sequence for the 100-amino-acid pre-pro-osteocalcin protein was determined. The amino acid sequence of Rana catesbeiana osteocalcin, especially the mature 49-amino acid sequence, is closer to the mammalian than to the fish, Sparus osteocalcin. Rana mature osteocalcin has a similarity of 67% with human or 59% with rat osteocalcin, and only 42% with fish mature osteocalcin. The 51-amino-acid pre-pro-peptide contains the expected hydrophobic leader sequence and the dibasic Arg-Arg sequence preceding the NH2-terminal Ser of the mature 49-amino-acid Rana osteocalcin. The pro-peptide sequence also contains the expected motif of polar and hydrophobic residues, which targets vitamin K-dependent gamma-carboxylation of three specific Glu residues at positions 17, 21, and 24 in the mature protein. At the native protein expression levels, extraction from Rana cortical bone in the presence of protease inhibitor cocktail resulted in the isolation of two distinct forms of osteocalcin, P-1 and P-2, with a 3:2 distribution. Using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) and amino acid sequence analysis of the N-terminal domain, we confirmed that P-1 is the intact 49-residue osteocalcin with N-terminal SNLRNAVFG., and that P-2 lacks four amino acids from the N-terminus, (NAVFG.). These results demonstrate the existence of a form of osteocalcin lacking four N-terminal amino acids in Rana bone, and that mature Rana osteocalcins remained highly conserved in their molecular evolution, especially with respect to the conservation of the C-terminal domain (residues 14-49).  相似文献   

19.
20.
DNA clones encoding chicken T-protein of the glycine cleavage system were isolated from chicken liver lambda gt10 cDNA libraries. Three overlapping clones provided an open reading frame of 1176 nucleotides that predicts a polypeptide of 392 amino acids (M(r) 42,056) comprised of a 16-residue mitochondrial targeting sequence and a 376-residue mature protein (M(r) 40,292). The amino acid sequence predicted for the mature protein showed 67% identity with that of bovine T-protein. A cDNA encoding mature T-protein was constructed, and the nucleotide sequence just downstream of the initiation codon was modified without amino acid substitution to reduce the free energy of formation for the folded mRNA. Expression plasmids containing these cDNA variants produced large amounts of T-protein in Escherichia coli, while very low expression was observed with a plasmid containing wild type cDNA. Enzymatically active T-protein was obtained when the expression was conducted at 30 degrees C with 25 microM isopropyl-1-thio-beta-D-galactopyranoside. Under the full inducing condition (at 37 degrees C and 1 mM inducer), the expressed T-protein was recovered as insoluble and inactive protein. The recombinant T-protein was purified to near homogeneity with a yield of about 30%. Apparent molecular weight on sodium dodecyl sulfate-polyacrylamide gel electrophoresis is approximately 40,000, similar to the size of T-protein purified from chicken liver. NH2-terminal amino acid sequence analysis (9 residues) revealed 100% identity with chicken T-protein determined chemically. The kinetic properties of the recombinant T-protein resembled those of the native chicken T-protein.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号