首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The nucleotide sequence of the glg B gene, coding for branching enzyme (EC 2.4.1.18), was elucidated. It consists of 2181 base pairs specifying a protein of 727 amino acids. The deduced amino acid sequence was consistent with the amino acid analysis that was obtained with the pure protein as well as with the molecular weight determined from sodium dodecyl sulfate-gel electrophoresis. The deduced amino acid sequence was also consistent with the amino-terminal amino acid sequence and the amino acid sequence analysis of various peptides obtained from CNBr degradation of purified branching enzyme.  相似文献   

2.
Spermine binding protein (SBP) is a rat ventral prostate protein that binds various polyamines, and the level of this protein and its mRNA is regulated by androgens. Previously, the cDNA for SBP was cloned and sequenced and an amino acid sequence deduced from the cDNA. Data from cloned and sequenced and an amino acid sequence deduced from the cDNA. Data from partial amino acid sequencing of the purified protein were consistent with the amino acid sequence deduced from the cDNA. However, the amino terminus of the protein was blocked, and therefore, direct protein sequence information confirming the cDNA reading frame of this region could not be obtained by Edman degradation. We have now employed an integrated approach using fast atom bombardment mass spectrometry, tandem mass spectrometry, and conventional sequencing methodologies to establish the amino-terminal sequence of the protein and to identify an amino acid sequence (35 residues) present in the purified protein but missing from the amino acid sequence deduced from cDNA clones for this protein. The missing piece of cDNA corresponds to an exon found in mouse genomic clones for a protein similar to rat SBP. Therefore, the cDNA clones for rat SBP may represent splicing variants that lack the sequence information of one exon. The blocked amino terminus of the protein was identified as 5-oxopyrrolidine-2-carboxylic acid. Mass spectrometry also provided evidence regarding glycosylation of the protein. The first of two potential glycosylation sites clearly carries carbohydrate; the second site is, at most, only partially glycosylated.  相似文献   

3.
The amino acid sequence of human beta-microseminoprotein   总被引:2,自引:0,他引:2  
The complete amino acid sequence of beta-microseminoprotein of human seminal plasma was determined by automated Edman degradation of the protein and peptides which were obtained by enzymatic cleavage with trypsin, chymotrypsin and Staphylococcus aureus V8 proteinase. The carboxyl-terminal sequence of the protein was established with the aid of carboxypeptidase A. The amino acid sequence of this protein proved to be as follows: (sequence; see text) Thus, beta-microseminoprotein consisting of 93 amino acid residues has a molecular mass of 10 652 Da. The linear structure of this protein represents the first complete amino acid sequence of a sperm-coating protein specific to human seminal plasma.  相似文献   

4.
5.
The nucleotide and partial amino acid sequence of toxic shock syndrome toxin-1   总被引:37,自引:0,他引:37  
The nucleotide sequence of toxic shock syndrome toxin-1 (TSST-1) has been determined. In addition, one-third of the predicted amino acid sequence was confirmed by amino acid sequence analysis of cyanogen bromide-generated TSST-1 protein fragments. The DNA sequencing results identified a 708-base pair open reading frame starting with an ATG, 7 base pairs downstream from a Shine-Dalgarno sequence, and terminating at a UAA stop codon. Amino acid analysis of the intact protein defined the NH2 terminus of the mature protein and located the cleavage point for the signal peptide (Ala/Ser). The signal peptide contained the first 40 amino acids and had characteristic structural similarities with other bacterial signal peptides. The coding sequence of the mature protein was 585 base pairs (194 amino acids) in length, and the molecular weight of the predicted protein was 22,049. This is in good agreement with the previously reported molecular weight of TSST-1 (22,000), as determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. NH2-terminal amino acid sequence analysis performed on isolated TSST-1 CNBr fragments determined the position of the peptides in the TSST-1 sequence and verified the predicted amino acid sequence in those positions. Computer analyses of the amino acid sequence showed that TSST-1 has little or no sequence homology with biologically related toxins, streptococcal pyrogenic exotoxin A, and staphylococcal enterotoxins B and C.  相似文献   

6.
A cDNA containing the coding region for the complete amino acid sequence of wound-induced proteinase Inhibitor I from tomato leaves was constructed in the plasmid pUC9 and characterized. The open reading frame codes for a protein of 111 amino acids. This deduced amino acid sequence revealed the presence of a 42-amino acid N-terminal sequence that is not found in the native protein. This sequence appears to contain a 23-amino acid segment typical of a signal sequence followed by a 19-amino acid sequence containing 9 charged amino acids. The 42-amino acid sequence is apparently lost during maturation to the native Inhibitor I and represents 38% of the translated protein. The Inhibitor I amino acid sequence contains 71% identity with potato tuber Inhibitor I sequence and 35% identity with an inhibitor from the leech.  相似文献   

7.
8.
We have determined the nucleotide sequence of the uvrA gene of Escherichia coli. The coding region of the gene is 2820 base pairs which specifies a protein of 940 amino acids and Mr = 103,874. The polypeptide sequence predicted from the DNA sequence was confirmed by analyzing the UvrA protein: the sequence of the first 7 NH2-terminal amino acids as well as the amino acid composition of the pure protein agreed with those predicted from the nucleotide sequence. By comparing the sequence of UvrA protein to the amino acid sequences of other ATPases, we found that two regions in the UvrA protein, separated from one another by about 600 amino acids, have the highly conserved G-X4-GKT(S)-X6-I(V) sequence found at the active sites of many, but not all, ATPases. Our findings suggest that UvrA protein may have two ATP binding sites.  相似文献   

9.
cDNA molecular cloning of Geotrichum candidum lipase   总被引:6,自引:0,他引:6  
The cDNA clone of Geotrichum candidum (Geo.) lipase was isolated from the Geo. cDNA library by colony hybridization using 32P-labeled oligonucleotides corresponding to a partial amino acid sequence of this enzyme. The nucleotide sequence of the cDNA determined by the dideoxy chain terminating method included some partial amino acid sequences determined by Edman degradation, and the overall amino acid composition deduced from the cDNA coincided with that from amino acid analysis of this protein. The cloned cDNA coded a protein of 554 amino acids and a hydrophobic signal sequence of 19 amino acids. Geo. lipase contained the -Gly-X-Ser-X-Gly- sequence which is believed to form part of the interfacial lipid recognition site.  相似文献   

10.
CAP18 is a novel 18 kDa cationic protein [pI approximately 10] originally purified from rabbit granulocytes using as an assay the agglutination of lipopolysaccharide (LPS) coated erythrocytes. cDNA clones encoding CAP18 were isolated from a rabbit bone marrow cDNA library using a PCR generated oligonucleotide probe derived from the N-terminal amino acid sequence. The deduced amino acid sequence reveals a putative signal sequence of 29 amino acids and a mature protein of 142 amino acid residues. The predicted size of the encoded protein is 16.6 kDa with a pI of 10. There are no N-linked glycosylation sites. The CAP18 sequence bears no homology with other known LPS-binding proteins including human bacterial permeability increasing protein (BPI)(1) and rabbit LPS binding protein (LBP)(2).  相似文献   

11.
The primary structure of rat ribosomal protein L37a   总被引:5,自引:0,他引:5  
The amino acid sequence of rat ribosomal protein L37a was deduced from the sequence of nucleotides in recombinant cDNAs isolated in Yamagata and in Chicago and confirmed from the NH2-terminal amino acid sequence of the protein. Ribosomal protein L37a contains 91 amino acids (the NH2-terminal methionine is removed after translation of the mRNA) and has a molecular mass of 10143 Da.  相似文献   

12.
The major auxin-binding protein from maize coleoptiles was purified to homogeneity. The protein has an apparent mol. wt of 22 kd and binds 1-naphthylacetic acid with a KD of 2.40 x 10(-7) M. Additional antigenically related proteins, present in very low amounts, could be demonstrated in maize coleoptiles using immunodetection. Extensive protein sequence analysis of the major auxin-binding protein allowed the construction of several synthetic oligonucleotide probes which were used to isolate a cDNA coding for this protein. The cDNA corresponds to a mRNA with a 3'-poly(A)+ sequence and a single, long open reading frame of 603 bases. The open reading frame, starting 34 residues from the 5' end of the cDNA, predicts a 21,990 Dalton protein of 201 amino acids. Comparison of this deduced amino acid sequence with the partial amino acid sequences of purified auxin-binding protein, revealed a perfect match, involving a total of 53 amino acid residues. The primary amino acid sequence includes a 38-amino-acid-long N-terminal hydrophobic leader sequence which could represent a signal for translocation of this protein to the endoplasmic reticulum. An additional signal is located at the C-terminal end, consisting of the amino acids KDEL known to be responsible for preventing secretion of proteins from the lumen of the endoplasmic reticulum in eucaryotic cells. The primary sequence contains a N-glycosylation site (-asp133-thr-thr-). This site was found to be glycosylated by a high-mannose-type oligosaccharide.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

13.
Cloning and sequence analysis of a DNA complementary to the mRNA expressed in undifferentiated mouse F9 teratocarcinoma stem cells but disappearing rapidly after treatment with a tumor-promoting phorbol ester revealed it to be a 1.9 kilobase pairs-long cDNA encoding a protein of 323 amino acid residues. Computer-assisted analyses of the deduced amino acid sequence indicated that this protein contains a typical hydrophobic signal peptide consisting of 33 amino acid residues and six putative membrane-spanning segments. The deduced amino acid sequence, as a whole, bears no significant sequence homology to any previously described protein.  相似文献   

14.
The complete amino acid sequence of human plasma apolipoprotein C-III (apoC-III) isolated from normal subjects is described. ApoC-III is a linear polypeptide chain of 79 amino acids. Tryptic digestion of intact apoC-III produced 5 major peptides, while tryptic digestion of the citraconylated protein yielded two peptides. The complete amino acid sequence of apoC-III was determined by the automated Edman degradation of the intact protein as well as the various tryptic peptides. Phenylthiohydantoin amino acids were identified by high-performance liquid chromatography and chemical ionization mass spectrometry. The amino acid sequence of apoC-III isolated from normolipidemic subjects is identical to the apoC-III sequence derived from the cDNA sequence and differs at 4 positions from the previously reported sequence of apoC-III derived from a patient with type V hyperlipoproteinemia.  相似文献   

15.
The complete amino acid sequence of the single polypeptide chain of human complex-forming glycoprotein heterogeneous in charge (protein HC) isolated from a single individual is reported with the supporting data. The primary structure was determined by automatic degradation of the intact chain and of fragments obtained by chemical and enzymatic degradations of the native or reduced and S-carboxymethylated protein. The polypeptide chain of protein HC contained 182 amino acid residues with a calculated molecular weight of 20,621. No amino acid sequence variability was found and such variability can therefore not explain the great charge heterogeneity of protein HC in a single individual. The amino acid sequence of protein HC was nearly identical to the one reported for human alpha 1-microglobulin in a research communication but contained 15 additional residues.  相似文献   

16.
Based on the conserved amino acid sequence (DLKPEN) of serine-threonine protein kinase from several fungi, a degenerate primer was designed and synthesized. Total RNA was isolated from the thermophilic fungus Thermomyces lanuginosus. Using RACE-PCR, full-length cDNA of a putative serine-threonine protein kinase gene was cloned from T. lanuginosus. The full-length cDNA of T. lanuginosus protein kinase was 2551 bp and contained an 1806 bp open reading frame encoding a putative protein kinase precursor of 601 amino acid residues. Sequencing analysis showed that the cloned cDNA of T. lanuginosus had consensus protein kinase sequences. Conservative amino acid subdomains which most serine-threonine kinases contain can be found in the deduced amino acid sequence of T. lanuginosus putative protein kinase. Comparison results showed that the deduced amino acid sequence of T. lanuginosus putative protein kinase was highly homologous to that of Neurospora crassa dis1-suppressing protein kinase Dsk1. The putative protein kinase contained three arginine/serine-rich (SR) regions and two transmembrane domains. These showed that it might be a novel putative serine-threonine protein kinase.  相似文献   

17.
We isolated a 38 kDa ssDNA-binding protein from the unicellular cyanobacterium Synechococcus sp. strain PCC 6301 and determined its N-terminal amino acid sequence. A genomic clone encoding the 38 kDa protein was isolated by using a degenerate oligonucleotide probe based on the amino acid sequence. The nucleotide sequence and predicted amino acid sequence revealed that the 38 kDa protein is 306 amino acids long and homologous to the nuclear-encoded 370 amino acid chloroplast ribosomal protein CS1 of spinach (48% identity), therefore identifying it as ribosomal protein (r-protein) S1. Cyanobacterial and chloroplast S1 proteins differ in size from Escherichia coli r-protein S1 (557 amino acids). This provides an additional evidence that cyanobacteria are closely related to chloroplasts. The Synechococcus gene rps1 encoding S1 is located 1.1 kb downstream from psbB, which encodes the photosystem 11 P680 chlorophyll a apoprotein. An open reading frame encoding a potential protein of 168 amino acids is present between psbB and rps1 and its deduced amino acid sequence is similar to that of E. coli hypothetical 17.2 kDa protein. Northern blot analysis showed that rps1 is transcribed as a monocistronic mRNA.  相似文献   

18.
A cDNA clone encoding 55-kDa multifunctional, thyroid hormone binding protein of rabbit skeletal muscle sarcoplasmic reticulum was isolated and sequenced. The cDNA encoded a protein of 509 amino acids, and a comparison of the deduced amino acid sequence with the NH2-terminal amino acid sequence of the purified protein indicates that an 18-residue NH2-terminal signal sequence was removed during synthesis. The deduced amino acid sequence of the rabbit muscle clone suggested that this protein is related to human liver thyroid hormone binding protein, rat liver protein disulfide isomerase, human hepatoma beta-subunit of prolyl 4-hydroxylase and hen oviduct glycosylation site binding protein. The protein contains two repeated sequences Trp-Cys-Gly-His-Cys-Lys proposed to be in the active sites of protein disulfide isomerase. Northern blot analysis showed that the mRNA encoding rabbit skeletal muscle form of the protein is present in liver, kidney, brain, fast- and slow-twitch skeletal muscle, and in the myocardium. In all tissues the cDNA reacts with mRNA of 2.7 kilobases in length. The 55-kDa multifunctional thyroid hormone binding protein was identified in isolated sarcoplasmic reticulum vesicles using a monoclonal antibody specific to the 55-kDa thyroid hormone binding protein from rat liver endoplasmic reticulum. The mature protein of Mr 56,681 contains 95 acidic and 61 basic amino acids. The COOH-terminal amino acid sequence of the protein is highly enriched in acidic residues with 17 of the last 29 amino acids being negatively charged. Analysis of hydropathy of the mature protein suggests that there are no potential transmembrane segments. The COOH-terminal sequence of the protein, Arg-Asp-Glu-Leu (RDEL), is similar to but different from that proposed to be an endoplasmic reticulum retention signal; Lys-Asp-Glu-Leu (KDEL) (Munro, S., and Pelham, H.R.B. (1987) Cell 48, 899-907). This variant of the retention signal may function in a similar manner to the KDEL sequence, to localize the protein to the sarcoplasmic or endoplasmic reticulum. The positively charged amino acids Lys and Arg may thus interchange in this retention signal.  相似文献   

19.
We report for the first time the complete amino acid sequence for the growth hormone dependent insulin-like growth factor binding protein (IGFBP-3) in the rat. A human IGFBP-3 clone was generated using the polymerase chain reaction (PCR) and used to screen a rat liver cDNA library. cDNA clones of the rat IGFBP-3 were isolated and the full amino acid sequence deduced. The sequence begins with a putative, 26 amino acid signal peptide followed by a 265 amino acid binding protein. The amino acid sequence is over 80% homologous with the equivalent human IGFBP-3 form and shows complete conservation of 18 cysteine residues that are clustered at the amino and carboxy ends of the protein. IGFBP-3 is the binding subunit of the major circulating IGFBP in the rat, and hence the availability of precise structural data and cDNA probes provides an important opportunity for a detailed study of the control of IGFBP-3 synthesis at the level of gene expression.  相似文献   

20.
T Isobe  T Okuyama 《FEBS letters》1985,182(2):389-392
The amino acid sequence of bovine brain micro glutamic acid-rich protein was determined by analysis of tryptic and Trimeresurus flavoviridis protease peptides of the molecule. The protein comprised 82 amino acid residues and has an Mr of 8992. The established sequence was highly homologous (90% identity) to the sequence of C-terminal 82 residues of the neurofilament 68-kDa protein from porcine spinal cord; there are differences of 8 residues which could be species-specific amino acid substitutions. This indicates that the micro glutamic acid-rich protein may arise by a restricted proteolysis of the neurofilament 68-kDa protein, with the break occuring toward the C-terminus.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号