首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Genomic DNA sequence for human C-reactive protein   总被引:12,自引:0,他引:12  
The gene for the prototype acute phase reactant, C-reactive protein, has been isolated from two lambda phage libraries containing inserted human DNA fragments using synthetic oligonucleotide probes. Nucleotide sequence analysis indicates that after coding for a signal peptide of 18 amino acids and the first two amino acids of the mature protein, there is an intron of 278 base pairs followed by the nucleotide sequence for the remaining 204 amino acids. The intron is unusual in that it contains on the positive strand a poly(A) stretch 16 nucleotides long and a poly(GT) region 30 nucleotides long which could adopt the Z-form of DNA. The nucleotide sequence reported here confirms the amino acid sequence of mature C-reactive protein as originally reported except that it codes for an additional 19 amino acids beginning at position 62. Thus DNA sequence analysis predicts that the mature protein consists of 206 amino acids rather than 187 as originally reported. The mRNA cap site is located 104 nucleotides from the start of the signal peptide and there is a 3' noncoding region 1.2 kilobase pairs in length. The gene has a typical promoter containing the sequences TATAAAT and CAAT 29 and 81 base pairs upstream, respectively, of the cap site.  相似文献   

2.
A biochemical approach to identify proteins with high affinity for choline-containing pneumococcal cell walls has allowed the localization, cloning and sequencing of a gene (lytC ) coding for a protein that degrades the cell walls of Streptococcus pneumoniae. The lytC gene is 1506 bp long and encodes a protein (LytC) of 501 amino acid residues with a predicted M r of 58 682. LytC has a cleavable signal peptide, as demonstrated when the mature protein (about 55 kDa) was purified from S. pneumoniae. Biochemical analyses of the pure, mature protein proved that LytC is a lysozyme. Combined cell fractionation and Western blot analysis showed that the unprocessed, primary product of the lytC gene is located in the pneumococcal cytoplasm whereas the processed, active form of LytC is tightly bound to the cell envelope. In vivo experiments demonstrated that this lysozyme behaves as a pneumococcal autolytic enzyme at 30 degrees C. The DNA region encoding the 253 C-terminal amino acid residues of LytC has been cloned and expressed in Escherichia coli. The truncated protein exhibits a low, but significant, choline-independent lysozyme activity, which suggests that this polypeptide adopts an active conformation. Self-alignment of the N-terminal part of the deduced amino acid sequence of LytC revealed the presence of 11 repeated motifs. These results strongly suggest that the lysozyme reported here has changed the general building plan characteristic of the choline-binding proteins of S. pneumoniae and its bacteriophages, i.e. the choline-binding domain and the catalytic domain are located, respectively, at the N-terminal and the C-terminal moieties of LytC. This work illustrates the natural versatility exhibited by the pneumococcal genes coding for choline-binding proteins to fuse separated catalytic and substrate-binding domains and create new and functional mature proteins.  相似文献   

3.
We have cloned a DNA that is complementary to the messenger RNA that encodes porcine pancreatic elastase 1 from pancreas using rat pancreatic elastase 1 cDNA as a probe. This complementary DNA contains the entire protein coding region of 798 nucleotides which encodes an elastase of 266 amino acids, and 22 and 136 nucleotides of the 5' and 3'-untranslated sequences. When this deduced amino acid sequence was compared with known amino acid sequences, a carboxy-terminal 240 amino acids long peptide was found to be identical with a mature form of porcine pancreatic elastase 1, except for two amino acids. The porcine enzyme contains the same number of amino acid residues as the rat enzyme, and their amino acid sequences are 85% homologous. Taking the above findings together, we conclude that the cloned cDNA encodes a mature enzyme of 240 amino acids including a leader and activation peptide of 26 amino acids. We expressed the cloned porcine pancreatic elastase 1 cDNA in E. coli as a lac-fused protein. The resulting fused protein showed enzymatic activity and immunoreactivity toward anti-elastase serum.  相似文献   

4.
5.
6.
We have isolated and sequenced a cDNA clone which contains the entire coding sequence of the precursor to a subunit of wheat phosphoribulokinase (PRKase). (The enzyme is a homodimer). The cDNA contains 1533 bp and has an open reading frame of 1212 nucleotides. This encodes a protein with an amino-terminal transit sequence of 53 amino acids, while the part that forms the mature protein contains 351 amino acids and has a molecular weight of 39,200 daltons. A comparison of the wheat amino acid sequence with that already known for the mature protein of spinach reveals that there are identical residues in 86% of the positions but their transit peptides differ substantially from one another. The mature wheat and spinach proteins are identical in a segment of over 50 amino acids near the amino-terminus which is the region believed to be involved in ATP binding and in regulation by light of the catalytic activity of the enzyme. We further demonstrate that the expression of PRKase mRNA in wheat leaves is regulated in a developmental, tissue-specific and light dependent manner. We also show that the light-induced increase in the steady-state levels of this mRNA is dependent on the developmental stage of the leaf.  相似文献   

7.
The gene for Aspergillus niger glucose oxidase (EC 1.1.3.4) has been cloned from both cDNA and genomic libraries using oligonucleotide probes derived from the amino acid sequences of peptide fragments of the enzyme. The mature enzyme consists of 583 amino acids and is preceded by a 22-amino acid presequence. No intervening sequences are found within the coding region. The enzyme contains 3 cysteine residues and 8 potential sites for N-linked glycosylation. The protein shows 26% identity with alcohol oxidase of Hansenuela polymorpha, and the N terminus has a sequence homologous with the AMP-binding region of other flavoenzymes such as p-hydroxybenzoate hydroxylase and glutathione reductase. Recombinant yeast expression plasmids have been constructed containing a hybrid yeast alcohol dehydrogenase II-glyceraldehyde-3-phosphate dehydrogenase promoter, either the yeast alpha-factor pheromone leader or the glucose oxidase presequence, and the mature glucose oxidase coding sequence. When transformed into yeast, these plasmids direct the synthesis and secretion of between 75 and 400 micrograms/ml of active glucose oxidase. Analysis of the yeast-derived enzymes shows that they are of comparable specific activity and have more extensive N-linked glycosylation than the A. niger protein.  相似文献   

8.
Complete nucleotide sequence of ovine alpha-lactalbumin mRNA   总被引:1,自引:0,他引:1  
The nucleotide sequence of ovine alpha-lactalbumin mRNA has been determined by chemical sequencing of two cDNA recombinant plasmids and a primer extension product. Ovine alpha-lactalbumin mRNA contains 723 nucleotides (excluding the poly(A) tail), with a 5' non-coding region of 26 nucleotides, followed by the 426 nucleotides of the coding region which determines a sequence signal of 19 amino acid residues and the 123 amino acid residues of mature alpha-lactalbumin. The coding region is followed by a 3' untranslated sequence of 271 nucleotides. The derived amino acid sequence of ovine pre-alpha-lactalbumin differs from that of its bovine counterpart by 8 amino acid substitutions, all but one originating from single mutations. Comparison of sequences of guinea pig, rat and human alpha-lactalbumin mRNAs with their ovine and bovine counterparts has revealed that these molecules have rapidly evolved. The highest degree of conservation was observed in the region coding for the mature protein and corresponds essentially to sequences which interact with UDP-galactosyltransferase and Ca2+ ions.  相似文献   

9.
The protein product corresponding to the gene located in the region of the coliphage Ifl genome shown to contain the code for the single-stranded DNA (ssDNA)-binding proteins of all filamentous phages so far studied has been isolated from infected bacterial cells and its amino acid sequence determined. The mature protein contains 95 amino acids (calculated molecular mass 10553 Da). Its sequence corresponds to that predicted from the DNA sequence but lacks the initiating methionine residue. Although there is little direct sequence homology between the phage Ifl protein and the ssDNA-binding proteins of the other filamentous phages that have been studied, computer-based comparisons of various physical and structural parameters showed that the phage Ifl protein contains a domain that is closely related to domains in the coliphage T4 gene 32 protein and the Pseudomonas phage Pfl ssDNA-binding protein and suggest that the Ifl protein does have a ssDNA-binding function although we were unable to show this directly.  相似文献   

10.
11.
We have cloned a DNA complementary to the messenger RNA encoding the precursor of ornithine transcarbamylase from rat liver. This complementary DNA contains the entire protein coding region of 1062 nucleotides and 86 nucleotides of 5'- and 298 nucleotides of 3'-untranslated sequences. The predicted amino acid sequence has been confirmed by extensive protein sequence data. The mature rat enzyme contains the same number of amino acid residues (322) as the human enzyme and their amino acid sequences are 93% homologous. The rat and human amino-terminal leader sequences of 32 amino acids, on the other hand, are only 69% homologous. The rat leader contains no acidic and seven basic residues compared to four basic residues found in the human leader. There is complete sequence homology (residues 58-62) among the ornithine and aspartate transcarbamylases from E. coli and the rat and human ornithine transcarbamylases at the carbamyl phosphate binding site. Finally, a cysteine containing hexapeptide (residues 268-273), the putative ornithine binding site in Streptococcus faecalis, Streptococcus faecium, and bovine transcarbamylases, is completely conserved among the two E. coli and the two mammalian transcarbamylases.  相似文献   

12.
A cloned cDNA containing the entire coding sequence for the long-chain S-acyl fatty acid synthetase thioester hydrolase (thioesterase I) component as well as the 3'-noncoding region of the fatty acid synthetase has been isolated using an expression vector and domain-specific antibodies. The coding region was assigned to the thioesterase I domain by identification of sequences coding for characterized peptide fragments, amino-terminal analysis of the isolated thioesterase I domain and the presence of the serine esterase active-site sequence motif. The thioesterase I domain is 306 amino acids long with a calculated molecular mass of 33,476 daltons; its DNA is flanked at the 5'-end by a region coding for the acyl carrier protein domain and at the 3'-end by a 1,537-base pairs-long noncoding sequence with a poly(A) tail. The thioesterase I domain exhibits a low, albeit discernible, homology with the discrete medium-chain S-acyl fatty acid synthetase thioester hydrolases (thioesterase II) from rat mammary gland and duck uropygial gland, suggesting a distant but common evolutionary ancestry for these proteins.  相似文献   

13.
A full-length cDNA clone encoding osteocalcin from the bullfrog, Rana catesbeiana (bone Gla-protein, BGP) has been isolated, and the complete coding sequence for the 100-amino-acid pre-pro-osteocalcin protein was determined. The amino acid sequence of Rana catesbeiana osteocalcin, especially the mature 49-amino acid sequence, is closer to the mammalian than to the fish, Sparus osteocalcin. Rana mature osteocalcin has a similarity of 67% with human or 59% with rat osteocalcin, and only 42% with fish mature osteocalcin. The 51-amino-acid pre-pro-peptide contains the expected hydrophobic leader sequence and the dibasic Arg-Arg sequence preceding the NH2-terminal Ser of the mature 49-amino-acid Rana osteocalcin. The pro-peptide sequence also contains the expected motif of polar and hydrophobic residues, which targets vitamin K-dependent gamma-carboxylation of three specific Glu residues at positions 17, 21, and 24 in the mature protein. At the native protein expression levels, extraction from Rana cortical bone in the presence of protease inhibitor cocktail resulted in the isolation of two distinct forms of osteocalcin, P-1 and P-2, with a 3:2 distribution. Using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) and amino acid sequence analysis of the N-terminal domain, we confirmed that P-1 is the intact 49-residue osteocalcin with N-terminal SNLRNAVFG., and that P-2 lacks four amino acids from the N-terminus, (NAVFG.). These results demonstrate the existence of a form of osteocalcin lacking four N-terminal amino acids in Rana bone, and that mature Rana osteocalcins remained highly conserved in their molecular evolution, especially with respect to the conservation of the C-terminal domain (residues 14-49).  相似文献   

14.
The nucleotide sequence of part of the late region of the polyoma virus genome was determined. It contains coding information for the major capsid protein VP1 and the C-terminal region of the minor proteins VP2 and VP3. In the sequence with the same polarity as late mRNA's, all coding frames are blocked by termination codons in a region around 48 units on the physical map. This is the region where the N-terminus of VP1 and the C-termini of VP2 and VP3 have been located (T. Hunter and W. Gibson, J. Virol. 28:240-253, 1978; S. G. Siddell and A. E. Smith, J. Virol. 27:427-431, 1978; Smith et al., Cell 9:481-487, 1976). There are two long uninterrupted coding frames in the late region of polyoma virus DNA. One lies at the 5' end of the sequence and contains potential coding sequences for VP2 and VP3. The other contains 383 consecutive sense codons starting with the ATG at nucleotide position 1,218, extends from 47.5 to 25.8 units counterclockwise on the physical map, and is located where the VP1 gene has been mapped. The VP1 gene overlaps the genes for proteins VP2/VP3 by 32 nucleotides and uses a different coding frame. From the DNA sequence, the amino acid sequence of VP1 was predicted. The proposed VP1 sequence is in good agreement with other data, namely, with the partial N-terminal amino acid sequence and the total amino acid composition. The VP1 coding frame terminates with a TAA codon at 25.8 map units. This is followed by an AATAAA sequence, which may act as a processing signal for the viral late mRNA's. When both nucleotide and amino acid sequences are compared with their counterparts in the related simian virus 40, extensive homologies are found over the entire region of the two viral genomes. Maximum homology appears to occur in those regions which code for the C-termini of the VP1 proteins. The overlap region of VP1 with VP2/VP3 of polyoma virus is shorter by 90 nucleotides than is that of simian virus 40 and shows very limited homology with the simian virus 40 sequence. This leads to the suggestion that the overlap segments of both viruses have been freed from stringency imposed on drifting during evolution and that proteins VP2 and VP3 of polyoma virus may have been truncated by the appearance of a termination codon within the sequence.  相似文献   

15.
cDNA encoding the human homologue of mouse APEX nuclease was isolated from a human bone-marrow cDNA library by screening with cDNA for mouse APEX nuclease. The mouse enzyme has been shown to possess four enzymatic activities, i.e., apurinic/apyrimidinic endonuclease, 3'-5' exonuclease, DNA 3'-phosphatase and DNA 3' repair diesterase activities. The cDNA for human APEX nuclease was 1420 nucleotides long, consisting of a 5' terminal untranslated region of 205 nucleotide long, a coding region of 954 nucleotide long encoding 318 amino acid residues, a 3' terminal untranslated region of 261 nucleotide long, and a poly(A) tail. Determination of the N-terminal amino acid sequence of APEX nuclease purified from HeLa cells showed that the mature enzyme lacks the N-terminal methionine. The amino acid sequence of human APEX nuclease has 94% sequence identity with that of mouse APEX nuclease, and shows significant homologies to those of Escherichia coli exonuclease III and Streptococcus pneumoniae ExoA protein. The coding sequence of human APEX nuclease was cloned into the pUC18 SmaI site in the control frame of the lacZ promoter. The construct was introduced into BW2001 (xth-11, nfo-2) strain and BW9109 (delta xth) strain cells of E. coli. The transformed cells expressed a 36.4 kDa polypeptide (the 317 amino acid sequence of APEX nuclease headed by the N-terminal decapeptide derived from the part of pUC18 sequence), and were less sensitive to methylmethanesulfonate and tert-butyl-hydroperoxide than the parent cells. The N-terminal regions of the constructed protein and APEX nuclease were cleaved frequently during the extraction and purification processes of protein to produce the 31, 33 and 35 kDa C-terminal fragments showing priming activities for DNA polymerase on acid-depurinated DNA and bleomycin-damaged DNA. Formation of such enzymatically active fragments of APEX nuclease may be a cause of heterogeneity of purified preparations of mammalian AP endonucleases. Based on analyses of the deduced amino acid sequence and the active fragments of APEX nuclease, it is suggested that the enzyme is organized into two domains, a 6 kDa N-terminal domain having nuclear location signals and 29 kDa C-terminal, catalytic domain.  相似文献   

16.
The gene for the small subunit (SS) of ribulose-1,5-bisphosphate carboxylase/oxygenase from a cyanobacterium, Anacystis nidulans 6301, has been cloned and subjected to sequence analysis. The SS coding region is located close to and downstream from the large subunit (LS) coding region on the same DNA strand. The spacer region between the LS and the SS coding regions contains 93 base pairs (bp), and has no promoter-like sequences. The coding region of A. nidulans SS gene contains 333 bp (111 codons). The deduced amino acid sequence of the A. nidulans SS protein shows 40% homology with those of higher plants.  相似文献   

17.
《Gene》1996,179(2):279-286
A 4040-bp cDNA was cloned from a human placenta library by screening with a polymerase chain reaction-amplified fragment. The fragment was generated from the library using primers corresponding to conserved sequences encompassing the topa quinone (TPQ) cofactor sites of the copper-containing proteins, bovine serum amine oxidase (BSAO) and human kidney diamine oxidase (DAO). The cloned cDNA contains a coding sequence from positions 161 to 2449. Between bases 2901 and 2974, in a very long 1591-bp 3′-untranslated region, there is a G/A-rich region in the minus strand, which contains a (AGG)5 tandem repeat. The human placenta cDNA sequence and its translated amino acid sequence are 84% and 81% identical to the corresponding BSAO sequences, while the identities for the placenta sequences and those for human kidney DAO are 60% and 41%, respectively. The TPQ consensus nucleotide and protein sequences are identical for the placenta enzyme and BSAO, but the corresponding sequences for human kidney DAO are nonidentical. Three His residues that have been identified as Cu(II) ligands in other amine oxidases are conserved in the human placenta amine oxidase protein sequence. It was concluded that the placenta cDNA open-reading frame codes for a copper-containing, TPQ-containing monoamine oxidase. A putative 19-amino acid signal peptide was identified for human placenta amine oxidase. The resulting mature protein would be composed of 744 amino acids, and would have a Mr of 82 525. Comparison of the human placenta amine oxidase with DNA sequences found in GenBank suggests that the gene for this enzyme is located in the q21 region of human chromosome 17, near the BRCA1 gene.  相似文献   

18.
19.
The nucleotide sequences coding for murine complement component C3 have been determined from a cloned genomic DNA fragment and several overlapping cloned complementary DNA fragments. The amino acid sequence of the protein was deduced. The mature beta and alpha subunits contain 642 and 993 amino acids respectively. Including a 24 amino acid signal peptide and four arginines in the beta-alpha transition region, which are probably not contained in the mature protein, the unglycosylated single chain precursor protein preproC3 would have a molecular mass of 186 484 Da and consist of 1663 amino acid residues. The C3 messenger RNA would be composed of a 56 +/- 2 nucleotide long 5' non-translated region, 4992 nucleotides of coding sequence, and a 3' non-translated region of 39 nucleotides, excluding the poly A tail. The beta chain contains only three cysteine residues, the alpha chain 24, ten of which are clustered in the carboxy terminal stretch of 175 amino acids. Two potential carbohydrate attachment sites are predicted for the alpha chain, none for the beta chain. From a comparison with human C3 cDNA sequence (of which over 80% has been determined) an extensive overall sequence homology was observed. Human and murine preproC3 would be of very similar length and share several noteworthy properties: the same order of the subunits in the precursor, the same basic residue multiplet in the beta-alpha transition region, and a glutamine residue in the thioester region. The equivalent position of the known factor I cleavage sites in human C3 alpha could be located in the murine C3 alpha chain and the size and sequence of the resulting peptide were deduced. A comparison of the amino acid sequences of murine C3 and human alpha 2-macroglobulin is given. Several areas of strong sequence homology are observed, and we conclude that the two genes must have evolved from a common ancestor.  相似文献   

20.
The sequence of human serum albumin cDNA and its expression in E. coli   总被引:41,自引:6,他引:35       下载免费PDF全文
A recombinant plasmid has been constructed which contains the mature protein coding region of the human serum albumin (HSA) gene. Bacteria containing this plasmid synthesize HSA protein under control of the E. coli trp promoter-operator. The DNA sequence and predicted protein sequence of HSA were determined from the cDNA plasmid and are compared to existing data obtained from direct protein sequencing. The DNA sequence predicts a mature protein of 585 amino acids preceded by a 24 amino acid "prepro" peptide.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号