首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Genomic DNA sequence for human C-reactive protein   总被引:12,自引:0,他引:12  
The gene for the prototype acute phase reactant, C-reactive protein, has been isolated from two lambda phage libraries containing inserted human DNA fragments using synthetic oligonucleotide probes. Nucleotide sequence analysis indicates that after coding for a signal peptide of 18 amino acids and the first two amino acids of the mature protein, there is an intron of 278 base pairs followed by the nucleotide sequence for the remaining 204 amino acids. The intron is unusual in that it contains on the positive strand a poly(A) stretch 16 nucleotides long and a poly(GT) region 30 nucleotides long which could adopt the Z-form of DNA. The nucleotide sequence reported here confirms the amino acid sequence of mature C-reactive protein as originally reported except that it codes for an additional 19 amino acids beginning at position 62. Thus DNA sequence analysis predicts that the mature protein consists of 206 amino acids rather than 187 as originally reported. The mRNA cap site is located 104 nucleotides from the start of the signal peptide and there is a 3' noncoding region 1.2 kilobase pairs in length. The gene has a typical promoter containing the sequences TATAAAT and CAAT 29 and 81 base pairs upstream, respectively, of the cap site.  相似文献   

2.
The nucleotide sequence of the gene that codes for the major inner capsid protein of the simian rotavirus SA11 has been determined. A DNA copy of mRNA from gene 6 was cloned in the E. coli plasmid pBR322. The full-length gene is 1357 nucleotides long with a 5'-noncoding region of 23 nucleotides and a 3'-noncoding region of 140 nucleotides. The gene contains a single, long, open reading-frame of 1194 nucleotides capable of coding for a protein of 397 amino acids with a molecular weight of 44,816. The predicted protein product is relatively proline-rich with a net charge at neutral pH of -3.5. One stretch of 53 amino acids (encoded by nucleotides 327-485) is basic.  相似文献   

3.
Two DNA molecules complementary to human liver mRNA coding for the alpha-subunit of the stimulatory regulatory component Gs of adenylyl cyclase were cloned. One of the two forms is a full-length cDNA of 1614 nucleotides plus a poly(A) tail of 59 nucleotides. The deduced sequence of 394 amino acids encoded by its open reading frame is essentially identical to that of the alpha-subunits of Gs identified by molecular cloning from bovine adrenals, bovine brain and rat brain. Two independent clones of the other type of cDNA were isolated. Both were incomplete, beginning within the open reading frame coding for the alpha s polypeptide. One codes for amino acids 5 through 394 and the other for amino acids 48 through 394 of the above described cDNA of 1614 nucleotides, and both have the identical 3'-untranslated sequence. They differ from the first cDNA, however, in that they lack a stretch of 42 nucleotides (numbers 214 through 255) and have nucleotides 213 (G) and 256 (G) replaced with C and A, respectively. This results in a predicted amino acid composition of another alpha-subunit of Gs that is shorter by 14 amino acids and contains two substitutions (Asp for Glu and Ser for Gly) at the interface between the deletion and the unchanged sequence. We call the smaller subunit alpha s1 and the larger alpha s2. This is the first demonstration of a structural heterogeneity in alpha s subunits that is due to a difference in amino acid sequence.  相似文献   

4.
The mRNA of a putative small hydrophobic protein (SH) of mumps virus was identified in mumps virus-infected Vero cells, and its complete nucleotide sequence was determined by sequencing the genomic RNA and cDNA clones and partial sequencing of mRNA. The SH mRNA is 310 nucleotides long excluding the poly(A) and contains a single open reading frame encoding a protein of 57 amino acids with a calculated molecular weight of 6,719. The predicted protein is highly hydrophobic and contains a stretch of 25 hydrophobic amino acids near the amino terminus which could act as a membrane anchor region. There is no homology between the putative SH protein of mumps virus and the SH protein of simian virus 5, even though the SH genes are located in the same locus in the corresponding genome. One interesting observation is that the hydrophobic domain of simian virus 5 SH protein is at the carboxyl terminus, whereas that of mumps virus putative SH protein is near the amino terminus.  相似文献   

5.
The nucleotide sequence of a complete chicken delta-crystallin cDNA   总被引:8,自引:2,他引:6       下载免费PDF全文
The nucleotide sequence of a full length cDNA of delta-crystallin mRNA from chicken lens has been determined using a delta-crystallin cDNA clone (pB delta 11), which represents the mRNA sequence of 1530 nucleotides from the poly(A) junction but does not contain the 5'-terminal sequence of 44 nucleotides of the mRNA. The 5'-terminal sequence of the mRNA, absent in the cDNA clone, has been determined with a stretch of cDNA sequence by the primer extension procedure. The amino acid sequence deduced from the nucleotide sequence is consistent with the amino acid sequences of several tryptic peptides, the total amino acid composition, and the mol. wt. of delta-crystallin estimated by SDS-polyacrylamide gel electrophoresis. The computer-assisted analysis predicts high alpha-helical content throughout the polypeptide. Sequence analyses have revealed that gene 1 encodes the mRNA from which the cDNA clone was derived.  相似文献   

6.
7.
Amino acid sequence of the human respiratory syncytial (RS) virus nucleocapsid (NC) protein, deduced from the DNA sequence of a recombinant plasmid, is presented. The cDNA plasmid (pRSB11) has 1412 bp of RS viral NC sequence and lacks six nucleotides of the 5' end of mRNA. There is a single long open reading frame encoding 467 amino acids. This 51540 dal protein is rich in basic amino acids and has no homologies with other known viral capsid proteins.  相似文献   

8.
Skin of Xenopus laevis contains relatively large quantities of thyrotropin releasing hormone (TRH). Total mRNA isolated from skin was cloned in the plasmid pUC8. Among 1400 cDNA clones, one was found with an insert of 478 nucleotides coding for the amino-terminal part of prepro-TRH. This clone was detected using a mixture of two synthetic undecanucleotides for colony hybridization. The single open reading frame starts with a methionine residue and a stretch of hydrophobic amino acids, as is typical for signal peptides, and terminates at the poly(C) tail without a stop codon. The deduced polypeptide of 123 amino acids contains three copies of the sequences Lys-Arg-Gln-His-Pro-Gly-Lys Arg-Arg and a fourth incomplete copy at the carboxyl end. Typical pro-hormone processing at this sequence would yield pGlu-His-Pro.NH2,i.e. TRH. It is concluded that the cloned part of the mRNA codes for prepro-TRH and that the TRH precursor from skin of X. laevis is a polyprotein containing at least four copies of the end product in its amino acid sequence.  相似文献   

9.
The cDNA clones encoding the precursor form of glycinin A3B4 subunit have been identified from a library of soybean cotyledonary cDNA clones in the plasmid pBR322 by a combination of differential colony hybridizations, and then by immunoprecipitation of hybrid-selected translation product with A3-mono-specific antiserum. A recombinant plasmid, designated pGA3B41425, from one of six clones covering codons for the NH2-terminal region of the subunit was sequenced, and the amino acid sequence was inferred from the nucleotide sequence, which showed that the mRNA codes for a precursor protein of 516 amino acids. Analysis of this cDNA also showed that it contained 1786 nucleotides of mRNA sequence with a 5'-terminal nontranslated region of 46 nucleotides, a signal peptide region corresponding to 24 amino acids, an A3 acidic subunit region corresponding to 320 amino acids followed by a B4 basic subunit region corresponding to 172 amino acids, and a 3'-terminal nontranslated region of 192 nucleotides, which contained two characteristic AAUAAA sequences that ended 110 nucleotides and 26 nucleotides from a 3'-terminal poly(A) segment, respectively. Our results confirm that glycinin is synthesized as precursor polypeptides which undergo post-translational processing to form the nonrandom polypeptide pairs via disulfide bonds. The inferred amino acid sequence of the mature basic subunit, B4, was compared to that of the basic subunit of pea legumin, Leg Beta, which contained 185 amino acids. Using an alignment that permitted a maximum homology of amino acids, it was found that overall 42% of the amino acid positions are identical in both proteins. These results led us to conclude that both storage proteins have a common ancestor.  相似文献   

10.
The complete nucleotide sequence of the neuraminidase (NA) gene of WSN/33 (H1N1) virus was determined. The entire sequence was derived from the insert of cDNA clones, except the last 20 nucleotides, which were determined by primer extension. The WSN NA gene contained 1,409 nucleotides beginning at the 5' end (sense strand), with an untranslated region of 19 nucleotides followed by 1,359 nucleotides coding for 453 amino acids and finally ending with a 31-nucleotide sequence of untranslated region at the 3' termini. The amino acid sequence of WSN NA, as deduced from the DNA sequence, showed the presence of a stretch of 29 amino acids (7 to 35) enriched in hydrophobic amino acids, which may anchor the protein into the viral or cellular membrane. When compared with the PR8 NA sequence, WSN NA appeared to possess a similar structure, including the identical location of all cysteine and proline residues. However, WSN NA contained only three of the five potential glycosylation sites present in PR8 NA. Additionally, WSN NA contained a substitution of a five-amino acid sequence for a six-amino acid sequence in PR8 NA. The possible significance of these sequence changes in the primary structure of WSN NA in the unique role of WSN NA as a virulence factor in mouse brain and MDBK cells is discussed.  相似文献   

11.
12.
The complete nucleotide sequences of the vesicular stomatitis virus mRNA's encoding the glycoprotein (G) and the matrix protein (M) have been determined from cDNA clones that contain the complete coding sequences from each mRNA. The G protein mRNA is 1,665 nucleotides long, excluding polyadenylic acid, and encodes a protein of 511 amino acids including a signal peptide of 16 amino acids. G protein contains two large hydrophobic domains, one in the signal peptide and the other in the transmembrane segment near the COOH terminus. Two sites of glycosylation are predicted at amino acid residues 178 and 335. The close correspondence of the positions of these sites with the reported timing of the addition of the two oligosaccharides during synthesis of G suggests that glycosylation occurs as soon as the appropriate asparagine residues traverse the membrane of the rough endoplasmic reticulum. The mRNA encoding the vesicular stomatitis virus M protein is 831 nucleotides long, excluding polyadenylic acid, and encodes a protein of 229 amino acids. The predicted M protein sequence does not contain any long hydrophobic or nonpolar domains that might promote membrane association. The protein is rich in basic amino acids and contains a highly basic amino terminal domain. Details of construction of the nearly full-length cDNA clones are presented.  相似文献   

13.
Rat apolipoprotein E mRNA. Cloning and sequencing of double-stranded cDNA   总被引:21,自引:0,他引:21  
A 900-base pair clone corresponding to rat liver apolipoprotein E (apo-E) mRNA, and containing a 3'-terminal poly(A) segment, was identified from a library of rat liver cDNA clones in the plasmid pBR322 by specific hybrid selection and translation of mRNA. A restriction endonuclease DNA fragment from this recombinant plasmid was used to clone the 5'-terminal region of the apo-E mRNA by primed synthesis of cDNA. A portion of the double-stranded cDNA corresponding to the 3'-terminal region of apo-E mRNA was subcloned into the bacteriophage M13mp7 and employed as a template for the synthesis of a radioactively labeled, cDNA hybridization probe. This cDNA probe was used in a RNA-blot hybridization assay that showed the length of the apo-E mRNA to be about 1200 nucleotides. The hybridization assay also demonstrated that apo-E mRNA is present in rat intestine, but at about a 100-fold lower level than that of the rat liver. The nucleotide sequence of rat liver apo-E mRNA was determined from the cloned, double-stranded cDNAs. The amino acid sequence of rat liver apo-E was inferred from the nucleotide sequence, which showed that the mRNA codes for a precursor protein of 311 amino acids. A comparison to the NH2-terminal amino acid sequence of rat plasma apo-E indicated that the first 18 amino acids of the primary translation product are not present in the mature protein and are probably removed during co-translational processing. The coding region was flanked by a 3'-untranslated region of 109 nucleotides, which contained a characteristic AAUAAA sequence that ended 13 nucleotides from a 3'-terminal poly(A) segment. At the 5'-terminal region of the mRNA, 23 nucleotides of an untranslated region were also determined. The inferred amino acid sequence of mature rat apo-E, which contains 293 amino acids, was compared to the amino acid sequence of human apo-E, which contains 299 amino acids. Using an alignment that permitted a maximum homology of amino acids, it was found that overall, 69% of the amino acid positions are identical in both proteins. The amino acid identities are clustered in two broad domains separated by a short region of nonhomology, an NH2-terminal domain of 173 residues where 80% are identical, and a COOH-terminal domain of 84 residues where 70% are identical. These two domains may be associated with specific functional roles in the protein.  相似文献   

14.
Nucleotide sequence of rat alpha 1-acid glycoprotein messenger RNA   总被引:9,自引:0,他引:9  
The complete nucleotide sequence of rat alpha 1-acid glycoprotein (alpha 1-AGP) mRNA has been determined from cloned double-stranded cDNA. The coding portion of the mRNA was bounded at the ends by a 5'-untranslated region of 35 nucleotides in length and a 3'-untranslated region of 119 nucleotides in length. The 3'-untranslated region contains the characteristic AAUAAA sequence ending 18 nucleotides from the 3'-terminal poly(A) segment. The 5'-region of the mRNA contains two in-phase AUG codons separated by 12 nucleotides. Comparison with the known NH2-terminal amino acid sequence of serum rat alpha 1-AGP suggests that the primary translation product of the mRNA contains an additional 14 or 18 amino acids that are not present in the mature form of the protein, which contains 187 amino acids. The inferred amino acid sequence of rat alpha 1-AGP and the known amino acid sequence of human alpha 1-AGP have several regions of identity clustered in the NH2-terminal portion of the proteins. The carboxyl-terminal regions show significantly less homology. Six potential asparagine glycosylation sites are found in the rat sequence, and four of these sites are in positions similar to known glycosylation sites in the human protein. Furthermore, three of these potential glycosylation sites are in a region that exhibits extensive amino acid sequence conservation, suggesting that this region may be important for the biological function of alpha 1-AGP.  相似文献   

15.
16.
J F Mercer  A Grimes 《FEBS letters》1986,203(2):185-190
A number of cDNA clones encoding human ceruloplasmin were identified using two mixed oligonucleotide probes. One of these clones was shown by DNA sequence analysis to span from the complete N-terminal leader sequence to 114 amino acids short of the C-terminus. The leader sequence consists of 19 primarily hydrophobic amino acids. Northern blot analysis of RNA from human liver showed two species of ceruloplasmin mRNA; a minor species of 3600 nucleotides and a major one of 4400 nucleotides.  相似文献   

17.
The primary structure of rat ribosomal protein L38.   总被引:3,自引:0,他引:3  
The amino acid sequence of the rat 60S ribosomal protein L38 was deduced from the sequence of nucleotides in three recombinant cDNAs. Ribosomal protein L38 has 69 amino acids (the NH2-terminal methionine is removed after translation of the mRNA) and has a molecular weight of 8,081. Hybridization of the cDNA to digests of nuclear DNA suggests that there are 11-13 copies of the L38 gene. The mRNA for the protein is about 450 nucleotides in length.  相似文献   

18.
A cDNA clone (pFD1) derived from Silene pratensis ferredoxin mRNA was selected from a cDNA-library using the hybrid released translation technique. Nucleotide sequence analysis showed the cDNA insert to contain the complete coding region of the ferredoxin precursor protein. The ferredoxin precursor has a mol.wt. of 15 300, the transit-peptide has a mol.wt. of 5600. The length of the ferredoxin mRNA was found to be 700 nucleotides whereas the cDNA insert was about 1200 basepairs. S1 nuclease protection experiments showed the ferredoxin-specific DNA to be 660 basepairs in length and to start 39 nucleotides upstream of the ferredoxin coding sequence. Southern blot analysis of genomic DNA revealed the presence of only one fragment with homology to the ferredoxin cDNA probe, so it is probably a single-copy gene. Comparison of the ferredoxin transit-sequence with transit sequences of another stromal protein, the small subunit of ribulosebisphosphate carboxylase showed no apparent homology, except for a stretch of three amino acids near the processing site.  相似文献   

19.
Two species of folate binding protein (FBP), an integral membrane-associated form and a soluble secreted form, have been previously purified from cultured human KB cells. The complete nucleotide sequence of the complementary DNA (cDNA) clone for the coding region of the mature membrane-associated FBP has now been determined, and the deduced amino acid sequence has been computer-analyzed for a prediction of the secondary structure of the protein. The clone has 857 nucleotides of which 678 comprise the coding region for 226 amino acids. The deduced amino sequence contains the identical sequence of the published 18 NH2-terminal amino acids of the purified FBP from KB cells and the published partial amino acid sequence of the human milk FBP except for 1 residue. There was also over 90% homology with the published amino acid sequence of the bovine milk FBP. A total of 16 cysteine residues has been conserved in the three proteins indicating that this amino acid may provide a tertiary structure which is required for its ligand binding function. Northern blot analysis using the cDNA probe identified a single band of 1.28-kilobase pair mRNA in KB cells which was 4.7-fold more intense in folate-depleted cells than in normal cells. These results indicate that the membrane FBP and the soluble FBP in the medium are translation products of the same gene. Computer analysis of the deduced amino acid sequence indicates that there is only one stretch of amino acids of sufficient hydrophobicity and length to span the lipid bilayer of the plasma membrane, but it lacked a predictable helical structure. Those regions of the sequence which did have a predictable helical structure lacked sufficient hydrophobicity required for a membrane anchor. Thus, it is likely that the fatty acids previously reported to be present in the membrane-associated FBP from these cells rather than a peptide sequence provide an important membrane anchoring function.  相似文献   

20.
The complete nucleotide sequence of murine beta-glucuronidase (GUS) mRNA has been compiled from three overlapping cloned cDNAs and a single GUS-specific genomic clone. The sequence is composed of 2455 nucleotides, exclusive of the poly(A) tail. The 5' and 3' untranslated regions contain 12 and 499 bases, respectively, with the open reading frame encoding a polypeptide of 648 amino acids (74.2 kDa), including a 22 amino acid signal sequence. The nucleotide and deduced amino acid sequences of murine GUS are compared to those published for rat and human GUS and the results are presented. Murine GUS also shares amino acid sequence identity with Escherichia coli GUS and beta-galactosidase. The complete sequences of murine GUS mRNA and its deduced polypeptide provide a basis from which to study the mechanisms responsible for the well-characterized variation in GUS expression among inbred mouse strains.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号