首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Cloning and sequencing of a phospholipase C gene of Clostridium perfringens   总被引:13,自引:0,他引:13  
The gene encoding phospholipase C (alpha-toxin) of Clostridium perfringens was cloned into lambda gt10. The maximal size of the coding region was 1.4 kb and the minimum was 1.1 kb as determined by subcloning into the vector pBR322 and testing for activity. The nucleotide sequence of this region contained a single open reading frame of 1194 bp corresponding to a protein of Mr 45473 with a possible N-terminal signal sequence of 28 amino acids which when removed, would give a mature protein of Mr 42521. This is in good agreement with the reported size of 43 kDa. The coding region has a dG + dC content of 33.7%, and the codon usage displays a pronounced preference for codons with the lowest dG + dC content.  相似文献   

2.
A segment of 1160 nucleotides of the FMDV genome has been sequenced using three overlapping fragments of cloned cDNA from FMDV strain O1K. This sequence contains the coding sequence for the viral capsid protein VP1 as shown by its homology to known and newly determined amino acid sequences from this man antigenic polypeptide of the FMDV virion. The structural gene for VP1 comprises 639 nucleotides which specify a sequence of 213 amino acids for the VP1 protein. The coding sequence is not flanked by start and stop codons which is consistent with the mode of biosynthesis of VP1 by post-translational processing of a polyprotein precursor.  相似文献   

3.
Hybrid molecules containing DNA sequences complementary to bovine pituitary mRNA were constructed in the Pst I site of pBR322 by the dC . dG tailing technique. Recombinant plasmids containing bovine prolactin (bPRL) sequences were amplified in bacteria and identified by hybridization to purified [32P]bPRL cDNA sequences. Nucleotide sequence analysis was performed on the inserts from two of the positive clones. One clone, pBPRL72, contained a 982-base pair insert that included 67 nucleotides of the 5'-untranslated region, the complete coding region of the preprolactin protein (690 nucleotides), and the entire 3'-untranslated region (150 nucleotides) of bPRL mRNA. The nucleotide sequence analysis of clone pBPRL72 predicted the sequence of a 30-amino acid signal peptide and confirmed the published amino acid sequence of the protein with one exception. A comparison of the pBPRL72 cDNA sequence with a second bPRL clone, pBPRL4, revealed four silent nucleotide differences. Three of the base changes occurred in the third position of amino acid codons, and one occurred in the 3'-noncoding region. The sequence polymorphism suggests the existence of alleles or multiple loci for bPRL that do not alter the protein structure.  相似文献   

4.
S Forss  K Strebel  E Beck    H Schaller 《Nucleic acids research》1984,12(16):6587-6601
A continuous 7802 nucleotide sequence spanning the 94% of foot and mouth disease virus RNA between the 5'-proximal poly(C) tract and the 3'-terminal poly(A) was obtained from cloned cDNA, and the total size of the RNA genome was corrected to 8450 nucleotides. A long open reading frame was identified within this sequence starting about 1300 bases from the 5' end of the RNA genome and extending to a termination codon 92 bases from its polyadenylated 3' end. The protein sequence of 2332 amino acids deduced from this coding sequence was correlated with the 260 K FMDV polyprotein. Its processing sites and twelve mature viral proteins were inferred from protein data, available for some proteins, a predicted cleavage specificity of an FMDV encoded protease for Glu/Gly(Thr, Ser) linkages, and homologies to related proteins from poliovirus. In addition, a short unlinked reading frame of 92 codons has been identified by sequence homology to the polyprotein initiation signal and by in vitro translation studies.  相似文献   

5.
Molecular cloning of DNA complementary to bovine growth hormone mRNA   总被引:13,自引:0,他引:13  
We have cloned DNA complementary to mRNA coding for bovine growth hormone (bGH). Double-stranded DNA complementary to bovine pituitary mRNA was inserted into the Pst I site of plasmid pBR322 by the dC x dG tailing technique and amplified in E. coli x 1776. A recombinant plasmid containing bGH cDNA ws identified by hybridization to cloned rat growth hormone cDNA. It contains the entire coding and 3'-untranslated regions and 31 bases in the 5'-untranslated region. Nucleotide sequence analysis determined the sequence of the 26-amino acid signal peptide and confirmed the published amino acid sequence of the secreted hormone at all but 2 residues. Codon usage is nonrandom, with 81.7% of the codons ending in G or C. The nucleotide sequence of bGH mRNA is 83.9% homologous with rat GH mRNA and 76.5% homologous with human GH mRNA, while the respective amino acid sequence homologies are 83.5% and 66.8%.  相似文献   

6.
Poly(A)-containing RNA from the bovine anterior pituitary has been used as a template for the enzymatic synthesis of double-stranded cDNA. The resulting double-stranded cDNA was inserted into the Pst I site of pBR322 with the oligo(dG)-oligo(dC) tailing technique and subsequently cloned in E. coli chi 1776. Clones containing sequences complementary to prolactin mRNA were identified by colony hybridization with partially purified prolactin cDNA. A 250 base pair sequence from one prolactin positive clone was extensively characterized and shown to contain the coding information for amino acids 119-192 of authentic bovine prolactin. The recombinant DNA from this clone was covalently attached to diazotized aminocellulose and used to purify prolactin mRNA from a mixture of mRNAs.  相似文献   

7.
Inosine-5'-monophosphate dehydrogenase, a key enzyme in the regulation of guanine nucleotide biosynthesis, was purified to homogeneity; and a polyclonal antibody directed against the purified protein was used to isolate human and Chinese hamster IMP dehydrogenase cDNA clones. These clones were sequenced and found to contain an open reading frame of a protein containing 514 amino acids. A sequence of 35 amino acids obtained by analysis of the purified protein is identical to a segment of the protein sequence deduced from the IMP dehydrogenase cDNA. The molecular mass of the deduced protein is 56 kDa, which is the observed molecular mass of the purified protein and of the immunoprecipitated in vitro translation product. Comparison of the protein sequences deduced from the human and Chinese hamster cDNA clones indicates only eight amino acid differences, suggesting that IMP dehydrogenase is a highly conserved protein.  相似文献   

8.
Binding of proteins to chloroplast-encoded mRNAs has been shown to be an essential part of chloroplast gene expression. Four nuclear-encoded proteins (38, 47, 55, and 60 kDa) have been identified that bind to the 5'-untranslated region of the Chlamydomonas reinhardtii psbA mRNA with high affinity and specificity. We have cloned a cDNA that represents the 38 kDa protein (RB38) and show that it encodes a novel RNA binding protein that is primarily localized within the chloroplast stroma. RB38 contains four 70 amino acid repeats with a high percentage of basic amino acids, as well as an amino-terminal extension predicted to act as a chloroplast import sequence. We demonstrate that the 38 kDa precursor protein is imported into isolated chloroplasts and interacts with high specificity to uridine-rich regions within the 5'-untranslated region of the psbA mRNA. While database searches have identified hypothetical proteins from several other eukaryotic species with high sequence similarity to the deduced amino acid sequence of RB38, no proteins with homology to RB38 have been biochemically characterized. Bioinformatic analysis of the RB38 sequence, together with structure analysis using circular dichroism and protein modeling, suggests that the 70 amino acid repeats within RB38 are similar in fold to previously identified RNA binding motifs, despite limited sequence homology.  相似文献   

9.
Using Simian-11 rotavirus RNA, a strategy has been developed for the production of full length cloned copies of the genes of double-stranded (dsRNA) viruses. Genomic RNA segments were polyadenylated and reverse transcribed to yield a mixture of full length cDNA copies of both possible polarities. The cDNAs were annealed, filled in to complete any partial copies, tailed and inserted into the PstI site of pBR322 using dG/dC tailing. Cloned rotavirus cDNA gene copies were assigned to genomic RNA segments by Northern hybridization. The complete sequence of gene 8 which codes for NCVP3, a non-structural protein of SA11 rotavirus, was determined from a cloned gene copy. It is 1059 bases in length and has an open reading frame which could code for a protein containing 317 amino acids. The apparent 5' and 3' terminal non coding regions are 46 and 59 bases in length, respectively. The sequence ATGTGACCOH at the 3' end of the plus strand is conserved in four of the eleven genes examined. The cloning procedures used should be generally applicable to viruses with segmented dsRNA genomes.  相似文献   

10.
Human adenosine deaminase. cDNA and complete primary amino acid sequence   总被引:20,自引:0,他引:20  
A previously cloned partial adenosine deaminase cDNA insert (0.8 kilobase) was used to clone additional nucleotide sequences from human HPB ALL cDNA libraries. cDNA encompassing the entire coding, and 3'-untranslated regions as well as nearly all of the 5'-untranslated region was obtained. The complete amino acid sequence of the enzyme deduced from the cDNA sequence and protein sequencing consists of 362 amino acids, excluding the initiator Met, and accounts for Mr = 40,638. Secondary structure predictions assign adenosine deaminase to the alpha/beta class of proteins. Northern blot analysis with a cDNA probe showed adenosine deaminase mRNA to be present in normal to above normal amounts in B-lymphoblasts derived from adenosine deaminase-deficient patients with severe combined immunodeficiency disease. Knowledge of the cDNA and primary amino acid sequence of adenosine deaminase will be pivotal in further defining the genetic abnormality and its functional consequences in adenosine deaminase expression defects.  相似文献   

11.
We have purified apolipoprotein C-II (apo C-II) from cynomolgus monkey plasma, prepared antibody against it and used the antibody to isolate a cDNA containing the complete coding sequence for cynomolgus monkey apo C-11. Sequence analysis indicated that the monkey apo C-11 cDNA was 200 by longer than the human and the difference in size was all in the 5° untranslated region of the mRNA. This was confirmed by Northern analysis of human and monkey RNA. There was an open reading frame in the monkey apo C-11 cDNA sequence encoding a preprotein of 101 amino acids — identical in size to the human protein. The carboxyl terminal 44 amino acids of the protein were 100% homologous to the human apo C-11 amino acid sequence indicating evolutionary conservation of both structure and function. However, the amino terminal 35 amino acids of the protein were only 75% homologous and the amino terminal 19 amino acids were only 58% homologous to the human sequence. The amino acid sequence derived from the nucleotide sequence predicts a more basic protein than the human apo C-11 and this is confirmed by isoelectric focusing and immunoblotting.  相似文献   

12.
A rice (Oryza sativa L.) cDNA clone coding for the cytoplasmic ribosomal protein L5, which associates with 5 S rRNA for ribosome assembly, was cloned and its nucleotide sequence was determined. The primary structure of rice L5, deduced from the nucleotide sequence, contains 294 amino acids and has intriguing features some of which are also conserved in other eucaryotic homologues. These include: four clusters of basic amino acids, one of which may serve as a nucleolar localization signal; three repeated amino acid sequences; the conservation of glycine residues. This protein was identified as the nuclear-encoded cytoplasmic ribosomal protein L5 of rice by sequence similarity to other eucaryotic ribosomal 5 S RNA-binding proteins of rat, chicken, Xenopus laevis, and Saccharomyces cerevisiae. Rice L5 shares 51 to 62% amino acid sequence identity with the homologues. A group of ribosomal proteins from archaebacteria including Methanococcus vanniellii L18 and Halobacterium cutirubrum L13, which are known to be associated with 5 S rRNA, also related to rice L5 and the other eucaryotic counterparts, suggesting an evolutionary relationship in these ribosomal 5 S RNA-binding proteins.  相似文献   

13.
Eukaryotic ribosomes contain an acidic ribosomal protein of about 38 kDa which shows immunological cross-reactivity with the 13 kDa-type acidic ribosomal proteins that are related to L7/L12 of bacterial ribosomes. By using a cDNA clone for 38 kDa-type acidic ribosomal protein A0 from the yeast Saccharomyces cerevisiae, we have cloned a genomic DNA encoding A0 and determined the sequence of 1,614 nucleotides including about 500 nucleotides in the 5'-flanking region. The gene lacks introns and possesses two boxes homologous to upstream activation sequences (UASrpg) in the 5'-flanking region. The amino acid sequence of A0 deduced from the nucleotide sequence shows that A0 shares a highly similar carboxyl-terminal region of about 40 amino acids in length with 13 kDa-type acidic ribosomal proteins, including an identical carboxyl-terminal, DDDMGFGLFD. In the amino-terminal region A0 contains an arginine-rich segment which shows a low but distinct similarity to that of bacterial ribosomal protein L10 through which L10 is thought to bind to 23S rRNA. On the other hand, the carboxyl-terminal half of A0 is enriched with hydrophobic amino acid residues including four pairs of phenylalanine residues which are all conserved in a human homologue.  相似文献   

14.
The cDNA clone encoding human prechymotrypsinogen was isolated from a human pancreas cDNA library and its nucleotide sequence was determined. The sequence consists of a 16 bp 5' non-coding region, a 789 bp amino acid coding region and a 60 bp 3' non-coding region. The predicted product consists of 263 amino acids, including 18 amino acids for a signal peptide and 15 amino acids possible for an activation peptide. Southern blot analyses using the cloned cDNA as a probe revealed that human genomic DNA carries at least two genes that are related to chymotrypsinogen.  相似文献   

15.
The amino acid sequence of the matrix protein of the human respiratory syncytial virus (RS virus) was deduced from the sequence of a cDNA insert in a recombinant plasmid harboring an almost full-length copy of this gene. It specifically hybridized to a single 1,050-base mRNA from infected cells. The recombinant containing 944 base pairs of RS viral matrix protein gene sequence lacked five nucleotides corresponding to the 5' end of the mRNA. The nucleotide sequence of the 5' end of the mRNA was determined by the dideoxy sequencing method and found to be 5' NGGGC, wherein the C residue is one nucleotide upstream of the cloned viral sequence. The initiator ATG codon for the matrix protein is embedded in an AATATGG sequence similar to the canonical PXXATGG sequence present around functional eucaryotic translation initiation codons. There is no conserved sequence upstream of the polyadenylate tail, unlike vesicular stomatitis virus and Sendai virus, in which four nucleotides upstream of the polyadenylate tail are conserved in all genes. There is no equivalent of the eucaryotic polyadenylation signal AAUAAA upstream of the polyadenylate tail. The matrix protein of 28,717 daltons has 256 amino acids. It is relatively basic and moderately hydrophobic. There are two clusters of hydrophobic amino acid residues in the C-terminal third of the protein that could potentially interact with the membrane components of the infected cell. The matrix protein has no homology with the matrix proteins of other negative-strand RNA viruses, implying that RS virus has undergone extensive evolutionary divergence. A second open reading frame potentially encoding a protein of 75 amino acids and partially overlapping the C terminus of the matrix protein was also identified.  相似文献   

16.
The sequence of about 4,500 nucleotides of the internal part of tobacco mosaic virus (TMV)-tomato strain (L) RNA has been newly determined using cloned cDNAs. Together with the previously determined partial sequences at both ends, the entire sequence of the 6,384 nucleotide genome has been completed. The 130K (1,115 amino acids), 180K (1,615 amino acids), 30K (263 amino acids) and coat protein (158 amino acids) cistrons are located at residues 72-3442, 72-4922, 4906-5700, and 5703-6182 on the genome, respectively. Sequence polymorphism was not observed except for heterogeneity in the length of the A cluster near the 3' end. The homology of the nucleotide sequences of TMV-L and TMV-vulgare, a common strain, is about 80% on average. Remarkable differences between them were found in a part of the N-terminal portion of the 130K/180K protein and the C-terminal portion of the 30K protein. A new method for cDNA cloning was developed by which the cDNA of the 5'-terminus of viral RNA can be cloned efficiently.  相似文献   

17.
Rat beta casein cDNA: sequence analysis and evolutionary comparisons.   总被引:10,自引:6,他引:4       下载免费PDF全文
The complete sequence of a 1072 nucleotide rat beta-casein cDNA insertion in the hybrid plasmid pC beta 23 has been determined. Primer extension was employed to determine the sequence of an additional 82 5'-terminal nucleotides in beta-casein mRNA. Rat beta-casein mRNA consists of a 696 nucleotide coding region, flanked by 52 nucleotide 5' and 406 nucleotide 3' noncoding regions, including a 40 nucleotide poly(A) tail. The derived 216 amino acid sequence of rat beta-casein was compared to the previously determined sequences of beta-caseins from several other species. Approximately 38% of the amino acids have been conserved among the rat, ovine, bovine and human sequences and these conserved amino acids occurred in clusters throughout the protein. One such cluster containing the majority of the potential casein phosphorylation sites was located near the amino terminus. Contrary to the considerable divergence observed for the processed beta-casein, 14 of 15 amino acids in the signal peptide sequence of the precasein were identical between the rat and ovine caseins.  相似文献   

18.
A human umbilical vein endothelial cell cDNA library in lambda gt11 was screened for expression of thrombomodulin antigens with affinity-purified rabbit polyclonal anti-thrombomodulin immunoglobulin G (IgG) and mouse monoclonal anti-human thrombomodulin IgG. Among 7 million recombinant clones screened, 12 were recognized by both antibodies. Two of these, lambda HTm10 and lambda HTm12, were shown to encode thrombomodulin by comparison of the amino acid sequence deduced from the nucleotide sequence to the amino acid sequence determined directly from tryptic peptides of thrombomodulin. Thrombomodulin mRNA was estimated to be 3.7 kilobases in length by Northern blot analysis of endothelial cell and placental poly(A)+ RNA. Thrombomodulin mRNA was not detected in human brain, HepG2 hepatoma cells, or the monocytic U937 cell line. Additional cDNA clones were selected by hybridization with the 1.2-kilobase insert of lambda HTm10. One isolate, lambda HTm15, contained a 3693 base pair cDNA insert with an apparent 5'-noncoding region of 146 base pairs, an open reading frame of 1725 base pairs, a stop codon, a 3'-noncoding region of 1779 base pairs, and a poly(A) tail of 40 base pairs. The cDNA sequence encodes a 60.3-kDa protein of 575 amino acids. The predicted protein sequence includes a signal peptide of approximately 21 amino acids, an amino-terminal ligand-binding domain of approximately 223 amino acids, an epidermal growth factor (EGF) homology region of 236 amino acids, a serine/threonine-rich segment of 34 amino acids, a membrane-spanning domain of 23 amino acids, and a cytoplasmic tail of 38 amino acids. The EGF-homology region consists of six tandemly repeated EGF-like domains.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

19.
We isolated a 38 kDa ssDNA-binding protein from the unicellular cyanobacterium Synechococcus sp. strain PCC 6301 and determined its N-terminal amino acid sequence. A genomic clone encoding the 38 kDa protein was isolated by using a degenerate oligonucleotide probe based on the amino acid sequence. The nucleotide sequence and predicted amino acid sequence revealed that the 38 kDa protein is 306 amino acids long and homologous to the nuclear-encoded 370 amino acid chloroplast ribosomal protein CS1 of spinach (48% identity), therefore identifying it as ribosomal protein (r-protein) S1. Cyanobacterial and chloroplast S1 proteins differ in size from Escherichia coli r-protein S1 (557 amino acids). This provides an additional evidence that cyanobacteria are closely related to chloroplasts. The Synechococcus gene rps1 encoding S1 is located 1.1 kb downstream from psbB, which encodes the photosystem 11 P680 chlorophyll a apoprotein. An open reading frame encoding a potential protein of 168 amino acids is present between psbB and rps1 and its deduced amino acid sequence is similar to that of E. coli hypothetical 17.2 kDa protein. Northern blot analysis showed that rps1 is transcribed as a monocistronic mRNA.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号