首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The sequence of the first 2831 nucleotides of bovine thyroglobulin mRNA has been determined from the analysis of a cDNA clone. Following a 41-nucleotide 5' untranslated sequence, a single open-reading frame encoding 930 amino acids was observed. This corresponds to the aminoterminal third of thyroglobulin, preceded by a putative signal peptide of 19 amino acids. The protein sequence was found to be essentially made of the sevenfold repetition of a 60-amino-acid-long building unit, interrupted at fixed positions by unrelated segments of variable length. The presence of an internal homology within the repetitive unit itself suggests that the 5' region of the thyroglobulin gene has evolved from the initial duplication of a relatively short sequence, followed by the serial duplication of the resulting unit. The tyrosine residue at position five has been assigned an important hormonogenic function [Mercken, L., Simons, M.-J. and Vassart, G. (1982) FEBS Lett. 149, 285-287]. This residue is flanked by sequence elements related to the repeated unit, suggesting that the hormonogenic domain evolved also from the basic ancestor sequence.  相似文献   

2.
We have sequenced a cDNA clone, pLgSSU, which encodes the small subunit of ribulose 1,5-bisphosphate carboxylase of Lemna gibba L.G-3 a monocot plant. This clone contains a 832 basepair insert which encodes the entire 120 amino acids of the mature small subunit polypeptide (Mr = 14,127). In addition this clone encodes 53 amino acids of the amino terminal transit peptide of the precursor polypeptide and 242 nucleotides of the 3' non-coding region. Comparison of the nucleotide sequence of pLgSSU with Lemna gibba genomic sequences homologous to the 5' end of the cDNA clone suggests that nucleotides encoding four amino-terminal amino acids of the transit peptide are not included in the cDNA clone. The deduced amino acid sequence of the Lemna gibba mature small subunit polypeptide shows 70-75% homology to the reported sequences of other species. The transit peptide amino acid sequence shows less homology to other species. There is 50% homology to the reported soybean sequence and only 25% homology to the transit sequence of another monocot, wheat.  相似文献   

3.
We determined the cDNA sequence of the mRNA for antithrombin III (AT III) from sheep liver. It encodes a protein of 465 amino acids, including a signal peptide of 32 amino acids. The amino acid sequence of the mature protein shows a sequence identity of 89.1%, 95.6% and 85.0% to the human, bovine and rabbit equivalents, respectively. Cysteine residues involved in disulfide bonds as well as potential glycosylation sites are conserved between the four species. In contrast, the amino acid sequence of the signal peptide shows a smaller identity, i.e., 68.7% and 56.3% compared to the human and rabbit preprotein, respectively.  相似文献   

4.
The nucleotide sequence of the xynZ gene, encoding the extracellular xylanase Z of Clostridium thermocellum, was determined. The putative xynZ gene was 2,511 base pairs long and encoded a polypeptide of 837 amino acids. A region of 60 amino acids containing a duplicated segment of 24 amino acids was found between residues 429 and 488 of xylanase Z. This region was strongly similar to the conserved domain found at the carboxy-terminal ends of C. thermocellum endoglucanases A, B, and D. Deletions removing up to 508 codons from the 5' end of the gene did not affect the activity of the encoded polypeptide, showing that the active site was located in the C-terminal half of the protein and that the conserved region was not involved in catalysis. Expression of xylanase activity in Escherichia coli was increased up to 220-fold by fusing fragments containing the 3' end of the gene with the start of lacZ present in pUC19. An internal translational initiation site which was efficiently recognized in E. coli was tentatively identified 470 codons downstream from the actual start codon.  相似文献   

5.
The design and rapid construction of libraries of genes coding beta-sheet forming repetitive and block-copolymerized polypeptides bearing various C- and N-terminal sequences are described. The design was based on the assembly of DNA cassettes coding for the (GA)3GX amino acid sequence where the (GAGAGA) sequences would constitute the beta-strand units of a larger beta-sheet assembly. The edges of this beta-sheet would be functionalized by the turn-inducing amino acids (GX). The polypeptides were expressed in Escherichia coli using conventional vectors and were purified by Ni-nitriloacetic acid (NTA) chromatography. The correlation of polymer structure with molecular weight was investigated by gel electrophoresis and mass spectrometry. The monomer sequences and post-translational chemical modifications were found to influence the mobility of the polypeptides over the full range of polypeptide molecular weights while the electrophoretic mobility of lower molecular weight polypeptides was more susceptible to C- and N-termini polypeptide modifications.  相似文献   

6.
The complete sequence of the structural gene encoding the immunoglobulin G binding protein from Streptococcus G148 has been determined, as well as its 5' and 3' flanking sequences. The sequence reveals an open reading frame encoding a putative preprotein with a relative molecular mass of 63294. N-Terminal sequencing of the mature protein, spontaneously released from streptococcal cells, demonstrates that the signal peptide consists of 33 amino acids. The DNA sequence reveals extensive internal homologies similar to other cell-wall-bound receptors from gram-positive bacteria. Comparisons with a related gene previously isolated from another strain of streptococci revealed large differences in size, due to variations in the number of internal repeats. The structure of the gene suggests an evolution through multiple duplications.  相似文献   

7.
鲑鱼生长激素cDNA的分子克隆和序列分析   总被引:8,自引:0,他引:8  
宋诗铎  丘才良 《遗传学报》1992,19(4):308-315
从太平洋切奴克鲑鱼(Pacific Chinook Salmon,Oncorthychus tschawytscha)垂体poly(A)~+ RNA构建cDNA文库。按照鲑鱼生长激素(sGH)部分氨基酸序列合成两个寡聚脱氧核苷酸探针,它们分别与编码第1—7和第166—172氨基酸序列互补。用探针筛查cDNA文库,得到了完整的sGH cDNA克隆。cDNA序列已测定,包括编码210个氨基酸的编码序列。其中含有22个氨基酸的信号肽序列和188个氨基酸的成熟GH序列。该克隆还包括了5'端和3'端非翻译区,分别为72个和438个碱基对长。与Chum鲑鱼比较表明,核酸序列和氨基酸序列的同源性分别为97%和99%。  相似文献   

8.
9.
The nucleotide sequence of the yeast MEL1 gene.   总被引:13,自引:1,他引:12       下载免费PDF全文
The complete nucleotide sequence of the MEL1 gene of the yeast, Saccharomyces cerevisiae, encoding alpha-galactosidase was determined. The nucleotide sequence contains an open reading frame of 1413 bp encoding a protein of 471 amino acids. Comparison with the known N-terminal amino acid sequence of the mature secreted protein indicated that alpha-galactosidase is synthesized as a precursor with an N-terminal signal sequence of 18 amino acids. The general features of this signal peptide resemble those of other yeast signal peptides. Molecular weight of the mature alpha-galactosidase polypeptide deduced from the nucleotide sequence is 50.049 kd. The 5' regulatory region has sequences in common with other yeast genes regulated by the GAL4-protein.  相似文献   

10.
The cDNAs corresponding to the mRNA encoding a polypeptide which is immunoreactive with the antisera specific to carcinoembryonic antigen (CEA) (1) are cloned. The amino acid sequences deduced from the nucleotide sequences of the cDNAs show that it is synthesized as a precursor with a signal peptide followed by 668 amino acids of the putative mature CEA peptide, whose N-terminal 24 amino acids and amino acids 286 to 295 exactly coincide with those known for N-terminal sequences of CEA (2) and NFA-1 (3), respectively. The first 108 N-terminal residues are followed by three very homologous repetitive domains of 178 residues each and then by 26 mostly hydrophobic residues which probably comprise a membrane anchor. Each repetitive domains contains 4 cysteines at precisely the same positions and as many as 28 possible N-glycosylation sites are found in the CEA peptide region agreeing with high carbohydrate content of purified CEA.  相似文献   

11.
Nucleotide sequence of a cDNA clone encoding mouse protamine 1   总被引:9,自引:0,他引:9  
The nucleotide sequence of a 404-base cDNA encoding the cysteine-rich, tyrosine-containing mouse protamine has been determined. This insert, isolated from a mouse testis cDNA library, encodes a polypeptide of 50 amino acids of which 28 are arginine, 9 are cysteine, and 3 are tyrosine. The insert contains the complete 3' noncoding region of 151 bases and most of the 5' noncoding region. The predicted amino acid sequence of mouse protamine 1 is about 80% homologous to boar protamine and 67% homologous to bull protamine and contains the central, highly basic domain of four arginine clusters found in the trout protamines. The identification of a cDNA clone for a mouse protamine will facilitate studies of the evolution, regulation, and protein-DNA interaction of this nuclear protein unique to haploid spermatogenic cells.  相似文献   

12.
13.
The cDNAs encoding human prostatic acid phosphatase were cloned and characterized. The mRNAs contain 3' noncoding regions of heterogeneous sizes 646, 1887 or 1913 nucleotides. A dimer and a monomer of the conserved Alu-repeats are present in the longer 3' noncoding sequences. The complete sequence of 354 amino acids for the mature enzyme was determined by sequencing both cDNA and protein. Human prostatic and lysosomal acid phosphatases exhibit 50% sequence homology, including five Cys residues and two putative N-linked glycosylation sites. The Acp-3 gene coding for human prostatic acid phosphatase was mapped onto chromosome 3 in this investigation. The Acp-2 gene coding for lysosomal acid phosphatase has previously been located on chromosome 11, while the Acp-1 gene coding for red blood cell acid phosphatase is on chromosome 2.  相似文献   

14.
Abstract The gene encoding an 18 kDa fimbrial subunit of Vibrio cholerae O1 was identified in a fimbriate strain Bgd17. Mixed oligoprimers were prepared based on the amino acid sequence of the N-terminus and that from a cyanogen bromide-cleaved fragment of the fimbrillin. A PCR-amplified 185 bp DNA fragment was sequenced. This 185 bp fragment was further extended to 540 bp to 3' and 5' termini by RNA-PCR using a primer containing a random hexamer at its 3' end. This fragment did not contain the stop codons. It was further extended by a gene walking method using Eco RI cassette and its primers. Finally a 660 bp fragment was obtained and sequenced. This fragment contained the complete open reading frame of the structural subunit of the fimbriae, composed of 169 amino acids with a molecular mass of 17435.65 and a leader sequence of 6 or 9 amino acids. The deduced amino acid sequence of the polypeptide encoded by the gene, designated fim A, displayed a highly conserved sequence of MKXXXGFTLI EL of type 4 fimbriae.  相似文献   

15.
The complete nucleotide sequence of murine beta-glucuronidase (GUS) mRNA has been compiled from three overlapping cloned cDNAs and a single GUS-specific genomic clone. The sequence is composed of 2455 nucleotides, exclusive of the poly(A) tail. The 5' and 3' untranslated regions contain 12 and 499 bases, respectively, with the open reading frame encoding a polypeptide of 648 amino acids (74.2 kDa), including a 22 amino acid signal sequence. The nucleotide and deduced amino acid sequences of murine GUS are compared to those published for rat and human GUS and the results are presented. Murine GUS also shares amino acid sequence identity with Escherichia coli GUS and beta-galactosidase. The complete sequences of murine GUS mRNA and its deduced polypeptide provide a basis from which to study the mechanisms responsible for the well-characterized variation in GUS expression among inbred mouse strains.  相似文献   

16.
Recently, an inhibitory polypeptide that could block the follicle-stimulating hormone-induced estradiol and progesterone production in rat ovary granulosa cells has been isolated from porcine ovarian follicular fluid. Amino-terminal sequence analysis of the purified inhibitor suggests that it could be the porcine congener of the 53-kDa subunit of the growth hormone-dependent insulin-like growth factor binding protein (IGF-BP3). Using amino acid sequence information derived from the purified inhibitor to construct oligonucleotide probes, we have now identified the complementary deoxyribonucleic acids (cDNAs) encoding the inhibitory polypeptide from a porcine liver and a porcine ovary library. The nucleotide and predicted amino acid sequences revealed that the cDNAs indeed encode the porcine homolog of the recently characterized human IGF-BP3. The mature polypeptide consists of 266 amino acids, which is 2 amino acids longer than the human sequence. Between the two species, there are 42 amino acid substitutions, but the 18 cysteines and the three Asn-linked glycosylation sites are totally conserved. A single mRNA species of 2.6 kilobases encoding the IGF-BP3 was detected in porcine gonadal, brain, and liver tissues by Northern analysis.  相似文献   

17.
Sunflower cystatin a (Sca) is distinguished from other phytocystatins by its lack of the N-terminal about 20 amino acids, resulting in the absence of the evolutionarily conserved Gly residue. The cDNA encoding Sca was amplified by PCR methods. The cDNA consists of 520 nucleotides and includes an open reading frame encoding a polypeptide of 98 amino acids. Comparison of the deduced amino acid sequence with the Sca protein sequence indicated that the deduced sequence has an extra 15 amino acids and one amino acid at the N- and C-termini, respectively. This result suggests that Sca is synthesized as a preprotein (preSca) and proteolytic cleavages at peptide bonds may give rise to the mature Sca. To address this assumption and also to investigate the significance of the N-terminal extension sequence to Sca for inhibitory activity, a recombinant pre-Sca (rpre-Sca), in which the N-terminal extension was fused to the matured Sca, and a recombinant matured Sca (rSca) were overproduced in Escherichia coli cells. Incubation of the rpre-Sca with a seed extract resulted in a mobility by SDS-PAGE that was the same as rSca, demonstrating a proteolytic cleavage by endogenous proteinases. The rSca and rpre-Sca proteins were further characterized with respect to inhibitory activity and sensorgrams of the interaction with papain. The result showed that rpre-Sca had stronger inhibitory activity than rSca, and that the increased activity toward papain was due to a lower dissociation rate constant. This finding indicates that the N-terminal region of rpre-Sca increases the inhibitory activity by stabilizing the rpre-Sca and papain complex.  相似文献   

18.
We have purified apolipoprotein C-II (apo C-II) from cynomolgus monkey plasma, prepared antibody against it and used the antibody to isolate a cDNA containing the complete coding sequence for cynomolgus monkey apo C-11. Sequence analysis indicated that the monkey apo C-11 cDNA was 200 by longer than the human and the difference in size was all in the 5° untranslated region of the mRNA. This was confirmed by Northern analysis of human and monkey RNA. There was an open reading frame in the monkey apo C-11 cDNA sequence encoding a preprotein of 101 amino acids — identical in size to the human protein. The carboxyl terminal 44 amino acids of the protein were 100% homologous to the human apo C-11 amino acid sequence indicating evolutionary conservation of both structure and function. However, the amino terminal 35 amino acids of the protein were only 75% homologous and the amino terminal 19 amino acids were only 58% homologous to the human sequence. The amino acid sequence derived from the nucleotide sequence predicts a more basic protein than the human apo C-11 and this is confirmed by isoelectric focusing and immunoblotting.  相似文献   

19.
Authentic cDNAs encoding the activator protein for acid beta-glucosidase (EC3.2.1.45), co-beta-glucosidase, were cloned from the pCD and lambda gt11 human cDNA libraries. Initial screening with oligonucleotide mixtures encoding amino acid sequences of co-beta-glucosidase identified partial cDNAs which were used to obtain a potentially full-length cDNA from the lambda gt11 library. This clone (2767 bp), EGTISI, contained 5' (38 bp) and 3' (1157 bp) noncoding sequences, a translation initiation site, and an open reading frame encoding 524 amino acids which included a typical hydrophobic signal sequence (16 amino acids). Computer analyses identified three regions of high similarity to co-beta-glucosidase encoded by tandem sequences in EGTISI. Searches revealed that two of these regions encoded peptides of known function; SAP1 (sphingolipid activator protein 1) and protein C (a new sphingolipid activator protein) were encoded by EGTISI sequences 5' and 3', respectively, to those for co-beta-glucosidase. The third region of similarity, encoding a theoretical peptide (undefined function), was located most 5' in the cDNA. EGTISI and its encoded polypeptide had high similarity (77% nucleotide identity and about 80% amino acid similarity) to a rat Sertoli cell cDNA and its encoded sulfated glycoprotein-1. These results indicate that a single highly conserved gene encodes the precursor for four potential sphingolipid activator proteins in rat and man.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号