首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 625 毫秒
1.
H C Lai  G Grove    C P Tu 《Nucleic acids research》1986,14(15):6101-6114
We have isolated a Yb-subunit cDNA clone from a GSH S-transferase (GST) cDNA library made from rat liver polysomal poly(A) RNAs. Sequence analysis of one of these cDNA, pGTR200, revealed an open reading frame of 218 amino acids of Mr = 25,915. The deduced sequence is in agreement with the 19 NH2-terminal residues for GST-A. The sequence of pGTR200 differs from another Yb cDNA, pGTA/C44 by four nucleotides and two amino acids in the coding region, thus revealing sequence microheterogeneity. The cDNA insert in pGTR200 also contains 36 nucleotides in the 5' noncoding region and a complete 3' noncoding region. The Yb subunit cDNA shares very limited homology with those of the Ya or Yc cDNAs, but has relatively higher sequence homology to the placental subunit Yp clone pGP5. The mRNA of pGTR200 is not expressed abundantly in rat hearts and seminal vesicles. Therefore, the GST subunit sequence of pGTR200 probably represents a basic Yb subunit. Genomic DNA hybridization patterns showed a complexity consistent with having a multigene family for Yb subunits. Comparison of the amino acid sequences of the Ya, Yb, Yc, and Yp subunits revealed significant conservation of amino acids (approximately 29%) throughout the coding sequences. These results indicate that the rat GSTs are products of at least four different genes that may constitute a supergene family.  相似文献   

2.
P J Curtis  E Withers  D Demuth  R Watt  P J Venta  R E Tashian 《Gene》1983,25(2-3):325-332
The nucleotide sequence of a clone containing mouse carbonic anhydrase (CA) cDNA in pBR322 has been determined. The cloned cDNA contains all of the coding region except for nucleotides specifying the first eight amino acids, and all of the 3' noncoding region, which consists of 700 nucleotides. A cDNA clone was identified which contains an additional 54 bp at the 5' end, so that the complete amino acid sequence of mouse CA could be deduced. This sequence showed a 73-81% homology with other mammalian CA form II isozymes, 56-63% with form I isozymes, and 52-56% with form III isozymes. By examination of the amino acids which are unique and invariant for each isozyme, the mouse amino acid sequence was found to contain 16 of the 23 residues that are unique and invariant to mammalian CA form II isozymes, but only one or no residue for forms I and III, respectively.  相似文献   

3.
Structure of rodent helix-destabilizing protein revealed by cDNA cloning   总被引:50,自引:0,他引:50  
A cDNA library of newborn rat brain poly(A+) RNA in lambda gt 11 was screened with a synthetic oligonucleotide probe corresponding to a five amino acid sequence in the N-terminal region of the calf helix-destabilizing protein, UP1. Six positive phage were isolated after testing 2 X 10(5) recombinants, and each phage was plaque purified. Four of these phage clones were positive with a second oligonucleotide probe corresponding to a 5 amino acid sequence in the C-terminal region of calf UP1; one of the clones positive with both probes was selected for detailed study. This phage, designated lambda HDP-182, contained a 1706-base pair cDNA insert corresponding to an mRNA with a poly(A) sequence at the 3' terminus and a single open reading frame starting 63 bases from the 5' terminus and extending 988 bases. The 3' untranslated region of the mRNA contained 718 bases, including an AAUAAA signal 21 bases from the poly(A) sequence and a 16-residue poly(U) sequence flanked on each side by oligonucleotide repeats. Primer extension analysis of newborn rat brain poly(A+) RNA suggested that the cDNA insert in lambda HDP-182 was full length except for about 35 nucleotide residues missing from the 5' end untranslated region, and Northern blot analysis revealed one relatively abundant mRNA species of approximately the same size as the cDNA insert. The 988-residue open reading frame in the cDNA predicted a 34,215-dalton protein of 320 amino acids. Residues 2 through 196 of this rat protein are identical to the 195-residue sequence of the calf helix-destabilizing protein, UP1. The 124-amino acid sequence in the C-terminal portion of the 34,215-dalton protein is not present in purified calf UP1. This 124-residue sequence has unusual amino acid content in that it is 11% asparagine, 15% serine, and 40% glycine and consists of 16 consecutive oligopeptide repeats. Computer-derived secondary structure predictions for the 34,215-dalton protein revealed two distinct domains consisting of residues 1 through approximately 196 and residues approximately 197 to 320, respectively.  相似文献   

4.
A 2112-bp cDNA clone (lambda CT29) encoding the entire sequence of the human lysosomal acid phosphatase (EC 3.1.3.2) was isolated from a lambda gt11 human placenta cDNA library. The cDNA hybridized with a 2.3-kb mRNA from human liver and HL-60 promyelocytes. The gene for lysosomal acid phosphatase was localized to human chromosome 11. The cDNA includes a 12-bp 5' non-coding region, an open reading frame of 1269 bp and an 831-bp 3' non-coding region with a putative polyadenylation signal 25 bp upstream of a 3' poly(A) tract. The deduced amino acid sequence reveals a putative signal sequence of 30 amino acids followed by a sequence of 393 amino acids that contains eight potential glycosylation sites and a hydrophobic region, which could function as a transmembrane domain. A 60% homology between the known 23 N-terminal amino acid residues of human prostatic acid phosphatase and the N-terminal sequence of lysosomal acid phosphatase suggests an evolutionary link between these two phosphatases. Insertion of the cDNA into the expression vector pSVL yielded a construct that encoded enzymatically active acid phosphatase in transfected monkey COS cells.  相似文献   

5.
Recombinant clones with cDNA inserts coding for a new serine protease (hepsin) have been isolated from cDNA libraries prepared from human liver and hepatoma cell line mRNA. The total length of the cDNA is approximately 1.8 kilobases and includes a 5' untranslated region, 1251 nucleotides coding for a protein of 417 amino acids, a 3' untranslated region, and a poly(A) tail. The amino acid sequence coded by the cDNA for hepsin shows a high degree of identity to pancreatic trypsin and other serine proteases present in plasma. It also exhibits features characteristic of zymogens to serine proteases in that it contains a cleavage site for protease activation and the highly conserved regions surrounding the His, Asp, and Ser residues that participate in enzyme catalysis. In addition, hepsin lacks a typical amino-terminal signal peptide. Hydropathy analysis of the protein sequence, however, revealed a very hydrophobic region of 27 amino acids starting 18 residues downstream from the apparent initiator Met. This region may serve as an internal signal sequence and a transmembrane domain. This putative transmembrane domain could be involved in anchoring hepsin to the cell membrane and orienting it in such a manner that its carboxyl terminus, containing the catalytic domain, is extracellular.  相似文献   

6.
Factor XIII is a plasma protein that participates in the final stages of blood coagulation. The complete amino acid sequence of the b subunit of human factor XIII was determined by a combination of cDNA cloning and amino acid sequence analysis. A lambda gt11 cDNA library prepared from human liver mRNA was screened with an affinity-purified antibody against the b subunit of human factor XIII. Nine positive clones were isolated from 2 X 10(6) phage and plaque-purified. The largest cDNA insert was sequenced and shown to contain 2180 base pairs coding for a portion of the leader sequence (19 amino acids), the mature protein (641 amino acids), a stop codon (TGA), a 3' noncoding region (187 nucleotides), and a poly(A) tail. When the b subunit of human factor XIII was digested with cyanogen bromide, nine peptides were isolated by gel filtration and reverse-phase high-performance liquid chromatography. Amino acid sequence analyses of these peptides were performed with an automated sequenator, and 299 amino acid residues were identified. These amino acid sequences were in complete agreement with the amino acid sequence predicted from the cDNA. The b subunit of factor XIII contained 10 repetitive homologous segments, each composed of about 60 amino acids and 4 half-cystine residues. Each of these repeated segments is a member of a family of repeats present in human beta 2-glycoprotein I, complement factor B, and haptoglobin alpha 1 chain. Three potential Asn-linked carbohydrate attachment sites were also identified in the b subunit of factor XIII.  相似文献   

7.
Complementary DNA clones for the boar preproacrosin have been isolated from a randomly primed testis cDNA library in lambda gt10 and from an oligo(dT)-primed testis cDNA in lambda gt11. The nucleotide sequence of the 1418-bp cDNA insert includes a 46-bp 5'-untranslated region, an open reading frame of 1248 bp corresponding to 416 amino acids (45.59 kDa) and a 121-bp 3'-untranslated region. The deduced amino acid sequence includes the active-site residues histidine, asparagine and serine of the catalytic triad of the serine proteinase super-family and is colinear with that determined by amino acid sequencing of the boar acrosin light chain and of a small region of the NH2-terminal sequence of the heavy chain. The preproacrosin cDNA contains at the 3' end a 381-bp sequence which codes for an amino acid sequence not yet found in any other serine proteinase. This amino acid sequence is rich in proline (42 out of 127 amino acids) and is suggested to be involved in the recognition and binding of the spermatozoa to the zona pellucida of the ovum. The mRNA for preproacrosin is synthesized as an approximately 1.6-kb-long molecule only in the postmeiotic stages of boar and bull spermatogenesis.  相似文献   

8.
The nucleotide sequences coding for murine complement component C3 have been determined from a cloned genomic DNA fragment and several overlapping cloned complementary DNA fragments. The amino acid sequence of the protein was deduced. The mature beta and alpha subunits contain 642 and 993 amino acids respectively. Including a 24 amino acid signal peptide and four arginines in the beta-alpha transition region, which are probably not contained in the mature protein, the unglycosylated single chain precursor protein preproC3 would have a molecular mass of 186 484 Da and consist of 1663 amino acid residues. The C3 messenger RNA would be composed of a 56 +/- 2 nucleotide long 5' non-translated region, 4992 nucleotides of coding sequence, and a 3' non-translated region of 39 nucleotides, excluding the poly A tail. The beta chain contains only three cysteine residues, the alpha chain 24, ten of which are clustered in the carboxy terminal stretch of 175 amino acids. Two potential carbohydrate attachment sites are predicted for the alpha chain, none for the beta chain. From a comparison with human C3 cDNA sequence (of which over 80% has been determined) an extensive overall sequence homology was observed. Human and murine preproC3 would be of very similar length and share several noteworthy properties: the same order of the subunits in the precursor, the same basic residue multiplet in the beta-alpha transition region, and a glutamine residue in the thioester region. The equivalent position of the known factor I cleavage sites in human C3 alpha could be located in the murine C3 alpha chain and the size and sequence of the resulting peptide were deduced. A comparison of the amino acid sequences of murine C3 and human alpha 2-macroglobulin is given. Several areas of strong sequence homology are observed, and we conclude that the two genes must have evolved from a common ancestor.  相似文献   

9.
Sequence of human DNA polymerase beta mRNA obtained through cDNA cloning   总被引:29,自引:0,他引:29  
A cDNA library from polyA+ RNA of a human teratocarcinoma cell line in phage lambda gt11 was screened with a fragment of the rat beta-polymerase cDNA, lambda pol beta-10, as probe. Five positive phage were identified and plaque purified. The cDNA of one positive clone selected for detailed study was 1257 bp. This insert was sequenced and found to contain the coding region for beta-polymerase, as well as 163 bp and 137 bp from the 5' and 3' untranslated regions, respectively. The primary structure of human beta-polymerase (318 amino acids, Mr = 36, 133) deduced from the cDNA was similar to rat beta-polymerase (95% matched residues). The greatest difference between the sequences of the human and rat cDNAs was in the 3' untranslated regions (64% matched base residues). These results provide necessary sequence information for study of the human beta-polymerase gene.  相似文献   

10.
The cDNA clone encoding human prechymotrypsinogen was isolated from a human pancreas cDNA library and its nucleotide sequence was determined. The sequence consists of a 16 bp 5' non-coding region, a 789 bp amino acid coding region and a 60 bp 3' non-coding region. The predicted product consists of 263 amino acids, including 18 amino acids for a signal peptide and 15 amino acids possible for an activation peptide. Southern blot analyses using the cloned cDNA as a probe revealed that human genomic DNA carries at least two genes that are related to chymotrypsinogen.  相似文献   

11.
根据从柽柳cDNA文库克隆获得的脂质转运蛋白(LTP)的部分序列,用RACE技术克隆出其全长cDNA序列.基因的5'非翻译区96bp,3'非翻译区222bp,开放阅读框285bp,编码94个氨基酸,预计蛋白的分子量为9.9 kD,等电点为8.02.此基因有8个位置保守的Cys残基及26个氨基酸的信号肽,为典型的植物脂质转运蛋白基因.其基因序列数据库(GenBank)登录号为AY574218(基因)和AAS79106(蛋白).  相似文献   

12.
By using synthetic oligonucleotides as probes, plasmid clones containing portions of cDNA coding for human C4b-binding protein were isolated from a liver cDNA library. The entire amino acid sequence of the C4b-binding protein can be predicted from this study of the cloned cDNA when allied to a previous sequence study at the protein level [Chung, Gagnon & Reid (1985) Mol. Immunol. 22, 427-435], in which over 55% of the amino acid sequence, including the N-terminal 62 residues, was obtained. The plasmid clones isolated allowed the unambiguous determination of 1717 nucleotides of cDNA sequence between the codon for the 32nd amino acid in the sequence of C4b-binding protein and the 164th nucleotide in the 3' non-translated region. The sequence studies show that the secreted form of C4b-binding protein, found in plasma, is composed of chains of apparent Mr 70 000 that contains 549 amino acid residues. Examination of the protein and cDNA sequence results show that there are at least two polymorphic sites in the molecule. One is at position 44, which can be glutamine or threonine, and the other is at position 309, which can be tyrosine or histidine. Northern-blot analysis indicated that the mRNA for C4b-binding protein is approx. 2.5 kilobases long. The N-terminal 491 amino acids of C4b-binding protein can be divided into eight internal homologous regions, each approx. 60 amino acids long, which can be aligned by the presence in each region of four half-cystine, one tryptophan and several other conserved residues. These regions in C4b-binding protein are homologous with the three internal-homology regions that have been reported to be present within the Ba region of the complement enzyme factor B and also to the internal-homology regions found in the non-complement beta 2-glycoprotein I.  相似文献   

13.
A cDNA clone encoding an anionic form of bovine trypsinogen was isolated from a pancreatic cDNA library. The corresponding 855-nucleotide mRNA contains a short 5' noncoding region of 8 nucleotides and a long 3' noncoding region of 56 nucleotides in addition to a poly(A) tail of at least 50 nucleotides. The deduced amino acid sequence for the anionic pretrypsinogen (247 residues) includes the N-terminal 15-amino-acid signal peptide followed by an 8-amino-acid activation peptide. The zymogen (232 residues) contains an additional C-terminal serine, compared with the amino acid sequence of bovine cationic trypsinogen. The identity between the anionic and cationic forms of bovine trypsinogen (65%) is lower than that existing between the anionic protein and other mammalian anionic trypsinogens (73-85%), suggesting that trypsin gene duplication in mammals occurred prior to the evolutionary events responsible for the species divergence. Bovine pancreatic anionic trypsin possesses all the key amino acids characteristic of the serine protease family.  相似文献   

14.
A cDNA coding for the catalytic subunit of phosphorylase phosphatase (phosphatase C-I/phosphatase-1c) was cloned from a rabbit muscle cDNA library by screening with oligonucleotide probes. Ten clones were analyzed. The full cDNA sequence of 1395 base pairs contained an open reading frame of 990 base pairs flanked by 3' and 5' noncoding regions of 84 and 321 base pairs, respectively. The DNA sequence (and deduced amino acid sequence) of this cDNA is distinctly different from that of a clone of 1492 base pairs previously reported. Our cDNA is essentially identical to the 1492-base pair clone from residue 182 in the 3' direction, but it is completely different in the 5' direction. Consequently, the amino acid sequence deduced from our cDNA differs by 14 amino acids in the amino terminal from that previously reported and extends for an additional 19 amino acids. Probes to the divergent and common region of our cDNA clone hybridized to an mRNA of the same size by Northern blotting. Thus the cDNA we have isolated appears to code for an isoform of the catalytic subunit of phosphorylase phosphatase.  相似文献   

15.
16.
The entire amino acid sequence of the alpha subunit (Mr 64,000) of the eighth component of complement (C8) was determined by characterizing cDNA clones isolated from a human liver cDNA library. Two clones with overlapping inserts of net length 2.44 kilobases (kb) were isolated and found to contain the entire alpha coding region [1659 base pairs (bp)]. The 5' end consists of an untranslated region and a leader sequence of 30 amino acids. This sequence contains an apparent initiation Met, signal peptide, and propeptide which ends with an arginine-rich sequence that is characteristic of proteolytic processing sites found in the pro form of protein precursors. The 3' untranslated region contains two polyadenylation signals and a poly(A) sequence. RNA blot analysis of total cellular RNA from the human hepatoma cell line HepG2 revealed a message size of approximately 2.5 kb. Features of the 5' and 3' sequences and the message size suggest that a separate mRNA codes for alpha and argues against the occurrence of a single-chain precursor form of the disulfide-linked alpha-gamma subunit found in mature C8. Analysis of the derived amino acid sequence revealed several membrane surface seeking domains and a possible transmembrane domain. These occur in a cysteine-free region of the subunit and may constitute the structural basis for alpha interaction with target membranes. Analysis of the carbohydrate composition indicates 1 or 2 asparagine-linked but no O-linked oligosaccharide chains, a result consistent with predictions from the amino acid sequence.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

17.
18.
The nucleotide sequence of a cloned 5.3 kilobase cDNA for the human low density lipoprotein receptor revealed five domains in the 839 amino acid protein: 322 NH2-terminal amino acids, extremely rich in disulfide-bonded cysteine residues (15%) and including an 8-fold repeat of 40 residues that may contain the LDL binding site; 350 residues homologous to the precursor of mouse epidermal growth factor; a region immediately outside the plasma membrane, rich in serine and threonine and the site of O-linked glycosylation; 22 hydrophobic amino acids, spanning the plasma membrane; and 50 COOH-terminal amino acids, projecting into the cytoplasm. The mRNA for the receptor contains a 3' untranslated region of 2.5 kilobases that includes multiple copies of the Alu family of repetitive DNAs. Transfection of simian COS cells with the human LDL receptor cDNA linked to the SV40 early promoter resulted in expression of functional cell surface receptors.  相似文献   

19.
鲑鱼生长激素cDNA的分子克隆和序列分析   总被引:8,自引:0,他引:8  
宋诗铎  丘才良 《遗传学报》1992,19(4):308-315
从太平洋切奴克鲑鱼(Pacific Chinook Salmon,Oncorthychus tschawytscha)垂体poly(A)~+ RNA构建cDNA文库。按照鲑鱼生长激素(sGH)部分氨基酸序列合成两个寡聚脱氧核苷酸探针,它们分别与编码第1—7和第166—172氨基酸序列互补。用探针筛查cDNA文库,得到了完整的sGH cDNA克隆。cDNA序列已测定,包括编码210个氨基酸的编码序列。其中含有22个氨基酸的信号肽序列和188个氨基酸的成熟GH序列。该克隆还包括了5'端和3'端非翻译区,分别为72个和438个碱基对长。与Chum鲑鱼比较表明,核酸序列和氨基酸序列的同源性分别为97%和99%。  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号