首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The eye lens contains a structural protein, alpha crystallin, composed of two homologous primary gene products alpha A2 and alpha B2. In certain rodents, still another alpha crystallin polypeptide, alpha AIns, occurs, which is identical to alpha A2 except that it contains an insertion peptide between residues 63 and 64. In this paper we describe the complete alpha A crystallin gene that has been cloned from DNA isolated from Syrian golden hamster. Evidence is provided that the alpha A gene is present as a single copy in the hamster genome. The detailed organization of the gene has been established by means of DNA sequence analysis and S1 nuclease mapping, revealing that the gene consists of four exons. The first exon contains the information for the 68 base-pair long 5' non-coding region as well as the coding information for the first 63 amino acids. The second exon encodes the 23 amino acid insertion sequence, the third exon codes for amino acid 87 to 127 of the alpha AIns chain, whereas the last exon encodes the C-terminal 69 amino acids and contains the information for the 523 base-pair long 3' non-coding region. The second exon is bordered by a 3' splice junction (A X G/G X C), which deviates from the consensus for donor splice sites (A X G/G X T). This deviation is found in both hamster and mouse. An internal duplication was detected in the first exon by using a DIAGON-generated matrix for comparison. By means of similar DIAGON-generated matrices it was confirmed that the amino acids coded for by the third and fourth exons are homologous to the small heat-shock proteins of Drosophila, Caenorhabditis and soyabean. The implications of the differential splicing and the evolutionary aspects of the detected homologies are discussed.  相似文献   

2.
Nucleotide sequence of rat alpha 1-acid glycoprotein messenger RNA   总被引:9,自引:0,他引:9  
The complete nucleotide sequence of rat alpha 1-acid glycoprotein (alpha 1-AGP) mRNA has been determined from cloned double-stranded cDNA. The coding portion of the mRNA was bounded at the ends by a 5'-untranslated region of 35 nucleotides in length and a 3'-untranslated region of 119 nucleotides in length. The 3'-untranslated region contains the characteristic AAUAAA sequence ending 18 nucleotides from the 3'-terminal poly(A) segment. The 5'-region of the mRNA contains two in-phase AUG codons separated by 12 nucleotides. Comparison with the known NH2-terminal amino acid sequence of serum rat alpha 1-AGP suggests that the primary translation product of the mRNA contains an additional 14 or 18 amino acids that are not present in the mature form of the protein, which contains 187 amino acids. The inferred amino acid sequence of rat alpha 1-AGP and the known amino acid sequence of human alpha 1-AGP have several regions of identity clustered in the NH2-terminal portion of the proteins. The carboxyl-terminal regions show significantly less homology. Six potential asparagine glycosylation sites are found in the rat sequence, and four of these sites are in positions similar to known glycosylation sites in the human protein. Furthermore, three of these potential glycosylation sites are in a region that exhibits extensive amino acid sequence conservation, suggesting that this region may be important for the biological function of alpha 1-AGP.  相似文献   

3.
The nucleotide sequence of a cloned cDNA (clone pRt(1)297; GENE (1982) 17, 131) coding for a 18 kDa polypeptide of the frog eye lens has been determined. The sequence, 791 nucleotide in length has only one long open reading frame (447 nucleotides). The derived amino acid sequence in this frame has greater than 90% homology with the region 25-173 of alpha A2-crystallin amino acid sequence from a related frog species Rana pipiens. The 5'-terminal part of mRNA corresponding to the first 24 amino acids of alpha A2-crystallin has been lost in cloning and substituted by an artefactual sequence. The 3'-terminal part appears to be intact as follows from the presence of the universal poly(A) addition site and poly(A) tract. The 3'-nontranslated region present in frog alpha A2-crystallin mRNA (130 nucleotides) is about 4-times shorter than in mammalian alpha A2-crystallin mRNA. Intact alpha A2-crystallin mRNA with a size of about 700 nucleotides as determined by Northern blot hybridization is about twice smaller than corresponding mammalian mRNAs.  相似文献   

4.
Complete sequence of ovine alpha s2-casein messenger RNA   总被引:1,自引:0,他引:1  
M Boisnard  G Petrissant 《Biochimie》1985,67(9):1043-1051
The primary structure of mRNA coding for ovine alpha s2 casein has been determined by chemical sequencing of three cDNA clones and the primer extension products of the longest one. The mRNA was 1,024 nucleotides long, excluding the poly(A) tail. The length of the 5' noncoding, coding and 3' noncoding regions was 53, 669 and 302 nucleotides, respectively. A comparison of the nucleotide sequence of ovine alpha s2-casein and guinea-pig casein A mRNAs revealed an extensive homology in the 5' and 3' noncoding regions. The deduced amino acid sequence of ovine alpha s2-casein was compared with its bovine and guinea-pig counterparts. Moreover, an heterogeneity was evidenced in the mRNA population of the alpha s2-casein.  相似文献   

5.
The entire amino acid sequence of the alpha subunit (Mr 64,000) of the eighth component of complement (C8) was determined by characterizing cDNA clones isolated from a human liver cDNA library. Two clones with overlapping inserts of net length 2.44 kilobases (kb) were isolated and found to contain the entire alpha coding region [1659 base pairs (bp)]. The 5' end consists of an untranslated region and a leader sequence of 30 amino acids. This sequence contains an apparent initiation Met, signal peptide, and propeptide which ends with an arginine-rich sequence that is characteristic of proteolytic processing sites found in the pro form of protein precursors. The 3' untranslated region contains two polyadenylation signals and a poly(A) sequence. RNA blot analysis of total cellular RNA from the human hepatoma cell line HepG2 revealed a message size of approximately 2.5 kb. Features of the 5' and 3' sequences and the message size suggest that a separate mRNA codes for alpha and argues against the occurrence of a single-chain precursor form of the disulfide-linked alpha-gamma subunit found in mature C8. Analysis of the derived amino acid sequence revealed several membrane surface seeking domains and a possible transmembrane domain. These occur in a cysteine-free region of the subunit and may constitute the structural basis for alpha interaction with target membranes. Analysis of the carbohydrate composition indicates 1 or 2 asparagine-linked but no O-linked oligosaccharide chains, a result consistent with predictions from the amino acid sequence.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

6.
The nucleotide sequences coding for murine complement component C3 have been determined from a cloned genomic DNA fragment and several overlapping cloned complementary DNA fragments. The amino acid sequence of the protein was deduced. The mature beta and alpha subunits contain 642 and 993 amino acids respectively. Including a 24 amino acid signal peptide and four arginines in the beta-alpha transition region, which are probably not contained in the mature protein, the unglycosylated single chain precursor protein preproC3 would have a molecular mass of 186 484 Da and consist of 1663 amino acid residues. The C3 messenger RNA would be composed of a 56 +/- 2 nucleotide long 5' non-translated region, 4992 nucleotides of coding sequence, and a 3' non-translated region of 39 nucleotides, excluding the poly A tail. The beta chain contains only three cysteine residues, the alpha chain 24, ten of which are clustered in the carboxy terminal stretch of 175 amino acids. Two potential carbohydrate attachment sites are predicted for the alpha chain, none for the beta chain. From a comparison with human C3 cDNA sequence (of which over 80% has been determined) an extensive overall sequence homology was observed. Human and murine preproC3 would be of very similar length and share several noteworthy properties: the same order of the subunits in the precursor, the same basic residue multiplet in the beta-alpha transition region, and a glutamine residue in the thioester region. The equivalent position of the known factor I cleavage sites in human C3 alpha could be located in the murine C3 alpha chain and the size and sequence of the resulting peptide were deduced. A comparison of the amino acid sequences of murine C3 and human alpha 2-macroglobulin is given. Several areas of strong sequence homology are observed, and we conclude that the two genes must have evolved from a common ancestor.  相似文献   

7.
Rat beta casein cDNA: sequence analysis and evolutionary comparisons.   总被引:10,自引:6,他引:4       下载免费PDF全文
The complete sequence of a 1072 nucleotide rat beta-casein cDNA insertion in the hybrid plasmid pC beta 23 has been determined. Primer extension was employed to determine the sequence of an additional 82 5'-terminal nucleotides in beta-casein mRNA. Rat beta-casein mRNA consists of a 696 nucleotide coding region, flanked by 52 nucleotide 5' and 406 nucleotide 3' noncoding regions, including a 40 nucleotide poly(A) tail. The derived 216 amino acid sequence of rat beta-casein was compared to the previously determined sequences of beta-caseins from several other species. Approximately 38% of the amino acids have been conserved among the rat, ovine, bovine and human sequences and these conserved amino acids occurred in clusters throughout the protein. One such cluster containing the majority of the potential casein phosphorylation sites was located near the amino terminus. Contrary to the considerable divergence observed for the processed beta-casein, 14 of 15 amino acids in the signal peptide sequence of the precasein were identical between the rat and ovine caseins.  相似文献   

8.
Sequence of the cDNA and gene for angiogenin, a human angiogenesis factor   总被引:29,自引:0,他引:29  
Human cDNAs coding for angiogenin, a human tumor derived angiogenesis factor, were isolated from a cDNA library prepared from human liver poly(A) mRNA employing a synthetic oligonucleotide as a hybridization probe. The largest cDNA insert (697 base pairs) contained a short 5'-noncoding sequence followed by a sequence coding for a signal peptide of 24 (or 22) amino acids, 369 nucleotides coding for the mature protein of 123 amino acids, a stop codon, a 3'-noncoding sequence of 175 nucleotides, and a poly(A) tail. The gene coding for human angiogenin was then isolated from a genomic lambda Charon 4A bacteriophage library employing the cDNA as a probe. The nucleotide sequence of the gene and the adjacent 5'- and 3'-flanking regions (4688 base pairs) was then determined. The coding and 3'-noncoding regions of the gene for human angiogenin were found to be free of introns, and the DNA sequence for the gene agreed well with that of the cDNA. The gene contained a potential TATA box in the 5' end in addition to two Alu repetitive sequences immediately flanking the 5' and 3' ends of the gene. The third Alu sequence was also found about 500 nucleotides downstream from the Alu sequence at the 3' end of the gene. The amino acid sequence of human angiogenin as predicted from the gene sequence was in complete agreement with that determined by amino acid sequence analysis. It is about 35% homologous with human pancreatic ribonuclease, and the amino acid residues that are essential for the activity of ribonuclease are also conserved in angiogenin. This provocative finding is thought to have important physiological implications.  相似文献   

9.
A lambda gt11 cDNA library containing DNA inserts prepared from human liver mRNA has been screened with an antibody to human alpha 2-thiol proteinase inhibitor that was isolated from fresh plasma. Eighteen positive clones were isolated from one million phage, and each was plaque purified. The cDNA insert of one of these phage was sequenced and shown to code for alpha 2-thiol proteinase inhibitor as identified by a partial amino acid sequence of the light chain of alpha 2-thiol proteinase inhibitor. This cDNA insert contained 1529 base pairs coding for the complete alpha 2-thiol proteinase inhibitor. It included 45 base pairs of 5' noncoding sequence, 1281 base pairs that code for pre alpha 2-thiol proteinase inhibitor, a stop codon, 160 base pairs of 3' noncoding sequence, and 40 base pairs of poly(A) tail. The noncoding sequence on the 3' end contained a potential recognition site (AATAAA) for processing and polyadenylation of precursor messenger RNA. The amino acid sequence of alpha 2-thiol proteinase inhibitor deduced from the cDNA showed a striking similarity (overall homology at 74%) to that of bovine low molecular weight (LMW) kininogen, including two internally repeated sequences and a nonapeptide sequence of bradykinin. These data clearly indicated that alpha 2-thiol proteinase inhibitor and LMW kininogen are identical. This was further supported by immunological cross-reactivity between alpha 2-thiol proteinase inhibitor and LMW kininogen.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

10.
We have isolated a cDNA clone (pRcol 2) which is complementary to the 5'-terminal portion of the rat pro-alpha 1(II) chain mRNA. A synthetic oligonucleotide was used both as a primer for cDNA synthesis and as a probe for screening a cDNA library. The probe was a mixture of sixteen 14-mers deduced from an amino acid sequence present in the amino-terminal telopeptide of the rat cartilage alpha 1(II) chain. This primer was chosen so that the resulting cDNA would contain the sequence of the 5' end of the mRNA. The nucleotide sequences of the cDNA were determined and compared with that of three other interstitial procollagen chain mRNAs (pro-alpha 1(I), pro-alpha 2(I), and pro-alpha 1(III) chain mRNA). pRcol 2 contains a 521-base pair (bp) insert, including 153 bp of the 5' untranslated region plus 368 bp coding for the signal peptide, the amino-terminal propeptide, and a part of the telopeptide. The signal peptide of the type II collagen chain is composed of about 20 amino acids. There is little homology between the amino acid sequence of the signal peptide in the pro-alpha 1(II) chain and that of three other interstitial procollagen chains. The NH2-terminal propeptide is deduced to contain short nonhelical sequences at its amino and carboxyl ends and an internal helical collagenous domain comprising 25 repeats of Gly-X-Y with one interruption. There is a strong conservation of the amino acid sequence of the carboxyl-terminal part of the NH2-terminal propeptide in the pro-alpha 1(II), pro-alpha 1(I), and pro-alpha 2(I) chains. Type II collagen mRNA does not contain a sequence corresponding to a uniquely conserved nucleotide sequence around the translation initiation site which occurs in mRNA for other procollagen chains.  相似文献   

11.
12.
13.
DNA complementary to mRNA of human immunoglobulin E heavy chain (epsilon chain) isolated and purified from U266 cells has been synthesized and inserted into the PstI site of pBR322 by G-C tailing. This recombinant plasmid was used to transform E. coli chi 1776 to screen 1445 tetracycline resistant colonies. Nine clones (pGETI - 9) containing cDNA coding for the human epsilon chain were recognized by colony hybridization and Southern blotting analysis with a nick-translated human IgE genome fragment. The nucleotide sequence of the longest cDNA contained in pGET2 was determined. The results indicate that the sequence of 1657 nucleotides codes for 494 amino acids covering a part of the variable region and all of the constant region of the human epsilon chain. Most of the amino acid sequence deduced from the nucleotide sequence is in substantial agreement with that reported. Furthermore a termination codon after the -COOH terminal amino acid marks the beginning of a 3' untranslated region of 125 nucleotides with a poly A tail. Taking this into account, the structure of the human epsilon chain mRNA, except a part of the 5' end, is conserved fairly well in the cDNA insert in pGET2.  相似文献   

14.
The mouse cell line IF2 secretes an immunoglobulin heavy chain lacking the CH1 domain. We have isolated and characterised a recombinant plasmid containing cDNA copies of the IF2 mutant mRNA. The cloned sequence extends from the nucleotides coding for amino acid 96 in the variable region through 100 nucleotides of untranslated region at the 3' end. The sequence of the cDNA insert reveals no discontinuity at the variable-hinge region junction, the site of the CH1 deletion. Experiments employing direct priming on the poly(A) tail of the IF2 heavy chain mRNA suggest that the 3' end of the cDNA clone (sequence C-C-C-T-G-C) is also the 3' end of the mRNA.  相似文献   

15.
Rat apolipoprotein E mRNA. Cloning and sequencing of double-stranded cDNA   总被引:21,自引:0,他引:21  
A 900-base pair clone corresponding to rat liver apolipoprotein E (apo-E) mRNA, and containing a 3'-terminal poly(A) segment, was identified from a library of rat liver cDNA clones in the plasmid pBR322 by specific hybrid selection and translation of mRNA. A restriction endonuclease DNA fragment from this recombinant plasmid was used to clone the 5'-terminal region of the apo-E mRNA by primed synthesis of cDNA. A portion of the double-stranded cDNA corresponding to the 3'-terminal region of apo-E mRNA was subcloned into the bacteriophage M13mp7 and employed as a template for the synthesis of a radioactively labeled, cDNA hybridization probe. This cDNA probe was used in a RNA-blot hybridization assay that showed the length of the apo-E mRNA to be about 1200 nucleotides. The hybridization assay also demonstrated that apo-E mRNA is present in rat intestine, but at about a 100-fold lower level than that of the rat liver. The nucleotide sequence of rat liver apo-E mRNA was determined from the cloned, double-stranded cDNAs. The amino acid sequence of rat liver apo-E was inferred from the nucleotide sequence, which showed that the mRNA codes for a precursor protein of 311 amino acids. A comparison to the NH2-terminal amino acid sequence of rat plasma apo-E indicated that the first 18 amino acids of the primary translation product are not present in the mature protein and are probably removed during co-translational processing. The coding region was flanked by a 3'-untranslated region of 109 nucleotides, which contained a characteristic AAUAAA sequence that ended 13 nucleotides from a 3'-terminal poly(A) segment. At the 5'-terminal region of the mRNA, 23 nucleotides of an untranslated region were also determined. The inferred amino acid sequence of mature rat apo-E, which contains 293 amino acids, was compared to the amino acid sequence of human apo-E, which contains 299 amino acids. Using an alignment that permitted a maximum homology of amino acids, it was found that overall, 69% of the amino acid positions are identical in both proteins. The amino acid identities are clustered in two broad domains separated by a short region of nonhomology, an NH2-terminal domain of 173 residues where 80% are identical, and a COOH-terminal domain of 84 residues where 70% are identical. These two domains may be associated with specific functional roles in the protein.  相似文献   

16.
H C Lai  G Grove    C P Tu 《Nucleic acids research》1986,14(15):6101-6114
We have isolated a Yb-subunit cDNA clone from a GSH S-transferase (GST) cDNA library made from rat liver polysomal poly(A) RNAs. Sequence analysis of one of these cDNA, pGTR200, revealed an open reading frame of 218 amino acids of Mr = 25,915. The deduced sequence is in agreement with the 19 NH2-terminal residues for GST-A. The sequence of pGTR200 differs from another Yb cDNA, pGTA/C44 by four nucleotides and two amino acids in the coding region, thus revealing sequence microheterogeneity. The cDNA insert in pGTR200 also contains 36 nucleotides in the 5' noncoding region and a complete 3' noncoding region. The Yb subunit cDNA shares very limited homology with those of the Ya or Yc cDNAs, but has relatively higher sequence homology to the placental subunit Yp clone pGP5. The mRNA of pGTR200 is not expressed abundantly in rat hearts and seminal vesicles. Therefore, the GST subunit sequence of pGTR200 probably represents a basic Yb subunit. Genomic DNA hybridization patterns showed a complexity consistent with having a multigene family for Yb subunits. Comparison of the amino acid sequences of the Ya, Yb, Yc, and Yp subunits revealed significant conservation of amino acids (approximately 29%) throughout the coding sequences. These results indicate that the rat GSTs are products of at least four different genes that may constitute a supergene family.  相似文献   

17.
Two DNA molecules complementary to human liver mRNA coding for the alpha-subunit of the stimulatory regulatory component Gs of adenylyl cyclase were cloned. One of the two forms is a full-length cDNA of 1614 nucleotides plus a poly(A) tail of 59 nucleotides. The deduced sequence of 394 amino acids encoded by its open reading frame is essentially identical to that of the alpha-subunits of Gs identified by molecular cloning from bovine adrenals, bovine brain and rat brain. Two independent clones of the other type of cDNA were isolated. Both were incomplete, beginning within the open reading frame coding for the alpha s polypeptide. One codes for amino acids 5 through 394 and the other for amino acids 48 through 394 of the above described cDNA of 1614 nucleotides, and both have the identical 3'-untranslated sequence. They differ from the first cDNA, however, in that they lack a stretch of 42 nucleotides (numbers 214 through 255) and have nucleotides 213 (G) and 256 (G) replaced with C and A, respectively. This results in a predicted amino acid composition of another alpha-subunit of Gs that is shorter by 14 amino acids and contains two substitutions (Asp for Glu and Ser for Gly) at the interface between the deletion and the unchanged sequence. We call the smaller subunit alpha s1 and the larger alpha s2. This is the first demonstration of a structural heterogeneity in alpha s subunits that is due to a difference in amino acid sequence.  相似文献   

18.
D W Chung  E W Davie 《Biochemistry》1984,23(18):4232-4236
cDNAs and the genomic DNA coding for the gamma and gamma' chains of human fibrinogen have been isolated and characterized by sequence analysis. The cDNAs coding for the gamma and gamma' chains share a common nucleotide sequence coding for the first 407 amino acid residues in each polypeptide chain. The predominant gamma chain contains an additional four amino acids on its carboxyl-terminal end (residues 408-411). These four amino acids, together with the 3' noncoding sequences, are encoded by the tenth exon. Removal of the ninth intervening sequence following the processing and polyadenylation reactions yields a mature mRNA coding for the predominant gamma chain. The less prevalent gamma' chain contains 20 amino acids at its carboxyl-terminal end (residues 408-417). These 20 amino acids are encoded by the immediate 5' end of the ninth intervening sequence. This results from an occasional processing and polyadenylation reaction that occurs within the region normally constituting the ninth intervening sequence. Accordingly, the gene for the gamma chain of human fibrinogen gives rise to two mRNAs that differ in sequence on their 3' ends. These mRNAs code for polypeptide chains with different carboxyl-terminal sequences. Both of these polypeptides are incorporated into the fibrinogen molecule present in plasma.  相似文献   

19.
20.
Nucleotide sequence of cloned cDNA specific for rat ribosomal protein S11   总被引:9,自引:0,他引:9  
A cDNA clone specific for rat ribosomal protein S11 was isolated by hybrid-selected translation from the cDNA library made for 8-9 S poly(A) RNA from regenerating rat liver. Since this cDNA had not enough length, another clone was selected by colony hybridization using a fragment of isolated cDNA as a probe. The nucleotide sequence of the cDNA was determined. The sequence contains 2 base pairs from the 5' noncoding region, the entire coding region of 477 base pairs, and the 3' noncoding region of 55 base pairs besides the poly(A) tail. The primary structure of the protein S11 was deduced from the nucleotide sequence. It consists of 157 amino acids. Its molecular weight is 18,299. The calculated amino acid composition is consistent with the reported composition of S11 determined on the protein hydrolysate. The amino acid sequence showed a marked homology with that of S16 of Halobacterium cutirubrum and an appreciable homology with that of S17 of Escherichia coli.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号