首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Sequence of the cDNA and gene for angiogenin, a human angiogenesis factor   总被引:29,自引:0,他引:29  
Human cDNAs coding for angiogenin, a human tumor derived angiogenesis factor, were isolated from a cDNA library prepared from human liver poly(A) mRNA employing a synthetic oligonucleotide as a hybridization probe. The largest cDNA insert (697 base pairs) contained a short 5'-noncoding sequence followed by a sequence coding for a signal peptide of 24 (or 22) amino acids, 369 nucleotides coding for the mature protein of 123 amino acids, a stop codon, a 3'-noncoding sequence of 175 nucleotides, and a poly(A) tail. The gene coding for human angiogenin was then isolated from a genomic lambda Charon 4A bacteriophage library employing the cDNA as a probe. The nucleotide sequence of the gene and the adjacent 5'- and 3'-flanking regions (4688 base pairs) was then determined. The coding and 3'-noncoding regions of the gene for human angiogenin were found to be free of introns, and the DNA sequence for the gene agreed well with that of the cDNA. The gene contained a potential TATA box in the 5' end in addition to two Alu repetitive sequences immediately flanking the 5' and 3' ends of the gene. The third Alu sequence was also found about 500 nucleotides downstream from the Alu sequence at the 3' end of the gene. The amino acid sequence of human angiogenin as predicted from the gene sequence was in complete agreement with that determined by amino acid sequence analysis. It is about 35% homologous with human pancreatic ribonuclease, and the amino acid residues that are essential for the activity of ribonuclease are also conserved in angiogenin. This provocative finding is thought to have important physiological implications.  相似文献   

2.
Whey acidic protein (WAP) is a major milk protein found in mouse and rat. Cloned WAP cDNAs from both species have been sequenced and the respective protein sequences have been deduced. Mouse and rat WAP (134 and 137 amino acids respectively) are acidic, cysteine rich proteins which contain a N-terminal signal peptide of 19 amino acids. Most of the cysteines are located in two clusters containing six cysteine residues each, arranged in an identical pattern. Comparison of the mouse and rat WAPs show that the signal peptide and the first cysteine domain are conserved to a greater extent than the rest of the protein. This result is reflected in the nucleotide sequence homology, where the regions coding for the signal peptide and cysteine domain I are the only regions where the rate of replacement substitution is lower than the rate of silent substitution. The 3' non-coding regions show a 91% conservation which is half the substitution rate for the coding region. This low rate of sequence divergence in the 3' non-translated region of the mRNA may indicate a functional importance for this region.  相似文献   

3.
4.
Filaggrin is an intermediate filament-associated protein that is involved in aggregation of keratin filaments in fully cornified cells of the mammalian epidermis, and is an important marker for epidermal differentiation. In this report, the sequence of a rat cDNA clone coding for a portion of the polymeric precursor, profilaggrin, is presented. The cDNA is 2,314 bp long with 1,875 bp of coding region ending with an A-T-rich 3' noncoding region. Genomic analysis indicates that the profilaggrin gene consists of 20 +/- 2 repeats of 1,218 bp of sequence coding for 406 amino acids, making the mRNA at least 25-27 kb in length. Each repeat consists of a filaggrin domain and a linker sequence with an estimated size of 380 and 26 amino acids, respectively. High levels of profilaggrin mRNA are found only in keratinizing epithelia. Comparison of the rat filaggrin sequence with that of mouse and human filaggrin and with the sequence of phosphorylated peptides from mouse profilaggrin indicates that the proteins share extensive amino acid sequence similarities, especially in the two phosphorylated regions. Proteolytic processing sites are also quite similar in rat and mouse. The three species show blocks of sequence that are similar in length and composition which alternate with sequences that are variable in length. This analysis suggests that the evolution of the present-day filaggrins has been constrained by maintenance of phosphorylation sites and overall amino acid composition. The cDNAs for the profilaggrins are similar in structure, reflecting genes that have simple repeating structures and lack introns within their coding regions. Mouse and rat profilaggrin terminate with a nonpolar sequence atypical of the rest of the coding region, and have similar 3' noncoding regions. To explain these observations, a novel evolutionary model is proposed.  相似文献   

5.
cDNAs encoding the entire coding regions of the precursors (p) of rat long chain acyl-CoA (LCAD), short chain acyl-CoA (SCAD) and isovaleryl-CoA dehydrogenase (IVD) have been cloned and sequenced. Three cDNAs for rat liver LCAD together cover a 1440-base pair region. These cDNAs encode the entire 430-amino acid sequence of pLCAD, including the 30-amino acid leader peptide and the 400-amino acid mature LCAD. A single 1773 base pair cDNA for rat SCAD covers the entire coding region (414 amino acids), including the 26-amino acid leader peptide and the 388-amino acid mature peptide. Four identified IVD cDNAs, when combined, encompass a 2104 base region, and encode 424 amino acids including a 30-amino acid leader peptide and the 394-amino acid mature peptide. The identities of all cDNA clones have been confirmed by matching the amino acid sequences predicted from the respective cDNAs to the amino-terminal and tryptic peptide sequences derived from the corresponding purified rat enzyme. Comparison of the sequences of four rat acyl-CoA dehydrogenases, including LCAD, MCAD, SCAD, and IVD, and two of their human counterparts (MCAD and SCAD) reveals a high degree of homology (57 invariant and 92 near invariant residues: 30.6-35.4% of identical residues in pairwise comparisons), suggesting that these enzymes belong to a gene family and have evolved from a common ancestral gene.  相似文献   

6.
The cDNA and protein sequences of human lactate dehydrogenase B.   总被引:9,自引:0,他引:9       下载免费PDF全文
Human lactate dehydrogenase B (LDH-B) cDNA was isolated and sequenced. The LDH-B cDNA insert consists of the protein-coding sequence (999 bp), the 5' (54 bp) and 3' (203 bp) non-coding regions, and the poly(A) tail (50 bp). The predicted sequence of 333 amino acid residues was confirmed by amino acid composition and/or sequence analyses of a total of 185 (56%) residues from tryptic peptides of human LDH-B protein. The nucleotide and amino acid sequences of the human LDH-B coding region show 68% and 75% homologies respectively with those of the human LDH-A. The peptide map and amino acid composition data have been deposited as Supplementary Publication SUP 50139 (7 pages) at the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies are available on prepayment [see Biochem. J. (1987) 241, 5].  相似文献   

7.
Nucleotide sequences were determined for cloned cDNAs encoding for more than half of the pro alpha 2 chain of type I procollagen from man. Comparisons with previously published data on homologous cDNAs from chick embryos made it possible to examine evolution of the gene in two species which have diverged for 250-300 million years. The amino acid sequence of the alpha-chain domain supported previous indications that there is a strong selective pressure to maintain glycine as every third amino acid and to maintain a prescribed distribution of charged amino acids. However, there is little apparent selective pressure on other amino acids. The amino acid sequence of the C-propeptide domain showed less divergence than the alpha-chain domain. The 5' end or N terminus of the human C-propeptide, however, contained an insert of 12 bases coding for 4 amino acids not found in the chick C-propeptide. About 100 amino acid residues from the N terminus, two residues found in the chick sequence were missing from the human. In the second half of the C-propeptide, there was complete conservation of a 37 amino acid sequence and conservation of 50 out of 51 amino acids in the same region, an observation which suggested that the region serves some special purpose such as directing the association of one pro alpha 2(I) C-propeptide with two pro alpha 1(I) C-propeptides so as to produce the heteropolymeric structure of type I procollagen. In addition, comparison of human and chick DNAs for pro alpha 2(I) revealed three different classes of conservation of nucleotide sequence which have no apparent effect on the structure of the protein: a preference for U on the third base position of codons for glycine, proline, and alanine; a high degree of nucleotide conservation in the 51 amino acid highly conserved region of the C-propeptide; a high degree of nucleotide conservation in the 3'-noncoding region. These three classes of nucleotide conservation may reflect unusual features of collagen genes, such as their high GC content or their highly repetitive coding sequences.  相似文献   

8.
9.
A rat spleen cDNA library was screened for clones carrying the cDNAs for prothymosin alpha and parathymosin. Sequence analysis of a clone carrying the entire coding region for prothymosin alpha confirmed and completed the amino acid sequence for this polypeptide and established the number of amino acid residues as 111. Rat prothymosin alpha differs from human prothymosin alpha at six positions, including four substitutions and two insertions. The nucleotide sequences of the cDNAs for the rat and human polypeptides are more than 90% identical in the open reading frames, with significant homology extending into the 5' and 3' flanking regions. From the same library, we also isolated a clone carrying 80% of the coding region for rat parathymosin. The number of amino acid residues in rat parathymosin is 101, based on the sequence deduced from the cDNA insert and earlier information on the sequence in the amino-terminal portion of this polypeptide. Despite their similarity in size and amino acid composition, rat prothymosin alpha and rat parathymosin show only limited sequence homology, primarily in the segment including residues 14 through 25, where 10 of 12 positions are identical in the two polypeptides. this is also the region of significant sequence similarity to a 12-amino-acid segment in the p17 protein of the human immunodeficiency disease associated virus (HTLV-IIIB).  相似文献   

10.
Molecular cloning and nucleotide sequence of the streptavidin gene.   总被引:15,自引:2,他引:13       下载免费PDF全文
Using synthetic oligonucleotides as probes we have cloned the streptavidin gene from a genomic library of Streptomyces avidinii. Nucleotide sequence analysis indicated that a 2 Kb DNA-fragment contained the entire coding region, a signal peptide region and the 3' and 5' flanking regions of the gene. The deduced amino acid sequence shows several interrupted blocks of homology with the amino acid sequence of chicken egg-white avidin. Analysis of the secondary structure suggests a high content of beta-structure in both proteins and considerable overall structural similarity between them.  相似文献   

11.
12.
13.
H J Hong  A K Kim  C J Ryu  S S Park  H K Chung  K S Kwon  K L Kim  J Kim  M H Han 《Gene》1992,121(2):331-335
Binding specificity of a monoclonal antibody (mAb) (kappa, gamma 2b) H8 which can react with the pre-S2 peptide of hepatitis B virus (HBV) was determined by Western blot analyses. From the hybridoma cell line secreting mAb H8, poly(A)+ RNA was prepared and used as a template for cDNA synthesis and cloning. Full-length cDNAs coding for the heavy and kappa light chains of the mAb were cloned from the cDNA library and characterized by nucleotide (nt) sequence analyses and N-terminal amino acid sequencing. The sequence analyses revealed that both heavy and light chain-specific cDNAs are functional, and the variable regions of the heavy and light chains are members of mouse heavy chain subgroup III(c) and light chain group I, respectively. Comparison of the nt sequences with mouse immunoglobulin genes listed in the GenBank data base show that the cDNAs have not been previously reported. The cDNAs will be used for the construction of a therapeutic antibody for HBV infection.  相似文献   

14.
The amino acid sequence of the bovine mitochondrial nicotinamide nucleotide transhydrogenase was recently deduced from isolated cDNAs and reported [Yamaguchi, M., Hatefi, Y., Trach, K., and Hoch, J.A. (1988) J. Biol. Chem. 263, 2761-2767]. The cDNAs lacked the N-terminal coding region, however, and the 8 N-terminal residues were determined by protein sequencing. In the present study, the nucleotide sequence of the 5' upstream region was determined by dideoxynucleotide sequencing of the transhydrogenase messenger RNA, and amino acid sequences of the N-terminal region and the signal peptide of the enzyme were deduced from the nucleotide sequence. The N-terminal sequence of the enzyme as deduced from the mRNA sequence is the same as that determined by protein sequencing, with one difference. Protein sequencing showed Ser as the N-terminal residue. The mRNA sequence indicated that Ser is the second N-terminal residue, and the first is Cys. That preparations of the enzyme are mixtures of two polypeptides, one polypeptide being one residue shorter at the N terminus than the other, has been pointed out in the above reference. The signal peptide consists of 43 residues, is rich in basic (4 Lys, 2 Arg) and hydroxylated (4 Thr, 3 Ser) amino acids, and lacks acidic residues.  相似文献   

15.
Comparison of human brain and liver glutamate dehydrogenase cDNAS   总被引:1,自引:0,他引:1  
In order to investigate suggestions that more than one glutamate dehydrogenase (GDH) gene may be active in humans, seven human brain and seventeen human liver GDH cDNAs were isolated by probing with a 590 base cDNA from the coding region of human brain GDH. No sequence heterogeneity was revealed among any of the cDNAs by an oligonucleotide binding assay, nor did any cDNA appear to encode a hexapeptide contained in a published amino acid sequence of human liver GDH. Homologous regions of three liver and three brain cDNAs had identical sequences over more than 2 kb, including 3' nontranslated regions. This suggests that identical GDH mRNAs are present in human brain and human liver. Although only one gene appears to be expressed, human genomic DNA blots show a pattern of hybridization consistent with the existence of more than one GDH gene.  相似文献   

16.
Neural cell adhesion molecules (NCAMs) are cell surface glycoproteins that appear to mediate cell-cell adhesion. In vertebrates NCAMs exist in at least three different polypeptide forms of apparent molecular masses 180, 140, and 120 kD. The 180- and 140-kD forms span the plasma membrane whereas the 120-kD form lacks a transmembrane region. In this study, we report the isolation of NCAM clones from an adult rat brain cDNA library. Sequence analysis indicated that the longest isolate, pR18, contains a 2,574 nucleotide open reading frame flanked by 208 bases of 5' and 409 bases of 3' untranslated sequence. The predicted polypeptide encoded by clone pR18 contains a single membrane-spanning region and a small cytoplasmic domain (120 amino acids), suggesting that it codes for a full-length 140-kD NCAM form. In Northern analysis, probes derived from 5' sequences of pR18, which presumably code for extracellular portions of the molecule hybridized to five discrete mRNA size classes (7.4, 6.7, 5.2, 4.3, and 2.9 kb) in adult rat brain but not to liver or muscle RNA. However, the 5.2- and 2.9-kb mRNA size classes did not hybridize to either a large restriction fragment or three oligonucleotides derived from the putative transmembrane coding region and regions that lie 3' to it. The 3' probes did hybridize to the 7.4-, 6.7-, and 4.3-kb message size classes. These combined results indicate that clone pR18 is derived from either the 7.4-, 6.7-, or 4.3-kb adult rat brain RNA size class. Comparison with chicken and mouse NCAM cDNA sequences suggests that pR18 represents the amino acid coding region of the 6.7- or 4.3-kb mRNA. The isolation of pR18, the first cDNA that contains the complete coding sequence of an NCAM polypeptide, unambiguously demonstrates the predicted linear amino acid sequence of this probable rat 140-kD polypeptide. This cDNA also contains a 30-base pair segment not found in NCAM cDNAs isolated from other species. The significance of this segment and other structural features of the 140-kD form of NCAM can now be studied.  相似文献   

17.
Construction and sequence of cDNA for rat liver stearyl coenzyme A desaturase   总被引:23,自引:0,他引:23  
Hepatic poly(A+) RNA from rats induced for stearyl-CoA desaturase was used for primer-extension of cDNA coding for stearyl-CoA desaturase. Previously, Northern blot analysis showed that translatable desaturase mRNA is 4,900 nucleotides in length (Thiede, M. A., and Strittmatter, P. (1985) J. Biol. Chem. 260, 14459-14463). Six overlapping cDNAs, ranging from 850 to 1450 bases, were used to compile the 4,689-nucleotide sequence. The cDNA includes a 1,074-base open reading frame coding for 358 amino acids, corresponding to a molecular mass of 41,400 daltons. Positive identification of this open reading frame was accomplished by matching the amino acid sequence of both amino-terminal and cyanogen bromide peptides of the purified enzyme with regions of the sequence deduced from the cDNA. Amino acid composition data from the cDNA compares well with that from the desaturase. The protein contains 62% hydrophobic amino acids. An interesting feature of this mRNA is the 3,500-base 3' noncoding region, which has been localized on a single 3' exon by Southern blot analysis.  相似文献   

18.
Amino acid sequence of a specific antigenic peptide of protein B23   总被引:6,自引:0,他引:6  
A specific antigenic peptide was obtained from protein B23 (Mr/pI = 37,000/5.1) after 30 min of digestion with staphylococcal V8 protease (10 micrograms/ml/mg protein B23). The antigenic peptide was purified by DEAE-cellulose chromatography and high pressure liquid chromatography on a reverse-phase C18 column. The antigenic peptide contains 14.7 and 18.7 mol% of glutamic acid and lysine, respectively. Amino acid sequence analysis showed that the peptide has 68 amino acids and is located on the carboxyl-terminal sequence of protein B23. The sequence is Ser-Phe-Lys-Lys-Gln-Glu-Lys-Thr-Pro-Lys-Thr-Pro- Lys-Gly-Pro-Ser-Ser-Val-Glu-Asp-Ile-Lys-Ala-Lys-Met-Gln-Ala-Ser-Ile-Glu- Lys-Gly- Gly-Ser-Leu-Pro-Lys-Val-Glu-Ala-Lys-Phe-Ile-Asn-Tyr-Val-Lys-Asn-Cys-Phe- Arg-Met- Thr-Asp-Gln-Glu-Ala-Ile-Gln-Asp-Leu-Trp-Gln-Trp-Arg-Lys-Ser-Leu-Cooh. Extensive digestion of the antigenic peptide with V8 protease, trypsin, or chymotrypsin results in loss of the antigenic activity. Three cloned cDNAs (hpB1, hpB2, and hpB7) which code for the 82 amino acids at the COOH terminus of protein B23 and the 3' non-translating sequence were identified and characterized. All three clones have identical nucleotide sequences coding for the antigenic portion of the protein (68 amino acids at the COOH terminus), the stop codon, and the 3' non-translated region. However, mutation of 6 nucleotide bases of one clone (hpB2) caused changes in 4 amino acids in the sequence just preceding the immunoreactive region. The result suggests the presence of at least 2 immunologically similar but distinct proteins which are both recognized by the anti-B23 antibody.  相似文献   

19.
Two different bovine cDNAs have been characterized that encode closely related homologues of the mitochondrial membrane carrier protein ADP/ATP translocase. One of them codes for the protein that has been characterized previously from bovine heart mitochondria, and the other codes for a protein that differs from it in 33 amino acids out of 297. Including the base substitutions required to bring about these changes in amino acid sequence, the coding regions of the cDNAs differ at 184 positions. In addition, they are extensively diverged in their 3' noncoding sequences, which differ greatly in both length and sequence, and these segments of the cDNAs have been used as hybridization probes to demonstrate that the expression of the two genes giving rise to the two proteins is very different in various bovine tissues. Expression of one gene predominates in heart muscle and that of the other in intestine. Hybridization experiments with digests of genomic DNA have shown the presence of numerous sequences related to the two cDNAs in both the bovine and human genomes. Some of these probably arise from pseudogenes, but three expressed genes have been detected in the human genome. The study of the regulation of the expression of these genes may help to illuminate the basis of tissue-specific human mitochondrial diseases which arise because of defects in mitochondrial enzymes only in the affected tissue and not in other tissues of the same individual.  相似文献   

20.
D W Chung  E W Davie 《Biochemistry》1984,23(18):4232-4236
cDNAs and the genomic DNA coding for the gamma and gamma' chains of human fibrinogen have been isolated and characterized by sequence analysis. The cDNAs coding for the gamma and gamma' chains share a common nucleotide sequence coding for the first 407 amino acid residues in each polypeptide chain. The predominant gamma chain contains an additional four amino acids on its carboxyl-terminal end (residues 408-411). These four amino acids, together with the 3' noncoding sequences, are encoded by the tenth exon. Removal of the ninth intervening sequence following the processing and polyadenylation reactions yields a mature mRNA coding for the predominant gamma chain. The less prevalent gamma' chain contains 20 amino acids at its carboxyl-terminal end (residues 408-417). These 20 amino acids are encoded by the immediate 5' end of the ninth intervening sequence. This results from an occasional processing and polyadenylation reaction that occurs within the region normally constituting the ninth intervening sequence. Accordingly, the gene for the gamma chain of human fibrinogen gives rise to two mRNAs that differ in sequence on their 3' ends. These mRNAs code for polypeptide chains with different carboxyl-terminal sequences. Both of these polypeptides are incorporated into the fibrinogen molecule present in plasma.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号