首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We report here the molecular cloning and sequence analysis of DNAs complementary to mRNAs for myosin alkali light chain of chicken embryo and adult leg skeletal muscle. pSMA2-1 contained an 818 base-pair insert that includes the entire coding region and 5' and 3' untranslated regions of A2 mRNA. pSMA1-1 contained a 848 base-pair insert that included the 3' untranslated region and almost all of the coding region except for the N-terminal 13 amino acid residues of the A1 light chain. The 741 nucleotide sequences of A1 and A2 mRNAs corresponding to C-terminal 141 amino acid residues and 3' untranslated regions were identical. The 5' terminal nucleotide sequences corresponding to N-terminal 35 amino acid residues of A1 chain were quite different from the sequences corresponding to N-terminal 8 amino acid residues and of the 5' untranslated region of A2 mRNA. These findings are discussed in relation to the structures of the genes for A1 and A2 mRNA.  相似文献   

2.
cDNAs encoding the human lysosomal hydrolase, arylsulfatase B (ASB; N-acetylgalactosamine-4-sulfatase, EC 3.1.6.1), were isolated from a hepatoma cell cDNA library using an ASB-specific oligonucleotide generated by the MOPAC (mixed oligonucleotide primed amplification of cDNA) technique. To facilitate cDNA cloning, human ASB was purified to apparent homogeneity and a total of 112 amino acid residues were microsequenced from the N-terminus and four internal tryptic peptides of the 47-kDa subunit. Based on the ASB N-terminal amino acid sequence, two oligonucleotide mixtures containing inosines to reduce the mixture complexity were constructed and used as primers to amplify an ASB-specific product from human placental cDNA by the polymerase chain reaction. DNA sequencing of this MOPAC product demonstrated colinearity with 21 N-terminal ASB amino acids. Based on this sequence and on codon usage for the adjacent conserved amino acids in human arylsulfatases A and C, a unique 66-mer was synthesized and used to screen a human hepatoma cell cDNA library. Four putative positive cDNA clones were isolated, and the largest insert (pASB-1) was sequenced in both orientations. The 1834-bp pASB-1 insert had a 1278-bp open reading frame encoding 425 amino acids that was colinear with 85 microsequenced amino acids of the purified enzyme, demonstrating its authenticity. Using the pASB-1 cDNA as a probe, a full-length cDNA clone, pASB-4, was isolated from a human testes library and sequenced in both orientations. pASB-4 had a 2811-bp insert containing a 559-bp 5' untranslated sequence, a 1602-bp open reading frame encoding 533 amino acids (six potential N-glycosylation sites), a 641-bp 3' untranslated sequence, and a 9-bp poly(A) tract. Comparison of the predicted amino acid sequences of arylsulfatases A, B, and C revealed regions of identity, particularly in their N-termini.  相似文献   

3.
The serum level of the fourth component of complement (C4) in mice bearing the H-2k haplotype is only 1/10 to 1/20 of that of non-H-2k mice. We have analyzed C4 cDNA clones from B10.BR(H-2k) mouse liver and found aberrant C4 cDNA which contained a 200-base pair (bp) insertion between the exon 13 and exon 14 encoded sequences in addition to the normal C4 cDNA. The 5' 148 bp and the 3' 52 bp of this insert were derived from the B2 sequence, the short interspersed repeats of mouse genome, and the central part of intron 13, respectively. Sequence analysis of intron 13 of the C4k gene showed the presence of a complete copy of a B2 consensus sequence. The structure of aberrant C4 mRNA indicated that the possible 3' splice site in the B2 sequence and the cryptic 5' splice site in intron 13 were used. Both the insertion of the B2 sequence into intron 13 and the presence of aberrant mRNA in the liver were specific to H-2k-bearing mice, suggesting that the aberrant splicing due to the B2 insertion is the basis for low C4 expression in H-2k mice.  相似文献   

4.
The sequence of 3,687 nucleotides from the 3' end of the Sendai virus genome (Z strain) was determined by a molecular cloning technique followed by rapid sequence analysis. Two large open reading frames, one consisting of 1,572 nucleotides and the other of 1,704 nucleotides, were observed in the region, that is OP-1 and OP-2 from the 3' end of the genome. The amino acid sequences of the gene products were predicted from the observed sequence. Determination of amino acid compositions of viral proteins, P, HN, Fo, NP and M, led us to conclude that NP and P are the gene products of OP-1 and OP-2, respectively. An additional open reading frame consisting of 612 nucleotides (OP-3) was discovered in the 3' most proximal region of OP-2. The predicted product of OP-3 was considered to be viral non-structural protein C. The leader sequence of 51 nucleotides at the 3' terminal of the genome and consensus sequences at 3' and 5' ends of each gene for proteins NP and P were identified.  相似文献   

5.
A cDNA library in lambda-phage lambda gt11 containing DNA inserts prepared from human liver mRNA was screened with monoclonal antibodies to human protein C inhibitor. Six positive clones were isolated from 6 X 10(6) phages and plaque purified. The cDNA in the phage containing the largest insert, which hybridized to a DNA probe prepared on the basis of the amino-terminal amino acid sequence of the mature inhibitor, was sequenced. This cDNA insert contained 2106 base pairs coding for a 5'-noncoding region, a 19-amino acid signal peptide, a 387-amino acid mature protein, a stop codon, and a long 3'-noncoding region of 839 base pairs. Based on the amino acid sequence of the carboxyl-terminal peptide released by cleavage of protein C inhibitor by activated protein C as well as by thrombin, the reactive site peptide bond of protein C inhibitor is Arg354-Ser355. Five potential carbohydrate-binding sites were found in the mature protein. The high homology of the amino acid sequence of protein C inhibitor to the other known inhibitors clearly demonstrates that protein C inhibitor is a member of the superfamily of serine protease inhibitors including alpha 1-antichymotrypsin, alpha 1-antitrypsin, antithrombin III, ovalbumin, and angiotensinogen. Based on the difference matrices for these proteins, we present possible phylogenetic trees for these proteins.  相似文献   

6.
Isolation and analysis of a cDNA coding for human C1 inhibitor   总被引:1,自引:0,他引:1  
A cDNA coding for C1 inhibitor was isolated from a human liver lambda gt11 expression library and sequenced by the dideoxy method. The amino acid sequence deduced from the cDNA indicated that the insert was a partial clone coding for 310 amino acids including the reactive site present at the carboxyl end of the molecule. The reactive site corresponds to that previously reported by Salvesen et al. (J. Biol. Chem. 260, 2432, 1985). The cDNA also contained a stop codon of TGA, 264 nucleotides at the 3' noncoding region, and a polyadenylation signal sequence of AATAAA 15 nucleotides upstream from the poly(A) tail. The amino acid sequence flanking the reactive site of the inhibitor is homologous to other members of the superfamily of plasma serine protease inhibitors.  相似文献   

7.
Structure of the horseradish peroxidase isozyme C genes   总被引:13,自引:0,他引:13  
We have isolated, cloned and characterized three cDNAs and two genomic DNAs corresponding to the mRNAs and genes for the horseradish (Armoracia rusticana) peroxidase isoenzyme C (HPR C). The amino acid sequence of HRP C1, deduced from the nucleotide sequence of one of the cDNA clone, pSK1, contained the same primary sequence as that of the purified enzyme established by Welinder [FEBS Lett. 72, 19-23 (1976)] with additional sequences at the N and C terminal. All three inserts in the cDNA clones, pSK1, pSK2 and pSK3, coded the same size of peptide (308 amino acid residues) if these are processed in the same way, and the amino acid sequence were homologous to each other by 91-94%. Functional amino acids, including His40, His170, Tyr185 and Arg183 and S-S-bond-forming Cys, were conserved in the three isozymes, but a few N-glycosylation sites were not the same. Two HRP C isoenzyme genomic genes, prxC1 and prxC2, were tandem on the chromosomal DNA and each gene consisted of four exons and three introns. The positions in the exons interrupted by introns were the same in two genes. We observed a putative promoter sequence 5' upstream and a poly(A) signal 3' downstream in both genes. The gene product of prxC1 might be processed with a signal sequence of 30 amino acid residues at the N terminus and a peptide consisting of 15 amino acid residues at the C terminus.  相似文献   

8.
9.
T K Frey  L D Marr 《Gene》1988,62(1):85-99
The sequence of the 3' 4508 nucleotides (nt) of the genomic RNA of the Therien strain of rubella virus (RV) was determined for cDNA clones. The sequence contains a 3189-nt open reading frame (ORF) which codes for the structural proteins C, E2 and E1. C is predicted to have a length of 300 amino acids (aa). The N-terminal half of the C protein is highly basic and hydrophilic in nature, and is putatively the region of the protein which interacts with the virion RNA. At the C terminus of the C protein is a stretch of 20 hydrophobic aa which also serves as the signal sequence for E2, indicating that the cleavage of C from the polyprotein precursor may be catalyzed by signalase in the lumen of the endoplasmic reticulum. E2 is 282 aa in length and contains four potential N-linked glycosylation sites and a putative transmembrane domain near its C terminus. The sequence of E1 has been previously described [Frey et al., Virology 154 (1986) 228-232]. No homology could be detected between the amino acid sequence of the RV structural proteins and the amino acid sequence of the alphavirus structural proteins. From the position of a region of 30 nt in the RV genomic sequence which exhibited significant homology with the sequence in the alphavirus genome at which subgenomic RNA synthesis is initiated, the RV subgenomic RNA is predicted to be 3346 nt in length and the nontranslated region from the 5' end of the subgenomic RNA to the structural protein ORF is predicted to be 98 nt. In a different translation frame beginning at the 5' end of the RV nt sequence reported here is a 1407 nt ORF which is the C terminal region of the nonstructural protein ORF. This ORF overlaps the structural protein ORF by 149 nt. A low level of homology could be detected between the predicted amino acid sequence of the C-terminus of the RV nonstructural protein ORF and the replicase proteins of several positive RNA viruses of animals and plants, including nsp4 of the alphaviruses, the protein encoded by the C-terminal region of the alphavirus nonstructural ORF. However, the overall homology between RV and the alphaviruses in this region of the genome was only 18%, indicating that these two genera of the Togavirus family are only distantly related. Intriguingly, there is a 2844-nt ORF present in the negative polarity orientation of the RV sequence which could encode a 928-aa polyprotein.  相似文献   

10.
Structure of rodent helix-destabilizing protein revealed by cDNA cloning   总被引:50,自引:0,他引:50  
A cDNA library of newborn rat brain poly(A+) RNA in lambda gt 11 was screened with a synthetic oligonucleotide probe corresponding to a five amino acid sequence in the N-terminal region of the calf helix-destabilizing protein, UP1. Six positive phage were isolated after testing 2 X 10(5) recombinants, and each phage was plaque purified. Four of these phage clones were positive with a second oligonucleotide probe corresponding to a 5 amino acid sequence in the C-terminal region of calf UP1; one of the clones positive with both probes was selected for detailed study. This phage, designated lambda HDP-182, contained a 1706-base pair cDNA insert corresponding to an mRNA with a poly(A) sequence at the 3' terminus and a single open reading frame starting 63 bases from the 5' terminus and extending 988 bases. The 3' untranslated region of the mRNA contained 718 bases, including an AAUAAA signal 21 bases from the poly(A) sequence and a 16-residue poly(U) sequence flanked on each side by oligonucleotide repeats. Primer extension analysis of newborn rat brain poly(A+) RNA suggested that the cDNA insert in lambda HDP-182 was full length except for about 35 nucleotide residues missing from the 5' end untranslated region, and Northern blot analysis revealed one relatively abundant mRNA species of approximately the same size as the cDNA insert. The 988-residue open reading frame in the cDNA predicted a 34,215-dalton protein of 320 amino acids. Residues 2 through 196 of this rat protein are identical to the 195-residue sequence of the calf helix-destabilizing protein, UP1. The 124-amino acid sequence in the C-terminal portion of the 34,215-dalton protein is not present in purified calf UP1. This 124-residue sequence has unusual amino acid content in that it is 11% asparagine, 15% serine, and 40% glycine and consists of 16 consecutive oligopeptide repeats. Computer-derived secondary structure predictions for the 34,215-dalton protein revealed two distinct domains consisting of residues 1 through approximately 196 and residues approximately 197 to 320, respectively.  相似文献   

11.
12.
In the eucaryotic nucleus, heterogeneous nuclear RNAs exist in a complex with a specific set of proteins to form heterogeneous nuclear ribonucleoprotein particles (hnRNPs). The C proteins, C1 and C2, are major constituents of hnRNPs and appear to play a role in RNA splicing as suggested by antibody inhibition and immunodepletion experiments. With the use of a previously described partial cDNA clone as a hybridization probe, full-length cDNAs for the human C proteins were isolated. All of the cDNAs isolated hybridized to two poly(A)+ RNAs of 1.9 and 1.4 kilobases (kb). DNA sequencing of a cDNA clone for the 1.9-kb mRNA (pHC12) revealed a single open reading frame of 290 amino acids coding for a protein of 31,931 daltons and two polyadenylation signals, AAUAAA, approximately 400 base pairs apart in the 3' untranslated region of the mRNA. DNA sequencing of a clone corresponding to the 1.4-kb mRNA (pHC5) indicated that the sequence of this mRNA is identical to that of the 1.9-kb mRNA up to the first polyadenylation signal which it uses. Both mRNAs therefore have the same coding capacity and are probably transcribed from a single gene. Translation in vitro of the 1.9-kb mRNA selected by hybridization with a 3'-end subfragment of pHC12 demonstrated that it by itself can direct the synthesis of both C1 and C2. The difference between the C1 and C2 proteins which results in their electrophoretic separation is not known, but most likely one of them is generated from the other posttranslationally. Since several hnRNP proteins appeared by sodium dodecyl sulfate-polyacrylamide gel electrophoresis as multiple antigenically related polypeptides, this raises the possibility that some of these other groups of hnRNP proteins are also each produced from a single mRNA. The predicted amino acid sequence of the protein indicates that it is composed of two distinct domains: an amino terminus that contains what we have recently described as a RNP consensus sequence, which is the putative RNA-binding site, and a carboxy terminus that is very negatively charged, contains no aromatic amino acids or prolines, and contains a putative nucleoside triphosphate-binding fold, as well as a phosphorylation site for casein kinase type II. The RNP consensus sequence was also found in the yeast poly(A)-binding protein (PABP), the heterogeneous nuclear RNA-binding proteins A1 and A2, and the pre-rRNA binding protein C23. All of these proteins are also composed of at least two distinct domains: an amino terminus, which possesses one or more RNP consensus sequences, and a carboxy terminus, which is unique to each protein, being very acidic in the C proteins and rich in glycine in A1, and C23 and rich in proline in the poly(A)-binding protein. These findings suggest that the amino terminus of these proteins possesses a highly conserved RNA-binding domain, whereas the carboxy terminus contains a region essential to the unique function and interactions of each of the RNA-binding proteins.  相似文献   

13.
In contrast to hepatic hydrosteroid dehydrogenases (HSDs) of the aldo-keto reductase family (AKR1C), little is known about a stomach one. From a mouse stomach cDNA library, we isolated two clones encoding proteins of 323 amino acid residues. They exhibited 93.2% amino acid sequence identity and 64-68% with any known HSDs. Recombinant proteins expressed in Escherichia coli reduced 9,10-phenanthraquinone with NAD(P)H as cofactor. The mRNAs were exclusively expressed in stomach, liver and ileum. The present study demonstrates that these proteins are new members of the HSD subfamily and they are named AKR1C12 and AKR1C13. Immunohistochemical analysis suggests that they are involved in detoxification of xenobiotics in the stomach.  相似文献   

14.
Two clones (p17 and p13), each containing the complete coding sequence for the bovine cardiac Na+/Ca2+ exchanger, were obtained from a lambda gt10 cDNA library by screening with cDNA probes from the canine exchanger. The coding sequence of clone p17 was 92 and 98% identical to the canine cDNA at the nucleotide and amino acid levels, respectively. Nine of the 21 amino acid differences between the two exchangers were found within the 32-amino acid signal sequence. The sequenced portions of the 3' untranslated regions of the cow and dog clones were 88% identical. Na+/Ca2+ exchange activity was expressed in Xenopus laevis oocytes injected with cRNA from clone p17, and in COS cells transfected with expression vectors containing p17. Immunoprecipitation of 35S-labeled proteins from transfected cells with an antibody against the N-terminal portion of the bovine exchanger showed the presence of a 120-kDa protein corresponding to the intact cardiac exchanger. The second bovine clone (p13) did not express exchange activity in either of the above expression systems, presumably because it contained a 300-bp insert with multiple stop codons which interrupted the coding sequence. Comparison of the 5' untranslated regions of p13 and p17 revealed a 156-bp segment in p17 that was apparently spliced out of p13. This segment contained a short open reading frame. A chimera encoding the 5' untranslated region of p13 and the coding sequence of p17 exhibited only a modest (74%) increase in expressed exchange activity in transfected cells compared to p17, suggesting that the presence of the upstream open reading frame in p17 did not greatly reduce translation efficiency. The results suggest that alternate splicing mechanisms may be involved in processing mRNA for the bovine cardiac exchanger.  相似文献   

15.
We have cloned a full length cDNA for the small subunit of ribulose-1,5-bisphosphate carboxylase from C4 monocot maize, determined the complete nucleotide sequence of this cDNA and deduced its amino acid sequence. The cDNA insert included 513 bp of the coding region, and 65 and 252 nucleotides of the 5' and 3' untranslated regions, respectively. The transit and mature peptides have, respectively, 47 and 123 amino acids. Comparison with the small subunit genes from other plants revealed that the maize small subunit is similar to the wheat one, there being 73% homology between the transit peptides and 64% between the mature proteins. This indicates that there is no noteworthy difference between the C3 and C4 small subunit structures. Extreme codon bias was observed for this gene, and similar codon preferences are observed for other proteins highly expressed in maize leaf, light harvesting chlorophyll binding protein and phosphoenolpyruvate carboxylase. The results indicate that preferential codon usage for highly expressed genes occurs in maize leaf.  相似文献   

16.
17.
The amino acid sequence of the matrix protein of the human respiratory syncytial virus (RS virus) was deduced from the sequence of a cDNA insert in a recombinant plasmid harboring an almost full-length copy of this gene. It specifically hybridized to a single 1,050-base mRNA from infected cells. The recombinant containing 944 base pairs of RS viral matrix protein gene sequence lacked five nucleotides corresponding to the 5' end of the mRNA. The nucleotide sequence of the 5' end of the mRNA was determined by the dideoxy sequencing method and found to be 5' NGGGC, wherein the C residue is one nucleotide upstream of the cloned viral sequence. The initiator ATG codon for the matrix protein is embedded in an AATATGG sequence similar to the canonical PXXATGG sequence present around functional eucaryotic translation initiation codons. There is no conserved sequence upstream of the polyadenylate tail, unlike vesicular stomatitis virus and Sendai virus, in which four nucleotides upstream of the polyadenylate tail are conserved in all genes. There is no equivalent of the eucaryotic polyadenylation signal AAUAAA upstream of the polyadenylate tail. The matrix protein of 28,717 daltons has 256 amino acids. It is relatively basic and moderately hydrophobic. There are two clusters of hydrophobic amino acid residues in the C-terminal third of the protein that could potentially interact with the membrane components of the infected cell. The matrix protein has no homology with the matrix proteins of other negative-strand RNA viruses, implying that RS virus has undergone extensive evolutionary divergence. A second open reading frame potentially encoding a protein of 75 amino acids and partially overlapping the C terminus of the matrix protein was also identified.  相似文献   

18.
19.
Mouse lactate dehydrogenase-B cDNAs were isolated from cDNA libraries of macrophage (ICR strain) and thymus (F1 hybrid of C57BL/6 and CBA strains), and their nucleotide sequences determined. The lactate dehydrogenase-B cDNA insert of thymus clone mB188 consists of the protein-coding sequence (1002 nucleotides), the 5' (46 nucleotides) and 3' (190 nucleotides) non-coding regions, and poly(A) tail (19 nucleotides), while macrophage clone mB168 contains a partial lactate dehydrogenase cDNA insert from codon no. 55 to the poly(A) tail. Seven silent nucleotide substitutions at codon no. 142, 143, 186, 187, 241, 285 and 292, as well as a single nucleotide change in the 3' non-coding region, were found between these different strains of mice. The predicted sequence of 333 amino acids, excluding initiation methionine, was confirmed by sequencing and/or compositional analyses of a total of 103 (31%) amino acids from tryptic peptides of mouse lactate dehydrogenase-B protein. The nucleotide sequence of the mouse coding region for lactate dehydrogenase B shows 86% identity with that of the human isoenzyme, and only eight of the 139 nucleotide differences resulted in amino acid substitutions at residues 10, 13, 14, 17, 52, 132, 236 and 317. The rates of nucleotide substitutions at synonymous and nonsynonymous sites in the mammalian lactate dehydrogenase genes are calculated. The rates of synonymous substitutions for lactate dehydrogenase genes A (muscle) and B (heart) are considerably higher than the average rate computed from human and rodent genes. The rates of nonsynonymous substitutions for lactate dehydrogenase genes A (muscle) and B (heart), particularly the latter, are highly conservative. The rates of synonymous and nonsynonymous substitutions for the lactate dehydrogenase-C gene are about the same as the average rates for mammalian genes. A phylogenetic tree of vertebrate lactate dehydrogenase protein sequences is constructed. In agreement with the previous results, this analysis further indicates that lactate dehydrogenase-C gene branched off earlier than did lactate dehydrogenase-A and lactate dehydrogenase-B genes.  相似文献   

20.
Suzuki Y  Gojobori T 《Gene》2001,276(1-2):83-87
To predict the amino acid sites important for the clearance of hepatitis C virus (HCV) subtype 1b in vivo, positively selected amino acid sites were detected by analyzing the sequence data collected from the international DNA databank. The rate of nonsynonymous substitutions per nonsynonymous site was compared with that of synonymous substitutions per synonymous site for each codon site in the entire coding region. As a result, 13 out of 3010 amino acid sites were found to be positively selected. Among the 13 positively selected amino acid sites, eight were located in the structural proteins and five were in the nonstructural proteins. Moreover, eight were located in B-cell epitopes and two were in T-cell epitopes. These observations suggest that both the antibody and the cytotoxic T lymphocyte are involved in the clearance of HCV subtype 1b in vivo. These positively selected amino acid sites represent candidate vaccination targets for HCV subtype 1b.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号