首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
V Bernan  D Filpula  W Herber  M Bibb  E Katz 《Gene》1985,37(1-3):101-110
The sequence of a 1.56-kb DNA fragment containing the tyrosinase gene (mel) from Streptomyces antibioticus was determined and the Mr (30612) and amino acid (aa) sequence of the protein were deduced from the nucleotide (nt) sequence. Intracellular and extracellular tyrosinase from S. antibioticus, transformed with pIJ702 (containing mel), were purified to homogeneity; the Mr (29 500), as determined by Sephadex G-75 chromatography and sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), was consistent with the value derived from the nt sequence. Edman degradation established that the N-terminal sequence of both the intracellular and extracellular forms of tyrosinase are identical and correspond to the aa sequence derived from the structural gene. In addition, this sequence exhibits striking homology to the N-terminal region of the intracellular and extracellular enzyme purified from Streptomyces glaucescens (Crameri et al., 1982). An additional open reading frame (ORF438) upstream of the mel gene, was also identified that appears to code for a protein (Mr = 14 754) with a putative signal sequence.  相似文献   

3.
T K Frey  L D Marr 《Gene》1988,62(1):85-99
The sequence of the 3' 4508 nucleotides (nt) of the genomic RNA of the Therien strain of rubella virus (RV) was determined for cDNA clones. The sequence contains a 3189-nt open reading frame (ORF) which codes for the structural proteins C, E2 and E1. C is predicted to have a length of 300 amino acids (aa). The N-terminal half of the C protein is highly basic and hydrophilic in nature, and is putatively the region of the protein which interacts with the virion RNA. At the C terminus of the C protein is a stretch of 20 hydrophobic aa which also serves as the signal sequence for E2, indicating that the cleavage of C from the polyprotein precursor may be catalyzed by signalase in the lumen of the endoplasmic reticulum. E2 is 282 aa in length and contains four potential N-linked glycosylation sites and a putative transmembrane domain near its C terminus. The sequence of E1 has been previously described [Frey et al., Virology 154 (1986) 228-232]. No homology could be detected between the amino acid sequence of the RV structural proteins and the amino acid sequence of the alphavirus structural proteins. From the position of a region of 30 nt in the RV genomic sequence which exhibited significant homology with the sequence in the alphavirus genome at which subgenomic RNA synthesis is initiated, the RV subgenomic RNA is predicted to be 3346 nt in length and the nontranslated region from the 5' end of the subgenomic RNA to the structural protein ORF is predicted to be 98 nt. In a different translation frame beginning at the 5' end of the RV nt sequence reported here is a 1407 nt ORF which is the C terminal region of the nonstructural protein ORF. This ORF overlaps the structural protein ORF by 149 nt. A low level of homology could be detected between the predicted amino acid sequence of the C-terminus of the RV nonstructural protein ORF and the replicase proteins of several positive RNA viruses of animals and plants, including nsp4 of the alphaviruses, the protein encoded by the C-terminal region of the alphavirus nonstructural ORF. However, the overall homology between RV and the alphaviruses in this region of the genome was only 18%, indicating that these two genera of the Togavirus family are only distantly related. Intriguingly, there is a 2844-nt ORF present in the negative polarity orientation of the RV sequence which could encode a 928-aa polyprotein.  相似文献   

4.
F K Chu  G F Maley  A M Wang  F Maley 《Gene》1987,57(1):143-148
The nucleotide (nt) sequence in a 757-bp [corrected] segment downstream from the intron-containing T4 phage thymidylate synthase gene (td) has been determined. This region was found to contain two open reading frames (ORFs). The first ORF(ORF2) [corrected] 261 bp [corrected] in length, is 24 [corrected] nt downstream from the td gene. The second ORF(ORF3) [corrected]) is 200 bp long at 558 [corrected] nt from the td gene and extends to the end of the Eco RI fragment. The amino acid (aa) sequence (66 aa residues) deduced from the second truncated ORF shows 59% homology to the sequence of the N-terminal portion of the ribonucleotide reductase large subunit of either Escherichia coli (B1 subunit) or mouse (M1 subunit). This tentatively identifies the truncated gene to be the 5' end of the T4 phage ribonucleotide reductase subunit B1 (nrdA) gene and pinpoints its exact location on the T4 phage genomic map. Southern hybridization analysis suggests good sequence homology among the nrdA genes of various T-even phages.  相似文献   

5.
Structure of the gene encoding the exoglucanase of Cellulomonas fimi   总被引:29,自引:0,他引:29  
G O'Neill  S H Goh  R A Warren  D G Kilburn  R C Miller 《Gene》1986,44(2-3):325-330
In Cellulomonas fimi the cex gene encodes an exoglucanase (Exg) involved in the degradation of cellulose. The gene now has been sequenced as part of a 2.58-kb fragment of C. fimi DNA. The cex coding region of 1452 bp (484 codons) was identified by comparison of the DNA sequence to the N-terminal amino acid (aa) sequence of the Exg purified from C. fimi. The Exg sequence is preceded by a putative signal peptide of 41 aa, a translational initiation codon, and a sequence resembling a ribosome-binding site five nucleotides (nt) before the initiation codon. The nt sequence immediately following the translational stop codon contains four inverted repeats, two of which overlap, and which can be arranged in stable secondary structures. The codon usage in C. fimi appears to be quite different from that of Escherichia coli. A dramatic (98.5%) bias occurs for G or C in the third position for the 35 codons utilized in the cex gene.  相似文献   

6.
《Gene》1997,184(2):273-278
Genes for the snRNP proteins U1-70K, U1-A, Sm-B′/B, Sm-D1 and Sm-E have been isolated from various metazoan species. The genes for Sm-D1 and Sm-E, which were isolated from a murine and human source respectively, appear to belong to a multigene family. It has been suggested that also for the mammalian U1-C protein such a multigene family exists. With the human U1-C cDNA as a probe, two genes containing sequences homologous to the probe sequence were isolated from a mouse genomic library. Simultaneously, a murine U1-C cDNA was isolated from a mouse cDNA library. This 0.74 kb cDNA contains an open reading frame (ORF) of 477 bp encoding a polypeptide of 159 amino acids (aa) which differs at only one position (position 65) from the human U1-C protein. One of the isolated U1-C genes contains an ORF as well and shares 92% nucleotide sequence identity with the mouse U1-C cDNA. The features of this gene, in particular the absence of introns, the acquisition of a 3′ poly(A) tail and flanking direct repeats, indicate that it represents a processed pseudogene. At the predicted aa sequence level, substitutions of conserved residues at functionally important positions are observed, strongly suggesting that expression of this gene would not lead to a functional polypeptide. The second U1-C gene appeared to be a pseudogene as well because it is also intronless and contains a frameshift mutation compared to the ORF in the mouse U1-C cDNA. The characterization of these two pseudogenes points to the existence of a U1-C multigene family in mice. Furthermore, comparison of aa sequences of the murine, human and Xenopus U1-C shows that the protein is highly conserved through evolution. Since the Xenopus U1-C differs from the two mammalian counterparts solely at a number of positions in the C-terminal region, it can be concluded that aa changes are less well tolerated in the N-terminal region of U1-C than in the rest of the protein.  相似文献   

7.
Two BamHI fragments (0.8 and 5.2 kb) of Cellulomonas fimi containing an endoglucanase (Eng) gene (cenA) were individually cloned into the BamHI site of pBR322; they expressed carboxymethylcellulase activity in Escherichia coli. The nucleotide (nt) sequence of the cenA gene was determined by sequencing overlapping deletions. The cenA gene is 1350 bp long encoding a polypeptide of 449 amino acids (aa) and stop codon. The 0.8-kb BamHI component encodes the first 76 aa, whereas the 5.2-kb BamHI component encodes the rest of the Eng. The Eng lacking the N-terminal 76 aa retains its activity and antigenicity, and it forms an active fusion protein with the N-terminal portion of the TcR determinant. The C-terminal region of the Eng is crucial for activity and a deletion of as little as 12 aa from that end results in the loss of all Eng activity. The N-terminal 31 aa of the Eng constitute a leader peptide which appears to be functional in exporting the enzyme to the periplasm in E. coli.  相似文献   

8.
《Gene》1997,189(1):73-78
A cDNA encoding a two-domain hemoglobin (Hb) chain of Daphnia magna was cloned and its nucleotide (nt) sequence of 1261 bp was determined. The nt sequence contained 74 bp of the leader sequence, 1047 bp of an open reading frame (ORF), and 119 bp of the 3′-untranslated region (UTR), excluding the polyadenylation tail. A sequence, AATACA, located 24 bp upstream from the polyA sequence was considered to be a polyadenylation signal. cDNA-derived amino acid (aa) sequence revealed that D. magna Hb chain is synthesized as a secretory precursor with a signal peptide of 18 aa. Mature D. magna Hb chain consists of 330-aa residues with a calculated molecular weight of 36 227, which is composed of two large repeated domains, domain 1 and 2. Several key aa that are invariant in all or most of other Hb and required for functional heme-binding are conserved in each of the two domains. The N-terminal extension (pre-A segment) of domain 1 was unusually long and contained an unusual threonine-rich sequence. The homology between the aa sequences of the two domains (24% identity) was much lower than that observed in other two-domain Hb chains from clams or nematode. Hb mRNA level in D. magna reared under low oxygen concentration was more than 12 times higher than that in D. magna reared with sufficient aeration, indicating that the expression of Hb gene is regulated by mRNA level.  相似文献   

9.
J Eldridge  Z Zehner  B M Paterson 《Gene》1985,36(1-2):55-63
The entire nucleotide sequence of the chicken cardiac alpha-actin (CC alpha A) gene has been determined. This is the first complete sequence of a cardiac actin gene that includes the promoter region, cap site, all the introns, and the polyadenylation site. The gene contains six introns, five of which interrupt the coding region at amino acids (aa) 41, 150, 204, 267, and 327. The first intron is in the 5'-noncoding region and is 438 bp in length. The CC alpha A gene encodes an mRNA of approx. 1400 bp with 5'- and 3'-untranslated region of 59 and 184 nucleotides (nt), respectively. Like the chicken skeletal alpha-actin gene, the CC alpha A gene has the codon for the aa cysteine between the initiator ATG and the codon for the N-terminal aspartic acid residue of the mature protein. There are no strong homologies (less than 13 consecutive nt) in the promoter or 3'-untranslated regions between the CC alpha A and chicken skeletal alpha-actin genes even though both are expressed in skeletal muscle during development. However, the 3'-untranslated region of the CC alpha A gene demonstrates significant sequence homology (76% over a 200-nt region) with the same region in the partial sequence of the human cardiac gene. The conservation of these sequence homologies between identical isoforms rather than the different alpha actin genes suggests these conserved regions may have a role in regulation rather than tissue-specific expression, as previously proposed.  相似文献   

10.
11.
The primary structure of rat ribosomal protein L9   总被引:3,自引:0,他引:3  
K Suzuki  J Olvera  I G Wool 《Gene》1990,93(2):297-300
The amino acid (aa) sequence of rat ribosomal (r) protein L9 was deduced from the nucleotide (nt) sequence in a recombinant cDNA and confirmed from the N-terminal aa sequence of the protein. L9 contains 192 aa and has an Mr of 21879. Hybridization of the cDNA to digests of nuclear DNA suggests that there are 20-23 copies of the L9 gene. The mRNA for the protein is about 800 nt in length. Rat L9 is related to Saccharomyces cerevisiae YL11, Methanococcus vannielii L6, Escherichia coli L6 and other members of the prokaryotic L6 family. The protein contains a possible internal duplication of 11 aa.  相似文献   

12.
13.
W W Mulbry 《Gene》1992,121(1):149-153
Using degenerate oligodeoxyribonucleotides (oligos) derived from the N-terminal sequence of an aryldialkylphosphatase (ADPase) from Nocardia sp. strain B-1, an amplification reaction was used to isolate a DNA segment containing a 57-bp fragment from the adpB gene. Based on the nucleotide (nt) sequence of this fragment, a nondegenerate oligo was synthesized and used to screen a subgenomic library of strain B-1 DNA for fragments containing adpB. A 3.55-kb PstI fragment containing adpB was cloned into Escherichia coli, and the nt sequence of a 1600-bp region containing adpB was determined. Under control of the lac promoter of pUC19, adpB expression in E. coli cultures was approx. 15-fold higher than in strain B-1 under the native adpB promoter. Comparison of adpB with the Flavobacterium ADPase-encoding gene, opd, revealed no significant homology at the nt or aa levels.  相似文献   

14.
An exopolygalacturonase (exo-PGase; EC 3.2.1.82) was found in the culture broth of a Bacillus isolate. The gene encoding the exo-PGase, pehK, was cloned by polymerase chain reaction using mixed primers designed from N-terminal and internal amino acid (aa) sequences of the enzyme (PehK). The determined nucleotide (nt) sequence of pehK revealed a 2940 bp open reading frame (980 aa) that encoded a putative signal sequence (27 aa) and a mature protein (953 aa; 103810 Da). The recombinant enzyme was purified to homogeneity from a culture broth of Bacillus subtilis harboring a pehK-containing plasmid. It had a molecular mass of 105 kDa and a pI value of 5.0. The maximum activity was observed at pH 8 and 55 degrees C in Tris-HCl buffer. The degradation products from polygalacturonic or oligogalacturonic acids were digalacturonic acid, like the exo-PGases, PehX of Erwinia chrysanthemi and PehB of Ralstonia solanacearum. The deduced aa sequence of PehK exhibited moderate homology to those of PehX and PehB with approx. 30% identity for both. High homology was observed in a suitably aligned internal region of the three enzymes (65% identity), and some of the conserved aa residues appeared to form the catalytic core of the enzymes.  相似文献   

15.
H P Lerch  R Frank  J Collins 《Gene》1989,83(2):263-270
The gene (L-HicDH) encoding L-2-hydroxyisocaproate dehydrogenase (L-HicDH) from Lactobacillus confusus was cloned in Escherichia coli. A 69-mer oligodeoxyribonucleotide probe, derived to be complementary to the N-terminal amino acid (aa) coding sequence, was used for screening. The complete nucleotide (nt) sequence of the L-HicDH gene was determined. The 5'-end of the mRNA was mapped by primer extension and the promoter identified. Downstream from the L-HicDH gene is a typical Rho-independent terminator. The aa sequence of L-HicDH, deduced from the nt sequence, has an overall similarity of 30% to the aa sequence of L-lactate dehydrogenase (L-LDH) from Lactobacillus casei. The aa residues involved in binding of coenzyme and substrate are highly conserved in L-HicDH with respect to prokaryotic and eukaryotic L-LDHs. The L-HicDH gene could be expressed under control of phage lambda 'Leftward' and 'rightward' promoters in E. coli up to 35% of total cell protein. The enzyme produced under these conditions exhibits full specific activity and is found exclusively in soluble form.  相似文献   

16.
Dysfunctions of the genes coding for the two chains of the human type-I procollagen result in genetic disorders that affect the integrity of bone, ligaments, tendons, and other connective tissues. While the primary amino acid (aa) sequence of one of the two type-I subunits, pro alpha 2(I), has been derived in its entirety from the analysis of overlapping cDNAs, the sequence of the first 247 aa residues of the helical domain of the other polypeptide, pro alpha 1(I), had yet to be determined. To this end, we have sequenced nearly 4 kb of the human pro alpha 1(I) collagen gene and identified twelve open reading frames whose conceptual amino acid translation exhibits 95% homology to the first 247 aa of rat alpha 1(I) chain. Furthermore, with these and other data, some of which previously unpublished, we have derived the complete sequence of the first 7618 bp of the gene. This region comprises the 25 exons encoding the N-terminal pre-propeptide and five of the eight cyanogen-bromide-derived peptides. This information therefore represents a most useful reference for the characterization of molecular defects in individuals affected by various connective tissue disorders.  相似文献   

17.
18.
N Mori  J Singer-Sam  C Y Lee  A D Riggs 《Gene》1986,45(3):275-280
A clone containing cDNA for X chromosome-linked phosphoglycerate kinase (PGK-1) was isolated from a mouse myeloma cDNA library. The nucleotide (nt) sequence of the cDNA has been determined, and the amino acid (aa) sequence of the enzyme thereby deduced. At the nt level, the coding region of mouse PGK cDNA has 93% homology with human X-linked cDNA and 60% homology with the yeast gene. Mouse PGK-1 protein contains 416 aa and is 98%, 96% and 64% homologous with human, horse, and yeast enzyme sequences, respectively.  相似文献   

19.
20.
V A David  A H Deutch  A Sloma  D Pawlyk  A Ally  D R Durham 《Gene》1992,112(1):107-112
The gene (nprV), encoding the extracellular neutral protease, vibriolysin (NprV), of the Gram- marine microorganism, Vibrio proteolyticus, was isolated from a V. proteolyticus DNA library constructed in Escherichia coli. The recombinant E. coli produced a protease that co-migrated with purified neutral protease from V. proteolyticus on non-denaturing polyacrylamide gels, and that demonstrated enzymatic specificity towards the neutral protease substrate N-[3-(2-furyl)acryloyl]-L-alanylphenylalanine amide. The nucleotide (nt) sequence of the cloned nprV gene revealed an open reading frame encoding 609 amino acids (aa) including a putative signal peptide sequence followed by a long 'pro' sequence consisting of 172 aa. The N-terminal aa sequence of NprV purified from cultures of V. proteolyticus, identified the beginning of the mature protein within the aa sequence deduced from the nt sequence. Comparative analysis of mature NprV to the sequences of the neutral proteases from Bacillus thermoproteolyticus (thermolysin) and Bacillus stearothermophilus identified extensive regions of conserved aa homology, particularly with respect to active-site residues, zinc-binding residues, and calcium-binding sites. NprV was overproduced in Bacillus subtilis by placing the DNA encoding the 'pro' and mature enzyme downstream from a Bacillus promoter and signal sequence.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号