首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The secreted form of the halogenating glycoenzyme, chloroperoxidase, is processed from a precursor containing a 21-residue-long, moderately hydrophobic signal sequence, at an atypical Gln-Glu peptide bond. Following cleavage, the N-terminal glutamic acid readily cyclizes into pyroglutamic acid. Chloroperoxidase contains two high-mannose N-glycosylation sites, identified as Asn12 and Asn213. Other modifications include deamidation of residues Asn13, Asn198, and Gln183 into the corresponding acids. Finally, structural arguments suggest that Cys87 may be the axial heme ligand in the active site of chloroperoxidase.  相似文献   

2.
Fructose induces and glucose represses chloroperoxidase mRNA levels   总被引:1,自引:0,他引:1  
The fungus Caldariomyces fumago can be induced to secrete the heme protein chloroperoxidase at levels of 500 mg/liter. Chloroperoxidase synthesis is controlled at the mRNA level. Glucose strongly represses production of chloroperoxidase mRNA and protein, whereas fructose induces both to high levels. Chloroperoxidase has been partially sequenced by automated Edman degradation of tryptic peptides. Based on this amino acid sequence data, a 2-fold degenerate, 29-base oligonucleotide (29-mer) complementary to chloroperoxidase mRNA was synthesized. Polyadenylated RNA, purified from C. fumago, was used as substrate for cDNA synthesis using the 29-mer as primer. cDNAs were made double-stranded and cloned into plasmid pBR322 by conventional methods. Screening the resultant cDNA bank by colony hybridization with the 29-mer as probe showed that 18% of the clones contained the 29-mer sequence. Dideoxy sequencing of one clone (pMA340) identified it as part of the coding region for chloroperoxidase by comparison with known amino acid sequences. In addition, the amino-terminal coding region of clone pMA340 reveals a putative signal peptide for chloroperoxidase. Clone pMA340 was then used in Northern analysis of chloroperoxidase mRNA levels under conditions which induce and repress enzyme secretion.  相似文献   

3.
DEK is an abundant chromatin protein in metazoans reaching copy numbers of several millions/nucleus. Previous work has shown that human DEK, a protein of 375 amino acids, has two functional DNA-binding domains, of which one resides in a central part of the molecule and contains sequences corresponding to the scaffold attachment factor-box (SAF-box) domain as found in a growing number of nuclear proteins. Isolated SAF-box peptides (amino acids 137–187) bind weakly to DNA in solution, but when many SAF-box peptides are brought into close proximity on the surface of Sephadex beads, cooperative effects lead to a high affinity to DNA. Furthermore, a peptide (amino acids 87–187) that includes a sequence on the N-terminal side of the SAF-box binds efficiently to DNA. This peptide prefers four-way junction DNA over straight DNA and induces supercoils in relaxed circular DNA just like the full-length DEK. Interestingly, however, the 87–187 amino acid peptide introduces negative supercoils in contrast to the full-length DEK, which is known to introduce positive supercoils. We found that two adjacent regions (amino acids 68–87 and 187–250) are necessary for the formation of positive supercoils. Our data contribute to the ongoing characterization of the abundant and ubiquitous DEK chromatin protein.  相似文献   

4.
The complete nucleotide sequence of the Pseudomonas chromosomal gene coding for the enzyme carboxypeptidase G2 (CPG2) has been determined. The nucleotide sequence obtained has been confirmed by comparing the predicted amino acid sequence with that of randomly derived peptide fragments and by N-terminal sequencing of the purified protein. The gene has been shown to code for a 22 amino acid signal peptide at its N-terminus which closely resembles the signal peptides of other secreted proteins. An alternative 36 amino acid signal peptide which may function in Pseudomonas has also been identified. The codon utilisation of the gene is influenced by the high G + C (67.2%) content of the DNA and exhibits a 92.8% preference for codons ending in G or C. This unusual codon preference may contribute to the generally observed weak expression of Pseudomonas genes in Escherichia coli. A region of DNA upstream of the structural gene has also been sequenced and a ribosome binding site and two putative promoter sequences identified.  相似文献   

5.
The amino acid sequence of purified gene 0.3 protein of T7, the protein responsible for overcoming host restriction, has been determined. The nucleotide sequence of the 0.3 RNA, the messenger RNA that codes for both the 0.3 protein and the gene 0.4 protein, a T7 protein of unknown function, has also been determined. The 0.3 RNA is 578 nucleotides long, 509 of which are used to code for the 2 proteins. The coding sequences do not overlap, but the termination codon for the 0.3 protein and the presumed initiation codon for the 0.4 protein do overlap in the sequence UAAUG. The 0.3 protein is very acidic: 34 of its 116 amino acids are aspartic or glutamic acid and only 6 are arginine or lysine. The 0.3 protein contains no cysteine. The nucleotide sequence predicts that the 0.4 protein consists of 50 amino acids and contains no histidine or proline. The effects of different mutations indicate that a protein which contains only the first 87 amino acids of the 0.3 protein is unable to prevent host restriction in vivo; one that contains te first 93 amino acids has weak function; and one that has the first 94 amino acids (plus 2 that are not in the wild type sequence) is fully able to prevent host restriction. The apparently critical 94th amino acid is tryptophan. The mutant 0.3 proteins that contain 87 or more amino acids appear to be reasonably stable in vivo, but those that contain 78 or fewer are apparently too unstable to have been observed by gel electrophoresis.  相似文献   

6.
7.
8.
Isolation and sequencing of mouse angiogenin DNA   总被引:2,自引:0,他引:2  
The mouse genomic DNA for angiogenin, a potent blood vessel inducing protein, has been isolated from a bacteriophage library using the human angiogenin gene as a probe. The 1129 bp fragment contains 499 bp in the 5' flanking region, 192 bp in the 3' flanking region, and 438 bp coding for the mature protein (121 amino acids) and signal peptide (24 amino acids). Potential TATA box and AATAAA polyadenylation sequences are present, and a consensus sequence for an intron 3' boundary occurs 16 bp upstream of the Met-(24) codon, suggesting the presence of an intron in the 5' region. The protein sequence inferred from the DNA is 76% identical to that of human angiogenin, and matches the sequences obtained previously from tryptic peptides of a serum-derived mouse angiogenin. The critical catalytic residues of human angiogenin are conserved in the mouse protein, as are the six cysteines necessary for disulfide bond formation.  相似文献   

9.
The nucleotide sequence of a 2224 bp region of the Escherichia coli chromosome that carries the LexA regulated recN gene has been determined. A region of 1701 nucleotides encoding a polypeptide of 567 amino acids with a predicted molecular weight of 63,599 was identified as the most probable sequence for the recN structural gene. The proposed initiation codon is preceded by a reasonable Shine-Dalgarno sequence and a promoter region containing two 16 bp sequences, separated by 6 bp, that match the consensus sequence (SOS box) for binding LexA protein. DNA fragments containing this putative promoter region are shown to bind LexA in vitro and to have LexA-regulated promoter activity in vivo. The amino acid sequence of RecN predicted from the DNA contains a region that is homologous to highly conserved sequences found in several DNA repair enzymes and other proteins that bind ATP. A sequence of 9 amino acids was found to be homologous to a region of the RecA protein of E. coli postulated to have a role in DNA/nucleotide binding.  相似文献   

10.
Computer programs that can be used for the design of syntheticgenes and that are run on an Apple Macintosh computer are described.These programs determine nucleic acid sequences encoding aminoacid sequences. They select DNA sequences based on codon usageas specified by the user, and determine the placement of basechanges that can be used to create restriction enzyme siteswithout altering the amino acid sequence. A new algorithm forfinding restriction sites by translating the restriction endonucleasetarget sequence in all three reading frames and then searchingthe given peptide or protein amino acid sequence with theseshort restriction enzyme peptide sequences is described. Examplesare given for the creation of synthetic DNA sequences for thebovine prethrombin-2 and ribonuclease A genes Received on October 18, 1988; accepted on December 9, 1988  相似文献   

11.
Genomic DNA sequence for human C-reactive protein   总被引:12,自引:0,他引:12  
The gene for the prototype acute phase reactant, C-reactive protein, has been isolated from two lambda phage libraries containing inserted human DNA fragments using synthetic oligonucleotide probes. Nucleotide sequence analysis indicates that after coding for a signal peptide of 18 amino acids and the first two amino acids of the mature protein, there is an intron of 278 base pairs followed by the nucleotide sequence for the remaining 204 amino acids. The intron is unusual in that it contains on the positive strand a poly(A) stretch 16 nucleotides long and a poly(GT) region 30 nucleotides long which could adopt the Z-form of DNA. The nucleotide sequence reported here confirms the amino acid sequence of mature C-reactive protein as originally reported except that it codes for an additional 19 amino acids beginning at position 62. Thus DNA sequence analysis predicts that the mature protein consists of 206 amino acids rather than 187 as originally reported. The mRNA cap site is located 104 nucleotides from the start of the signal peptide and there is a 3' noncoding region 1.2 kilobase pairs in length. The gene has a typical promoter containing the sequences TATAAAT and CAAT 29 and 81 base pairs upstream, respectively, of the cap site.  相似文献   

12.
A nucleotide sequence of 2271 basepairs has been determined from cloned E. coli DNA which contains ompA. Withing that sequence, starting at nucleotide 1037, an open translational reading frame encodes a protein of 367 amino acids which starting with amino acid 22 agrees with the primary structure of protein II. The preceeding 21 amino acids constitute a typical signal sequence. There is a non-translated region of 360 nucleotides in front of the translational start. The insertion point of an IS1 element 110 nucleotides upstream from the start codon and an amber codon at the position of amino acid residue 28 have been localized in the DNA from two ompA mutants.  相似文献   

13.
本文介绍了一个在微机(IBM PC)上实现的、用于核酸顺序分析的计算机程序系统.该系统由三个层次和18个功能块构成,菜单及人机对话使得用户能较快地掌握和使用它.在编程中,采用了树结构、先进后出栈和稀疏矩阵等数据结构技巧,运用了Bayes法等统计分析方法,Kruskal算法和Floyd算法等一系列图论方法也被得到应用,这个软件系统的推出对于分子生物学研究具有一定的积极作用.  相似文献   

14.
Amino acid sequence of a specific antigenic peptide of protein B23   总被引:6,自引:0,他引:6  
A specific antigenic peptide was obtained from protein B23 (Mr/pI = 37,000/5.1) after 30 min of digestion with staphylococcal V8 protease (10 micrograms/ml/mg protein B23). The antigenic peptide was purified by DEAE-cellulose chromatography and high pressure liquid chromatography on a reverse-phase C18 column. The antigenic peptide contains 14.7 and 18.7 mol% of glutamic acid and lysine, respectively. Amino acid sequence analysis showed that the peptide has 68 amino acids and is located on the carboxyl-terminal sequence of protein B23. The sequence is Ser-Phe-Lys-Lys-Gln-Glu-Lys-Thr-Pro-Lys-Thr-Pro- Lys-Gly-Pro-Ser-Ser-Val-Glu-Asp-Ile-Lys-Ala-Lys-Met-Gln-Ala-Ser-Ile-Glu- Lys-Gly- Gly-Ser-Leu-Pro-Lys-Val-Glu-Ala-Lys-Phe-Ile-Asn-Tyr-Val-Lys-Asn-Cys-Phe- Arg-Met- Thr-Asp-Gln-Glu-Ala-Ile-Gln-Asp-Leu-Trp-Gln-Trp-Arg-Lys-Ser-Leu-Cooh. Extensive digestion of the antigenic peptide with V8 protease, trypsin, or chymotrypsin results in loss of the antigenic activity. Three cloned cDNAs (hpB1, hpB2, and hpB7) which code for the 82 amino acids at the COOH terminus of protein B23 and the 3' non-translating sequence were identified and characterized. All three clones have identical nucleotide sequences coding for the antigenic portion of the protein (68 amino acids at the COOH terminus), the stop codon, and the 3' non-translated region. However, mutation of 6 nucleotide bases of one clone (hpB2) caused changes in 4 amino acids in the sequence just preceding the immunoreactive region. The result suggests the presence of at least 2 immunologically similar but distinct proteins which are both recognized by the anti-B23 antibody.  相似文献   

15.
Mast cell carboxypeptidase A has been isolated from the secretory granules of mouse peritoneal connective tissue mast cells (CTMC) and from a mouse Kirsten sarcoma virus-immortalized mast cell line (KiSV-MC), and a cDNA that encodes this exopeptidase has been cloned from a KiSV-MC-derived cDNA library. KiSV-MC-derived mast cell carboxypeptidase A was purified with a potato-derived carboxypeptidase-inhibitor affinity column and was found by analytical sodium dodecyl sulfate-polyacrylamide gel electrophoresis to be a Mr 36,000 protein. Secretory granule proteins from KiSV-MC and from mouse peritoneal CTMC were then resolved by preparative sodium dodecyl sulfate-polyacrylamide gel electrophoresis and transblotted to polyvinylidine difluoride membranes. Identical aminoterminal amino acid sequences were obtained for the prominent Mr 36,000 protein present in the granules of both cell types. Based on the amino-terminal sequence, an oligonucleotide probe was synthesized and used to isolate a 1,470-base pair cDNA that encodes this mouse exopeptidase. The deduced amino acid sequence revealed that, after cleavage of a 15-amino acid hydrophobic signal peptide and a 94-amino acid activation peptide from a 417-amino acid preproenzyme, the mature mast cell carboxypeptidase A protein core has a predicted Mr of 35,780 and a high positive charge [Lys + Arg) - (Asp + Glu) = 17) at neutral pH. Although critical zinc-binding amino acids (His67, Glu70, His195), substrate-binding amino acids (Arg69, Asn142, Arg143, Tyr197, Asp255, Phe278), and cysteine residues that participate in intrachain disulfide bonds (Cys64-Cys77, Cys136-Cys159) of pancreatic carboxypeptidases were also present in mast cell carboxypeptidase A, the overall amino acid sequence identities for mouse mast cell carboxypeptidase A relative to rat pancreatic carboxypeptidases A1, A2, and B were only 43, 41, and 53%, respectively. RNA and DNA blot analyses revealed that mouse peritoneal CTMC, KiSV-MC, and bone marrow-derived mast cells all express a prominent 1.5-kilobase mast cell carboxypeptidase A mRNA which is transcribed from a single gene. We conclude that mouse mast cell carboxypeptidase A is a prominent secretory granule enzyme of mast cells of the CTMC subclass and represents a novel addition to the carboxypeptidase gene family.  相似文献   

16.
A 2.9 kbp region from within the inverted repeat of Nicotiana chloroplast DNA hybridized with a chloroplast DNA fragment from Euglena containing the complete rps12 gene coding for ribosomal protein S12. Nucleotide sequencing within this region revealed the existance of two rps12 coding stretches interrupted by 540 bp having class II intron structure. Joining and decoding the exon regions produced a sequence of 85 amino acids colinear and 81% homologous to the S12 protein of Euglena chloroplasts and E. coli, starting from amino acid residue 38 to the stop codon. Immediately upstream of codon 38, conserved intron sequences were located. However, the 5' 37 codon of Nicotiana chloroplast rps12 could not be identified by electron microscopy of RNA-DNA hybrids within a DNA region extending 4000 bp upstream of codon 38, nor by computer search of a completely sequenced region extending for more than 9000 bp upstream of this codon. In E. coli, alteration in rps12 codons 42 or 87 causes streptomycin resistance. However, the nucleotide sequence of the identified rps12 exons in two Nicotiana chloroplast mutants resistant to streptomycin were found to be identical to that of wild type.  相似文献   

17.
The evolutionary potential of a gene is constrained not only by the amino acid sequence of its product, but by its DNA sequence as well. The topology of the genetic code is such that half of the amino acids exhibit synonymous codons that can reach different subsets of amino acids from each other through single mutation. Thus, synonymous DNA sequences should access different regions of the protein sequence space through a limited number of mutations, and this may deeply influence the evolution of natural proteins. Here, we demonstrate that this feature can be of value for manipulating protein evolvability. We designed an algorithm that, starting from an input gene, constructs a synonymous sequence that systematically includes the codons with the most different evolutionary perspectives; i.e., codons that maximize accessibility to amino acids previously unreachable from the template by point mutation. A synonymous version of a bacterial antibiotic resistance gene was computed and synthesized. When concurrently submitted to identical directed evolution protocols, both the wild type and the recoded sequence led to the isolation of specific, advantageous phenotypic variants. Simulations based on a mutation isolated only from the synthetic gene libraries were conducted to assess the impact of sub-functional selective constraints, such as codon usage, on natural adaptation. Our data demonstrate that rational design of synonymous synthetic genes stands as an affordable improvement to any directed evolution protocol. We show that using two synonymous DNA sequences improves the overall yield of the procedure by increasing the diversity of mutants generated. These results provide conclusive evidence that synonymous coding sequences do experience different areas of the corresponding protein adaptive landscape, and that a sequence''s codon usage effectively constrains the evolution of the encoded protein.  相似文献   

18.
A lambda gtll cDNA library prepared from human liver poly(A) RNA has been screened with affinity-purified antibody to human factor XI, a blood coagulation factor composed of two identical polypeptide chains linked by a disulfide bond(s). A cDNA insert coding for factor XI was isolated and shown to contain 2097 nucleotides, including 54 nucleotides coding for a leader peptide of 18 amino acids and 1821 nucleotides coding for 607 amino acids that are present in each of the 2 chains of the mature protein. The cDNA for factor XI also contained a stop codon (TGA), a potential polyadenylation or processing sequence (AACAAA), and a poly(A) tail at the 3' end. Five potential N-glycosylation sites were found in each of the two chains of factor XI. The cleavage site for the activation of factor XI by factor XIIa was identified as an internal peptide bond between Arg-369 and Ile-370 in each polypeptide chain. This was based upon the amino acid sequence predicted by the cDNA and the amino acid sequence previously reported for the amino-terminal portion of the light chain of factor XI. Each heavy chain of factor XIa (369 amino acids) was found to contain 4 tandem repeats of 90 (or 91) amino acids plus a short connecting peptide. Each repeat probably forms a separate domain containing three internal disulfide bonds. The light chains of factor XIa (each 238 amino acids) contain the catalytic portion of the enzyme with sequences that are typical of the trypsin family of serine proteases. The amino acid sequence of factor XI shows 58% identity with human plasma prekallikrein.  相似文献   

19.
20.
不具有3-碱基周期性的编码序列初探   总被引:4,自引:0,他引:4  
对120个较短编码序列(<1 200 bp)的Fourier频谱进行分析表明,3-碱基周期性在短编码序列中并不是绝对存在的.统计分析提示,编码序列有无3-碱基周期性与序列的碱基组成和分布、所编码蛋白质氨基酸的选用和顺序以及同义密码子的使用都有一定的关系.一般地,非周期-3序列中A+U含量高于G+C含量,周期-3序列的情况则相反;非周期-3序列中碱基在密码子三个位点上的分布比周期-3序列中的分布均匀;非周期-3序列密码子和氨基酸的使用偏向没有周期-3序列的大.在利用Fourier分析方法预测DNA序列中的基因和外显子时,应充分考虑到这些现象.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号