首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A lambda gtll cDNA library prepared from human liver poly(A) RNA has been screened with affinity-purified antibody to human factor XI, a blood coagulation factor composed of two identical polypeptide chains linked by a disulfide bond(s). A cDNA insert coding for factor XI was isolated and shown to contain 2097 nucleotides, including 54 nucleotides coding for a leader peptide of 18 amino acids and 1821 nucleotides coding for 607 amino acids that are present in each of the 2 chains of the mature protein. The cDNA for factor XI also contained a stop codon (TGA), a potential polyadenylation or processing sequence (AACAAA), and a poly(A) tail at the 3' end. Five potential N-glycosylation sites were found in each of the two chains of factor XI. The cleavage site for the activation of factor XI by factor XIIa was identified as an internal peptide bond between Arg-369 and Ile-370 in each polypeptide chain. This was based upon the amino acid sequence predicted by the cDNA and the amino acid sequence previously reported for the amino-terminal portion of the light chain of factor XI. Each heavy chain of factor XIa (369 amino acids) was found to contain 4 tandem repeats of 90 (or 91) amino acids plus a short connecting peptide. Each repeat probably forms a separate domain containing three internal disulfide bonds. The light chains of factor XIa (each 238 amino acids) contain the catalytic portion of the enzyme with sequences that are typical of the trypsin family of serine proteases. The amino acid sequence of factor XI shows 58% identity with human plasma prekallikrein.  相似文献   

2.
Lactocin S, a bacteriocin produced by Lactobacillus sake L45, has been purified to homogeneity by ion exchange, hydrophobic interaction and reverse-phase chromatography, and gel filtration. The purification resulted in approximately a 40,000-fold increase in the specific activity of lactocin S and enabled the determination of a major part of the amino acid sequence. Judging from the amino acid composition, lactocin S contained approximately 33 amino acid residues, of which about 50% were the nonpolar amino acids alanine, valine, and leucine. Amino acids were not detected upon direct N-terminal sequencing, indicating that the N-terminal amino acid was blocked. By cyanogen bromide cleavage at an internal methionine, the sequence of the 25 amino acids (including the methionine at the cleavage site) in the C-terminal part of the molecule was determined. The sequence was Met-Glu-Leu-Leu-Pro-Thr-Ala-Ala-Val-Leu-Tyr-Xaa-Asp-Val-Ala-Gly-Xaa-Phe- Lys-Tyr-Xaa-Ala-Lys-His-His, where Xaa represents unidentified residues. It is likely that the unidentified residues are modified forms of cysteine or amino acids associated with cysteine, since two cysteic acids per lactocin S molecule were found upon performic acid oxidation of lactocin S. The sequence was unique when compared to the SWISS-PROT data bank.  相似文献   

3.
Lactocin S, a bacteriocin produced by Lactobacillus sake L45, has been purified to homogeneity by ion exchange, hydrophobic interaction and reverse-phase chromatography, and gel filtration. The purification resulted in approximately a 40,000-fold increase in the specific activity of lactocin S and enabled the determination of a major part of the amino acid sequence. Judging from the amino acid composition, lactocin S contained approximately 33 amino acid residues, of which about 50% were the nonpolar amino acids alanine, valine, and leucine. Amino acids were not detected upon direct N-terminal sequencing, indicating that the N-terminal amino acid was blocked. By cyanogen bromide cleavage at an internal methionine, the sequence of the 25 amino acids (including the methionine at the cleavage site) in the C-terminal part of the molecule was determined. The sequence was Met-Glu-Leu-Leu-Pro-Thr-Ala-Ala-Val-Leu-Tyr-Xaa-Asp-Val-Ala-Gly-Xaa-Phe- Lys-Tyr-Xaa-Ala-Lys-His-His, where Xaa represents unidentified residues. It is likely that the unidentified residues are modified forms of cysteine or amino acids associated with cysteine, since two cysteic acids per lactocin S molecule were found upon performic acid oxidation of lactocin S. The sequence was unique when compared to the SWISS-PROT data bank.  相似文献   

4.
采用PCR技术扩增单核细胞增多性李氏杆菌TA野毒株内化素B(InlB)基因,进行编码分子的序列和结构分析,并克隆入大肠杆菌表达载体pET28a中诱导表达。该基因全长1893bp,编码630个氨基酸,其中前35个氨基酸残基构成信号肽序列。在推导的InlB蛋白氨基酸序列中,从N端到C端分别包括1个α-螺旋的Cap结构域、6个富含亮氨酸的重复基序(LRR)、1个免疫球蛋白样结构域(IR)、1段B重复序列和3个串联的GW结构域,同时还存在5个潜在的N-联糖基化位点,Leu占所有氨基酸残基的10.2%。与GenBank已经报道的18个不同流行株InlB基因相比,核苷酸和推导的氨基酸序列的同源性分别在91.1%~99.6%和92.3%~99.8%之间。重组菌菌体裂解物经SDS-PAGE和Western blot分析证实该基因已经正确表达。用Ni2 亲和层析柱纯化了InlB重组蛋白。  相似文献   

5.
The complete amino acid sequence of human complement factor H.   总被引:17,自引:2,他引:17       下载免费PDF全文
The complete amino acid sequence of the human complement system regulatory protein, factor H, has been derived from sequencing three overlapping cDNA clones. The sequence consists of 1213 amino acids arranged in 20 homologous units, each about 60 amino acids long, and an 18-residue leader sequence. The 60-amino-acid-long repetitive units are homologous with those found in a large number of other complement and non-complement proteins. Two basic C-terminal residues, deduced from the cDNA sequence, are absent from factor H isolated from outdated plasma. A tyrosine/histidine polymorphism was observed within the seventh homologous repeat unit of factor H. This is likely to represent a difference between the two major allelic variants of factor H. The nature of the cDNA clones indicates that there is likely to be an alternative splicing mechanism, resulting in the formation of at least two species of factor H mRNA.  相似文献   

6.
We report here a partial primary structure for human complement protein H. Tryptic peptides comprising 27% of the H molecule were isolated by conventional techniques and were sequenced (333 amino acid residues). Several mixed-sequence oligonucleotide probes were constructed, based on the peptide sequence data, and were used to screen a human liver cDNA library. The largest recombinant plasmid (pH1050), which hybridized with two probes, was further characterized. The cDNA insert of this plasmid contained coding sequence (672 bp) for 224 amino acids of H. The 3' end of this clone had a polyadenylated tail preceded by a polyadenylation recognition site (ATTAAA) and a 3'-untranslated region (229 bp). Four regions of internal homology, each about 60 amino acids in length, were observed in the derived protein sequence from this cDNA clone, and a further seven from the tryptic peptide sequences. The consensus sequence for each of the repetitive units of H was four cysteines, two prolines, three glycines, one tryptophan, and two tyrosines/phenylalanines. Based on the mole percent values for each of these amino acids, it is likely that H is composed of about 20 repetitive units of this nature. Furthermore, the repetitive unit of H shows pronounced homology with the Ba fragment of B, the C4b binding protein, and beta 2-glycoprotein I. Therefore, it seems that at least portions of these proteins have evolved from a common ancestral DNA element.  相似文献   

7.
The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of the predictions and shows the power of techniques for the determination of tertiary structural information from the amino acid sequences of an aligned protein family.  相似文献   

8.
The complete amino acid sequence of human antileukoprotease has been determined by direct sequencing of the inhibitory active protein isolated from seminal plasma (HUSI-I) and by sequence analysis of cDNA reverse-transcribed from mRNA prepared from cervical tissue. The inhibitor (Mr 11726) consists of 107 amino acid residues including 16 cysteines presumably forming disulfide bonds. The molecule comprises two consecutive domains which are homologous to each other, to the second domain of the basic protease inhibitor from Red Sea turtle (chelonianin) and to both domains of the whey proteins of rat and mouse. Both domains contain a pattern of cysteines known as the 'four-disulfide-core' that has also been found in wheat germ agglutinin and neurophysin.  相似文献   

9.
Escherichia coli DNA topoisomerase I catalyzes interconversions of different DNA topological isomers by the breakage and rejoining of DNA phosphodiester bonds. It has a crucial role in maintaining an optimal DNA superhelicity in E. coli. It is a single polypeptide of 864 amino acids. Analysis of the amino acid sequence reveals three tandem repeat units each containing two pairs of cysteines suggesting that each unit may form a zinc-binding domain. We have determined that each enzyme molecule contains three to four zinc atoms using inductively coupled plasma-atomic emission analysis. Modification of the cysteine residues and removal of the zinc from the enzyme result in loss of activity. Zinc ions are needed for full recovery of enzyme activity when the cysteine modification is reversed. Comparison with the zinc-binding domains of the sequence-specific DNA-binding proteins shows significant differences.  相似文献   

10.
Characterization of rat heart tropoelastin   总被引:1,自引:0,他引:1  
Several overlapping rat tropoelastin cDNA clones were isolated from a lambda gt11 rat heart cDNA library and their nucleotide sequence was determined. The corresponding deduced amino acid sequence of rat tropoelastin revealed strong homology to bovine and human tropoelastins although possessing some unique features including greater size (18%) and composition of repetitive units. Comparison of the amino acid sequence of rat tropoelastin to four other tropoelastin species reveals that the hydrophobic peptide repeat regions in the middle of each molecule and the crosslinking areas containing three lysine residues are remarkably conserved. A possible function for the clustering of three lysine residues in providing a mechanism for the in vivo reduction of dehydrolysinonorleucine via a redox shuttle with dihydrodesmosine is proposed. In addition, the COOH-terminal sequence of the rat tropoelastin is virtually identical to tropoelastins of other species in possessing a cysteine/arginine/lysine containing segment. There are no obvious amino acid insertions or substitutions in the COOH-terminal half of the rat tropoelastin molecule which would signal unique cleavage or glycosylation sites. Examination of the steady-state levels of rat tropoelastin mRNA in 8- and 12-day neonatal lung, heart, and aortic tissues showed that the amount of tropoelastin mRNA was abundant and of similar size (3.9 kb) in all three tissues.  相似文献   

11.
The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated M(r) value of 57,190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27-61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein is synthesized as a 557-amino acid precursor and processed to produce a mature protein of M(r) 54,505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.  相似文献   

12.
Sequence analysis of the gtfB gene from Streptococcus mutans.   总被引:52,自引:13,他引:39       下载免费PDF全文
The nucleotide sequence of the gtfB gene from Streptococcus mutans GS-5, coding for glucosyltransferase I activity, was determined. The gene codes for a strongly hydrophilic protein with a molecular size of 165,800 daltons. The deduced amino acid sequence revealed a typical gram-positive bacterial signal sequence at the NH2 terminus of the protein and 3.5 direct repeating units (each containing 65 amino acids) at the COOH terminus. Nucleotide sequencing of the region immediately downstream from the gtfB gene revealed the presence of a putative gene coding for an extracellular protein. This open reading frame is partially homologous to the gtfB gene.  相似文献   

13.
N-terminal and cDNA characterization of murine lymphocyte antigen Ly-6C.2   总被引:5,自引:0,他引:5  
The Ly-6C.2 molecule was purified from K36 tumor cells by affinity chromatography and gel filtration. The electrophoretically homogeneous preparation, with m.w. 15,000, was tested with a panel of antibodies that confirmed the presence of the LY-6C.2 epitope. An N-terminal sequence of 39 amino acids was obtained showing 59% homology with the corresponding portion of the Ly-6A.2 polypeptide. Based on the least homologous (29%) 14 amino acid segment, an oligonucleotide probe was constructed, and Ly-6C.2 cDNA was cloned from a BW5147 cDNA library. A 794-base pair cDNA containing the entire coding region had 82% homology with Ly-6A.2 cDNA. The encoded polypeptide sequence of 131 amino acids containing a perfect correlation with the N-terminal sequence data was 63% homologous with that of Ly-6A.2. The greatest homology was in the leader, first 16 N-terminal and last 39 C-terminal amino acids. The latter are likely to be important in determining the attachment of glycophosphatidylinositol. Despite results indicating fewer disulfide constraints in the Ly-6C molecule, the predicted sequence contains 10 cysteine residues nearly perfectly matched with those predicted in Ly-6A.  相似文献   

14.
The amino acid sequence of subunit A of the potato chymotryptic inhibitor I was determined. The sequence was deduced from analysis of fragments and peptides derived from the protein by cleavage with cyanogen bromide, N-bromosuccinimide and dilute acid, and by digestion with trypsin, thermolysin, pepsin and papain. The molecule consists of a single polypeptide chain of 84 residues, which contains two homologous regions each of 13 amino acids. The protein does not appear to be homologous with any other known proteinase inhibitors.  相似文献   

15.
Abstract The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum , was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated M r value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of M r 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.  相似文献   

16.
A small enzyme showing diaphorase activity was purified from culture supernatant of Clostridium kluyveri and its N-terminal amino acid sequence was determined. This sequence identified a gene (diaA) encoding a protein (DiaA) of 229 amino acids with a predicted molecular weight of 24,981 in the genomic DNA sequence database of C. kluyveri constructed by the Research Institute of Innovative Technology for the Earth. The predicted protein was composed of a flavin reductase-like domain and a rubredoxin-like domain from its N-terminus. The diaA gene was cloned into an expression vector and expressed in an Escherichia coli recombinant. Recombinant enzyme rDiaA showed NADH/NADPH diaphorase activity with 2,6-dichlorophenolindophenol and nitro blue tetrazolium. The enzyme was most active at pH 8.0 at 40 degrees C. The UV-visible absorption spectrum and thin layer chromatography (TLC) analyses indicated that one rDiaA molecule contained a tightly bound FMN molecule as a prosthetic group. An iron molecule was also detected in an enzyme molecule.  相似文献   

17.
Thrombospondin is one of a class of adhesive glycoproteins that mediate cell-to-cell and cell-to-matrix interactions. We have used two monoclonal antibodies to isolate cDNA clones of thrombospondin from a human endothelial cell cDNA library and have determined the complete nucleotide sequence of the coding region. Three regions of known amino acid sequence of human platelet thrombospondin confirm that the clones are authentic. Three types of repeating amino acid sequence are present in thrombospondin. The first is 57 amino acids long and shows homology with circumsporozoite protein from Plasmodium falciparum. The second is 50-60 amino acids long and shows homology with epidermal growth factor precursor. The third occurs as a continuous eightfold repeat of a 38-residue sequence; structural homology with parvalbumin and calmodulin indicates that these repeats constitute the multiple calcium-binding sites of thrombospondin. The amino acid sequence arg-gly-asp-ala is included in the last type 3 repeat. This sequence is probably the site for the association of thrombospondin with cells. In addition, localized homologies with procollagen, fibronectin, and von Willebrand factor are present in one region of the thrombospondin molecule.  相似文献   

18.
The location of 16 of the 18 disulfide bonds in human plasma prekallikrein was determined by amino acid sequence analysis of cystinyl peptides produced by chemical and enzymatic digestions. A unique structure, named the apple domain, was established for each of the four tandem repeats in the amino-terminal portion of the molecule. The apple domains (90 or 91 amino acids) contain 3 highly conserved disulfide bonds linking the first and sixth, second and fifth, and third and fourth half-cystine residues present in each repeat. The fourth tandem repeat contains an extra disulfide bond that forms a second small loop within the apple domain. The carboxyl-terminal portion of plasma prekallikrein containing the catalytic region of the molecule was found to have disulfide bonds located in positions similar to those of other serine proteases.  相似文献   

19.
The amino acid sequence of an insect apolipoprotein, apolipophorin-III from Manduca sexta, was determined by a combination of cDNA and protein sequencing. The mature hemolymph protein consists of 166 amino acids. The cDNA also encodes for an amino-terminal extension of 23 amino acids which is not represented in the mature hemolymph protein. The existence of a precursor protein was confirmed by in vitro translation of fat body mRNA. Computer-assisted comparative sequence analysis revealed the following points: 1) the protein is composed of tandemly repeating tetradecapeptide units with a high potential for forming amphiphilic helical structures. Compared to mammalian apolipoproteins the repeat units in the insect apolipoprotein show considerable length variability; 2) the sequence has a striking resemblance to several human apolipoproteins including apoE, AIV, AI, and CI. However, the homology seems to be entirely functional since, although the insect and mammalian apoproteins contain very similar types of amino acid residues, the actual degree of sequence identity is quite low. Whether the mammalian and insect apoproteins are derived from a common ancestral amphiphilic helix forming, lipid-binding protein, or arose by convergent evolution can not be determined at present. This represents the first complete amino acid sequence for an insect apolipoprotein.  相似文献   

20.
A DNA sequence of 4,592 nucleotides (nt) was derived for the nonpathogenic ADV-G strain of Aleutian mink disease parvovirus (ADV). The 3'(left) end of the virion strand contained a 117-nt palindrome that could assume a Y-shaped configuration similar to, but less stable than, that of other parvoviruses. The sequence obtained for the 5' end was incomplete and did not contain the 5' (right) hairpin structure but ended just after a 25-nt A + T-rich direct repeat. Features of ADV genomic organization are (i) major left (622 amino acids) and right (702 amino acids) open reading frames (ORFs) in different translational frames of the plus-sense strand, (ii) two short mid-ORFs, (iii) eight potential promoter motifs (TATA boxes), including ones at 3 and 36 map units, and (iv) six potential polyadenylation sites, including three clustered near the termination of the right ORF. Although the overall homology to other parvoviruses is less than 50%, there are short conserved amino acid regions in both major ORFs. However, two regions in the right ORF allegedly conserved among the parvoviruses were not present in ADV. At the DNA level, ADV-G is 97.5% related to the pathogenic ADV-Utah 1. A total of 22 amino acid changes were found in the right ORF; changes were found in both hydrophilic and hydrophobic regions and generally did not affect the theoretical hydropathy. However, there is a short heterogeneous region at 64 to 65 map units in which 8 out of 11 residues have diverged; this hypervariable segment may be analogous to short amino acid regions in other parvoviruses that determine host range and pathogenicity. These findings suggested that this region may harbor some of the determinants responsible for the differences in pathogenicity of ADV-G and ADV-Utah 1.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号