首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A special matrix of amino acid antigenic similarity for computer detection of the potential antigenic proximity of unrelated proteins is proposed. The matrix was built using the data concerning affinities of amino acid residue interactions between subunits in oligomeric proteins. The diagonal elements of the matrix characterize the recognition of amino acid residues and the non-diagonal ones represent the relative similarity measure of antibody--amino acid residue interactions specificity. The application of the new matrix for comparing proteins allows the hydrophilic potentially immunologically active regions of sequences to be picked out as similar fragments. When the influenza virus hemagglutinin was compared with 116 human proteins, eight fragments were picked out, that could not be determined by means of the routinely used MDM78 matrix. The antigenic similarity matrix for defining the forbidden structures is proposed to be used for preparing the peptidic antiviral vaccines.  相似文献   

2.
A principal component analysis based on the physico-chemical properties of amino acid residues is developed to assign similar regions between distantly related families of proteins, taking account of the species diversities in respective families. The most important advantage of this analysis should be that it reflects different physico-chemical properties and thus can predict more detailed structural properties, including the transmembrane helices, than the hydropathy analysis. Its first application reconfirms the similarity between the core proteins of photosynthetic reaction center in purple bacteria and those of photosystem II, indicating that the low percentage of identical amino acid residues estimated previously between them is due to much allowance for amino acid substitutions in purple bacteria. The application of this analysis to the core proteins of photosystem I reveals that any of these proteins includes two domains, each showing high similarity to the amino acid sequences of core proteins in photosystem II and purple bacteria. A core structure model of A1 and A2 proteins folded into four layers of sheets of transmembrane helices is proposed to provide a molecular basis for the electron pathway suggested by spectroscopic experiments as well as for the interaction sites with plastocyanin, 9 kDa protein and LHC proteins.  相似文献   

3.
The presence in proteins of amino acid residues that change in concert during evolution is associated with keeping constant the protein spatial structure and functions. As in the case with morphological features, correlated substitutions may become the cause of homoplasies--the independent evolution of identical non-homological adaptations. Our data obtained on model phylogenetic trees and corresponding sets of sequences have shown that the presence of correlated substitutions distorts the results of phylogenetic reconstructions. A method for accounting for co-evolving amino acid residues in phylogenetic analysis is proposed. According to this method, only a single site from the group of correlated amino acid positions should remain, whereas other positions should not be used in further phylogenetic analysis. Simulations performed have shown that replacement on the average of 8% of variable positions in a pair of model sequences by coordinately evolving amino acid residues is able to change the tree topology. The removal of such amino acid residues from sequences before phylogenetic analysis restores the correct topology.  相似文献   

4.
The covalent structures of two, novel, neutrophile, leucocyte-derived, strongly basic proteins of porcine and human origin have been determined by microsequencing in combination with time-of-flight plasma desorption mass spectrometry. The porcine protein primary structure of 219 amino acid residues was shown to contain 6 cysteine residues, 2 putative carbohydrate sites and 14% basic residues. The human protein contained 221 amino acid residues of which 8 were cysteine, 4 putative carbohydrate sites and 12% basic. A 47% direct sequence similarity to human neutrophile elastase was found, but due to mutations of two of the three amino acids in the catalytic triad, proteolytic activity is absent. Modelling and alignment studies unveil a close relationship of both proteins to the serine protease family, the greatest similarity being to those serine proteases present in granules from peripheral blood cells. Both proteins have been shown to be chemotactically active for monocytes and fibroblasts in vitro.  相似文献   

5.
The general aromatic amino acid permease, AroP, of Escherichia coli is responsible for the active transport of phenylalanine, tyrosine, and tryptophan. A proposed topological model for the AroP permease, consisting of 12 hydrophobic transmembrane spans connected by hydrophilic loops, is very similar to that of the closely related phenylalanine-specific permease. The validity of this model and its similarity to that of the PheP permease were investigated by studying fusion proteins of AroP permease and alkaline phosphatase. Based on the results obtained from the AroP-alkaline phosphatase sandwich fusions, we have significantly revised the proposed topological model for AroP in two regions. In this modified AroP topological model, the three charged residues E151, E153, and K160 are repositioned within the membrane in span 5. These three residues are conserved in a large family of amino acid transport proteins, and site-directed mutagenesis identifies them as being essential for transport activity. It is postulated that these residues together with E110 in transmembrane span 3 may be involved in a proton relay system.  相似文献   

6.
Two proteins from the eggshell of Rhodnius prolixus were isolated, characterized and named Rp30 and Rp45 according to their molecular masses. Purified proteins were used to obtain specific antiserum which was later used for immunolocalization. The antiserum against Rp30 and Rp45 detected their presence inside the follicle cells, their secretion and their association with oocyte microvilli. Both proteins are expressed during the final stage of vitellogenesis, preserved during embryogenesis and discarded together with the eggshell. The amino terminals were sequenced and both proteins were further cloned using degenerated primers. The amino acid sequences appear to have a tripartite arrangement with a highly conserved central domain which presents a repetitive motif of valine-proline-valine (VPV) at intervals of 15 amino acid residues. Their amino acid sequence showed no similarity to any known eggshell protein. The expression of these proteins was also investigated; the results demonstrated that this occurred strictly in choriogenic follicles. Antifungal activity against Aspergillus niger was found to be associated with Rp45 but not with Rp30. A. niger exposed to Rp45 protein induced growth inhibition and several morphological changes such as large vacuoles, swollen mitochondria, multi-lamellar structures and a disorganized cell wall as demonstrated by electron microscopy analysis.  相似文献   

7.
It is a well known phenomenon that the occurrence of several distinct amino acids at the C-terminus of proteins is non-random. We have analysed all Saccharomyces cerevisiae proteins predicted by computer databases and found lysine to be the most frequent residue both at the last (-1) and at the penultimate amino acid (-2) positions. To test the hypothesis that C-terminal basic residues efficiently bind to phospholipids we randomly expressed GST-fusion proteins from a yeast genomic library. Fifty-four different peptide fragments were found to bind phospholipids and 40% of them contained lysine/arginine residues at the (-1) or (-2) positions. One peptide showed high sequence similarity with the yeast protein Sip18p. Mutational analysis revealed that both C-terminal lysine residues of Sip18p are essential for phospholipid-binding in vitro. We assume that basic amino acid residues at the (-1) and (-2) positions in C-termini are suitable to attach the C-terminus of a given protein to membrane components such as phospholipids, thereby stabilizing the spatial structure of the protein or contributing to its subcellular localization. This mechanism could be an additional explanation for the C-terminal amino acid bias observed in proteins of several species.  相似文献   

8.
Successful genome mining is dependent on accurate prediction of protein function from sequence. This often involves dividing protein families into functional subtypes (e.g., with different substrates). In many cases, there are only a small number of known functional subtypes, but in the case of the adenylation domains of nonribosomal peptide synthetases (NRPS), there are >500 known substrates. Latent semantic indexing (LSI) was originally developed for text processing but has also been used to assign proteins to families. Proteins are treated as ‘‘documents’’ and it is necessary to encode properties of the amino acid sequence as ‘‘terms’’ in order to construct a term-document matrix, which counts the terms in each document. This matrix is then processed to produce a document-concept matrix, where each protein is represented as a row vector. A standard measure of the closeness of vectors to each other (cosines of the angle between them) provides a measure of protein similarity. Previous work encoded proteins as oligopeptide terms, i.e. counted oligopeptides, but used no information regarding location of oligopeptides in the proteins. A novel tokenization method was developed to analyze information from multiple alignments. LSI successfully distinguished between two functional subtypes in five well-characterized families. Visualization of different ‘‘concept’’ dimensions allows exploration of the structure of protein families. LSI was also used to predict the amino acid substrate of adenylation domains of NRPS. Better results were obtained when selected residues from multiple alignments were used rather than the total sequence of the adenylation domains. Using ten residues from the substrate binding pocket performed better than using 34 residues within 8 Å of the active site. Prediction efficiency was somewhat better than that of the best published method using a support vector machine.  相似文献   

9.
We report the complete amino acid sequence of bovine conglutinin obtained by structural characterization of peptides derived from the protein by various chemical and enzymatic fragmentation methods. The protein consists of 351 amino acid residues including 55 apparent Gly-X-Y repeats with two interruptions. This 171-residue-long collagenous domain separates a short noncollagenous NH2-terminal region of 25 residues from the 155-residue-long globular COOH terminus revealing the structural relation of conglutinin with mannose-binding proteins, pulmonary surfactant-associated proteins, and a complement component C1q. Eight hydroxylysine residues were found in the collagenous domain. All of these hydroxylysine residues which occupy a Y position in a Gly-X-Y triplet are possible glycosylation sites since no phenylthiohydantoin amino acid was identified in automated Edman degradation cycles corresponding to these sites. The noncollagenous COOH domain of conglutinin, on the other hand, contains a carbohydrate recognition domain which shares substantial sequence homology with C-type animal lectins. Conglutinin has the greatest sequence similarity with mannose-binding proteins and pulmonary surfactant-associated proteins.  相似文献   

10.
11.
The amino acid sequence of ribosomal protein S18 from Bacillus stearothermophilus has been completely determined by automated sequence analysis of the intact protein as well as of peptides derived from digestion with Staphylococcus aureus protease at pH 4.0 and cleavage with cyanogen bromide. The carboxy-terminal region was verified by both amino acid analyses of chymotryptic peptides and by mass spectrometry from the terminal region. The protein contains 77 amino acid residues and has an Mr of 8838. Comparison of this sequence with the sequences of the S18 proteins from tobacco and liverwort chloroplasts and E. coli shows a relatively high similarity, ranging from 42 to 55% identical residues with the B. stearothermophilus S18 protein. The regions of homology common to all four proteins consist of several positively charged sections spanning the entire length of the protein.  相似文献   

12.
The complete amino acid sequence (186 amino acid residues) of a basic cytosolic protein from bovine brain has been determined. It was previously described as a phosphatidylethanolamine binding protein. Computer analyses have been used to calculate its hydropathy profile and to predict its secondary structure. Comparison with other proteins did not detect any significant sequence similarity, except for a short region which presents 53% sequence homology with bovine phosphatidylcholine transfer protein.  相似文献   

13.
An LC-MS-based approach is presented for the identification and quantification of proteins from unsequenced organisms. The method relies on the preservation of homology across species and the similarity in detection characteristics of proteomes in general. Species related proteomes share similarity that progresses from the amino acid frequency distribution to the complete amino sequence of matured proteins. Moreover, the comparative analysis between theoretical and experimental proteome distributions can be used as a measure for the correctness of detection and identification obtained through LC-MS-based schemes. Presented are means to the identification and quantification of rabbit myocardium proteins, immediately after inducing cardiac arrest, using a data-independent LC-MS acquisition strategy. The employed method of acquisition affords accurate mass information on both the precursor and associated product ions, whilst preserving and recording the intensities of the ions. The latter facilitates label-free quantification. The experimental ion density observations obtained for the rabbit sub proteome were found to share great similarity with five other mammalian samples, including human heart, human breast tissue, human plasma, rat liver and a mouse cell line. Redundant, species-homologues peptide identifications from other mammalian organisms were used for initial protein identification, which were complemented with peptide identifications of translated gene sequences. The feasibility and accuracy of label-free quantification of the identified peptides and proteins utilizing above mentioned strategy is demonstrated for selected cardiac rabbit proteins.  相似文献   

14.
Conformational similarity among amino acid residues, a property derived by analysing (φ, ψ)-probability distributions of 20 proteinous amino acids from 38 different globular proteins, is used to arrive at a set of six ‘obligatory’ amino acids of primitive proteins. The amino acids Ser, Val, Leu, Asp, Gly and Pro have been argued to be ‘obligatory’ and to represent, conformationally, the remaining amino acids. The reasons for consideration of these six residues as ‘obligatory’ are discussed. Methods to check the validity of our proposition are suggested.  相似文献   

15.
The amino acid composition of human alcohol dehydrogenase (ADH) was compared with alcohol dehydrogenases from different organisms and with other proteins. Similar amino acid sequences in human ADH (template protein) and in other proteins were determined by means of an original computer program. Analysis of amino acid motifs reveals that the ADHs from evolutionary more close organisms have more common amino acid sequences. The quantity measure of amino acid similarity was the number of similar motifs in analyzed protein per protein length. This value was measured for ADHs and for different proteins. For ADHs, this quotient was higher than for proteins with different functions; for vertebrates it correlated with evolutionary closeness. The similar operation of motif comparison was made with the help of program complex “MEME”. The analysis of ADHs revealed 4 motifs common to 6 of 10 tested organisms and no such motifs for proteins of different function. The conclusion is that general amino composition is more important for protein function than amino acid order and for enzymes of similar function it better correlates with evolutionary distance between organisms.  相似文献   

16.
The gluA gene, encoding an endo-beta-1,3-glucanase from Arthrobacter sp. (strain NHB-10), was cloned and analyzed. The deduced endo-beta-1,3-glucanase amino acid sequence was 750 amino acids long and contained a 42 amino acid signal peptide with a mature protein of 708 amino acids. There was no similarity to known endo-beta-1,3-glucanases, but GluA was partially similar to two fungal exo-beta-1,3-glucanases in glycoside hydrolase (GH) family 55. Of five possible residues for catalysis and two motifs in two beta-helix heads of GH family 55, three residues and one motif were conserved in GluA, suggesting that GluA is the first bacterial endo-beta-1,3-glucanase in GH family 55. Significant similarity was also found to two proteins of unknown function from Streptomyces coelicolor A3(2) and S. avermitilis.  相似文献   

17.
MATRIX is a program designed primarily to enable the user to visualize all regions of similarity between two proteins at a glance. The program helps the user to see where they are similar—at what relative positions in the amino acid sequences of the two proteins in question does the similarity exist; how they are similar—what functional characteristics the two similar sequences have in common; and to what extent they are similar—is the similarity significant, if so how significant relative to other similar sequences in the protein. This is achieved by constructing a diagram in which quantitative parameters of amino acids are used to compare every amino acid residue of the first protein with every amino acid residue of the second.Another function of the program is, given two sets of atomic coordinates—either of different proteins or for the same protein (for self-comparison)—to demonstrate which residues of the two proteins, when the two proteins are superimposed upon each other, appear in the same space (or are close to each other).  相似文献   

18.
The final destination of glycosylphosphatidylinositol (GPI)-attached proteins in Saccharomyces cerevisiae is the plasma membrane or the cell wall. Two kinds of signals have been proposed for their cellular localization: (i) the specific amino acid residues V, I, or L at the site 4 or 5 amino acids upstream of the GPI attachment site (the omega site) and Y or N at the site 2 amino acids upstream of the omega site for cell wall localization and (ii) dibasic residues in the region upstream of the omega site (the omega-minus region) for plasma membrane localization. The relationships between these amino acid residues and efficiencies of cell wall incorporation were examined by constructing fusion reporter proteins from open reading frames encoding putative GPI-attached proteins. The levels of incorporation were high in the constructs containing the specific amino acid residues and quite low in those containing two basic amino acid residues in the omega-minus region. With constructs that contained neither specific residues nor two basic residues, levels of incorporation were moderate. These correlations clearly suggest that GPI-attached proteins have two different signals which act positively or negatively in cell wall incorporation for their cellular localization.  相似文献   

19.
Hansson K  Thämlitz AM  Furie B  Furie BC  Stenflo J 《Biochemistry》2006,45(42):12828-12839
Gamma-glutamyl carboxylase catalyzes the modification of specific glutamyl residues to gamma-carboxyglutamyl (Gla) residues in precursor proteins that possess the appropriate gamma-carboxylation recognition signal within the propeptide region. We describe the immunopurification and first biochemical characterization of an invertebrate high molecular weight Gla-containing protein with homologues in mammals. The protein, named GlaCrisp, was isolated from the venom of the marine cone snail Conus marmoreus. GlaCrisp gave intense signals in Western blot experiments employing the Gla-specific antibody M3B, and the presence of Gla was chemically confirmed by amino acid analysis after alkaline hydrolysis. Characterization of a full-length cDNA clone encoding GlaCrisp deduced a precursor containing an N-terminal signal peptide but, unlike other Gla-containing proteins, no apparent propeptide. The predicted mature protein of 265 amino acid residues showed considerable sequence similarity to the widely distributed cysteine-rich secretory protein family and closest similarity (65% identity) to the recently described substrate-specific protease Tex31. In addition, two cDNA clones encoding the precursors of two isoforms of GlaCrisp were identified. The predicted precursor isoforms differed at three amino acid positions (-6, 9, and 25). Analysis by Edman degradation and nanoelectrospray ionization mass spectrometry, before and after methyl esterfication, identified a Gla residue at amino acid position 9 in GlaCrisp. This is the first example of a Gla-containing protein without an obvious gamma-carboxylation recognition site. The results define a new class of Gla proteins and support the notion that gamma-carboxylation of glutamyl residues is phylogenetically older than blood coagulation and the vertebrate lineage.  相似文献   

20.
The partial amino acid sequence including the N- and C-terminal portions of tauropine dehydrogenase (EC 1.5.1.23) from the marine sponge Halichondria japonica was determined by enzymatic cleavages followed by peptide sequencing. This information was used to design degenerate primers for amplification of cDNA encoding the tauropine dehydrogenase. The cDNA included 1231 nucleotides with an open reading frame of 1002 nucleotides that encodes a protein of 334 amino acid residues. From the peptide and nucleotide sequencing, the mature tauropine dehydrogenase was estimated to consist of 333 amino acid residues with an acetylated N-terminal serine residue and no intrachain disulfide bonds. The primary structure of the H. japonica enzyme showed apparent similarity with a homolog of ornithine cyclodeaminase from Rhizobium meliloti and other proteins of the ornithine cyclodeaminase/mu-crystallin family, but it showed no significant similarity with the known sequences of octopine dehydrogenases and tauropine dehydrogenases from marine invertebrates. These findings indicate that opine dehydrogenases in marine invertebrates are not all homologous.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号