首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Summary We examine in this paper one of the expected consequences of the hypothesis that modern proteins evolved from random heteropeptide sequences. Specifically, we investigate the lengthwise distributions of amino acids in a set of 1,789 protein sequences with little sequence identity using the run test statistic (r o) of Mood (1940,Ann. Math. Stat. 11, 367–392). The probability density ofr o for a collection of random sequences has mean=0 and variance=1 [the N(0,1) distribution] and can be used to measure the tendency of amino acids of a given type to cluster together in a sequence relative to that of a random sequence. We implement the run test using binary representations of protein sequences in which the amino acids of interest are assigned a value of 1 and all others a value of 0. We consider individual amino acids and sets of various combinations of them based upon hydrophobicity (4 sets), charge (3 sets), volume (4 sets), and secondary structure propensity (3 sets). We find that any sequence chosen randomly has a 90% or greater chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. We regard this as strong support for the random-origin hypothesis. However, we do observe significant deviations from the random expectation as might be expected after billions years of evolution. Two important global trends are found: (1) Amino acids with a strong α-helix propensity show a strong tendency to cluster whereas those with β-sheet or reverse-turn propensity do not. (2) Clustered rather than evenly distributed patterns tend to be preferred by the individual amino acids and this is particularly so for methionine. Finally, we consider the problem of reconciling the random nature of protein sequences with structurally meaningful periodic “patterns” that can be detected by sliding-window, autocorrelation, and Fourier analyses. Two examples, rhodopsin and bacteriorhodopsin, show that such patterns are a natural feature of random sequences.  相似文献   

2.
Rhamnogalacturonase was purified from culture filtrate ofAspergillus aculeatus after growth in medium with sugar-beet pulp as carbon source. Purified protein was used to raise antibodies in mice and with the antiserum obtained a gene coding for rhamnogalacturonase (rhgA) was isolated from a λ cDNA expression library. The clonedrhgA gene has an open-reading frame of 1320 base pairs encoding a protein of 440 amino acids with a predicted molecular mass of 45 962 Da. The protein contains a potential signal peptidase cleavage site behind Gly-18 and three potential sites forN-glycosylation. Limited homology withA. niger polygalacturonase amino acid sequences is found. A genomic clone ofrhgA was isolated from a recombinant phage λ genomic library. Comparison of the genomic and cDNA sequences revealed that the coding region of the gene is interrupted by three introns. Furthermore, amino acid sequences of four different peptides, derived from purifiedA. aculeatus rhamnogalacturonase, were also found in the deduced amino acid sequence ofrhgA.A. aculeatus strains overexpressing rhamnogalacturonase were obtained by cotransformation using either theA. niger pyrA gene or theA. aculeatus pyr A gene as selection marker. For expression of rhamnogalacturonase inA. awamori theA. awamori pyrA gene was used as selection marker. Degradation patterns of modified hairy regions, determined by HPLC, show the recombinant rhamnogalacturonase to be active, and the enzyme was found to have a positive effect in the apple hot-mash liquefaction process.  相似文献   

3.

Background  

The rapid burgeoning of available protein data makes the use of clustering within families of proteins increasingly important. The challenge is to identify subfamilies of evolutionarily related sequences. This identification reveals phylogenetic relationships, which provide prior knowledge to help researchers understand biological phenomena. A good evolutionary model is essential to achieve a clustering that reflects the biological reality, and an accurate estimate of protein sequence similarity is crucial to the building of such a model. Most existing algorithms estimate this similarity using techniques that are not necessarily biologically plausible, especially for hard-to-align sequences such as proteins with different domain structures, which cause many difficulties for the alignment-dependent algorithms. In this paper, we propose a novel similarity measure based on matching amino acid subsequences. This measure, named SMS for Substitution Matching Similarity, is especially designed for application to non-aligned protein sequences. It allows us to develop a new alignment-free algorithm, named CLUSS, for clustering protein families. To the best of our knowledge, this is the first alignment-free algorithm for clustering protein sequences. Unlike other clustering algorithms, CLUSS is effective on both alignable and non-alignable protein families. In the rest of the paper, we use the term "phylogenetic" in the sense of "relatedness of biological functions".  相似文献   

4.
A simple method for the sequence prediction of peptides capable of thein vivo stimulation of antibody production in mice without conjugation with protein carriers was proposed on the basis of literature data on the structure of T-helper epitopes active in vivo. According to this approach, a potentially active peptide should contain a nine-membered sequence with a hydrophobic amino acid residue in the first position and a positively charged residue in the ninth position. The efficiency of this approach was confirmed by the presence of such sequences in the previously described synthetic peptides with immune activities, by the application of this approach to the choice of immunogenic fragments within the sequences of various proteins that exhibited further the specific activity, and by the construction of immunogenic peptides on the basis of inactive natural sequences.  相似文献   

5.
Summary In contrast to chemically induced mutants of tobacco mosaic virus (TMV) in which we have found replacement of one or at most of two amino acids per coat protein chain, the protein chains of naturally occurring TMV strains differ from each other in numerous positions. The complete amino acid sequence of the naturally occurring TMV straindahlemense isolated byMelchers (1940) has been determined. It differs in 30 of the 158 amino acid positions from the TMV wild strainvulgare (Fig. 1). This is the first case in which complete amino acid sequences of the coat proteins of two virus strains can be compared. Such a comparison permits conclusions about the structure of the protein subunits and about certain aspects of the genetic code to be drawn.The electrophoretic mobility curves for the virus rods and the A proteins ofvulgare anddahlemense (Fig. 4) can be explained on the basis of the amino acid sequences of the two strains. Spatial distribution of the positive and negative groups within the protein subunits are discussed. One particular segment of the protein chain appears to be so important for the secondary and/or tertiary structure of the protein subunit that amino acid replacements within this segment in general lead to a loss of infectivity.The 46 cases in which we have exactly located the positions of amino acid differences betweenvulgare and various TMV mutants and strains are summarized in Table 1. Combination of the data in Table 1 with the base compositions of the triplets as obtained from the cell free system ofE. coli permits conclusions about the nucleotide sequence within the triplets to be drawn. The triplets shown in Table 2 represent, at present, the best agreement between the data from the cell free system and the work with TMV mutants.

Mit 4 Textabbildungen  相似文献   

6.
Summary Enteropeptidase (enterokinase EC 3.4.21.9), catalyzing trypsinogen activation, exhibits unique properties for high efficiency hydrolysis of the polypeptide chain after the N-terminal tetraaspartyl-lysyl sequence. This makes it a convenient tool for the processing of fusion proteins containing this sequence. We found the enteropeptidase-catalysing degradation of some bioactive peptides: cattle hemoglobin beta-chain fragments Hb (2–8) (LTAEEKA) and Hb (1–9) (MLTAEEKAA), human angiotensin II (DRVYIHPF) (AT). Model peptides with truncated linker WDDRG and WDDKG also were shown to be susceptible to enteropeptidase action. Kinetic parameters of enteropeptidase hydrolysis for these substrates were determined.K m values for all substrates with truncated linker (≈10−3 M) are an order of magnitude higher than corresponding values for typical enteropeptidase artificial peptide or fusion protein substrates with full enteropeptidase linker-DDDDK-(K m ≈10−4 M).k cat values for AT, Hb (2–8), WDDRG and WDDKG are ≈30–40 min−1. But one additional amino acid residue at both N-and C-terminus of Hb (2–8) results in a drastic increase of hydrolysis efficiency:k cat value for Hb (1–9) is 1510 min−1. Recent study demonstrates the possibility of undesirable cleavage of target peptides or proteins containing the above-mentioned truncated linker sequences; further, the ability of enteropeptidase to hydrolyse specifically several biologically active peptidesin vitro along with its unique natural substrate trypsinogen was demonstrated.  相似文献   

7.
The Chou-Fasman predictive algorithm for determining the secondary structure of proteins from the primary sequence is reviewed. Many examples of its use are presented which illustrate its wide applicability, such as predicting (a) regions with the potential for conformational change, (b) sequences which are capable of assuming several conformations in different environments, (c) effects of single amino acid mutations, (d) amino acid replacements in synthesis of peptides to bring about a change in conformation, (e) guide to the synthesis of polypeptides with definitive secondary structure,e.g. signal sequences, (f) conformational homologues from varying sequences and (g) the amino acid requirements for amphiphilicα-helical peptides.  相似文献   

8.
Thewaxy proteins encoded by the genomes A, B, and D in polyploid wheats and related diploid species were isolated by SDS-PAGE. The N-terminal amino acid sequences of mature proteins and V8 protease-induced fragments were determined. A total of five amino acid substitutions was detected in these sequences, which represent about 10% of the whole sequences of thewaxy proteins. A comparison of these sequences in polyploid wheats with those in related diploid species revealed the following: (i)waxy proteins encoded by the A genome of polyploid wheats were identical to that ofTriticum monococcum, (ii) thewaxy protein encoded by the B genome ofT. turgidum was identical to that ofT. searsii, but differed from those ofT. speltoides andT. longissimum by one amino acid substitution, (iii) thewaxy protein encoded by the B genome ofT. aestivum differed from that encoded by the B genome ofT. turgidum by one amino acid substitution, and (iv) thewaxy protein encoded by the D genome ofT. aestivum was identical to that ofT. tauschii.  相似文献   

9.
Summary Aspartate transcarbamoylase (ATCase, EC 2.1.3.2) is the first unique enzyme common to de novo pyrimidine biosynthesis and is involved in a variety of structural patterns in different organisms. InEscherichia coli, ATCase is a functionally independent, oligomeric enzyme; in hamster, it is part of a trifunctional protein complex, designated CAD, that includes the preceding and subsequent enzymes of the biosynthetic pathway (carbamoyl phosphate synthetase and dihydroorotase). The complete complementary DNA (cDNA) nucleotide sequence of the ATCase-encoding portion of the hamster CAD gene is reported here. A comparison of the deduced amino acid sequences of the hamster andE. coli catalytic peptides revealed an overall 44% amino acid similarity, substantial conservation of predicted secondary structure, and complete conservation of all the amino acids implicated in the active site of theE. coli enzyme. These observations led to the construction of a functional hybrid ATCase formed by intragenic fusion based on the known tertiary structure of the bacterial enzyme. In this fusion, the amino terminal half (the “polar domain”) of the fusion protein was provided by a hamster ATCase cDNA subclone, and the carboxyl terminal portion (the “equatorial domain”) was derived from a clonedpyrBI operon ofE. coli K-12. The recombinant plasmid bearing the hybrid ATCase was shown to satisfy growth requirements of transformedE. coli pyrB cells. The functionality of thisE. coli-hamster hybrid enzyme confirms conservation of essential structure-function relationships between evolutionarily distant and structurally divergent ATCases.  相似文献   

10.
A lectin was purified from the leaves of Allium altaicum and corresponding gene was cloned. The lectin namely Allium altaicum agglutinin (AAA) was ~24 kDa homodimeric protein and similar to a typical garlic leaf lectin. It was synthesized as 177 amino acid residues pre-proprotein, which consisted of 28 and 43 amino acid long N and C-terminal signal peptides, respectively. The plant expressed this protein more in scapes and flowers in comparison to the bulbs and leaves. Hemagglutination activity (with rabbit erythrocytes) was 1,428 fold higher as compared to Allium sativum leaf agglutinin (ASAL) although, the insecticidal activity against cotton aphid (Aphis gossypii) was relatively low. Glycan array revealed that AAA had higher affinity towards GlcAb1-3Galb as compared to ASAL. Homology analysis showed 57–94% similarity with other Allium lectins. The mature protein was expressed in E. coli as a fusion with SUMO peptide in soluble and biologically active form. Recombinant protein retained high hemagglutination activity.  相似文献   

11.
Summary Ribosomes were isolated from two E. coli revertants from streptomycin dependence to independence, N660 and d1023. After separation of subunits, proteins were extracted from ribosomal 30S subunits and separated by CM-cellulose column chromatography and gel filtration. Pure S5 and S12 proteins of the two mutants were digested with trypsin and all resulting peptides were isolated by column and paper chromatography. The amino acid compositions of the peptides from the four mutant proteins were compared with the corresponding peptides of the wild type strain A19. The amino acid sequences of non-identical peptides were determined.The following amino acid replacements were found: Glycine by arginine in peptide T2 of protein S5 from mutant N660 and glycine by aspartic acid in peptide T15 of protein S12 from the same mutant. In the other mutant, d1023, arginine in peptide T2 of protein S5 was replaced by leucine and furthermore arginine by serine in peptide T10 of protein S12. Besides the single amino acid replacements mentioned above which are compatible with alterations of single nucleotides, a rather drastic difference between peptides T15 of proteins S12 isolated from strain A19 and mutant d1023 has been detected.The results presented in this paper are compared with amino acid replacements in proteins S5 and S12 from other ribosomal mutants of E. coli.Paper No. 62 on Ribosomal Proteins. Preceding paper is by Wittmann et al., Molec. gen. Genet., in press.  相似文献   

12.
Phosphorylation of ribosomal acidic proteins ofSaccharomyces cerevisiae is an important mechanism regulating a number of active ribosomes. The key role in the regulatory mechanism is played by specific phosphoprotein kinases and phosphoprotein phosphatases. Three different cAMP-independent protein kinases phosphorylating acidic ribosomal proteins have been identified and characterized. The protein kinase 60S (PK60S), RAP kinase, and casein kinase type 2 (CK2). All three protein kinases phosphorylate serine residues which are localized in the C-terminal end of phosphoproteins. Synthetic peptides were used to determinate the amino acid sequence of phosphoacceptor site for PK60S. Peptide AAEESDDD derived from phosphoproteins YP1β/β′ and YP2α turned out to be the best substrate for PK60S. A number of halogenated benzimidazoles and 2-azabenzimidazoles were tested as inhibitors of the three protein kinases. 4,5,6,7-Tetrabromo-2-azabenzimidazole inhibits phosphorylation only of these polypeptides phosphorylated by protein kinase 60S, namely YP1β/β′ and YP2α, but not the other, YP1α and YP2β phosphorylated by protein kinases RAP and CK2. RAP kinase has been found in an active form in the soluble fraction ofS. cerevisiae. The enzyme uses ATP as a phosphate donor and is less sensitive to heparin than casein kinase 2. RAP kinase monophosphorylates the four acidic proteins. The ribosome-bound proteins are a better substrate for the enzyme. Multifunctional CK2 kinase phosphorylate all four acidic proteins. The kinase phosphorylates preferentially serine or threonine residues surrounded by cluster of acidic residues. The enzyme activity is stimulatedin vitro by the presence of polylysine and inhibited by heparin. Presented at theSymposium on Regulation of Translation of Genetic Information by Protein Phosphorylation, 21 st Congress of the Czechoslovak Society for Microbiology, Hradec Králové (Czech Republic), September 6–10, 1998.  相似文献   

13.
The nucleotide sequence of a cDNA prepared from poly(A)+ RNA from Lycopersicon esculentum fruit codes for a protein, M r 20812, with features representative of the protein core of arabinogalactan proteins. The deduced amino acid sequence resembles that of peptides of arabinogalactan proteins isolated from carrot and rose and is most similar to the sequence of tryptic peptides from Lolium multiflorum (Gleeson et al., Biochem J 264 (1989) 857–862). The similar sequences include a number of Ala-Pro repeats, a feature considered distinctive of arabinogalactan proteins. The amino acid composition is similar to that of the peptide core of the Lolium multiflorum arabinogalactan protein; alanine, serine and proline account for 57% of the polypeptide. The mRNA corresponding to the cDNA sequence was detected in roots, leaves and fruit. The levels of mRNA are reduced in older leaves, in fruit that have commenced ripening and in leaves and fruit that have been wounded.  相似文献   

14.
Length-dependent prediction of protein intrinsic disorder   总被引:2,自引:0,他引:2  

Background  

Due to the functional importance of intrinsically disordered proteins or protein regions, prediction of intrinsic protein disorder from amino acid sequence has become an area of active research as witnessed in the 6th experiment on Critical Assessment of Techniques for Protein Structure Prediction (CASP6). Since the initial work by Romero et al. (Identifying disordered regions in proteins from amino acid sequences, IEEE Int. Conf. Neural Netw., 1997), our group has developed several predictors optimized for long disordered regions (>30 residues) with prediction accuracy exceeding 85%. However, these predictors are less successful on short disordered regions (≤30 residues). A probable cause is a length-dependent amino acid compositions and sequence properties of disordered regions.  相似文献   

15.
Red clover (Trifolium pratense) leaves accumulate several μmol of phaselic acid [2-O-caffeoyl-l-malate] per gram fresh weight. Post-harvest oxidation of such o-diphenols to o-quinones by endogenous polyphenol oxidases (PPO) prevents breakdown of forage protein during storage. Forages like alfalfa (Medicago sativa) lack both foliar PPO activity and o-diphenols. Consequently, breakdown of their protein upon harvest and storage results in economic losses and release of excess nitrogen into the environment. Understanding how red clover synthesizes o-diphenols such as phaselic acid will help in the development of forages utilizing this natural system of protein protection. We have proposed biosynthetic pathways in red clover for phaselic acid that involve a specific hydroxycinnamoyl-CoA:malate hydroxycinnamoyl transferase. It is unclear whether the transfer reaction to malate to form phaselic acid involves caffeic acid or p-coumaric acid and subsequent hydroxylation of the resulting p-coumaroyl-malate. The latter would require a coumarate 3′-hydroxylase (C3′H) capable of hydroxylating p-coumaroyl-malate, an activity not previously described. Here, a cytochrome P450 C3′H (CYP98A44) was identified and its gene cloned from red clover. CYP98A44 shares 96 and 79% amino acid identity with Medicago truncatula and Arabidopsis thaliana C3′H proteins that are capable of hydroxylating p-coumaroyl-shikimate and have been implicated in monolignol biosynthesis. CYP98A44 mRNA is expressed in stems and flowers and to a lesser extent in leaves. Immune serum raised against CYP98A44 recognizes a membrane-associated protein in red clover stems and leaves and cross-reacts with C3′H proteins from other species. CYP98A44 expressed in Saccharomyces cerevisiae is capable of hydroxylating p-coumaroyl-shikimate, but not p-coumaroyl-malate. This finding indicates that in red clover, phaselic acid is likely formed by transfer of a caffeoyl moiety to malic acid, although the existence of a second C3′H capable of hydroxylating p-coumaroyl-malate cannot be definitively ruled out.  相似文献   

16.
QSAR analysis of the primary sequence of human cytochrome c indicated that numerous putative cell penetrating peptide sequences are located within major amino- and carboxyl-terminal helical domains. Two such sequences, Cyt c 77−101 and the shorter homologue Cyt c 86−101, readily translocate the plasma membrane of U373MG astrocytoma cells but differentially accumulate in extranuclear or nuclear compartments. Such sequences could be employed for the selective intracellular targeting of cytotoxins and other bioactive cargoes. Moreover, these peptides also induce apoptosis, indicating that they mimic the role of cytochrome c as a key regulator of programmed cell death. We propose that these domains may constitute an integral transduction domain within cytochrome c that enables the protein to directly translocate plasma membranes. These, and other studies, indicate that cell penetrating peptides (CPP) are valuable tools both for the purposes of intracellular delivery and for the direct manipulation of therapeutically relevant proteins and cellular processes. We propose that the term bioportide should be introduced to distinguish biologically active, proteomimetic CPP from inert sequences that include the majority of common vectors.  相似文献   

17.
18.
Enterolobin, a 55-kDa cytolytic, inflammatory, and insecticidal protein isolated from seeds of the Brazilian treeEnterolobium contortisiliquum (Leguminosae-Mimosoideae) has been further purified and partially sequenced by using both manual and automated methods. A computational search of enterolobin partial amino acid sequence against the PIR database revealed possible sequence similarities with aerolysins, cytolytic proteins fromAeromonas species. An alignment of enterolobin partial sequence to the amino acid sequences ofA. hydrophila andA. sobria aerolysins showed several similar regions with many residue identites. The seed protein enterolobin and the bacterial aerolysins may be homologous proteins despite the distant phylogenetic relationship.  相似文献   

19.
The bifunctional dihydrofolate reductase-thymidylate synthase (DHFR-TS) of Daucus carota has been further characterized as regards molecular weight, amino acid composition, protease digestion and microsequencing of proteolytic peptides. Data reported in this paper demonstrate that the carrot protein has a calculated M r of 124000 thus indicating that, contrarily to what has previously been suggested, it occurs as a dimer of identical subunits. Results of partial amino acid microsequencing show the presence of sequences highly homologous with those of the active sites of both DHFR and TS from other organisms confirming, at the structural level, the bifunctional nature of the carrot protein. As in the case of Leishmania tropica DHFR-TS, incubation of the carrot protein with V8 protease led to a rapid loss of TS activity while retaining that of DHFR. However the pattern of proteolysis did not allow to establish whether the sequence of domains is DHFR-TS as in Leishmania, or vice versa. Low homology of other amino acid sequences, as judged by computer analysis, and absence of common epitopes indicate an apparent divergence between carrot and leishmanian proteins.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号