首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
Liang G  Yang L  Kang L  Mei H  Li Z 《Amino acids》2009,37(4):583-591
On the basis of exploratory factor analysis, six multidimensional patterns of 516 amino acid attributes, namely, factor analysis scales of generalized amino acid information (FASGAI) involving hydrophobicity, alpha and turn propensities, bulky properties, compositional characteristics, local flexibility and electronic properties, are proposed to represent structures of 48 bitter-tasting dipeptides and 58 angiotensin-converting enzyme inhibitors. Characteristic parameters related to bioactivities of the peptides studied are selected by genetic algorithm, and quantitative structure–activity relationship (QSAR) models are constructed by partial least square (PLS). Our results by a leave-one-out cross validation are compared with the previously known structure representation method and are shown to give slightly superior or comparative performance. Further, two data sets are divided into training sets and test sets to validate the characterization repertoire of FASGAI. Performance of the PLS models developed by training samples by a leave-one-out cross validation and external validation for test samples are satisfying. These results demonstrate that FASGAI is an effective representation technique of peptide structures, and that FASGAI vectors have many preponderant characteristics such as straightforward physicochemical information, high characterization competence and easy manipulation. They can be further applied to investigate the relationship between structures and functions of various peptides, even proteins.  相似文献   

2.
Liang GZ  Ma XY  Li YC  Lv FL  Yang L 《Bio Systems》2011,105(1):101-106
This article offers a novel sequence-based approach to discriminate outer membrane proteins (OMPs). The first step is to use a new representation approach, factor analysis scales of generalized amino acid information (FASGAI) representing hydrophobicity, alpha and turn propensities, bulky properties, compositional characteristics, local flexibility and electronic properties, etc., to characterize sequences of OMPs and non-OMPs. The subsequent data is then transformed into a uniform matrix by the auto cross covariance (ACC). The second step is to develop discrimination predictors of OMPs from non-OMPs using a support vector machine (SVM). The SVM predictors thus successfully produce a high Matthews correlation coefficient (MCC) of 0.916 on 208 OMPs from non-OMPs including 206 α-helical membrane proteins and 673 globular proteins by a fivefold cross validation test. Meanwhile, overall MCC values of 0.923 and 0.930 are obtained for the discrimination OMPs from the α-helical membrane proteins and the globular proteins, respectively. The results demonstrate that the FASGAI-ACC-SVM combination approach shows great prospect of application in the field of bioinformatics or proteomics studies.  相似文献   

3.
4.
The sinus gland is a major neurosecretory structure in Crustacea. Five peptides, labeled C, D, E, F, and I, isolated from the sinus gland of the land crab have been hypothesized to arise from the incomplete proteolysis at two internal sites on a single biosynthetic intermediate peptide "H", based on amino acid composition additivities and pulse-chase radiolabeling studies. The presence of only a single major precursor for the sinus gland peptides implies that peptide H may be synthesized on a common precursor with crustacean hyperglycemic hormone forms, "J" and "L," and a peptide, "K," similar to peptides with molt inhibiting activity. Here I report amino acid sequences of these peptides. The amino terminal sequence of the parent peptide, H, (and the homologous fragments) proved refractory to Edman degradation. Data from amino acid analysis and carboxypeptidase digestion of the naturally occurring fragments and of fragments produced by endopeptidase digestion were used together with Edman degradation to obtain the sequences. Amino acid analysis of fragments of the naturally occurring "overlap" peptides (those produced by internal cleavage at one site on H) was used to obtain the sequences across the cleavage sites. The amino acid sequence of the land crab peptide H is Arg-Ser-Ala-Asp-Gly-Phe-Gly-Arg-Met-Glu-Ser-Leu-Leu-Thr-Ser-Leu-Arg-Gly- Ser-Ala-Glu- Ser-Pro-Ala-Ala-Leu-Gly-Glu-Ala-Ser-Ala-Ala-His-Pro-Leu-Glu. In vivo cleavage at one site involves excision of arginine from the sequence Leu-Arg-Gly, whereas cleavage at the other site involves excision of serine from the sequence Glu-Ser-Leu. Proteolysis at the latter sequence has not been previously reported in intact secretory granules. The aspartate at position 4 is possibly covalently modified.  相似文献   

5.
The sequences of the ubiquitous and phylogenetically diversified cyclophilin family of proteins were divided into six groups, namely, vertebrates, invertebrates, other metazoa, plants, fungi, and prokaryotes. These groups of sequences were aligned with the multiple sequence alignment program Clustal-W. The variations of amino acid substitutions and amino acid compositions for these six groups of cyclophilins were calculated using a novel suite of multiple-sequence alignment analysis routines. The cyclophilins from vertebrates can be divided for at least two distinct structural classes that differ from each other by a variable-length amino acid insert within the loop that links alpha-helix II and beta-strand III. A similar structural feature is also present in the other groups of cyclophilins, namely, those from invertebrates, other metazoa, plants, and fungi. The sequences of cyclophilins from fungi and prokaryotes are more diversified than those from vertebrates, and their alterations involve structures other than the amino acid inserts within the loops. Variations of the hydrophobicity and bulkiness of amino acid substitutions of the aligned sequences were calculated for each group of cyclophilins and for the alignment of all the sequences. The variations have clear asymmetry that may signify the need for modification of the physical properties of certain fragments of cyclophilins that are involved in interactions with various cellular components in the evolving environment.  相似文献   

6.
This study aimed to investigate the degradation of the natural substrates tropoelastin and elastin by the neutrophil-derived serine proteases human leukocyte elastase (HLE), proteinase 3 (PR3) and cathepsin G (CG). Focus was placed on determining their cleavage site specificities using mass spectrometric techniques. Moreover, the release of bioactive peptides from elastin by the three proteases was studied. Tropoelastin was comprehensively degraded by all three proteases, whereas less cleavage occurred in mature cross-linked elastin. An analysis of the cleavage site specificities of the three proteases in tropoelastin and elastin revealed that HLE and PR3 similarly tolerate hydrophobic and/or aliphatic amino acids such as Ala, Gly and Val at P1, which are also preferred by CG. In addition, CG prefers the bulky hydrophobic amino acid Leu and accepts the bulky aromatic amino acids Phe and Tyr. CG shows a strong preference for the charged amino acid Lys at P1 in tropoelastin, whereas Lys was not identified at P1 in CG digests of elastin due to extensive cross-linking at Lys residues in mature elastin. All three serine proteases showed a clear preference for Pro at P2 and P4′. With respect to the liberation of potentially bioactive peptides from elastin, the study revealed that all three serine proteases have a similar ability to release bioactive sequences, with CG producing the highest number of these peptides. In bioactivity studies, potentially bioactive peptides that have not been investigated on their bioactivity to date, were tested. Three new bioactive GxxPG motifs were identified; GVYPG, GFGPG and GVLPG.  相似文献   

7.
The primary structures of three peptides from extracts from the pancreatic islets of the daddy sculpin (Cottus scorpius) and three analogous peptides from the islets of the flounder (Platichthys flesus), two species of teleostean fish, have been determined by automated Edman degradation. The structures of the flounder peptides were confirmed by fast-atom bombardment mass spectrometry. The peptides show strong homology to residues (49-60), (63-96) and (98-125) of the predicted sequence of preprosomatostatin II from the anglerfish (Lophius americanus). The amino acid sequences of the peptides suggest that, in the sculpin, prosomatostatin II is cleaved at a dibasic amino acid residue processing site (corresponding to Lys61-Arg62 in anglerfish preprosomatostatin II). The resulting fragments are further cleaved at monobasic residue processing sites (corresponding to Arg48 and Arg97 in anglerfish preprosomatostatin II). In the flounder the same dibasic residue processing site is utilised but cleavage at different monobasic sites takes place (corresponding to Arg50 and Arg97 in anglerfish preprosomatostatin II). A peptide identical to mammalian somatostatin-14 was also isolated from the islets of both species and is presumed to represent a cleavage product of prosomatostatin I.  相似文献   

8.
Proteinases play critical roles in both intra and extracellular processes by binding and cleaving their protein substrates. The cleavage can either be non-specific as part of degradation during protein catabolism or highly specific as part of proteolytic cascades and signal transduction events. Identification of these targets is extremely challenging. Current computational approaches for predicting cleavage sites are very limited since they mainly represent the amino acid sequences as patterns or frequency matrices. In this work, we developed a novel predictor based on Random Forest algorithm (RF) using maximum relevance minimum redundancy (mRMR) method followed by incremental feature selection (IFS). The features of physicochemical/biochemical properties, sequence conservation, residual disorder, amino acid occurrence frequency, secondary structure and solvent accessibility were utilized to represent the peptides concerned. Here, we compared existing prediction tools which are available for predicting possible cleavage sites in candidate substrates with ours. It is shown that our method makes much more reliable predictions in terms of the overall prediction accuracy. In addition, this predictor allows the use of a wide range of proteinases.  相似文献   

9.
Characterization of aggrecan core protein peptides appearing in the medium of adult articular cartilage maintained in tissue culture showed that eight major peptides could be detected. The two largest peptides had the same N-terminal sequence as bovine aggrecan core protein and probably represent partly degraded aggrecan lost to the medium in the form of the proteoglycan aggregate. The three next smallest peptides were all shown to have another N-terminal sequence which corresponded to a sequence in the interglobular domain starting at alanine residue 393 of the human aggrecan core protein (K. Doege et al., 1991, J. Biol. Chem. 266, 894-902). Two other peptides were isolated and shown to have two different N-terminal amino sequences corresponding to sequences in the chondroitin sulfate attachment domain 2 of the core protein starting at alanine residue 1839 and leucine residue 1939 of human aggrecan. This suggests that the catabolism of aggrecan by adult articular cartilage occurs by the proteolytic cleavage of the core protein of this proteoglycan at three separate sites. Examination of the amino acid sequences around each of these cleavage sites showed a similar pattern TEGE decreases ARGS, TAQE decreases AGEG, and VSQE decreases LGQR, suggesting that a single proteinase may be involved in the catabolism of aggrecan. Analysis of synovial fluids and serum of age-matched animals revealed the presence of aggrecan core protein peptides corresponding in size to those detected in vitro, thus indicating the cleavage observed in explant culture is the same as that which occurs in vivo.  相似文献   

10.
Beck ZQ  Lin YC  Elder JH 《Journal of virology》2001,75(19):9458-9469
We have used a random hexamer phage library to delineate similarities and differences between the substrate specificities of human immunodeficiency virus type 1 (HIV-1) and feline immunodeficiency virus (FIV) proteases (PRs). Peptide sequences were identified that were specifically cleaved by each protease, as well as sequences cleaved equally well by both enzymes. Based on amino acid distinctions within the P3-P3' region of substrates that appeared to correlate with these cleavage specificities, we prepared a series of synthetic peptides within the framework of a peptide sequence cleaved with essentially the same efficiency by both HIV-1 and FIV PRs, Ac-KSGVF/VVNGLVK-NH(2) (arrow denotes cleavage site). We used the resultant peptide set to assess the influence of specific amino acid substitutions on the cleavage characteristics of the two proteases. The findings show that when Asn is substituted for Val at the P2 position, HIV-1 PR cleaves the substrate at a much greater rate than does FIV PR. Likewise, Glu or Gln substituted for Val at the P2' position also yields peptides specifically susceptible to HIV-1 PR. In contrast, when Ser is substituted for Val at P1', FIV PR cleaves the substrate at a much higher rate than does HIV-1 PR. In addition, Asn or Gln at the P1 position, in combination with an appropriate P3 amino acid, Arg, also strongly favors cleavage by FIV PR over HIV PR. Structural analysis identified several protease residues likely to dictate the observed specificity differences. Interestingly, HIV PR Asp30 (Ile-35 in FIV PR), which influences specificity at the S2 and S2' subsites, and HIV-1 PR Pro-81 and Val-82 (Ile-98 and Gln-99 in FIV PR), which influence specificity at the S1 and S1' subsites, are residues which are often involved in development of drug resistance in HIV-1 protease. The peptide substrate KSGVF/VVNGK, cleaved by both PRs, was used as a template for the design of a reduced amide inhibitor, Ac-GSGVF Psi(CH(2)NH)VVNGL-NH(2.) This compound inhibited both FIV and HIV-1 PRs with approximately equal efficiency. These findings establish a molecular basis for distinctions in substrate specificity between human and feline lentivirus PRs and offer a framework for development of efficient broad-based inhibitors.  相似文献   

11.
The centre of the human lens, which is composed of proteins that were synthesized prior to birth, is an ideal model for the evaluation of long‐term protein stability and processes responsible for the degradation of macromolecules. By analysing the sequences of peptides present in human lens nuclei, characteristic features of intrinsic protein instability were determined. Prominent was the cleavage on the N‐terminal side of serine residues. Despite accounting for just 9% of the amino acid composition of crystallins, peptides with N‐terminal Ser represented one‐quarter of all peptides. Nonenzymatic cleavage at Ser could be reproduced by incubating peptides at elevated temperatures. Serine residues may thus represent susceptible sites for autolysis in polypeptides exposed to physiological conditions over a period of years. Once these sites are cleaved, other chemical processes result in progressive removal or ‘laddering’ of amino acid residues from newly exposed N‐ and C‐termini. As N‐terminal Ser peptides originated from several crystallins with unrelated sequences, this may represent a general feature of long‐lived proteins.  相似文献   

12.
Several studies have addressed the interaction of the HIV Tat protein with the cell surface. Our analysis of the cell attachment-promoting activity of Tat and peptides derived from it revealed that the basic domain of Tat, not the arg-gly-asp (RGD) sequence, is required for cell attachment to Tat. Affinity chromatography with Tat peptides and immunoprecipitation with various anti-integrin antibodies suggest that the vitronectin-binding integrin, alpha v beta 5, is the cell surface protein that binds to the basic domain of Tat. The Tat basic domain contains the sequence RKKRRQRRR. A related sequence, KKQRFRHRNRKG, present in the heparin-binding domain of an alpha v beta 5 ligand, vitronectin, also bound alpha v beta 5 in affinity chromatography and, in combination with an RGD peptide, was an inhibitor of cell attachment to vitronectin. The alpha v beta 5 interaction with these peptides was not solely due to high content of basic amino acids in the ligand sequences; alpha v beta 5 did not bind substantially to peptides consisting entirely of arginine or lysine, whereas a beta 1 integrin did bind to these peptides. The interaction of alpha v beta 5 with Tat is atypical for integrins in that the binding to Tat is divalent cation independent, whereas the binding of the same integrin to an RGD- containing peptide or to vitronectin requires divalent cations. These data define an auxiliary integrin binding specificity for basic amino acid sequences. These basic domain binding sites may function synergistically with the binding sites that recognize RGD or equivalent sequences.  相似文献   

13.
Abstract: The amino acid sequence of 11 peptides generated from human placental choline acetyltransferase was compared to the corresponding amino acid sequences predicted from the nucleotide sequence of a recently cloned porcine choline acetyltransferase cDNA. These peptides, which were generated by cyanogen bromide cleavage or tryptic digestion, accounted for 23% of the amino acids in the enzyme. Of the 145 amino acids sequenced eight differed between the two species, yielding an identity of 94% over the regions sampled.
Of the eight amino acids that differed six could represent single base changes in the DNA sequence. These findings demonstrate strong sequence similarity between porcine and human choline acetyltransferase and indicate that they are closely related evolutionarily.  相似文献   

14.
The R3-R14 neurons of the marine mollusc Aplysia are neuroendocrine cells that express a gene encoding peptides I, II and histidine-rich basic peptide (HRBP), a myoactive peptide that excites Aplysia heart and enhances gut motility in vitro. Peptide II has been chemically characterized (35), but the complete primary structures of peptide I and HRBP have not been established by amino acid sequence analysis. HRBP, peptide I, and the prohormone (proHRBP) were therefore purified from acid extracts of Aplysia californica neural tissue using sequential gel filtration and reverse-phase high-performance liquid chromatography and chemically characterized. Amino acid sequence analysis demonstrated that HRBP was a 43-residue peptide whose sequence was: less than Glu-Val-Ala-Gln-Met-His-Val-Trp-Arg-Ala-Val-Asn-His-Asp-Arg-Asn-His-Gly- Thr-Gly - Ser-Gly-Arg-His-Gly-Arg-Phe-Leu-Ile-Arg-Asn-Arg-Tyr-Arg-Tyr-Gly-Gly-Gly- His-Leu - Ser-Asp-Ala-COOH. Compositional and sequence analyses of peptide I and proHRBP demonstrated that peptide I was a 26-residue peptide with the following sequence: NH2-Glu-Glu-Val-Phe-Asp-Asp-Thr-Asp-Val-Gly-Asp-Glu-Leu-Thr-Asn-Ala- Leu-Glu-Ser-Val-Leu-Thr-Asp-Phe-Lys-Asp-COOH. These results demonstrated that the pro-HRBP sequence predicted by nucleotide sequence analysis of a cDNA clone (24) was in fact synthesized in R3-R14 neurons. Hydrophilicity and hydrophobicity profiles of preproHRBP, combined with charge distribution profiles and predictive secondary structural analysis, showed that cleavage at dibasic sequences was strongly associated with peaks of hydrophilicity in alpha-helical regions of the preprohormone.  相似文献   

15.
The amino acids of the B-chains of two abrins (designated as abrin-a and abrin-b) from the seeds of Abrus precatorius have been sequenced. The sequence of the B-chain of abrin-a was solved by analysis of peptides derived by enzymatic digestions with trypsin, Iysylendopeptidase, and chymotrypsin, as well as by chemical cleavage with cyanogen bromide. The sequence of the B-chain of abrin-b was analyzed by sequence analysis of tryptic peptides and comparing these sequences with those of corresponding peptides of the B-chain of abrin-a. The B-chains of abrin-a and abrin-b consist of 268 amino acid residues and share 256 identical residues. Comparison of their sequences with that of the ricin B-chain shows that 60% of the residues of both abrin B-chains are identical to those of the ricin B-chain and that two saccharide-binding sites in ricin B-chain identified by a crystallographic study are highly conserved in both abrin B-chains.  相似文献   

16.
The homodimeric HIV-1 protease is the target of some of the most effective antiviral AIDS therapy, as it facilitates viral maturation by cleaving ten asymmetric and nonhomologous sequences in the Gag and Pol polyproteins. Since the specificity of this enzyme is not easily determined from the sequences of these cleavage sites alone, we solved the crystal structures of complexes of an inactive variant (D25N) of HIV-1 protease with six peptides that correspond to the natural substrate cleavage sites. When the protease binds to its substrate and buries nearly 1000 A2 of surface area, the symmetry of the protease is broken, yet most internal hydrogen bonds and waters are conserved. However, no substrate side chain hydrogen bond is conserved. Specificity of HIV-1 protease appears to be determined by an asymmetric shape rather than a particular amino acid sequence.  相似文献   

17.
Presecretory signal peptides of 39 proteins from diverse prokaryotic and eukaryotic sources have been compared. Although varying in length and amino acid composition, the labile peptides share a hydrophobic core of approximately 12 amino acids. A positively charged residue (Lys or Arg) usually precedes the hydrophobic core. Core termination is defined by the occurrence of a charged residue, a sequence of residues which may induce a beta-turn in a polypeptide, or an interruption in potential alpha-helix or beta-extended strand structure. The hydrophobic cores contain, by weight average, 37% Leu: 15% Ala: 10% Val: 10% Phe: 7% Ile plus 21% other hydrophobic amino acids arranged in a non-random sequence. Following the hydrophobic cores (aligned by their last residue) a highly non-random and localized distribution of Ala is apparent within the initial eight positions following the core: (formula; see text) Coincident with this observation, Ala-X-Ala is the most frequent sequence preceding signal peptidase cleavage. We propose the existence of a signal peptidase recognition sequence A-X-B with the preferred cleavage site located after the sixth amino acid following the core sequence. Twenty-two of the above 27 underlined Ala residues would participate as A or B in peptidase cleavage. Position A includes the larger aliphatic amino acids, Leu, Val and Ile, as well as the residues already found at B (principally Ala, Gly and Ser). Since a preferred cleavage site can be discerned from carboxyl and not amino terminal alignment of the hydrophobic cores it is proposed that the carboxyl ends are oriented inward toward the lumen of the endoplasmic reticulum where cleavage is thought to occur. This orientation coupled with the predicted beta-turn typically found between the core and the cleavage site implies reverse hairpin insertion of the signal sequence. The structural features which we describe should help identify signal peptides and cleavage sites in presumptive amino acid sequences derived from DNA sequences.  相似文献   

18.
The feline immunodeficiency virus (FIV) protease is essential for virion maturation and subsequent viral replication in that it cleaves the Gag and Gag/Pol polyproteins at eight sites to release the respective structural proteins and enzymes. During purification of a recombinant FIV protease (PR), we noted that it underwent autoproteolysis (autolysis) to give discrete cleavage products. These additional PR cleavage sites were defined using N-terminal amino acid sequence analysis and mass spectrometry. Protease breakdown products were also found in FIV virions and were of the same apparent molecular weights as the in vitro autolysis products. Four primary PR autolysis sites were blocked via substitution of either the P1 amino acid with a beta-branched amino acid or the P1' amino acid with lysine. Cleavage-resistant PRs which had Km and k(cat) values similar to those of FIV PR were constructed. An autolysis time course determined that blocking all four primary autolysis sites yielded a cleavage-resistant PR which was enzymatically stable. Concomitant with autolysis is the generation of an N-terminally truncated form of the PR (Thr6/PR) which has enhanced stability with respect to that of FIV PR. A structural basis for the Thr6/PR activity is presented, as are the possible roles of autolysis in the viral replication cycle.  相似文献   

19.
20.
MOTIVATION: PHYSEAN predicts protein classes with highly variable sequences on the basis of their physical, chemical and biological characteristics such as diverse hydrophobicity, structural propensity and steric properties. These characteristics, calculated from multiple positions in a sequence, may be conserved even between sequences that fail to produce alignments at any acceptable level of statistical significance. PHYSEAN complements methods that require sequence alignments (BLAST, FASTA, dynamic programming) by adding less residue- and position-specific physicochemical information on the protein or the domain. RESULTS: We predict proteins or their domains like signal peptides using physical, chemical, geometric, and biological properties of the 20 amino acids. This comprehensive set of properties may cover the diagnostic functional and structural aspects of a domain or a protein class. We automatically select and weight a subset of properties so as to discriminate between, e.g., signal peptides and amino-termini of cytosolic proteins with the lowest number of incorrect predictions. This optimal selection of properties and their weights significantly decreases the number of incorrect predictions as compared to any single property or any combination of unweighted properties. Weights have been optimized by high-performance linear programming models that systematically find the optimal solution from among an astronomic number of property/weight combinations. PHYSEAN's performance is demonstrated by highly accurate predictions of signal peptides (the vehicles for protein transport across membranes) and their cleavage sites. The results indicate reliable predictions are possible even in the lack of sequence conservation using an automated physical and chemical analysis of proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号