首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
O-linked glycosylation is a post-translational and post-folding event involving exposed S/T residues at beta-turns or in regions with extended conformation. O-linked sites are difficult to predict from sequence analyses compared to N-linked sites. Here we compare the results of chemical analyses of isolated glycopeptides with the prediction using the neural network prediction method NetOGlyc3.1, a procedure that has been reported to correctly predict 76% of O-glycosylated residues in proteins. Using the heavily glycosylated human insulin receptor as the test protein six sites of mucin-type O-glycosylation were found at residues T744, T749, S757, S758, T759, and T763 compared to the three sites (T759 and T763- correctly, T756- incorrectly) predicted by the neural network method. These six sites occur in a 20 residue segment that begins nine residues downstream from the start of the insulin receptor beta-chain. This region which also includes N-linked glycosylation sites at N742 and N755, is predicted to lack secondary structure and is followed by residues 765-770, the known linear epitope for the monoclonal antibody 18-44.  相似文献   

2.
Database analysis of O-glycosylation sites in proteins   总被引:3,自引:0,他引:3       下载免费PDF全文
Statistical analysis was carried out to study the sequential aspects of amino acids around the O-glycosylated Ser/Thr. 992 sequences containing O-glycosylated Ser/Thr were selected from the O-GLYCBASE database of O-glycosylated proteins. The frequency of occurrence of amino acid residues around the glycosylated Ser/Thr revealed that there is an increased number of proline residues around the O-glycosylation sites in comparison with the nonglycosylated serine and threonine residues. The deviation parameter calculated as a measure of preferential and nonpreferential occurrence of amino acid residues around the glycosylation site shows that Pro has the maximum preference around the O-glycosylation site. Pro at +3 and/or -1 positions strongly favors glycosylation irrespective of single and multiple glycosylation sites. In addition, serine and threonine are preferred around the multiple glycosylation sites due to the effect of clusters of closely spaced glycosylated Ser/Thr. The preference of amino acids around the sites of mucin-type glycosylation is found likely to be similar to that of the O-glycosylation sites when taken together, but the acidic amino acids are more preferred around Ser/Thr in mucin-type glycosylation when compared totally. Aromatic amino acids hinder O-glycosylation in contrast to N-glycosylation. Cysteine and amino acids with bulky side chains inhibit O-glycosylation. The preference of certain potential sequence motifs of glycosylation has been discussed.  相似文献   

3.
Dictyostelium discoideum has been suggested as a eukaryotic model organism for glycobiology studies. Presently, the characteristics of acceptor sites for the N-acetylglucosaminyl-transferases in Dictyostelium discoideum, which link GlcNAc in an alpha linkage to hydroxyl residues, are largely unknown. This motivates the development of a species specific method for prediction of O-linked GlcNAc glycosylation sites in secreted and membrane proteins of D. discoideum. The method presented here employs a jury of artificial neural networks. These networks were trained to recognize the sequence context and protein surface accessibility in 39 experimentally determined O-alpha-GlcNAc sites found in D. discoideum glycoproteins expressed in vivo. Cross-validation of the data revealed a correlation in which 97% of the glycosylated and nonglycosylated sites were correctly identified. Based on the currently limited data set, an abundant periodicity of two (positions-3, -1, +1, +3, etc.) in Proline residues alternating with hydroxyl amino acids was observed upstream and downstream of the acceptor site. This was a consequence of the spacing of the glycosylated residues themselves which were peculiarly found to be situated only at even positions with respect to each other, indicating that these may be located within beta-strands. The method has been used for a rapid and ranked scan of the fraction of the Dictyostelium proteome available in public databases, remarkably 25-30% of which were predicted glycosylated. The scan revealed acceptor sites in several proteins known experimentally to be O-glycosylated at unmapped sites. The available proteome was classified into functional and cellular compartments to study any preferential patterns of glycosylation. A sequence based prediction server for GlcNAc O-glycosylations in D. discoideum proteins has been made available through the WWW at http://www.cbs.dtu.dk/services/DictyOGlyc/ and via E-mail to DictyOGlyc@cbs.dtu.dk.  相似文献   

4.
Although posttranslational protein modifications are generally thought to perform important cellular functions, recent studies showed that a large fraction of phosphorylation sites are not evolutionarily conserved. Whether the same is true for other protein modifications, such as N-glycosylation is an open question. N-glycosylation is a form of cotranslational and posttranslational modification that occurs by enzymatic addition of a polysaccharide, or glycan, to an asparagine (N) residue of a protein. Examining a large set of experimentally determined mouse N-glycosylation sites, we find that the evolutionary rate of glycosylated asparagines is significantly lower than that of nonglycosylated asparagines of the same proteins. We further confirm that the conservation of glycosylated asparagines is accompanied by the conservation of the canonical motif sequence for glycosylation, suggesting that the above substitution rate difference is related to glycosylation. Interestingly, when solvent accessibility is considered, the substitution rate disparity between glycosylated and nonglycosylated asparagines is highly significant at solvent accessible sites but not at solvent inaccessible sites. Thus, although the solvent inaccessible glycosylation sites were experimentally identified, they are unlikely to be genuine or physiologically important. For solvent accessible asparagines, our analysis reveals a widespread and strong functional constraint on glycosylation, unlike what has been observed for phosphorylation sites in most studies, including our own analysis. Because the majority of N-glycosylation occurs at solvent accessible sites, our results show an overall functional importance for N-glycosylation.  相似文献   

5.

Background

As one of the most common protein post-translational modifications, glycosylation is involved in a variety of important biological processes. Computational identification of glycosylation sites in protein sequences becomes increasingly important in the post-genomic era. A new encoding scheme was employed to improve the prediction of mucin-type O-glycosylation sites in mammalian proteins.

Results

A new protein bioinformatics tool, CKSAAP_OGlySite, was developed to predict mucin-type O-glycosylation serine/threonine (S/T) sites in mammalian proteins. Using the composition of k-spaced amino acid pairs (CKSAAP) based encoding scheme, the proposed method was trained and tested in a new and stringent O-glycosylation dataset with the assistance of Support Vector Machine (SVM). When the ratio of O-glycosylation to non-glycosylation sites in training datasets was set as 1:1, 10-fold cross-validation tests showed that the proposed method yielded a high accuracy of 83.1% and 81.4% in predicting O-glycosylated S and T sites, respectively. Based on the same datasets, CKSAAP_OGlySite resulted in a higher accuracy than the conventional binary encoding based method (about +5.0%). When trained and tested in 1:5 datasets, the CKSAAP encoding showed a more significant improvement than the binary encoding. We also merged the training datasets of S and T sites and integrated the prediction of S and T sites into one single predictor (i.e. S+T predictor). Either in 1:1 or 1:5 datasets, the performance of this S+T predictor was always slightly better than those predictors where S and T sites were independently predicted, suggesting that the molecular recognition of O-glycosylated S/T sites seems to be similar and the increase of the S+T predictor's accuracy may be a result of expanded training datasets. Moreover, CKSAAP_OGlySite was also shown to have better performance when benchmarked against two existing predictors.

Conclusion

Because of CKSAAP encoding's ability of reflecting characteristics of the sequences surrounding mucin-type O-glycosylation sites, CKSAAP_ OGlySite has been proved more powerful than the conventional binary encoding based method. This suggests that it can be used as a competitive mucin-type O-glycosylation site predictor to the biological community. CKSAAP_OGlySite is now available at http://bioinformatics.cau.edu.cn/zzd_lab/CKSAAP_OGlySite/.  相似文献   

6.
The tandem repeat of the MUC1 protein core is a major site of O-glycosylation that is catalyzed by several polypeptide GalNAc-transferases. To define structural features of the peptide substrates that contribute to acceptor substrate efficiency, solution structures of the 21-residue peptide AHGVTSAPDTRPAPGSTAPPA (AHG21) from the MUC1 protein core and four isoforms, glycosylated with alpha-N-acetylgalactosamine on corresponding Thr residues, AHG21 (T5), AHG21 (T10), AHG21 (T17), and AHG21 (T5,T17), were investigated by NMR spectroscopy and computational methods. NMR studies revealed that sugar attachment affected the conformational equilibrium of the peptide backbone near the glycosylated Thr residues. The clustering of the low-energy conformations for nonglycosylated and glycosylated counterparts within the VTSA, DTR, and GSTA fragments (including all sites of potential glycosylation catalyzed by GalNAc-T1, -T2, and -T4 transferases) showed that the glycosylated peptides display distinct structural propensities that may explain, in part, the differences in substrate specificities exhibited by these polypeptide GalNAc-transferases.  相似文献   

7.
Gerken TA  Tep C  Rarick J 《Biochemistry》2004,43(30):9888-9900
A large family of uridine 5'-diphosphate (UDP)-alpha-N-acetylgalactosamine (GalNAc):polypeptide N-acetylgalactosaminyl transferases (ppGalNAc Ts) initiates mucin-type O-glycan biosynthesis at serine and threonine. The peptide substrate specificities of individual family members are not well characterized or understood, leaving an inability to rationally predict or comprehend sites of O-glycosylation. Recently, a kinetic modeling approach demonstrated neighboring residue glycosylation as a major factor modulating the O-glycosylation of the porcine submaxillary gland mucin 81 residue tandem repeat by ppGalNAc T1 and T2 [Gerken et al. (2002) J. Biol. Chem. 277, 49850-49862]. To confirm the general applicability of this model and its parameters, the ppGalNAc T1 and T2 glycosylation kinetics of the 80+ residue tandem repeat from the canine submaxillary gland mucin was obtained and characterized. To reproduce the glycosylation patterns of both mucins (comprising 50+ serine/threonine residues), specific effects of neighboring peptide sequence, in addition to the previously described effects of neighboring residue glycosylation, were required of the model. Differences in specificity of the two transferases were defined by their sensitivities to neighboring proline and nonglycosylated hydroxyamino acid residues, from which a ppGalNAc T2 motif was identified. Importantly, the model can approximate the previously reported ppGalNAc T2 glycosylation kinetics of the IgA1 hinge domain peptide [Iwasaki, et al. (2003) J. Biol. Chem. 278, 5613-5621], further validating both the approach and the ppGalNAc T2 positional weighting parameters. The characterization of ppGalNAc transferase specificity by this approach may prove useful for the search for isoform-specific substrates, the creation of isoform-specific inhibitors, and the prediction of mucin-type O-glycosylation sites.  相似文献   

8.
N-Glycosylation, the most common and most versatile protein modification reaction, occurs at the beta-amide of the aspargine of the Asn-Xaa-Ser/Thr sequon. For reasons that are unclear, not all such sequons are glycosylated. To find patterns that affect glycosylation, we examined the amino acid residues from the 20th preceding the sequon to the 20th residue following it, using bioinformatics tools. A clean data set of annotated, experimentally verified, glycosylated and nonglycosylated sequons derived from 617 well-defined nonredundant N- and N-,O-glycoproteins listed in SWISS-PROT (June 2002) was used. NXS and NXT sequons were analyzed separately. Although no overt patterns were found to explain sequon occupancy or nonoccupancy, trends for over- or underrepresentation of certain amino acids at particular positions were statistically significant and different in NXS and NXT sequons. In extension of earlier reports, none of the 80 Asn-Pro-Ser/Thr found were glycosylated, and a markedly low level of glycosylation was seen in sequons with Pro at the position following the Ser/Thr. In addition, a general observation was made that the considerable number of glycosylated sequons in the C-terminal 10 residues of glycoproteins suggests that N-glycosylation in these cases may be posttranslational and not cotranslational, as widely accepted.  相似文献   

9.
Human proteins are expressed in some hosts wrongly glycosylated or nonglycosylated. Although it is accepted that glycosylation contributes to the stability of the protein in solution, the effect of glycosylation on the stability of human antibodies is not fully understood. In this work, we present solubility studies of two human antibodies that have the same primary structure but different glycosylation pattern. The studies were done by monitoring the partitioning behavior of both proteins in a series of aqueous two‐phase systems at and away the isoelectric point of the proteins and at different temperatures. Our studies show that in the absence of direct electrostatic forces, the partitioning behavior of the antibodies depends on the presence or absence of the polysaccharide chains. Overall, the nonglycosylated protein is less soluble than the glycosylated one. The potential of aqueous two‐phase systems for the separation of the glycosylated and nonglycosylated proteins was also explored. A simple series of extractions seems to be enough to separate the glycosylated variety from the nonglycosylated one at high purity but low yields. © 2013 American Institute of Chemical Engineers Biotechnol. Prog., 29:943–950, 2013  相似文献   

10.
The secretion of N-linked glycosylated alpha-lactalbumin was much higher in the expression system of yeast Pichia pastoris carrying goat alpha-lactalbumin cDNA than in mammalian milk. This is possibly because of the presence of N-linked glycosylation signal sequences, Asn(45)-Asp(46)-Ser(47) and Asn(74)-Ile(75)-Ser(76), in wild-type alpha-lactalbumin. Attempts to elucidate the mechanism of the higher secretion of glycosylated alpha-lactalbumin in P. pastoris were made. Mutant N45D that deleted the N-linked glycosylation signal sequence at position 45 predominantly secreted nonglycosylated protein. On the other hand, mutant D46N with another N-glycosylation signal site at position 46 only secreted N-linked glycosylated alpha-lactalbumin, i.e. not the nonglycosylated protein. The total secreted amount of mutant N45D was greatly enhanced, while the secreted amounts of the wild-type and mutant D46N were very low, suggesting that the increase in the number of glycosylation sites greatly reduced the secretion of alpha-lactalbumin. It seems likely that the glycosylated alpha-lactalbumin may be degraded by the quality control system.  相似文献   

11.
Two-pore domain potassium (K2P) channels play fundamental roles in cellular processes by enabling a constitutive leak of potassium from cells in which they are expressed, thus influencing cellular membrane potential and activity. Hence, regulation of these channels is of critical importance to cellular function. A key regulatory mechanism of K2P channels is the control of their cell surface expression. Membrane protein delivery to and retrieval from the cell surface is controlled by their passage through the secretory and endocytic pathways, and post-translational modifications regulate their progression through these pathways. All but one of the K2P channels possess consensus N-linked glycosylation sites, and here we demonstrate that the conserved putative N-glycosylation site in K2P3.1 and K2P9.1 is a glycan acceptor site. Patch clamp analysis revealed that disruption of channel glycosylation reduced K2P3.1 current, and flow cytometry was instrumental in attributing this to a decreased number of channels on the cell surface. Similar findings were observed when cells were cultured in reduced glucose concentrations. Disruption of N-linked glycosylation has less of an effect on K2P9.1, with a small reduction in number of channels on the surface observed, but no functional implications detected. Because nonglycosylated channels appear to pass through the secretory pathway in a manner comparable with glycosylated channels, the evidence presented here suggests that the decreased number of nonglycosylated K2P3.1 channels on the cell surface may be due to their decreased stability.  相似文献   

12.
The mammalian zona pellucida (ZP) is an extracellular glycoprotein coat that plays vital roles throughout fertilisation and preimplantation development. Like that of eutherian mammals the brushtail possum ZP is composed of three glycosylated proteins of 137 kDa, 92 kDa and 62 kDa. The 62 kDa protein is a ZP3 orthologue based on its nucleotide and deduced amino acid sequence. The brushtail possum ZP3 cDNA isolated in this study is 1305 nucleotides with an open reading frame encoding a 422 amino acid peptide of 45.7 kDa. Possum ZP3 has a 46% amino acid identity with eutherian ZP3 and shares similar structural characteristics including 12 conserved cysteine residues, N-linked glycosylation sites and hydrophobic regions. Like human and rabbit ZP1 an altered furin cleavage site upstream of the C-terminal hydrophobic domain also occurs in possum ZP3 (S-R-K-R), suggestive of processing by a furin-related endoprotease. Expression of brushtail possum ZP3 is limited to the ovary. Characterisation of brushtail possum ZP3 will enable examination of its functional role in marsupial fertilisation and its effectiveness as an immunocontraceptive agent.  相似文献   

13.
O-GlcNAcylation is an inducible, highly dynamic and reversible posttranslational modification, which regulates numerous cellular processes such as gene expression, translation, immune reactions, protein degradation, protein–protein interaction, apoptosis, and signal transduction. In contrast to N-linked glycosylation, O-GlcNAcylation does not display a strict amino acid consensus sequence, although serine or threonine residues flanked by proline and valine are preferred sites of O-GlcNAcylation. Based on this information, computational prediction tools of O-GlcNAc sites have been developed. Here, we retrospectively assessed the performance of two available O-GlcNAc prediction programs YinOYang 1.2 server and OGlcNAcScan by comparing their predictions for recently discovered experimentally validated O-GlcNAc sites. Both prediction programs efficiently identified O-GlcNAc sites situated in an environment resembling the consensus sequence P-P-V-[ST]-T-A. However, both prediction programs revealed numerous false negative O-GlcNAc predictions when the site of modification was located in an amino acid sequence differing from the known consensus sequence. By searching for a common sequence motif, we found that O-GlcNAcylation of nucleocytoplasmic proteins preferably occurs at serine and threonine residues flanked downstream by proline and valine and upstream by one to two alanines followed by a stretch of serine and threonine residues. However, O-GlcNAcylation of proteins located in the mitochondria or in the secretory lumen occurs at different sites and does not follow a distinct consensus sequence. Thus, our study indicates the limitations of the presently available computational prediction methods for O-GlcNAc sites and suggests that experimental validation is mandatory. Continuously update and further development of available databases will be the key to improve the performance of O-GlcNAc site prediction.  相似文献   

14.
A recombinant expression system was developed to analyse sequence determinants involved in O-glycosylation of proteins in mycobacteria. By expressing peptide sequences corresponding to known glycosylation sites within a chimeric lipoprotein construct, amino acids flanking modified threonine residues were found to have an important influence on glycosylation. The expression system was used to screen mycobacterial sequences selected using a neural network (NetOglyc) trained on eukaryotic O-glycoproteins. Evidence of glycosylation was obtained for eight of 11 proteins tested. The results suggest that sites involved in O-glycosylation of mycobacterial and eukaryotic proteins share similar structural features.  相似文献   

15.
Vaccinia virus complement control protein (VCP) is one of the proteins encoded by vaccinia virus to modulate the host inflammatory response. VCP modulates the inflammatory response and protects viral habitat by inhibiting the classical and the alternative pathways of complement activation. The extended structure of VCP, mobility between its sequential domains, charge distribution and type of residues at the binding regions are factors that have been identified to influence its ability to bind to complement proteins. We report that a Lister strain of vaccinia virus encodes a VCP homolog (Lis VCP) that is functional, glycosylated, has two amino acids less than the well-characterized VCP from vaccinia virus WR strain (WR VCP), and the human smallpox inhibitor of complement enzymes (SPICE) from variola virus. The glycosylated VCP of Lister is immunogenic in contrast to the weak immunogenicity of the nonglycosylated VCP. Lis VCP is the only orthopoxviral VCP homolog found to be glycosylated, and we speculate that glycosylation influences its pattern of complement inhibition. We also correlate dimerization of VCP observed only in mammalian and baculovirus expression systems to higher levels of activity than monomers, observed in the yeast expression system.  相似文献   

16.
We have isolated the Xenopus homolog (Xint-1) of the mouse protooncogene int-1 from a neurula stage 17 cDNA library. The deduced protein sequence of Xint-1 includes 371 amino acids. The Xint-1 protein is more similar to the mammalian int-1 product (69%), than to the Drosophila counterpart of int-1, wingless (50%). Xint-1 shares several characteristics of secreted proteins with the other int-1 homologs: it has a hydrophobic leader, multiple conserved potential N-linked glycosylation sites and is rich in cysteine residues. All 23 cysteines are conserved in the three proteins. Xint-1 is transiently expressed during the neurula stages of early Xenopus development.  相似文献   

17.
Glycosylation and translocation of the simian rotavirus protein VP7, a resident ER protein, does not occur co-translationally in vivo. In pulse-chase experiments in COS cells, nonglycosylated VP7 was still detectable after a 25-min chase period, although the single glycosylation site was only 18 residues beyond the signal peptide cleavage site. After labeling, glycosylated and nonglycosylated VP7 was recovered in microsomes but the latter was sensitive to trypsin (i.e., the nascent protein became membrane associated) but most of it entered the ER posttranslationally because of a rate-limiting step early in translocation. In contrast with the simian protein, bovine VP7 was glycosylated and translocated rapidly. Thus, delayed translocation per se was not required for retention of VP7 in the ER. By constructing hybrid proteins, it was further shown that the signal peptide together with residues 64-111 of the simian protein caused delayed translocation. The same sequences were also necessary and sufficient for retention of simian VP7 in the ER. The data are consistent with the idea that certain proteins are inserted into the ER membrane in a loop configuration.  相似文献   

18.
In this study we present a method for determination of O-glycosylation sites in glycopeptides, based on partial vapor-phase acid hydrolysis in combination with mass spectrometric analysis. Pentafluoropropionic acid and hydrochloric acid were used for the hydrolysis of glycosylated peptides. The reaction conditions were optimized for efficient polypeptide backbone cleavages with minimal cleavage of glycosidic bonds. The glycosylated residues were identified by mass spectrometric analysis of the hydrolytic cleavage products. Although glycosidic bonds are partially cleaved under acid hydrolysis, the resulting mass spectra allowed unambiguous determination of the glycosylation sites. Examples are shown with mannosyl- and mucin-type glycopeptides. Performing the hydrolysis in vapor eliminates the risk for contamination of the sample with impurities from the reagents, thus allowing analysis of the reaction products without further purification both by matrix-assisted laser desorption/ionization and electrospray ionization mass spectrometry.  相似文献   

19.
The specificities of the UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferases which link the carbohydrate GalNAc to the side-chain of certain serine and threonine residues in mucin type glycoproteins, are presently unknown. The specificity seems to be modulated by sequence context, secondary structure and surface accessibility. The sequence context of glycosylated threonines was found to differ from that of serine, and the sites were found to cluster. Non-clustered sites had a sequence context different from that of clustered sites. Charged residues were disfavoured at position – 1 and +3. A jury of artificial neural networks was trained to recognize the sequence context and surface accessibility of 299 known and verified mucin type O-glycosylation sites extracted from O-GLYCBASE. The cross-validated NetOglyc network system correctly found 83% of the glycosylated and 90% of the non-glycosylated serine and threonine residues in independent test sets, thus proving more accurate than matrix statistics and vector projection methods. Predictions of O-glycosylation sites in the envelope glycoprotein gp120 from the primate lentiviruses HIV-1, HIV-2 and SIV are presented. The most conserved O-glycosylation signals in these evolutionary-related glycoproteins were found in their first hypervariable loop, V1. However, the strain variation for HIV-1 gp120 was significant. A computer server, available through WWW or E-mail, has been developed for prediction of mucin type O-glycosylation sites in proteins based on the amino acid sequence. The server addresses are http://www.cbs.dtu.dk/services/NetOGlyc/ and netOglyc@cbs.dtu.dk.  相似文献   

20.
Mammalian mucin-type O-glycosylation is initiated by a large family of ~20 UDP-GalNAc:polypeptide α-N-acetylgalactosaminyltransferases (ppGalNAc Ts) that transfer α-GalNAc from UDP-GalNAc to Ser and Thr residues of polypeptide acceptors. Characterizing the peptide substrate specificity of each isoform is critical to understanding their properties, biological roles, and significance. Presently, only the specificities of ppGalNAc T1, T2, and T10 and the fly orthologues of T1 and T2 have been systematically characterized utilizing random peptide substrates. We now extend these studies to ppGalNAc T3, T5, and T12, transferases variously associated with human disease. Our results reveal several common features; the most striking is the similar pattern of enhancements for the three residues C-terminal to the site of glycosylation for those transferases that contain a common conserved Trp. In contrast, residues N-terminal to the site of glycosylation show a wide range of isoform-specific enhancements, with elevated preferences for Pro, Val, and Tyr being the most common at the -1 position. Further analysis reveals that the ratio of positive (Arg, Lys, and His) to negative (Asp and Glu) charged residue enhancements varied among transferases, thus further modulating substrate preference in an isoform-specific manner. By utilizing the obtained transferase-specific preferences, the glycosylation patterns of the ppGalNAc Ts against a series of peptide substrates could roughly be reproduced, demonstrating the potential for predicting isoform-specific glycosylation. We conclude that each ppGalNAc T isoform may be uniquely sensitive to peptide sequence and overall charge, which together dictates the substrate sites that will be glycosylated.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号