首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Rao RS  Bernd W 《Bioinformation》2010,5(5):208-212
Protein N-glycosylation requires the presence of asparagine (N) in the consensus tri-peptide NXS/T (where X is any amino acid, S is serine and T is threonine). Several factors affect the glycosylation potential of NXS/T sequons and one such factor is the type of amino acid at position X. While proline was shown to negatively affect N-glycosylation, the nature of other amino acids at this position is not clear. Using Markov chain analysis of tri-peptide NXS/T from viral, archaeal and eukaryotic proteins as well as experimentally confirmed N-glycosylated sequons from eukaryotic proteins, we show here that the occurrence of most sequon types differ significantly from the expected probability. Sequon types with F, G, I, S, T and V amino acids are consistently preferred while those with P and charged amino acids are under-represented in all four groups. Further, proteins contained far fewer number of possible sequon types (maximum 20 types for NXS or NXT taken separately) for any given number of sequons, which may be explained based on random sampling. Consistent with the present finding, majority of the over-represented sequons found in two important viral envelope glycoproteins (hemagglutinin of influenza A H3N2 and glycoprotein120 of HIV-1) are indeed preferred sequon types, which may provide a selective advantage. Accordingly, although there seems to be some preference for sequons, this preference may not be unique to N-glycosylation.  相似文献   

2.
Many viruses are known to undergo rapid evolutionary changes under selective pressures. The HIV-1 envelope glycoprotein 120 (gp120) shows extreme selection for NXS/T sequons, the potential sites of N-glycosylation. Although the average number of sequons in gp120 appears to be relatively stable in the recent past, even slight changes in the distribution of sequons may potentially play crucial roles in protein interaction and viral infection. This study tracked the prevalence and distribution of NXS/T sequons in gp120 over a period of 29 years (from 1981 to 2009). The gp120 showed location specific distribution of sequons with higher density in the outer domain of the molecule. The NXT sequon density decreased in the outer domain (despite the increase in the sequon specific amino acid threonine), but increased in the inner domain. By contrast, the NXS sequon density increased specifically in the outer domain. Related changes were also seen in the distribution probabilities of sequons in two domains. The results indicate that the gp120, chiefly in subtype B, is redistributing NXS/T sequons within the molecule with specific selection for NXS sequons. The subtle evolution of sequons in gp120 may have implications in viral resistance and infection.  相似文献   

3.
Many eukaryotic proteins are modified by N-linked glycosylation, a process in which oligosaccharides are added to asparagine residues in the sequon Asn-X-Ser/Thr. However, not all such sequons are glycosylated. For example, rabies virus glycoprotein (RGP) contains three sequons, only two of which appear to be glycosylated in virions. To examine further the signals in proteins which regulate N-linked core glycosylation, the glycosylation efficiencies of each of the three sequons in the antigenic domain of RGP were compared. For these studies, mutants were generated in which one or more sequons were deleted by site-directed mutagenesis. Core glycosylation of these mutants was studied using two independent systems: 1) in vitro translation in rabbit reticulocyte lysate supplemented with dog pancreatic microsomes, and 2) transfection into glycosylation-deficient Chinese hamster ovary cells. Parallel results were obtained with both systems, demonstrating that the sequon at Asn37 is inefficiently glycosylated, the sequons at Asn247 and Asn319 are efficiently glycosylated, and the glycosylation efficiency of each sequon is not influenced by glycosylation at other sequons in this protein. High levels of cell surface expression of RGP in Chinese hamster ovary cells are seen with any mutant containing an intact sequon at Asn247 or Asn319, whereas low levels of cell surface expression are seen when the sequon at Asn37 is present alone; deletion of all three sequons completely blocks RGP cell surface expression. Thus, although core glycosylation at Asn37 is inefficient, it is still sufficient to support a biological function, cell surface expression. Future studies using mutagenesis of this model protein and its expression in these two well defined systems will aim to begin to unravel the rules governing core glycosylation of glycoproteins.  相似文献   

4.
Yeast invertase contains 14 sequons, all of which are glycosylated to varying degrees except for sequon 5 which is marginally glycosylated, if at all. This sequon overlaps with sequon 4 in a sequence consisting of Asn92-Asn93-Thr94-Ser95(Reddy et al., 1988, J. Biol. Chem., 263, 6978-6985). To determine whether glycosylation at Asn93is sterically hindered by the oligosaccharide on Asn92, the latter amino acid was converted to a glutamine residue by site-directed mutagenesis of the SUC2 gene in a plasmid vector which was expressed in Saccharomyces cerevisiae. A glycopeptide encompassing sequons 3 through 6 was purified from a tryptic digest of the mutagenized invertase and sequenced by Edman degradation, which revealed that Asn93 of sequon 5 contained very little, if any, carbohydrate, despite the elimination of sequon 4. When Ser and Thr were inverted to yield Asn-Asn-Ser-Thr carbohydrate was associated primarily with the second sequon, in agreement with numerous studies indicating that Asn-X-Thr is preferred to Asn-X-Ser as an oligosaccharide acceptor. However, when the invertase overlapping sequons were converted to Asn-Asn-Ser-Ser, both sequons were clearly glycosylated, with the latter sequon predominating. These findings rule out steric hindrance as a factor involved in preventing the glycosylation of sequon 5 in invertase. Comparable results were obtained using an in vitro system with sequon-containing tri- and tetrapeptides acceptors, in addition to larger oligosaccharide acceptors.  相似文献   

5.
The SWISS-PROT protein sequence data bank contains at present nearly 75,000 entries, almost two thirds of which include the potential N-glycosylation consensus sequence, or sequon, NXS/T (where X can be any amino acid but proline) and thus may be glycoproteins. The number of proteins filed as glycoproteins is however considerably smaller, 7942, of which 749 have been characterized with respect to the total number of their carbohydrate units and sites of attachment of the latter to the protein, as well as the nature of the carbohydrate-peptide linking group. Of these well characterized glycoproteins, about 90% carry either N-linked carbohydrate units alone or both N- and O-linked ones, attached at 1297 N-glycosylation sites (1.9 per glycoprotein molecule) and the rest are O-glycosylated only. Since the total number of sequons in the well characterized glycoproteins is 1968, their rate of occupancy is 2/3. Assuming that the same number of N-linked units and rate of sequon occupancy occur in all sequon containing proteins and that the proportion of solely O-glycosylated proteins (ca. 10%) will also be the same as among the well characterized ones, we conclude that the majority of sequon containing proteins will be found to be glycosylated and that more than half of all proteins are glycoproteins.  相似文献   

6.
Campylobacter jejuni is unusual among bacteria in possessing a eukaryotic-like system for N-linked protein glycosylation at Asn residues in sequons of the type Asp/Glu-Xaa-Asn-Xaa-Ser/Thr. However, little is known about the structural context of the glycosylated sequons, limiting the design of novel recombinant glycoproteins. To obtain more information on sequon structure, we have determined the crystal structure of the PEB3 (Cj0289c) dimer. PEB3 has the class II periplasmic-binding protein fold, with each monomer having two domains with a ligand-binding site containing citrate located between them, and overall resembles molybdate- and sulfate-binding proteins. The sequon around Asn90 is located within a surface-exposed loop joining two structural elements. The three key residues are well exposed on the surface; hence, they may be accessible to the PglB oligosaccharyltransferase in the folded state.  相似文献   

7.
Human and simian immunodeficiency viruses (HIV and SIV), influenza virus, and hepatitis C virus (HCV) have heavily glycosylated, highly variable surface proteins. Here we explore N-linked glycosylation site (sequon) variation at the population level in these viruses, using a new Web-based program developed to facilitate the sequon tracking and to define patterns (www.hiv.lanl.gov). This tool allowed rapid visualization of the two distinctive patterns of sequon variation found in HIV-1, HIV-2, and SIV CPZ. The first pattern (fixed) describes readily aligned sites that are either simply present or absent. These sites tend to be occupied by high-mannose glycans. The second pattern (shifting) refers to sites embedded in regions of extreme local length variation and is characterized by shifts in terms of the relative position and local density of sequons; these sites tend to be populated by complex carbohydrates. HIV, with its extreme variation in number and precise location of sequons, does not have a net increase in the number of sites over time at the population level. Primate lentiviral lineages have host species-dependent levels of sequon shifting, with HIV-1 in humans the most extreme. HCV E1 and E2 proteins, despite evolving extremely rapidly through point mutation, show limited sequon variation, although two shifting sites were identified. Human influenza A hemagglutinin H3 HA1 is accumulating sequons over time, but this trend is not evident in any other avian or human influenza A serotypes.  相似文献   

8.
Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.  相似文献   

9.
10.
Rabies glycoprotein (RGP(WT)) contains N-glycosylation sequons at Asn(37), Asn(247), and Asn(319), although Asn(37) is not efficiently glycosylated. To examine N-glycan processing at Asn(247) and Asn(319), full-length glycosylation mutants, RGP(-2-) and RGP(--3), were expressed, and Endo H sensitivity was compared. When the Asn(247) sequon is present alone in RGP(-2-), 90% of its N-glycans are high-mannose type, whereas only 35% of the N-glycans at Asn(319) in RGP(--3) are high-mannose. When both sequons are present in RGP(-23), 87% of the N-glycans are of complex type. The differing patterns of Endo H sensitivity at sequons present individually or together suggests that glycosylation of one sequon affects glycosylation at another, distant sequon. To explore this further, we constructed soluble forms of RGP: RGP(WT)T441His and RGP(--3)T441His. Tryptic glycopeptides from these purified secreted proteins were isolated by HPLC and characterized by a 3D oligosaccharide mapping technique. RGP(WT)T441His had fucosylated, bi- and triantennary complex type glycans at Asn(247) and Asn(319). However, Asn(247) had half as many neutral glycans, more monosialylated glycans, and fewer disialylated glycans when compared with Asn(319). Moreover, when comparing the N-glycans at Asn(319) on RGP(--3)T441His and RGP(WT)T441His, the former had 30% more neutral, 28% more monosialylated, and 33% fewer disialylated glycans. This suggests that the N-glycan at Asn(247) allows additional N-glycan processing to occur at Asn(319), yielding more heavily sialylated bi- and triantennary forms. The mechanism(s) by which glycosylation at one sequon influences N-glycan processing at a distant sequon on the same glycoprotein remains to be determined.  相似文献   

11.
Roy Jefferis 《MABS-AUSTIN》2013,5(5):638-640
The glycoform profile of a glycoprotein is non-templated, i.e., is not encoded within the genome or otherwise predetermined; however, it is estimated that ~50% of human genes having an open reading frame encode a –N-X-S/T- amino acid sequence, where X represents any amino acid other than proline, that comprises a potential site (sequon) for N-linked glycosylation of the translated protein. N-linked glycosylation is both a co- and post-translational modification. The complex oligosaccharide GlcNAc2Man9Glu3 may be added at a –N-X-S/T- sequon as the polypeptide chain emerges from the ribosome tunnel. Local secondary structure determines whether oligosaccharide is added and the extent of addition. Higher occupancy is observed for –N-X-T- sequons than at –N-X-S- sequons, and the efficiency of addition can be further influenced by adjacent amino acid residues.  相似文献   

12.
13.
Human protein C (hPC) is glycosylated at three Asn‐X‐Ser/Thr and one atypical Asn‐X‐Cys sequons. We have characterized the micro‐ and macro‐heterogeneity of plasma‐derived hPC and compared the glycosylation features with recombinant protein C (tg‐PC) produced in a transgenic pig bioreactor from two animals having approximately tenfold different expression levels. The N‐glycans of hPC are complex di‐ and tri‐sialylated structures, and we measured 78% site occupancy at Asn‐329 (the Asn‐X‐Cys sequon). The N‐glycans of tg‐PC are complex sialylated structures, but less branched and partially sialylated. The porcine mammary epithelial cells glycosylate the Asn‐X‐Cys sequon with a similar efficiency as human hepatocytes even at these high expression levels, and site occupancy at this sequon was not affected by expression level. A distinct bias for particular structures was present at each of the four glycosylation sites for both hPC and tg‐PC. Interestingly, glycans with GalNAc in the antennae were predominant at the Asn‐329 site. The N‐glycan structures found for tg‐PC are very similar to those reported for a recombinant Factor IX produced in transgenic pig milk, and similar to the endogenous milk protein lactoferrin, which may indicate that N‐glycan processing in the porcine mammary epithelial cells is more uniform than in other tissues.  相似文献   

14.
The addition of N-linked oligosaccharides to Asn-X-(Ser/Thr) sites is catalyzed by the oligosaccharyltransferase, an enzyme closely associated with the translocon and generally thought to have access only to nascent chains as they emerge from the ribosome. However, the presence of the sequon does not automatically ensure core glycosylation because many proteins contain sequons that remain either nonglycosylated or glycosylated to a variable extent. In this study, hepatitis C virus (HCV) envelope protein E1 was used as a model to study the efficiency of N-glycosylation. HCV envelope proteins, E1 and E2, were released from a polyprotein precursor after cleavage by host signal peptidase(s). When expressed alone, E1 was not efficiently glycosylated. However, E1 glycosylation was improved when expressed as a polyprotein including full-length or truncated forms of E2. These data indicate that glycosylation of E1 is dependent on the presence of polypeptide sequences located downstream of E1 on HCV polyprotein.  相似文献   

15.
We recently reported statistical analysis of structural data on glycosidic linkages. Here we extend this analysis to the glycan-protein linkage, and the peptide primary, secondary, and tertiary structures around N-glycosylation sites. We surveyed 506 glycoproteins in the Protein Data Bank crystallographic database, giving 2592 glycosylation sequons (1683 occupied) and generated a database of 626 nonredundant sequons with 386 occupied. Deviations in the expected amino acid composition were seen around occupied asparagines, particularly an increased occurrence of aromatic residues before the asparagine and threonine at position +2. Glycosylation alters the asparagine side chain torsion angle distribution and reduces its flexibility. There is an elevated probability of finding glycosylation sites in which secondary structure changes. An 11-class taxonomy was developed to describe protein surface geometry around glycosylation sites. Thirty-three percent of the occupied sites are on exposed convex surfaces, 10% in deep recesses and 20% on the edge of grooves with the glycan filling the cleft. A surprisingly large number of glycosylated asparagine residues have a low accessibility. The incidence of aromatic amino acids brought into close contact with the glycan by the folding process is higher than their normal levels on the surface or in the protein core. These data have significant implications for control of sequon occupancy and evolutionary selection of glycosylation sites and are discussed in relation to mechanisms of protein fold stabilization and regional quality control of protein folding. Hydrophobic protein-glycan interactions and the low accessibility of glycosylation sites in folded proteins are common features and may be critical in mediating these functions.  相似文献   

16.
Database analysis of O-glycosylation sites in proteins   总被引:3,自引:0,他引:3       下载免费PDF全文
Statistical analysis was carried out to study the sequential aspects of amino acids around the O-glycosylated Ser/Thr. 992 sequences containing O-glycosylated Ser/Thr were selected from the O-GLYCBASE database of O-glycosylated proteins. The frequency of occurrence of amino acid residues around the glycosylated Ser/Thr revealed that there is an increased number of proline residues around the O-glycosylation sites in comparison with the nonglycosylated serine and threonine residues. The deviation parameter calculated as a measure of preferential and nonpreferential occurrence of amino acid residues around the glycosylation site shows that Pro has the maximum preference around the O-glycosylation site. Pro at +3 and/or -1 positions strongly favors glycosylation irrespective of single and multiple glycosylation sites. In addition, serine and threonine are preferred around the multiple glycosylation sites due to the effect of clusters of closely spaced glycosylated Ser/Thr. The preference of amino acids around the sites of mucin-type glycosylation is found likely to be similar to that of the O-glycosylation sites when taken together, but the acidic amino acids are more preferred around Ser/Thr in mucin-type glycosylation when compared totally. Aromatic amino acids hinder O-glycosylation in contrast to N-glycosylation. Cysteine and amino acids with bulky side chains inhibit O-glycosylation. The preference of certain potential sequence motifs of glycosylation has been discussed.  相似文献   

17.
Asn-linked glycosylation is the most ubiquitous posttranslational protein modification in eukaryotes and archaea, and in some eubacteria. Oligosaccharyltransferase (OST) catalyzes the transfer of preassembled oligosaccharides on lipid carriers onto asparagine residues in polypeptide chains. Inefficient oligosaccharide transfer results in glycoprotein heterogeneity, which is particularly bothersome in pharmaceutical glycoprotein production. Amino acid variation at the X position of the Asn-X-Ser/Thr sequon is known to modulate the glycosylation efficiency. The best amino acid at X is valine, for an archaeal Pyrococcus furiosus OST. We performed a systematic alanine mutagenesis study of the archaeal OST to identify the essential and dispensable amino acid residues in the three catalytic motifs. We then investigated the effects of the dispensable mutations on the amino acid preference in the N-glycosylation sequon. One residue position was found to selectively affect the amino acid preference at the X position. This residue is located within the recently identified DXXKXXX(M/I) motif, suggesting the involvement of this motif in N-glycosylation sequon recognition. In applications, mutations at this position may facilitate the design of OST variants adapted to particular N-glycosylation sites to reduce the heterogeneity of glycan occupancy. In fact, a mutation at this position led to 9-fold higher activity relative to the wild-type enzyme, toward a peptide containing arginine at X in place of valine. This mutational approach is potentially applicable to eukaryotic and eubacterial OSTs for the production of homogenous glycoproteins in engineered mammalian and Escherichia coli cells.  相似文献   

18.
Thermoplasma acidophilum is a thermoacidophilic archaeon that grows optimally at pH 2 and 59°C. This extremophile is remarkable by the absence of a cell wall or an S-layer. Treating the cells with Triton X-100 at pH 3 allowed the extraction of all of the cell surface glycoproteins while keeping cells intact. The extracted glycoproteins were partially purified by cation-exchange chromatography, and we identified five glycoproteins by N-terminal sequencing and mass spectrometry of in-gel tryptic digests. These glycoproteins are positive for periodic acid-Schiff staining, have a high content of Asn including a large number in the Asn-X-Ser/Thr sequon and have apparent masses that are 34-48% larger than the masses deduced from their amino acid sequences. The pooled glycoproteins were digested with proteinase K and the purified glycopeptides were analyzed by NMR. Structural determination showed that the carbohydrate part was represented by two structures in nearly equal amounts, differing by the presence of one terminal mannose residue. The larger glycan chain consists of eight residues: six hexoses, one heptose and one sugar with an unusual residue mass of 226 Da which was identified as 6-deoxy-6-C-sulfo-D-galactose (6-C-sulfo-D-fucose). Mass spectrometry analyses of the peptides obtained by trypsin and chymotrypsin digestion confirmed the principal structures to be those determined by NMR and identified 14 glycopeptides derived from the main glycoprotein, Ta0280, all containing the Asn-X-Ser/Thr sequons. Thermoplasma acidophilum appears to have a "general" protein N-glycosylation system that targets a number of cell surface proteins.  相似文献   

19.
A novel neural-network-based model has been developed for the prediction of N-linked glycosylation characteristics related to glycosylation site-occupancy. Intracellular oligosaccharide transfer to a polypeptide is known to be either robust or dependent upon culture conditions during pharmaceutical production. This glycan attachment is classified by the model as robust or variable and is based on an input of the polypeptide primary sequence around the site of glycosylation. The glycosylation model utilizes multiple recurrent neural networks followed by a perceptron classifier. The input length of the polypeptide chain around the site of glycosylation (glycosylation window) was optimized through multiple independent training sessions. Incorporation of five residues prior (n - 5) to the site of glycosylation (n) and four residues beyond (n + 4) the glycan attachment site led to optimal network performance. The size of the glycosylation window for site-occupancy determination is much larger than has been previously reported. This model was developed to evaluate the effects of theoretical polypeptide mutations on glycosylation site-occupancy characteristics. Following correct prediction of the model testing data set, 20 independent networks were used to predict site-occupancy characteristics of wild-type and mutants of the rabies virus glycoprotein (rgp). Simulation results strongly correlated with previously published experimental results (Kasturi, L.; Hegang, C.; Shakin-Eshleman, S. H. Regulation of N-linked core glycosylation: use of a site-directed mutagenesis approach to identify Asn-Xaa-Ser/Thr sequons that are poor oligosacchride acceptors. Biochem. J. 1997, 323, 415-419. Mellquist, J. L.; Kasturi, L.; Spitalnik, S. L.; Shakin-Eshleman, S. H. The amino acid following an Asn-X-Ser/Thr sequon is an important determinant of N-linked core glycosylation efficiency. Biochemistry 1998, 37, 6833-6837). Further simulations on purely theoretical sequences suggested that influences of charged residues were a subset of multiple mechanisms in the determination of glycosylation site-occupancy.  相似文献   

20.
External invertase is the product of the SUC2 gene of Saccharomyces cerevisiae. The deduced sequence of this enzyme (Taussig, R., and Carlson, M. (1983) Nucleic Acid Res. 11, 1943-1954) reveals it to contain 14 potential N-linked glycosylation sites, or sequons, although only 9-10 appear to be glycosylated (Trimble, R. B., and Maley, F. (1977) J. Biol. Chem. 252, 4409-4412). To determine the location of the glycosylated sequons, external invertase was deglycosylated with endo-beta-acetylglucosaminidase H and its component peptides analyzed by both fast atom bombardment mass spectrometry (FABMS) and classical peptide isolation procedures. By use of the former technique most of the glucosamine-containing sequons could be located and by the latter sufficient amounts of small glucosamine-containing peptides were isolated to enable their quantitation. From the combined FABMS and glucosamine analyses, it was established that eight of the sequons in a subunit of invertase are either completely or almost completely glycosylated, while five others are glycosylated to the extent of about 50% or less. In the case of two overlapping sequons (4 and 5), which include Asn92-Asn93-Thr-Ser, only the first Asn was glycosylated. Thus, all but one of the sequons of external invertase are glycosylated to some extent, giving an appearance of only 9-10 N-linked oligosaccharides/subunit. The sequence identity of both external and internal invertase was verified by FABMS and by peptide sequence analysis. In only one site was an amino acid found to differ from that deduced from the DNA sequence of the SUC2 gene. This occurred at position 390 where a proline was found in place of alanine, which could result from a single base change in the triplet specifying the latter amino acid.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号