首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 828 毫秒
1.
Hydroxyproline (Hyp) O-glycosylation characterizes the hydroxyproline-rich glycoprotein (HRGP) superfamily of the plant extracellular matrix. Hyp glycosylation occurs in two modes: Arabinosylation adds short oligoarabinosides (Hyp-arabinosides) while galactosylation leads to the addition of larger arabinogalactan polysaccharides (Hyp-polysaccharides). We hypothesize that sequence-dependent glycosylation of small peptide motifs results in glycomodules. These small functional units in combination with other repetitive peptide modules define the properties of HRGPs. The Hyp contiguity hypothesis predicts arabinosylation of contiguous Hyp residues and galactosylation of clustered noncontiguous Hyp residues. To determine the minimum level of Hyp contiguity that directs arabinosylation, we designed a series of synthetic genes encoding repetitive (Ser-Pro(2))(n), (Ser-Pro(3))(n), and (Ser-Pro(4))(n). A signal sequence targeted these endogenous substrates to the endoplasmic reticulum/Golgi for post-translational proline hydroxylation and glycosylation in transformed Nicotiana tabacum cells. The fusion glycoproteins also contained green fluorescence protein, facilitating their detection and isolation. The (Ser-Pro(2))(n) and (Ser-Hyp(4))(n) fusion glycoproteins yielded Hyp-arabinosides but no Hyp-polysaccharide. The motif (Ser-Pro(3))(n) was incompletely hydroxylated, yielding mixed contiguous/noncontiguous Hyp and a corresponding mixture of Hyp-arabinosides and Hyp-polysaccharides. These results plus circular dichroic spectra of the glycosylated and deglycosylated (Ser-Pro(2))(n), (Ser-Pro(3))(n), and (Ser-Pro(4))(n) modules corroborate the Hyp contiguity hypothesis and indicate that Hyp O-glycosylation is indeed sequence-driven.  相似文献   

2.
Most aspects of plant growth involve cell surface hydroxyproline (Hyp)-rich glycoproteins (HRGPs) whose properties depend on arabinogalactan polysaccharides and arabinosides that define the molecular surface. Potential glycosylation sites are defined by an O-Hyp glycosylation code: contiguous Hyp directs arabinosylation. Clustered non-contiguous Hyp directs arabinogalactosylation. Elucidation of this code involved a single species, tobacco (Nicotiana tabacum) BY-2 cells. However, recent work suggests species variation, perhaps tissue specific Hyp glycosylation. Thus, the extent to which the Hyp glycosylation code is 'global' needs testing. We compared the ability of distantly related Arabidopsis cell cultures to process putative HRGP glycosylation motifs encoded by synthetic genes. The genes included: repetitive Ser-Pro, Ser-Pro2, Ser-Pro4 and an analog of the tomato arabinogalactan-protein, LeAGP-1DeltaGPI. All were expressed as enhanced green fluorescent protein (EGFP) fusion glycoproteins, designated: AtSO-EGFP (O=Hyp), AtSO2-EGFP, AtSO4-EGFP and AtEGFP-LeAGP-1DeltaGPI, respectively. The Arabidopsis glycosylation patterns were essentially similar to those observed in Nicotiana: non-contiguous Hyp residues in AtSO-EGFP were glycosylated exclusively with arabinogalactan polysaccharides while contiguous Hyp in AtSO2-EGFP and AtSO4-EGFP was exclusively arabinosylated. Mixed contiguous and non-contiguous Hyp residues in AtEGFP-LeAGP-1DeltaGPI were also arabinosylated and arabinogalactosylated consistent with the code. However, slightly more arabinogalactosylated Hyp and less non-glycosylated Hyp in AtEGFP-LeAGP-1DeltaGPI than tobacco NtEGFP-LeAGP-1DeltaGPI suggested Arabidopsis prolyl hydroxylases have a slightly broader specificity. Arabidopsis Hyp-arabinogalactans differed from tobacco in decreased glucuronic acid content and lack of rhamnose. Yields of the EGFP fusion glycoproteins were dramatically higher than targeted EGFP lacking Hyp-glycomodules. This validates earlier suggestions that the glycosylation of proteins facilitates their secretion.  相似文献   

3.
Functional analysis of the hyperglycosylated arabinogalactan-proteins (AGPs) attempts to relate biological roles to the molecular properties that result largely from O-Hyp glycosylation putatively coded by the primary sequence. The Hyp contiguity hypothesis predicts contiguous Hyp residues as attachment sites for arabino-oligosaccharides (arabinosides) and clustered, non-contiguous Hyp residues as arabinogalactan polysaccharide sites. Although earlier tests of naturally occurring hydroxyproline-rich glycoproteins (HRGPs) and HRGPs designed by synthetic genes were consistent with a sequence-driven code, the predictive value of the hypothesis starting from the DNA sequences of known AGPs remained untested due to difficulties in purifying a single AGP for analysis. However, expression in tobacco (Nicotiana tabacum) of the major tomato (Lycopersicon esculentum) AGP, LeAGP-1, as an enhanced green fluorescent protein fusion glycoprotein (EGFP)-LeAGP-1, increased its hydrophobicity sufficiently for chromatographic purification from other closely related endogenous AGPs. We also designed and purified two variants of LeAGP-1 for future functional analysis: one lacking the putative glycosylphosphatidylinositol (GPI)-anchor signal sequence; the other lacking a 12-residue internal lysine-rich region. Fluorescence microscopy of plasmolysed cells confirmed the location of LeAGP-1 at the plasma membrane outer surface and in Hechtian threads. Hyp glycoside profiles of the fusion glycoproteins gave ratios of Hyp-polysaccharides to Hyp-arabinosides plus non-glycosylated Hyp consistent with those predicted from DNA sequences by the Hyp contiguity hypothesis. These results demonstrate a route to the purification of AGPs and the use of the Hyp contiguity hypothesis for predicting the Hyp O-glycosylation profile of an HRGP from its DNA sequence.  相似文献   

4.
Contemporary glycobiology reflects the intense interest in glycoproteins and their biological roles. Addition of saccharides by N- or O-glycosylation is precise rather than random and forms a uniquely interactive molecular surface. We designate these well conserved glycomotifs as glycomodules to emphasize their functional significance. Thus, elucidation of the glycosylation codes that determine saccharide addition is a significant goal. The focus here is on the Hyp O-glycosylation of cell wall proteins. This involves two consecutive posttranslational modifications, proline hydroxylation and glycosylation. Peptide sequence rather than conformation seems to determine these modifications. Hyp glycosylation occurs in two distinct modes: Hyp arabinosylation and Hyp galactosylation. The Hyp contiguity hypothesis predicts arabinosylation of contiguous Hyp residues and galactosylation of clustered non-contiguous Hyp. Elucidation of Hyp glycosylation codes involves the design and expression of putative glycomotifs as simple repetitive peptides. Thus, repetitive (Ser-Hyp), directed Hyp galactosylation resulting in the exclusive addition of arabinogalactan polysaccharide to all the non-contiguous Hyp residues. and a new AGP. Another repetitive peptide from gum arabic glycoprotein, containing both contiguous and non-contiguous Hyp, directed both modes of Hyp glycosylation. Furthermore, expression of the (Ser-Hypx)n series confirmed the arabinosylation of contiguous Hyp. Thus, the Hyp contiguity hypothesis is a useful predictive tool in the functional genomics toolbox.  相似文献   

5.
O-Linked glycans vary between eukaryotic cell types and play an important role in determining a glycoprotein's properties, including stability, target recognition, and potentially immunogenicity. We describe O-linked glycan structures of a recombinant human IgA1 (hIgA1) expressed in transgenic maize. Up to six proline/hydroxyproline conversions and variable amounts of arabinosylation (Pro/Hyp + Ara) were found in the hinge region of maize-expressed hIgA1 heavy chain (HC) by using a combination of matrix-assisted laser-desorption ionization mass spectrometry (MALDI MS), chromatography, and amino acid analysis. Approximately 90% of hIgA1 was modified in this way. An average molar ratio of six Ara units per molecule of hIgA1 was revealed. Substantial sequence similarity was identified between the HC hinge region of hIgA1 and regions of maize extensin-family of hydroxyproline-rich glycoproteins (HRGP). We propose that because of this sequence similarity, the HC hinge region of maize-expressed hIgA1 can become a substrate for posttranslational conversion of Pro to Hyp by maize prolyl-hydroxylase(s) with the subsequent arabinosylation of the Hyp residues by Hyp-glycosyltransferase(s) in the Golgi apparatus in maize endosperm tissue. The observation of up to six Pro/Hyp hydroxylations combined with extensive arabinosylation in the hIgA1 HC hinge region is well in agreement with the Pro/Hyp hydroxylation model and the Hyp contiguity hypothesis suggested earlier in literature for plant HRGP. For the first time, the extensin-like Hyp/Pro conversion and O-linked arabinosylation are described for a recombinant therapeutic protein expressed in transgenic plants. Our findings are of significance to the field of plant biotechnology and biopharmaceutical industry-developing transgenic plants as a platform for the production of recombinant therapeutic proteins.  相似文献   

6.
Graminaceous monocots generally contain low levels of hydroxyproline-rich Glycoproteins (HRGPs). As HRGPs are often at the cell surface, we used the intact cell elution technique (100 millimolar AlCl3) to isolate soluble surface proteins from Zea mays cell suspension cultures. Further fractionation of the trichloroacetic acid-soluble eluate on the cation exchangers phospho-cellulose and BioRex-70 gave several retarded, hence presumably basic fractions, which also contained hydroxyproline (Hyp). One of these fractions yielded a pure HRGP after a final purification step involving Superose-6 gel filtration. As this HRGP was unusually rich in threonine, (25 mole%) we designated it as a threonine-hydroxyproline-rich glycoprotein (THRGP); it contained about 27% carbohydrate occurring exclusively as arabinosylated Hyp, predominantly as the monosaccharide (15%), and trisaccharide (25%) with 48% Hyp nonglycosylated—a characteristically graminaceous monocot profile. Amino acid analysis confirmed the basic character, and gave a low alanine content. Reaction with Yariv artificial antigen was negative. These characteristics show that the THRGP is not an arabinogalactan protein. On the other hand, antibodies raised against tomato extensin P1 cross-reacted significantly with the THRGP; this cross-reactivity and the above analytical data provide the best evidence to date for the presence of extensin in a graminaceous monocot.  相似文献   

7.
Gum arabic glycoprotein (GAGP) is a large molecular weight, hydroxyproline-rich arabinogalactan-protein (AGP) component of gum arabic. GAGP has a simple, highly biased amino acid composition indicating a repetitive polypeptide backbone. Previous work (Qi, W., Fong, C., Lamport, D.T.A., 1991. Plant Physiology 96, 848), suggested small (approximately 11 residue) repetitive peptide motifs each with three Hyp-arabinoside attachment sites and a single Hyp-arabinogalactan polysaccharide attachment site. We tested that hypothesis by sequence analysis of the GAGP polypeptide after HF-deglycosylation. A family of closely related peptides confirmed the presence of a repetitive 19-residue consensus motif. However, the motif: Ser-Hyp-Hyp-Hyp-Thr-Leu-Ser-Hyp-Ser- Hyp-Thr-Hyp-Thr-Hyp-Hyp-Leu-Gly-Pro-His, was about twice the size anticipated. Thus, judging by Hyp-glycoside profiles of GAGP, the consensus motif contained six Hyp-arabinosides rather than three and two Hyp-polysaccharides rather than one. We inferred the glycosylation sites based on the Hyp contiguity hypothesis which predicts arabinosides on contiguous Hyp residues and arabinogalactan polysaccharides on clustered non-contiguous Hyp residues, i.e. the GAGP motif would consist of arabinosylated contiguous Hyp blocks flanking two central Hyp-polysaccharides. We predict this rigidifies the glycoprotein, enhances the overall symmetry of the glycopeptide motif, and may explain some of the remarkable properties of gum arabic.  相似文献   

8.
A synthetic gene encoding the fusion protein (Ala-Hyp)(51)-enhanced green fluorescent protein expressed in Nicotiana tabacum cells produced a fusion glycoprotein with all proline residues hydroxylated and substituted with an arabinogalactan polysaccharide. Alkaline hydrolysis of the fusion glycoprotein yielded a population of hydroxyproline (Hyp)-arabinogalactan polysaccharides ranging in size from 13 to 26 saccharide residues/Hyp, with a median size of 15-17 residues. We isolated a 15-residue Hyp-arabinogalactan for structure determination by sugar analyses and one- and two-dimensional nuclear magnetic resonance techniques that provided the assignment of proton and carbon signals of a small polysaccharide O-linked to the hydroxyl group of Hyp. The polysaccharide consisted of a 1,3-linked beta-D-Galp backbone with a single 1,6-linked beta-D-Galp "kink." The backbone had two side chains of Galp substituted at position 3 with an arabinose di- or trisaccharide and at position 6 with glucuronic acid or rhamnosyl glucuronic acid. Energy-minimized space-filling molecular models showed hydrogen bonding within polysaccharides attached to repetitive Ala-Hyp and also between polysaccharides and the peptide backbone. Polysaccharides distorted the peptide Ramachandran angles consistent with the circular dichroic spectra of isolated (Ala-Hyp)(51) and its reversion to a polyproline II-like helix after deglycosylation. This first complete structure of a Hyp-arabinogalactan polysaccharide shows that computer-based molecular modeling of Hyp-rich glycoproteins is now feasible and supports the suggestion that small repetitive subunits comprise larger arabinogalactan polysaccharides.  相似文献   

9.
Many plant gums, such as gum arabic, contain hydroxyproline-rich glycoproteins (HRGPs), which are also abundant components of the plant cell extracellular matrix. Here we expressed in transgenic BY2 Nicotiana tabacum (tobacco) cells, a synthetic gene encoding a novel HRGP-based gum, designated gum arabic-8 or (GA)(8). (GA)(8) encoded eight repeats of the consensus polypeptide sequence of gum arabic glycoprotein (GAGP): Gly-Pro-His-Ser-Pro-Pro-Pro-Pro-Leu-Ser-Pro-Ser-Pro-Thr-Pro-Thr-Pro-Pro-Leu, in which most of the Pro residues were posttranslationally modified to hydroxyproline (Hyp). (GA)(8) was expressed as a green fluorescent protein (GFP) fusion protein targeted to the culture medium, (GA)(8)GFP. The culture of the transgenic cells in a 5-L bioreactor showed that the production of (GA)(8)GFP was cell growth-associated. The extracellular yield of (GA)(8)GFP was 116.8 mg/L after 14 days of culture and accounted for 87% of the total fusion protein expressed. (GA)(8)GFP was purified from the culture medium by a combination of hydrophobic interaction, gel permeation, and reversed phase chromatography. Biochemical characterization indicated that the amino acid composition of the (GA)(8) module, after removal of GFP by proteolysis, was virtually identical to that of predicted by the GAGP consensus sequence and that carbohydrate, which occurred as arabinogalactan polysaccharides and small oligoarabinosides O-linked through the Hyp residues, accounted for 84% of the molecules' dry weight. Functional assays showed that (GA)(8) exhibited low viscosity in aqueous solution similar to native GAGP. However, neither GFP alone nor the (GA)(8) module could emulsify orange oil. However, the fusion protein (GA)(8)GFP possessed 1.28-fold better emulsification properties than native GAGP. This work demonstrates the feasibility and potential of a synthetic gene approach to the de novo design of novel glycoprotein-based gums and emulsifiers.  相似文献   

10.
The hydroxyproline-rich root nodules of legumes provide a microaerobic niche for symbiotic nitrogen-fixing Rhizobacteria. The contributions of the cell wall and associated structural proteins, particularly the hydroxyproline-rich glycoproteins (HRGPs), are therefore of interest. Our approach involved identification of the protein components by direct chemical analysis of the insoluble wall. Chymotryptic peptide mapping showed a "P3-type" extensin containing the highly arabinosylated Ser-Hyp4-Ser-Hyp-Ser-Hyp4-Tyr3-Lys motif as a major component. Cell wall amino acid analyses and quantitative hydroxyproline arabinoside profiles, predominantly of tri- and tetraarabinosides, confirmed this extensin as the major structural protein in the cell walls of both root nodules and uninfected roots. On the other hand, judging from the Pro, Glu and non-glycosylated Hyp content, the nodule-specific proline-rich glycoproteins, such as the early nodulins (ENOD-PRPs), are present in much lesser amounts. Although we isolated no PRP peptides from nodule cell walls, a single PRP peptide from root cell walls confirmed the presence of a PRP in roots and represented the first direct evidence for a crosslinked PRP in muro. Compared with root cell walls (approximately 7% protein dry weight) nodule cell walls contained significantly more protein (approximately 13% dry weight) with an overall amino acid and peptide composition indicating the presence of structural protein unrelated to the HRGPs.  相似文献   

11.
F. M. Klis  H. Eeltink 《Planta》1979,144(5):479-484
The arabinosylation patterns of wall-bound hydroxyproline in Phaseolus vulgaris L. cell suspension cultures were determined by separating free hydroxyproline and hydroxyproline-arabinose oligomers over a Bio-Gel P-2 column. Total hydroxyproline accounted for about 3.3% of wall dry weight during all growth phases of batch-cultured bean cells. The chemical arabinosylation patterns of wall-bound hydroxyproline varied during the lag phase and early log phase of the culture. First, an increase in nonglycosylated hydroxyproline occurred accompanied by a corresponding decrease in hydroxyproline tetra-arabinoside. During the early log phase the reverse happened. In later stages of growth the chemical arabinosylation patterns remained constant. The radiochemical arabinosylation patterns were also determined, after pulselabeling the cultures with [14C]proline at various times during growth, to be able to distinguish recently incorporated hydroxyproline. The time course of the arabinosylation pattern of this fraction indicated that the initial changes in the chemical pattern were due to the temporary incorporation of less extensively glycosylated hydroxyproline-containing protein into the cell wall.Abbreviations Hyp hydroxyproline - HAn hydroxyproline arabinoside - with n arabinosyl residues - TFA trifluoroacetic acid  相似文献   

12.
Many secretory and several vacuolar proteins in higher plants contain hydroxylated proline residues. In many cases, hydroxyprolines in proteins are glycosylated with either arabinogalactan or oligoarabinose. We have previously shown that a sporamin precursor is O-glycosylated at the hydroxylated proline 36 residue with an arabinogalactan-type glycan when this protein is expressed in tobacco BY-2 cells (Matsuoka et al., 1995). Taking advantage of the fact that this is the only site of proline hydroxylation and glycosylation in sporamin, we analyzed the amino acid requirement for proline hydroxylation and arabinogalactosylation. We expressed several deletion constructs and many amino acid substitution mutants in tobacco cells and analyzed glycosylation and proline hydroxylation of the expressed sporamins. Hydroxylation of a proline residue requires the five amino acid sequence [AVSTG]-Pro-[AVSTGA]-[GAVPSTC]-[APS or acidic] (where Pro is the modification site) and glycosylation of hydroxyproline (Hyp) requires the seven amino acid sequence [not basic]-[not T]-[neither P, T, nor amide]-Hyp-[neither amide nor P]-[not amide]-[APST], although charged amino acids at the -2 position and basic amide residues at the +1 position relative to the modification site seem to inhibit the elongation of the arabinogalactan side chain. Based on the combination of these two requirements, we concluded that the sequence motif for efficient arabinogalactosylation, including the elongation of the glycan side chain, is [not basic]-[not T]-[AVSG]-Pro-[AVST]-[GAVPSTC]-[APS].  相似文献   

13.
Extensins are cell wall hydroxyproline-rich glycoproteins that form covalent networks putatively involving tyrosyl and lysyl residues in cross-links catalyzed by one or more extensin peroxidases. The precise cross-links remain to be chemically identified both as network components in muro and as enzymic products generated in vitro with native extensin monomers as substrates. However, some extensin monomers contain variations within their putative cross-linking motifs that complicate cross-link identification. Other simpler extensins are recalcitrant to isolation including the ubiquitous P3-type extensin whose major repetitive motif, Hyp)(4)-Ser-Hyp-Ser-(Hyp)(4)-Tyr-Tyr-Tyr-Lys, is of particular interest, not least because its Tyr-Tyr-Tyr intramolecular isodityrosine cross-link motifs are also putative candidates for further intermolecular cross-linking to form di-isodityrosine. Therefore, we designed a set of extensin analogs encoding tandem repeats of the P3 motif, including Tyr --> Phe and Lys --> Leu variations. Expression of these P3 analogs in Nicotiana tabacum cells yielded glycoproteins with virtually all Pro residues hydroxylated and subsequently arabinosylated and with likely galactosylated Ser residues. This was consistent with earlier analyses of P3 glycopeptides isolated from cell wall digests and the predictions of the Hyp contiguity hypothesis. The tyrosine-rich P3 analogs also contained isodityrosine, formed in vivo. Significantly, these isodityrosine-containing analogs were further cross-linked in vitro by an extensin peroxidase to form the tetra-tyrosine intermolecular cross-link amino acid di-isodityrosine. This is the first identification of an inter-molecular cross-link amino acid in an extensin module and corroborates earlier suggestions that di-isodityrosine represents one mechanism for cross-linking extensins in muro.  相似文献   

14.
The green alga Chlamydomonas reinhardtii elaborates two biochemically and morphologically distinct cell walls during its life cycle: one surrounds the vegetative and gametic cell and the other encompasses the zygote. Hydroxyproline-rich glycoproteins (HRGPs) constitute a major component of both walls. We describe the isolation and characterization of a zygote-specific gene encoding a wall HRGP. The derived amino acid sequence of this algal HRGP is similar to those of higher plant extensins, rich in proline and serine residues and possessing repeating amino acid motifs, notably X(Pro)3 and (Ser-Pro)n. Antiserum against this zygote wall protein detected common epitopes in several other zygote polypeptides, at least one of which is also encoded by a zygote-specific gene. We conclude that there is one set of HRGP wall genes expressed only in zygotes and another set that is specific to vegetative and gametic cells.  相似文献   

15.
Hydroxyproline-rich glycoproteins (HRGPs) are a superfamily of plant cell wall proteins that function in diverse aspects of plant growth and development. This superfamily consists of three members: hyperglycosylated arabinogalactan proteins (AGPs), moderately glycosylated extensins (EXTs), and lightly glycosylated proline-rich proteins (PRPs). Hybrid and chimeric versions of HRGP molecules also exist. In order to “mine” genomic databases for HRGPs and to facilitate and guide research in the field, the BIO OHIO software program was developed that identifies and classifies AGPs, EXTs, PRPs, hybrid HRGPs, and chimeric HRGPs from proteins predicted from DNA sequence data. This bioinformatics program is based on searching for biased amino acid compositions and for particular protein motifs associated with known HRGPs. HRGPs identified by the program are subsequently analyzed to elucidate the following: (1) repeating amino acid sequences, (2) signal peptide and glycosylphosphatidylinositol lipid anchor addition sequences, (3) similar HRGPs via Basic Local Alignment Search Tool, (4) expression patterns of their genes, (5) other HRGPs, glycosyl transferase, prolyl 4-hydroxylase, and peroxidase genes coexpressed with their genes, and (6) gene structure and whether genetic mutants exist in their genes. The program was used to identify and classify 166 HRGPs from Arabidopsis (Arabidopsis thaliana) as follows: 85 AGPs (including classical AGPs, lysine-rich AGPs, arabinogalactan peptides, fasciclin-like AGPs, plastocyanin AGPs, and other chimeric AGPs), 59 EXTs (including SP5 EXTs, SP5/SP4 EXTs, SP4 EXTs, SP4/SP3 EXTs, a SP3 EXT, “short” EXTs, leucine-rich repeat-EXTs, proline-rich extensin-like receptor kinases, and other chimeric EXTs), 18 PRPs (including PRPs and chimeric PRPs), and AGP/EXT hybrid HRGPs.The genomics era has produced vast amounts of biological data that await examination. In order to “mine” such data effectively, a bioinformatics approach can be utilized to identify genes of interest, subject them to various in silico analyses, and extract relevant biological information on them from various public databases. Examination of such data produces novel insights with respect to the genes in question and can be used to facilitate and guide further research in the field. Such is the case here, where bioinformatics tools were developed to identify, classify, and analyze members of the Hyp-rich glycoprotein (HRGP) superfamily encoded by the Arabidopsis (Arabidopsis thaliana) genome.HRGPs are a superfamily of plant cell wall proteins that are subdivided into three families, arabinogalactan proteins (AGPs), extensins (EXTs), and Pro-rich proteins (PRPs), and extensively reviewed (Showalter, 1993; Kieliszewski and Lamport, 1994; Nothnagel, 1997; Cassab, 1998; José-Estanyol and Puigdomènech, 2000; Seifert and Roberts, 2007). However, it has become increasingly clear that the HRGP superfamily is perhaps better represented as a spectrum of molecules ranging from the highly glycosylated AGPs to the moderately glycosylated EXTs and finally to the lightly glycosylated PRPs. Moreover, hybrid HRGPs, composed of HRGP modules from different families, and chimeric HRGPs, composed of one or more HRGP modules within a non-HRGP protein, also can be considered part of the HRGP superfamily. Given that many HRGPs are composed of repetitive protein sequences, particularly the EXTs and PRPs, and many have low sequence similarity to one another, particularly the AGPs, BLAST searches typically identify only a few closely related family members and do not represent a particularly effective means to identify members of the HRGP superfamily in a comprehensive manner.Building upon the work of Schultz et al. (2002) that focused on the AGP family, a new bioinformatics software program, BIO OHIO, developed at Ohio University, makes it possible to search all 28,952 proteins encoded by the Arabidopsis genome and identify putative HRGP genes. Two distinct types of searches are possible with this program. First, the program can search for biased amino acid compositions in the genome-encoded protein sequences. For example, classical AGPs can be identified by their biased amino acid compositions of greater then 50% Pro (P), Ala (A), Ser (S), and Thr (T), as indicated by greater than 50% PAST. Similarly, arabinogalactan peptides (AG peptides) are identified by biased amino acid compositions of greater then 35% PAST, but the protein (i.e. peptide) must also be between 50 and 90 amino acids in length. Likewise, PRPs can be identified by a biased amino acid composition of greater then 45% PVKCYT. Second, the program can search for specific amino acid motifs that are commonly found in known HRGPs. For example, SP4 pentapeptide and SP3 tetrapeptide motifs are associated with EXTs, a fasciclin H1 motif is found in fasciclin-like AGPs (FLAs), and PPVX(K/T) (where X is any amino acid) and KKPCPP motifs are found in several known PRPs (Fowler et al., 1999). In addition to searching for HRGPs, the program can analyze proteins identified by a search. For example, the program checks for potential signal peptide sequences and glycosylphosphatidylinositol (GPI) plasma member anchor addition sequences, both of which are associated with HRGPs (Showalter, 1993, 2001; Youl et al., 1998; Sherrier et al., 1999; Svetek et al., 1999). Moreover, the program can identify repeated amino acid sequences within the sequence and has the ability to search for bias amino acid compositions within a sliding window of user-defined size, making it possible to identify HRGP domains within a protein sequence.Here, we report on the use of this bioinformatics program in identifying, classifying, and analyzing members of the HRGP superfamily (i.e. AGPs, EXTs, PRPs, hybrid HRGPs, and chimeric HRGPs) in the genetic model plant Arabidopsis. An overview of this bioinformatics approach is presented in Figure 1. In addition, public databases and programs were accessed and utilized to extract relevant biological information on these HRGPs in terms of their expression patterns, most similar sequences via BLAST analysis, available genetic mutants, and coexpressed HRGP, glycosyl transferase (GT), prolyl 4-hydroxylase (P4H), and peroxidase genes in Arabidopsis. This information provides new insight to the HRGP superfamily and can be used by researchers to facilitate and guide further research in the field. Moreover, the bioinformatics tools developed here can be readily applied to protein sequences from other species to analyze their HRGPs or, for that matter, any given protein family by altering the input parameters.Open in a separate windowFigure 1.Bioinformatics workflow diagram summarizing the identification, classification, and analysis of HRGPs (AGPs, EXTs, and PRPs) in Arabidopsis. Classical AGPs were defined as containing greater than 50% PAST coupled with the presence of AP, PA, SP, and TP repeats distributed throughout the protein, Lys-rich AGPs were a subgroup of classical AGPs that included a Lys-rich domain, and chimeric AGPs were defined as containing greater than 50% PAST coupled with the localized distribution of AP, PA, SP, and TP repeats. AG peptides were defined to be 50 to 90 amino acids in length and containing greater than 35% PAST coupled with the presence of AP, PA, SP, and TP repeats distributed throughout the peptide. FLAs were defined as having a fasciclin domain coupled with the localized distribution of AP, PA, SP, and TP repeats. Extensins were defined as containing two or more SP3 or SP4 repeats coupled with the distribution of such repeats throughout the protein; chimeric extensins were similarly identified but were distinguished from the extensins by the localized distribution of such repeats in the protein; and short extensins were defined to be less than 200 amino acids in length coupled with the extensin definition. PRPs were identified as containing greater than 45% PVKCYT or two or more KKPCPP or PVX(K/T) repeats coupled with the distribution of such repeats and/or PPV throughout the protein. Chimeric PRPs were similarly identified but were distinguished from PRPs by the localized distribution of such repeats in the protein. Hybrid HRGPs (i.e. AGP/EXT hybrids) were defined as containing two or more repeat units used to identify AGPs, extensins, or PRPs. The presence of a signal peptide was used to provide added support for the identification of an HRGP but was not used in an absolute fashion. Similarly, the presence of a GPI anchor addition sequence was used to provide added support for the identification of classical AGPs and AG peptides, which are known to contain such sequences. BLAST searches were also used to provide some support to our classification if the query sequence showed similarity to other members of an HRGP subfamily. Note that some AGPs, particularly chimeric AGPs, and PRPs were identified from an Arabidopsis database annotation search and that two chimeric extensins were identified from the primary literature as noted in the text.  相似文献   

16.
Art v 1, the major allergen of mugwort (Artemisia vulgaris) pollen contains galactose and arabinose. As the sera of some allergic patients react with natural but not with recombinant Art v 1 produced in bacteria, the glycosylation of Art v 1 may play a role in IgE binding and human allergic reactions. Chemical and enzymatic degradation, mass spectrometry, and 800 MHz (1)H and (13)C nuclear magnetic resonance spectroscopy indicated the proline-rich domain to be glycosylated in two ways. We found a large hydroxyproline-linked arabinogalactan composed of a short beta1,6-galactan core, which is substituted by a variable number (5-28) of alpha-arabinofuranose residues, which form branched side chains with 5-, 2,5-, 3,5-, and 2,3,5-substituted arabinoses. Thus, the design of the Art v 1 polysaccharide differs from that of the well known type II arabinogalactans, and we suggest it be named type III arabinogalactan. The other type of glycosylation was formed by single (but adjacent) beta-arabinofuranoses linked to hydroxyproline. In contrast to the arabinosylation of Ser-Hyp(4) motifs in other hydroxyproline-rich glycoproteins, such as extensins or solanaceous lectins, no oligo-arabinosides were found in Art v 1. Art v 1 and parts thereof produced by alkaline degradation, chemical deglycosylation, proteolytic degradation, and/or digestion with alpha-arabinofuranosidase were used in enzyme-linked immunosorbent assay and immunoblot experiments with rabbit serum and with the sera of patients. Although we could not observe antibody binding by the polysaccharide, the single hydroxyproline-linked beta-arabinose residues appeared to react with the antibodies. Mono-beta-arabinosylated hydroxyproline residues thus constitute a new, potentially cross-reactive, carbohydrate determinant in plant proteins.  相似文献   

17.
AtAGP17, AtAGP18 and AtAGP19 are homologous genes encoding three putative glycosylphosphatidylinositol (GPI)-anchored classical arabinogalactan-proteins (AGPs) in Arabidopsis. They are distinguished from other AGPs by a short, C-terminal lysine-rich region. Organ-specific expression of these genes was revealed by Northern blot analysis. AtAGP17 was strongly expressed in leaves and stems, and weakly expressed in flowers and roots; AtAGP18 was strongly expressed in flowers, and moderately expressed in roots, stems and young leaves; and AtAGP19 was strongly expressed in stems, moderately expressed in flowers and roots, and weakly expressed in young leaves. One of these genes, AtAGP17, was expressed and purified as a green fluorescent protein (GFP) fusion protein in transgenic tobacco cells using hydrophobic interaction chromatography, size exclusion chromatography and reverse phase high-performance liquid chromatography. The fusion (glyco)protein produced a characteristic AGP 'smear' with a molecular mass of 80-150 kDa when detected by Western blot analysis. Glycosyl composition and linkage analyses of purified GFP-AtAGP17 showed that carbohydrate accounted for approximately 86% of the molecule, with arabinose and galactose as major, and rhamnose and glucuronic acid as minor glycosyl residues and with 1,3,6-galactose, 1,4-glucuronic acid, 1,3-galactose and terminal arabinose as major linkages. GFP-AtAGP17 was also precipitated by beta-Yariv reagent, further confirming that AtAGP17 is a bona fide AGP. Confocal fluorescence microscopy of plasmolysed, transformed cells indicated that AtAGP17 is localized on the plasma membrane and in Hechtian strands. Hydroxyproline (Hyp) glycoside profiles of GFP-AtAGP17 in conjunction with the deduced protein sequence also served to corroborate the Hyp contiguity hypothesis, which predicts contiguous Hyp residues as attachment sites for arabinosides and clustered, non-contiguous Hyp residues as attachment sites for arabinogalactan polysaccharides.  相似文献   

18.
Proteins are important actors in plant cell walls because they contribute to their architecture and their dynamics. Among them, hydroxyproline (Hyp)-rich glycoproteins constitute a complex family of O-glycoproteins with various structures and functions. In this study, we characterized an atypical Hyp-rich glycoprotein, AGP31 (arabinogalactan protein 31), which displays a multidomain organization unique in Arabidopsis thaliana, consisting of a short arabinogalactan protein (AGP) motif, a His stretch, a Pro-rich domain, and a C-terminal PAC (PRP-AGP containing Cys) domain. The use of various mass spectrometry strategies was innovative and powerful: it permitted us to locate Hyp residues, to demonstrate the presence of carbohydrates, and to refine their distribution over the Pro-rich domain. Most Hyp were isolated within repeated motifs such as KAOV, KSOV, K(PO/OP)T, K(PO/OP)V, T(PO/OP)V, and Y(PO/OP)T. A few extensin-like motifs with contiguous Hyp (SOOA and SOOT) were also found. The Pro-rich domain was shown to carry Gal residues on isolated Hyp but also Ara residues. The existence of new type Hyp-O-Gal/Ara-rich motifs not recognized by the β-glucosyl Yariv reagent but interacting with the peanut agglutinin lectin was proposed. In addition, the N-terminal short AGP motif was assumed to be substituted by arabinogalactans. Altogether, AGP31 was found to be highly heterogeneous in cell walls because arabinogalactans could be absent, Hyp-O-Gal/Ara-rich motifs of different sizes were observed, and truncated forms missing the C-terminal PAC domain were found, suggesting degradation in muro and/or partial glycosylation prior to secretion.  相似文献   

19.
Intact cell elution of suspension cultures derived from Douglas fir, Pseudotsuga menziesii (Mirbel) Franco, yielded two extensin monomers, the first hydroxyproline-rich glycoproteins (HRGPs) to be isolated from a gymnosperm. These HRGPs resolved on Superose-6 gel filtration. The smaller monomer was compositionally similar to angiosperm extensins like tomato P1. The larger monomer had a simple composition reminiscent of repetitive proline-rich proteins (RPRPs) from soybean cell walls and contained proline, hydroxyproline, and sugar; hence designated a proline-hydroxyproline-rich glycoprotein (PHRGP). The simple composition of the PHRGP implied a periodic structure which was confirmed by the simple chymotryptic map and 45-residue partial sequence of the major proline-hydroxyproline-rich glycoprotein chymotryptide 5: Lys-Pro-Hyp-Val-Hyp-Val-Ile-Pro-Pro-Hyp-Val-Val-Lys-Pro-Hyp-Hyp-Val- Tyr-Lys-Pro-Hyp-Val-Hyp-Val-Ile-Pro-Pro-Hyp-Val-Val-Lys-Pro-Hyp-Hyp- Val-Tyr-Lys-Ile-Pro-Pro(Hyp)-Val-Ile-Lys-Pro. Proline-hydroxyproline-rich glycoprotein chymotryptide 5 contained an 18-residue tandem repeat devoid of tetra(hydroxy)-proline or serine; it also contained two instances of the five-residue motif Hyp-Hyp-Val-Tyr-Lys and five of the general Pro-Pro-X-X-Lys motif, thereby establishing its homology with typical angiosperm RPRPs and extensins from tomato, petunia, carrot, tobacco, sugar beet, and Phaseolus. Unlike the nonglycosylated soybean RPRP, the highly purified Douglas fir PHRGP was lightly glycosylated, confirmed by a quantitative hydroxyproline glycoside profile, indicating that extensins can range from highly glycosylated hydroxyproline to little or no glycosylated hydroxyproline. Comparison of extensin sequence data strongly indicates that a major determinant of hydroxyproline glycosylation specificity is hydroxyproline contiguity: extensins with tetrahydroxyproline blocks are very highly arabinosylated (>90% hydroxyproline glycosylated), tri- and dihydroxyproline are less so, and single hydroxyproline residues perhaps not at all. Despite high yields of extensins eluted from intact cells, the Douglas fir cell wall itself was hydroxyproline poor yet remarkably rich in protein (>20%), again emphasizing the existence of other structural cell wall proteins that are neither HRGPs nor glycine-rich proteins.  相似文献   

20.
In collagen, strands of the sequence XaaYaaGly form a triple-helical structure. The Yaa residue is often (2S,4R)-4-hydroxyproline (Hyp). The inductive effect of the hydroxyl group of Hyp residues greatly increases collagen stability. Here, electron withdrawal by the hydroxyl group in Hyp and its 4S diastereomer (hyp) is increased by the addition of an acetyl group or trifluoroacetyl group. The crystalline structures of AcHyp[C(O)CH3]OMe and Achyp[C(O)CH3]OMe are similar to those of AcHypOMe and AcProOMe, respectively. The O-acylation of AcHypOMe and AchypOMe increases the 13C chemical shift of its Cgamma atom: AcHyp[C(O)CF3]OMe congruent with Achyp[C(O)CF3]OMe > AcHyp[C(O)CH3]OMe congruent with Achyp[C(O)CH3]OMe > or = AcHypOMe congruent with AchypOMe. This increased inductive effect is not apparent in the thermodynamics or kinetics of amide bond isomerization. Despite apparently unfavorable steric interactions, (ProHypGly)(10), which is O-acylated with 10 acetyl groups, forms a triple helix that has intermediate stability: (ProHypGly)(10) > {ProHyp[C(O)CH3]Gly}(10) > (ProProGly)(10). Thus, the benefit to collagen stability endowed by the hydroxyl group of Hyp residues is largely retained by an acetoxyl group.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号