首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 562 毫秒
1.
Amino acid changes due to non-synonymous variation are included as annotations for individual proteins in UniProtKB/Swiss-Prot and RefSeq which present biological data in a protein-or gene-centric fashion. Unfortunately, proteome-wide analysis of non-synonymous singlenucleotide variations (nsSNVs) is not easy to perform because information on nsSNVs and functionally important sites are not well integrated both within and between databases and their search engines. We have developed SNVDis that allows evaluation of proteome-wide nsSNV distribution in functional sites, domains and pathways. More specifically, we have integrated human-specific data from major variation databases (UniProtKB, dbSNP and COSMIC), comprehensive sequence feature annotation from UniProtKB, Pfam, RefSeq, Conserved Domain Database (CDD) and pathway information from Protein ANalysis THrough Evolutionary Relationships (PANTHER) and mapped all of them in a uniform and comprehensive way to the human reference proteome provided by UniProtKB/Swiss-Prot. Integrated information of active sites, pathways, binding sites, domains, which are extracted from a number of different sources, provides a detailed overview of how nsSNVs are distributed over the human proteome and pathways and how they intersect with functional sites of proteins. Additionally, it is possible to find out whether there is an over-or under-representation of nsSNVs in specific domains, pathways or user-defined protein lists. The underlying datasets are updated once every 3 months. SNVDis is freely available at http://hive.biochemistry.gwu.edu/tool/snvdis.  相似文献   

2.
The asparagine-X-serine/threonine (NXS/T) motif, where X is any amino acid except proline, is the consensus motif for N-linked glycosylation. Significant numbers of high-resolution crystal structures of glycosylated proteins allow us to carry out structural analysis of the N-linked glycosylation sites (NGS). Our analysis shows that there is enough structural information from diverse glycoproteins to allow the development of rules which can be used to predict NGS. A Python-based tool was developed to investigate asparagines implicated in N-glycosylation in five species: Homo sapiens, Mus musculus, Drosophila melanogaster, Arabidopsis thaliana and Saccharomyces cerevisiae. Our analysis shows that 78% of all asparagines of NXS/T motif involved in N-glycosylation are localized in the loop/turn conformation in the human proteome. Similar distribution was revealed for all the other species examined. Comparative analysis of the occurrence of NXS/T motifs not known to be glycosylated and their reverse sequence (S/TXN) shows a similar distribution across the secondary structural elements, indicating that the NXS/T motif in itself is not biologically relevant. Based on our analysis, we have defined rules to determine NGS. Using machine learning methods based on these rules we can predict with 93% accuracy if a particular site will be glycosylated. If structural information is not available the tool uses structural prediction results resulting in 74% accuracy. The tool was used to identify glycosylation sites in 108 human proteins with structures and 2247 proteins without structures that have acquired NXS/T site/s due to non-synonymous variation. The tool, Structure Feature Analysis Tool (SFAT), is freely available to the public at http://hive.biochemistry.gwu.edu/tools/sfat.  相似文献   

3.
In the Gram-negative bacterium Campylobacter jejuni there is a pgl (protein glycosylation) locus-dependent general N-glycosylation system of proteins. One of the proteins encoded by pgl locus, PglB, a homolog of the eukaryotic oligosaccharyltransferase component Stt3p, is proposed to function as an oligosaccharyltransferase in this prokaryotic system. The sequence requirements of the acceptor polypeptide for N-glycosylation were analyzed by reverse genetics using the reconstituted glycosylation of the model protein AcrA in Escherichia coli. As in eukaryotes, the N-X-S/T sequon is an essential but not a sufficient determinant for N-linked protein glycosylation. This conclusion was supported by the analysis of a novel C. jejuni glycoprotein, HisJ. Export of the polypeptide to the periplasm was required for glycosylation. Our data support the hypothesis that eukaryotic and bacterial N-linked protein glycosylation are homologous processes.  相似文献   

4.
Identification of non-synonymous single nucleotide variations (nsSNVs) has exponentially increased due to advances in Next-Generation Sequencing technologies. The functional impacts of these variations have been difficult to ascertain because the corresponding knowledge about sequence functional sites is quite fragmented. It is clear that mapping of variations to sequence functional features can help us better understand the pathophysiological role of variations. In this study, we investigated the effect of nsSNVs on more than 17 common types of post-translational modification (PTM) sites, active sites and binding sites. Out of 1 705 285 distinct nsSNVs on 259 216 functional sites we identified 38 549 variations that significantly affect 10 major functional sites. Furthermore, we found distinct patterns of site disruptions due to germline and somatic nsSNVs. Pan-cancer analysis across 12 different cancer types led to the identification of 51 genes with 106 nsSNV affected functional sites found in 3 or more cancer types. 13 of the 51 genes overlap with previously identified Significantly Mutated Genes (Nature. 2013 Oct 17;502(7471)). 62 mutations in these 13 genes affecting functional sites such as DNA, ATP binding and various PTM sites occur across several cancers and can be prioritized for additional validation and investigations.  相似文献   

5.
N-Linked glycosylation is a post-translational event whereby carbohydrates are added to secreted proteins at the consensus sequence Asn-Xaa-Ser/Thr, where Xaa is any amino acid except proline. Some consensus sequences in secreted proteins are not glycosylated, indicating that consensus sequences are necessary but not sufficient for glycosylation. In order to understand the structural rules for N-linked glycosylation, we introduced N-linked consensus sequences by site-directed mutagenesis into the polypeptide chain of the recombinant human erythropoietin molecule. Some regions of the polypeptide chain supported N-linked glycosylation more effectively than others. N-Linked glycosylation was inhibited by an adjacent proline suggesting that sequence context of a consensus sequence could affect glycosylation. One N-linked consensus sequence (Asn123-Thr125) introduced into a position close to the existing O-glycosylation site (Ser126) had an additional O-linked carbohydrate chain and not an additional N-linked carbohydrate chain suggesting that structural requirements in this region favored O-glycosylation over N-glycosylation. The presence of a consensus sequence on the protein surface of the folded molecule did not appear to be a prerequisite for oligosaccharide addition. However, it was noted that recombinant human erythropoietin analogs that were hyperglycosylated at sites that were normally buried had altered protein structures. This suggests that carbohydrate addition precedes polypeptide folding.  相似文献   

6.
The Campylobacter jejuni pgl gene cluster encodes a complete N-linked protein glycosylation pathway that can be functionally transferred into Escherichia coli. In this system, we analyzed the interplay between N-linked glycosylation, membrane translocation and folding of acceptor proteins in bacteria. We developed a recombinant N-glycan acceptor peptide tag that permits N-linked glycosylation of diverse recombinant proteins expressed in the periplasm of glycosylation-competent E. coli cells. With this "glycosylation tag," a clear difference was observed in the glycosylation patterns found on periplasmic proteins depending on their mode of inner membrane translocation (i.e., Sec, signal recognition particle [SRP], or twin-arginine translocation [Tat] export), indicating that the mode of protein export can influence N-glycosylation efficiency. We also established that engineered substrate proteins targeted to environments beyond the periplasm, such as the outer membrane, the membrane vesicles, and the extracellular medium, could serve as substrates for N-linked glycosylation. Taken together, our results demonstrate that the C. jejuni N-glycosylation machinery is compatible with distinct secretory mechanisms in E. coli, effectively expanding the N-linked glycome of recombinant E. coli. Moreover, this simple glycosylation tag strategy expands the glycoengineering toolbox and opens the door to bacterial synthesis of a wide array of recombinant glycoprotein conjugates.  相似文献   

7.
Although posttranslational protein modifications are generally thought to perform important cellular functions, recent studies showed that a large fraction of phosphorylation sites are not evolutionarily conserved. Whether the same is true for other protein modifications, such as N-glycosylation is an open question. N-glycosylation is a form of cotranslational and posttranslational modification that occurs by enzymatic addition of a polysaccharide, or glycan, to an asparagine (N) residue of a protein. Examining a large set of experimentally determined mouse N-glycosylation sites, we find that the evolutionary rate of glycosylated asparagines is significantly lower than that of nonglycosylated asparagines of the same proteins. We further confirm that the conservation of glycosylated asparagines is accompanied by the conservation of the canonical motif sequence for glycosylation, suggesting that the above substitution rate difference is related to glycosylation. Interestingly, when solvent accessibility is considered, the substitution rate disparity between glycosylated and nonglycosylated asparagines is highly significant at solvent accessible sites but not at solvent inaccessible sites. Thus, although the solvent inaccessible glycosylation sites were experimentally identified, they are unlikely to be genuine or physiologically important. For solvent accessible asparagines, our analysis reveals a widespread and strong functional constraint on glycosylation, unlike what has been observed for phosphorylation sites in most studies, including our own analysis. Because the majority of N-glycosylation occurs at solvent accessible sites, our results show an overall functional importance for N-glycosylation.  相似文献   

8.
Genome‐wide association studies (GWAS) and whole‐exome sequencing (WES) generate massive amounts of genomic variant information, and a major challenge is to identify which variations drive disease or contribute to phenotypic traits. Because the majority of known disease‐causing mutations are exonic non‐synonymous single nucleotide variations (nsSNVs), most studies focus on whether these nsSNVs affect protein function. Computational studies show that the impact of nsSNVs on protein function reflects sequence homology and structural information and predict the impact through statistical methods, machine learning techniques, or models of protein evolution. Here, we review impact prediction methods and discuss their underlying principles, their advantages and limitations, and how they compare to and complement one another. Finally, we present current applications and future directions for these methods in biological research and medical genetics.  相似文献   

9.
Tie JK  Zheng MY  Pope RM  Straight DL  Stafford DW 《Biochemistry》2006,45(49):14755-14763
The vitamin K-dependent carboxylase is an integral membrane protein which is required for the post-translational modification of a variety of vitamin K-dependent proteins. Previous studies have suggested carboxylase is a glycoprotein with N-linked glycosylation sites. In this study, we identify the N-glycosylation sites of carboxylase by mass spectrometric peptide mapping analyses combined with site-directed mutagenesis. Our mass spectrometric results show that the N-linked glycosylation in carboxylase occurs at positions N459, N550, N605, and N627. Eliminating these glycosylation sites by changing asparagine to glutamine caused the mutant carboxylase to migrate faster on SDS-PAGE gels, adding further evidence that these sites are glycosylated. In addition, the mutation studies identified N525, a site that cannot be recovered by mass spectroscopy analysis, as a glycosylation site. Furthermore, the potential glycosylation site at N570 is glycosylated only if all five natural glycosylation sites are simultaneously mutated. Removal of the oligosaccharides by glycosidase from wild-type carboxylase or by elimination of the functional glycosylation sites by site-directed mutagenesis did not affect either the carboxylation or epoxidation activity when the small FLEEL pentapeptide was used as a substrate, suggesting that N-linked glycosylation is not required for the enzymatic function of carboxylase. In contrast, when site N570 and the five natural glycosylation sites were mutated simultaneously, the resulting carboxylase protein was degraded. Our results suggest that N-linked glycosylation is not essential for carboxylase enzymatic activity but is important for protein folding and stability.  相似文献   

10.
Neuroligins (NLs) are a family of transmembrane proteins that function in synapse formation and/or remodeling by interacting with beta-neurexins (beta-NXs) to form heterophilic cell adhesions. The large N-terminal extracellular domain of NLs, required for beta-NX interactions, has sequence homology to the alpha/beta hydrolase fold superfamily of proteins. By peptide mapping and mass spectrometric analysis of a soluble recombinant form of NL1, several structural features of the extracellular domain have been established. Of the nine cysteine residues in NL1, eight are shown to form intramolecular disulfide bonds. Disulfide pairings of Cys 117 to Cys 153 and Cys 342 to Cys 353 are consistent with disulfide linkages that are conserved among the family of alpha/beta hydrolase proteins. The disulfide bond between Cys 172 and Cys 181 occurs within a region of the protein encoded by an alternatively spliced exon. The disulfide pairing of Cys 512 and Cys 546 in NL1 yields a structural motif unique to the NLs, since these residues are highly conserved. The potential N-glycosylation sequons in NL1 at Asn 109, Asn 303, Asn 343, and Asn 547 are shown occupied by carbohydrate. An additional consensus sequence for N-glycosylation at Asn 662 is likely occupied. Analysis of N-linked oligosaccharide content by mass matching paradigms reveals significant microheterogeneous populations of complex glycosyl moieties. In addition, O-linked glycosylation is observed in the predicted stalk region of NL1, prior to the transmembrane spanning domain. From predictions based on sequence homology of NL1 to acetylcholinesterase and the molecular features of NL1 established from mass spectrometric analysis, a novel topology model for NL three-dimensional structure has been constructed.  相似文献   

11.
Protein glycosylation in bacteria: sweeter than ever   总被引:1,自引:0,他引:1  
Investigations into bacterial protein glycosylation continue to progress rapidly. It is now established that bacteria possess both N-linked and O-linked glycosylation pathways that display many commonalities with their eukaryotic and archaeal counterparts as well as some unexpected variations. In bacteria, protein glycosylation is not restricted to pathogens but also exists in commensal organisms such as certain Bacteroides species, and both the N-linked and O-linked glycosylation pathways can modify multiple proteins. Improving our understanding of the intricacies of bacterial protein glycosylation systems should lead to new opportunities to manipulate these pathways in order to engineer glycoproteins with potential value as novel vaccines.  相似文献   

12.
Envelope proteins E1 and E2 of the hepatitis C virus (HCV) play a major role in the life cycle of a virus. These proteins are the main components of the virion and are involved in virus assembly. Envelope proteins are modified by N-linked glycosylation, which is supposed to play a role in their stability, in the assembly of the functional glycoprotein heterodimer, in protein folding, and in viral entry. The effects of N-linked glycosylation of HCV protein E1 on the assembly of structural proteins were studied using site-directed mutagenesis in a model system of Sf9 insect cells producing three viral structural proteins with the formation of virus-like particles due to the baculovirus expression system. The removal of individual N-glycosylation sites in HCV protein E1 did not affect the efficiency of its expression in insect Sf9 cells. The electrophoretic mobility of E1 increased with a decreasing number of N-glycosylation sites. The destruction of E1 glycosylation sites N1 or N5 influenced the assembly of the noncovalent E1E2 glycoprotein heterodimer, which is the prototype of the natural complex within the HCV virion. It was also shown that the lack of glycans at E1 sites N1 and N5 significantly reduced the efficiency of E1 expression in mammalian HEK293 T cells.  相似文献   

13.
ANTXR 1 and 2, also known as TEM8 and CMG2, are two type I membrane proteins, which have been extensively studied for their role as anthrax toxin receptors, but with a still elusive physiological function. Here we have analyzed the importance of N-glycosylation on folding, trafficking and ligand binding of these closely related proteins. We find that TEM8 has a stringent dependence on N-glycosylation. The presence of at least one glycan on each of its two extracellular domains, the vWA and Ig-like domains, is indeed necessary for efficient trafficking to the cell surface. In the absence of any N-linked glycans, TEM8 fails to fold correctly and is recognized by the ER quality control machinery. Expression of N-glycosylation mutants reveals that CMG2 is less vulnerable to sugar loss. The absence of N-linked glycans in one of the extracellular domains indeed has little impact on folding, trafficking or receptor function of the wild type protein expressed in tissue culture cells. N-glycans do, however, seem required in primary fibroblasts from human patients. Here, the presence of N-linked sugars increases the tolerance to mutations in cmg2 causing the rare genetic disease Hyaline Fibromatosis Syndrome. It thus appears that CMG2 glycosylation provides a buffer towards genetic variation by promoting folding of the protein in the ER lumen.  相似文献   

14.
The Campylobacter jejuni pgl locus encodes an N-linked protein glycosylation machinery that can be functionally transferred into Escherichia coli. In this system, we analyzed the elements in the C. jejuni N-glycoprotein AcrA required for accepting an N-glycan. We found that the eukaryotic primary consensus sequence for N-glycosylation is N terminally extended to D/E-Y-N-X-S/T (Y, X not equalP) for recognition by the bacterial oligosaccharyltransferase (OST) PglB. However, not all consensus sequences were N-glycosylated when they were either artificially introduced or when they were present in non-C. jejuni proteins. We were able to produce recombinant glycoproteins with engineered N-glycosylation sites and confirmed the requirement for a negatively charged side chain at position -2 in C. jejuni N-glycoproteins. N-glycosylation of AcrA by the eukaryotic OST in Saccharomyces cerevisiae occurred independent of the acidic residue at the -2 position. Thus, bacterial N-glycosylation site selection is more specific than the eukaryotic equivalent with respect to the polypeptide acceptor sequence.  相似文献   

15.
The addition of N-linked glycans to a protein is catalyzed by oligosaccharyltransferase, an enzyme closely associated with the translocon. N-glycans are believed to be transferred as the protein is being synthesized and cotranslationally translocated in the lumen of the endoplasmic reticulum. We used a mannosylphosphoryldolichol-deficient Chinese hamster ovary mutant cell line (B3F7 cells) to study the temporal regulation of N-linked core glycosylation of hepatitis C virus envelope protein E1. In this cell line, truncated Glc(3)Man(5)GlcNAc(2) oligosaccharides are transferred onto nascent proteins. Pulse-chase analyses of E1 expressed in B3F7 cells show that the N-glycosylation sites of E1 are slowly occupied until up to 1 h after protein translation is completed. This posttranslational glycosylation of E1 indicates that the oligosaccharyltransferase has access to this protein in the lumen of the endoplasmic reticulum for at least 1 h after translation is completed. Comparisons with the N-glycosylation of other proteins expressed in B3F7 cells indicate that the posttranslational glycosylation of E1 is likely due to specific folding features of this acceptor protein.  相似文献   

16.
Nonglycosylated murine and human granulocyte-macrophage colony-stimulating factor have a molecular mass of approximately 14.5 kDa predicted from the primary amino acid sequence. The expression of both proteins in COS cells leads to a heterogeneous population of molecules that differ in the degree of glycosylation. Both human and murine molecules contain two N-linked glycosylation sites that are situated in nonhomologous locations along the linear sequence. Despite this difference both proteins show a similar size distribution among the glycosylation variants. These studies analyze the effects of introducing in the murine protein novel N-linked glycosylation sites corresponding to those sites found in the human molecule. A panel of molecules composed of various combinations of human N-linked glycosylation sites in either the presence or the absence of murine N-linked glycosylation was compared. Substitution of a proper human N-linked glycosylation consensus sequence at Asn 24 did not result in N-linked glycosylation, nor was there any considerable effect on bioactivity. Replacement of the N-linked glycosylation consensus sequence at Asn 34 results in glycosylation similar to that found in the human molecule and causes a significant decrease in bioactivity. These data suggest that the position of N-linked glycosylation is critical for maximal bioactivity in a particular species and that the changes in position of these sites in different species probably occurred during evolution in response to changes in their receptors.  相似文献   

17.
Shen J  Deininger PL  Zhao H 《Cytokine》2006,35(1-2):62-66
Understanding the functions of single nucleotide polymorphisms (SNPs) can greatly help to understand the genetics of the human phenotype variation and especially the genetic basis of human complex diseases. However, how to identify functional SNPs from a pool containing both functional and neutral SNPs is challenging. In this study, we analyzed the genetic variations that can alter the expression and function of a group of cytokine proteins using computational tools. As a result, we extracted 4552 SNPs from 45 cytokine proteins from SNPper database. Of particular interest, 828 SNPs were in the 5'UTR region, 961 SNPs were in the 3' UTR region, and 85 SNPs were non-synonymous SNPs (nsSNPs), which cause amino acid change. Evolutionary conservation analysis using the SIFT tool suggested that 8 nsSNPs may disrupt the protein function. Protein structure analysis using the PolyPhen tool suggested that 5 nsSNPs might alter protein structure. Binding motif analysis using the UTResource tool suggested that 27 SNPs in 5' or 3'UTR might change protein expression levels. Our study demonstrates the presence of naturally occurring genetic variations in the cytokine proteins that may affect their expressions and functions with possible roles in complex human disease, such as immune diseases.  相似文献   

18.
We have explored the structure, function, and membrane topography of enzymes that recognize dolichols and participate in glycosylation pathways in the endoplasmic reticulum. Enzymes that interact with dolichols, including dolichyl phosphate mannose (Dol-P-Man) synthase and UDP-GlcNAc:Dol-P-transferase, revealed a conserved amino acid sequence in membrane-spanning regions. The consensus is Phe-Ile/Val-Xaa-Phe/Try-Xaa-Xaa-Ile-Pro-Phe-Xaa-Phe/Tyr, and we propose it is involved in dolichol recognition. We have used yeast mutants to demonstrate the role of dolichols in three glycosylation pathways. At its nonpermissive temperature, a Dol-P-Man synthase mutant (dpm1) was blocked in N-glycosylation, O-mannosylation, and glycosyl phosphoinositol membrane anchoring of protein, most likely because Dol-P-Man serves as mannosyl donor in all three pathways. The secretion mutant sec59 has a similar phenotype to dpm1, and the presence of a dolichol recognition sequence in the SEC59 protein gave a clue to its defect, which is in dolichol kinase. Comparison of yeast glycosylation mutant suggests that the ability to carry out N-glycosylation alone is sufficient to allow yeast to secrete glycoproteins and that an N-linked saccharide of a minimum size must be attached to proteins for cells to be able to secrete them and maintain a functional secretory pathway.  相似文献   

19.
Carbohydrates comprise about 50% of the mass of gp120, the external envelope glycoprotein of simian immunodeficiency virus (SIV) and human immunodeficiency virus. We identified 11 replication-competent derivatives of SIVmac239 lacking two, three, four, or five potential sites for N-linked glycosylation. These sites were located within and around variable regions 1 and 2 of the surface envelope protein of the virus. Asn (AAT) of the canonical N-linked glycosylation recognition sequence (Asn X Ser/Thr) was changed in each case to the structurally similar Gln (CAG or CAA) such that two nucleotide changes in the codon would be required for reversion. Replication of one triple mutant (g456), however, was severely impaired. A revertant of the g456 mutant was recovered from CEMx174 cells with a Met-to-Val compensatory substitution at position 144, 2 amino acids upstream of attachment site 5. Thus, a debilitating loss of sites for N-linked glycosylation can be compensated for by amino acid changes not involving the Asn-X-Ser/Thr consensus motif. These results provide a framework to begin testing the hypothesis that carbohydrates form a barrier that can limit the humoral immune responses to the virus.  相似文献   

20.
N-linked glycosylation has a profound effect on the proper folding, oligomerization and stability of glycoproteins. These glycans impart many properties to proteins that may be important for their proper functioning, besides having a tendency to exert a chaperone-like effect on them. Certain glycosylation sites in a protein however, are more important than other sites for their function and stability. It has been observed that some N-glycosylation sites are conserved over families of glycoproteins over evolution, one such being the tyrosinase related protein family. The role of these conserved N-glycosylation sites in their trafficking, sorting, stability and activity has been examined here. By scrutinizing the different glycosylation sites on this family of glycoproteins it was inferred that different sites in the same family of polypeptides can perform distinct functions and conserved sites across the paralogues may perform diverse functions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号