共查询到20条相似文献,搜索用时 15 毫秒
1.
P Broquet A Martin M J Peschard H Baubichon-Cortay M Serres-Guillaumond P Louisot 《The International journal of biochemistry》1987,19(7):653-656
Circadian variations of the acetylcholine muscarinic receptor and some glycosyltransferases were studied in brain using multivariate analysis. Highly significant correlations exist between fucosyltransferase, sialyltransferase and galactosyltransferase and to a lesser extent between both of these enzymes and acetylcholine receptor. No correlation appeared between these enzymes and dolichol phosphate mannose synthase. 相似文献
2.
Fold recognition was applied to the systematic analysis of the all sequences encoded by the genome of Mycoplasma tuberculosis H37Rv in order to identify new putative glycosyltransferases. The search was conducted against a library composed of all known crystal structures of glycosyltransferases and some related proteins. A clear relationship appeared between some sequences and some folds. It appears necessary to complete the fold recognition approach with a statistical approach in order to identify the relevant data above the background noise. Exploratory data analysis was carried out using several methods. Analytical methods confirmed the validity of the approach, while predictive methods, although very preliminary in the present case, allowed for identifying a number of sequences of interest that should be further investigated. This new approach of combining bioinformatics and chemometrics appears to be a powerful tool for analysis of newly sequenced genomes. Its application to glycobiology is of great interest. 相似文献
3.
Marlow AJ Fisher SE Francks C MacPhie IL Cherny SS Richardson AJ Talcott JB Stein JF Monaco AP Cardon LR 《American journal of human genetics》2003,72(3):561-570
Replication of linkage results for complex traits has been exceedingly difficult, owing in part to the inability to measure the precise underlying phenotype, small sample sizes, genetic heterogeneity, and statistical methods employed in analysis. Often, in any particular study, multiple correlated traits have been collected, yet these have been analyzed independently or, at most, in bivariate analyses. Theoretical arguments suggest that full multivariate analysis of all available traits should offer more power to detect linkage; however, this has not yet been evaluated on a genomewide scale. Here, we conduct multivariate genomewide analyses of quantitative-trait loci that influence reading- and language-related measures in families affected with developmental dyslexia. The results of these analyses are substantially clearer than those of previous univariate analyses of the same data set, helping to resolve a number of key issues. These outcomes highlight the relevance of multivariate analysis for complex disorders for dissection of linkage results in correlated traits. The approach employed here may aid positional cloning of susceptibility genes in a wide spectrum of complex traits. 相似文献
4.
OBJECTIVES: Multivariate tests for linkage can provide improved power over univariate tests but the type I error rates and comparative power of commonly used methods have not previously been compared. Here we studied the behavior of bivariate formulations of the variance component (VC) and Haseman-Elston (H-E) approaches. METHODS: We compared through simulation studies the bivariate H-E test with the unconstrained bivariate VC approach and with a VC approach in which the major-gene correlation is constrained to +/-1. We also compared these methods to univariate methods. RESULTS: Bivariate approaches are more powerful than univariate analyses unless the traits are very highly positively correlated. The power of the bivariate H-E test was less than the VC procedures. The constrained test was often less powerful than the unconstrained test. The empirical distributions of the bivariate H-E test and the unconstrained bivariate VC test conformed with asymptotic distributions for samples of 100 or more sibships of size 4. CONCLUSIONS: The unconstrained VC test is valuable for testing for preliminary linkages using multivariate phenotypes. The bivariate H-E test was less powerful than the bivariate VC tests. 相似文献
5.
Burkart MD Vincent SP Düffels A Murray BW Ley SV Wong CH 《Bioorganic & medicinal chemistry》2000,8(8):1937-1946
An effective procedure for the synthesis of 2-deoxy-2-fluoro-sugar nucleotides via Select fluor-mediated electrophilic fluorination of glycals with concurrent nucleophilic addition or chemo-enzymatic transformation has been developed, and the fluorinated sugar nucleotides have been used as probes for glycosyltransferases, including fucosyltransferase III, V, VI, and VII, and sialyl transferases. In general, these fluorinated sugar nucleotides act as competitive inhibitors versus sugar nucleotide substrates and form a tight complex with the glycosyltransferase. 相似文献
6.
Bioinformatics is a very powerful tool in the field of glycoproteomics as well as genomics and proteomics. As a part of the Glycogene Project (GG project), we have developed a novel bioinformatics system for the comprehensive identification and in silico cloning of human glycogenes. Using our system, a total of 105 candidate human glycogenes were identified and then engineered for heterologous expression. Of these candidates, 38 recombinant proteins were successfully identified for their enzyme activity and substrate specificity. We also classified 47 out of 60 carbohydrate-active enzyme glycosyltransferase families into 4 superfamilies using the profile Hidden Markov Model method. On the basis of our classification and the relationship between glycosylation pathways and superfamilies, we propose the evolution of glycosyltransferases. 相似文献
7.
Sugar nucleotide-dependent glycosyltransferases (GTs) are key enzymes that catalyze the formation of glycosidic bonds in nature. They have been increasingly applied in the synthesis of complex carbohydrates and glycoconjugates with or without in situ generation of sugar nucleotides. Human GTs are becoming more accessible and new bacterial GTs have been identified and characterized. An increasing number of crystal structures elucidated for GTs from mammalian and bacterial sources facilitate structure-based design of mutants as improved catalysts for synthesis. Automated platforms have also been developed for chemoenzymatic synthesis of carbohydrates. Recent progress in applying sugar nucleotide-dependent GTs in enzymatic and chemoenzymatic synthesis of mammalian glycans and glycoconjugates, bacterial surface glycans, and glycosylated natural products from bacteria and plants are reviewed. 相似文献
8.
9.
10.
Mehmood T Bohlin J Bråthen Kristoffersen A Sæbø S Warringer J Snipen L 《BMC bioinformatics》2012,13(1):97
ABSTRACT: BACKGROUND: Gene finding is a complicated procedure that encapsulates algorithms for coding sequence modeling, identification of promoter regions, issues concerning overlapping genes and more. In the present study we focus on coding sequence modeling algorithms; that is, algorithms for identification and prediction of the actual coding sequences from genomic DNA. In this respect, we promote a novel multivariate method known as Canonical Powered Partial Least Squares (CPPLS) as an alternative to the commonly used Interpolated Markov model (IMM). Comparisons between the methods were performed on DNA, codon and protein sequences with highly conserved genes taken from several species with different genomic properties. RESULTS: The multivariate CPPLS approach classified coding sequence substantially better than the commonly used IMM on the same set of sequences. We also found that the use of CPPLS with codon representation gave significantly better classification results than both IMM with protein (p < 0.001) and with DNA (p < 0.001). Further, although the mean performance was similar, the variation of CPPLS performance on codon representation was significantly smaller than for IMM (p < 0.001). CONCLUSIONS: The performance of coding sequence modeling can be substantially improved by using an algorithm based on the multivariate CPPLS method applied to codon or DNA frequencies. 相似文献
11.
Aarya Venkat Daniel Tehrani Rahil Taujale Wayland Yeung Nathan Gravel Kelley W. Moremen Natarajan Kannan 《The Journal of biological chemistry》2022,298(8)
Hydrophobic cores are fundamental structural properties of proteins typically associated with protein folding and stability; however, how the hydrophobic core shapes protein evolution and function is poorly understood. Here, we investigated the role of conserved hydrophobic cores in fold-A glycosyltransferases (GT-As), a large superfamily of enzymes that catalyze formation of glycosidic linkages between diverse donor and acceptor substrates through distinct catalytic mechanisms (inverting versus retaining). Using hidden Markov models and protein structural alignments, we identify similarities in the phosphate-binding cassette (PBC) of GT-As and unrelated nucleotide-binding proteins, such as UDP-sugar pyrophosphorylases. We demonstrate that GT-As have diverged from other nucleotide-binding proteins through structural elaboration of the PBC and its unique hydrophobic tethering to the F-helix, which harbors the catalytic base (xED-Asp). While the hydrophobic tethering is conserved across diverse GT-A fold enzymes, some families, such as B3GNT2, display variations in tethering interactions and core packing. We evaluated the structural and functional impact of these core variations through experimental mutational analysis and molecular dynamics simulations and find that some of the core mutations (T336I in B3GNT2) increase catalytic efficiency by modulating the conformational occupancy of the catalytic base between “D-in” and acceptor-accessible “D-out” conformation. Taken together, our studies support a model of evolution in which the GT-A core evolved progressively through elaboration upon an ancient PBC found in diverse nucleotide-binding proteins, and malleability of this core provided the structural framework for evolving new catalytic and substrate-binding functions in extant GT-A fold enzymes. 相似文献
12.
CY-007 and CY-049 pteridine glycosyltransferases (PGTs) that differ in sugar donor specificity to catalyze either glucose or xylose transfer to tetrahydrobiopterin were studied here touncover the structural determinants necessary for the specificity. The importance of the C-terminal domain and its residues 218 and 258 that are different between the two PGTs was assessed via structure-guided domain swapping or single and dual amino acid substitutions. Catalytic activity and selectivity were altered in all the mutants (2 chimeric and 6 substitution) to accept both UDP-glucose and UDP-xylose. In addition, the wild type activities were improved 1.6-4.2 fold in 4 substitution mutants and activity was observed towards another substrate UDP-Nacetylglucosamine in all the substitution mutants from CY-007 PGT. The results strongly support essential role of the C-terminal domain and the two residues for catalysis as well as sugar donor specificity, bringing insight into the structural features of the PGTs. [BMB Reports 2013; 46(1): 37-40] 相似文献
13.
Background
An organism's ability to adapt to its particular environmental niche is of fundamental importance to its survival and proliferation. In the largest study of its kind, we sought to identify and exploit the amino-acid signatures that make species-specific protein adaptation possible across 100 complete genomes.Results
Environmental niche was determined to be a significant factor in variability from correspondence analysis using the amino acid composition of over 360,000 predicted open reading frames (ORFs) from 17 archae, 76 bacteria and 7 eukaryote complete genomes. Additionally, we found clusters of phylogenetically unrelated archae and bacteria that share similar environments by amino acid composition clustering. Composition analyses of conservative, domain-based homology modeling suggested an enrichment of small hydrophobic residues Ala, Gly, Val and charged residues Asp, Glu, His and Arg across all genomes. However, larger aromatic residues Phe, Trp and Tyr are reduced in folds, and these results were not affected by low complexity biases. We derived two simple log-odds scoring functions from ORFs (CG) and folds (CF) for each of the complete genomes. CF achieved an average cross-validation success rate of 85 ± 8% whereas the CG detected 73 ± 9% species-specific sequences when competing against all other non-redundant CG. Continuously updated results are available at http://genome.mshri.on.ca.Conclusion
Our analysis of amino acid compositions from the complete genomes provides stronger evidence for species-specific and environmental residue preferences in genomic sequences as well as in folds. Scoring functions derived from this work will be useful in future protein engineering experiments and possibly in identifying horizontal transfer events. 相似文献14.
There are no well accepted criteria for the diagnosis of the metabolic syndrome. However, the metabolic syndrome is identified clinically by the presence of three or more of these five variables: larger waist circumference, higher triglyceride levels, lower HDL-cholesterol concentrations, hypertension, and impaired fasting glucose. We use sets of two or three variables, which are available in the Framingham Heart Study data set, to localize genes responsible for this syndrome using multivariate quantitative linkage analysis. This analysis demonstrates the applicability of using multivariate linkage analysis and how its use increases the power to detect linkage when genes are involved in the same disease mechanism. 相似文献
15.
Takamatsu S Inoue N Katsumata T Nakamura K Fujibayashi Y Takeuchi M 《Biochemistry》2005,44(16):6343-6349
Many recombinant proteins developed or under development for clinical use are glycoproteins, and trials aimed at improving their bioactivity or pharmacokinetics in vivo by altering specific glycan structures are ongoing. For pharmaceuticals of glycoproteins, it is important to characterize and, if possible, control the glycosylation profile. However, the mechanism responsible for the regulation of sugar chain structures found on naturally occurring glycoproteins is still unclear. To clarify the relationship between glycosyltransferases and sugar chain branch structure, we estimated six glycosyltransferases' activities (N-acetylglucosaminyltransferase (GlcNAcTase)-I, -II, -III, -IV, -V, and beta-1,4-galactosyltransferase (GalT)) which control the branch formation on asparagine (Asn)-linked sugar chains in 18 human cancer cell lines derived from several tissues. To visualize the balance of glycosyltransferase activity associated with each cell line, we expressed the relative glycosyltransferase activity in comparison to the average activity among the cell lines. These cell lines were classified into five groups according to their relative glycosyltransferase balance and were termed GlcNAcTase-I/-II, GlcNAcTase-III, GlcNAcTase-IV, GlcNAcTase-V, and GalT. We also characterized the structures of Asn-linked sugar chains on the cell surface of representative cell lines of each group. The branching structure of cell surface sugar chains roughly corresponded to the glycosyltransferase balance. This finding suggests that, for the sugar chain structure remodeling of glycoproteins, attention should be focused on the glycosyltransferase balance of host cells before introducing exogenous glycosyltransferases or down-regulating the activity of intrinsic glycosyltransferases. 相似文献
16.
The inhibition of glycosyltransferases was studied using uridine monophosphate derivatives and uridine diphosphate sugar analogs. Modification in the nucleoside portion caused selective inhibition of glycosyltransferases. 相似文献
17.
M van Heel 《Journal of molecular biology》1991,220(4):877-887
A novel multivariate statistical approach is presented for extracting and exploiting intrinsic information present in our ever-growing sequence data banks. The information extraction from the sequences avoids the pitfalls of intersequence alignment by analyzing secondary invariant functions derived from the sequences in the data bank rather than the sequences themselves. Such typical invariant function is a 20 x 20 histogram of occurrences of amino acid pairs in a given sequence or fragment thereof. To illustrate the potential of the approach an analysis of 10,000 protein sequences from the National Biomedical Research Foundation Protein Identification Resource is presented, whose analysis already reveals great biological detail. For example, zeta-hemoglobin is found to lie close to amphibian and fish chi-hemoglobin which, in turn, is an important clue to the physiological function of this mammalian early embryonic hemoglobin. The multivariate statistical framework presented unifies such apparently unrelated issues as phylogenetic comparisons between a set of sequences and distance matrices between the constituents of the biological sequences. The Multivariate Statistical Sequence Analysis (MSSA) principles can be used for a wide spectrum of sequence analysis problems such as: assignment of family memberships to new sequences, validation of new incoming sequences to be entered into the database, prediction of structure from sequence, discrimination of coding from non-coding DNA regions, and automatic generation of an atlas of protein or DNA sequences. The MSSA techniques represent a self-contained approach to learning continuously and automatically from the growing stream of new sequences. The MSSA approach is particularly likely to play a significant role in major sequencing efforts such as the human genome project. 相似文献
18.
We have compared the efficiency of the lod score test which assumes heterogeneity (lod2) to the standard lod score test which assumes homogeneity (lod1) when three-point linkage analysis is used in successive map intervals. If it is assumed that a gene located midway between two linked marker loci is responsible for a proportion of disease cases, then the lod1 test loses power relative to the lod2 test, as the proportion of linked families decreases, as the flanking markers are more closely linked, and as more map intervals are tested. Moreover, when multipoint analysis is used, linkage for a disease gene is more likely to be incorrectly excluded from a complete and dense linkage map if true genetic heterogeneity is ignored. We thus conclude that, in general, the lod2 linkage test is more efficient for detecting a true linkage when a complete genetic marker map is screened for a heterogeneous disorder. 相似文献
19.
Fifty-two 3D structures of Ig-like domains covering the immunoglobulin fold family (IgFF) were compared and classified according to the conservation of their secondary structures. Members of the IgFF are distantly related proteins or evolutionarily unrelated proteins with a similar fold, the Ig fold. In this paper, a multiple structural alignment of the conserved common core is described and the correlation between corresponding sequences is discussed. While the members of the IgFF exhibit wide heterogeneity in terms of tissue and species distribution or functional implications, the 3D structures of these domains are far more conserved than their sequences. We define topologically equivalent residues in the Ig-like domains, describe the hydrophobic common cores and discuss the presence of additional strands. The disulfide bridges, not necessary for the stability of the Ig fold, may have an effect on the compactness of the domains. Based upon sequence and structure analysis, we propose the introduction of two new subtypes (C3 and C4) to the previous classifications, in addition to a new global structural classification. The very low mean sequence identity between subgroups of the IgFF suggests the occurrence of both divergent and convergent evolutionary processes, explaining the wide diversity of the superfamily. Finally, this review suggest that hydrophobic residues constituting the common hydrophobic cores are important clues to explain how highly divergent sequences can adopt a similar fold. 相似文献