首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Human immunoglobulin D (IgD) occurs most abundantly as a membrane-bound antibody on the surface of mature B cells (mIgD). IgD possesses the longest hinge sequence of all the human antibody isotypes, with 64 residues connecting the Fab and Fc fragments. A novel rapid purification scheme of secreted IgD from the serum of an IgD myeloma patient using thiophilic (T-gel) and lectin affinity chromatography gave a stable, homogeneous IgD preparation. Synchrotron X-ray scattering and analytical ultracentrifugation of IgD identified the solution arrangement of its Fab and Fc fragments, and thereby its hinge structure. The Guinier X-ray radius of gyration R(G) of 6.9(+/-0.1)nm showed that IgD is more extended in solution than the immunoglobulin subclass IgA1 (R(G) of 6.1-6.2nm). Its distance distribution function P(r) showed a single peak at 4.7nm and a maximum dimension of 23nm. Velocity experiments gave a sedimentation coefficient of 6.3S, which is similar to that for IgA1 at 6.2S. The complete IgD structure was modelled using molecular dynamics to generate IgD hinge structures, to which homology models for the Fab and Fc fragments were connected. Good scattering curve fits were obtained with 18 semi-extended best fit IgD models that were filtered from 8500 trial models. These best-fit models showed that the IgD hinge does not correspond to an extended polypeptide structure. The averaged solution structure arrangement of the Fab and Fc fragments in IgD is principally T-shaped and flexible, with contribution from Y-shaped and inverted Y-shaped structures. Although the linear sequence of the IgD hinge is much longer, comparison with previous scattering modelling of IgA1 and IgA2(m)1 suggests that the hinge of IgA1 and IgD are more similar than might have been expected, Both possess flexible T-shaped solution structures, probably reflecting the presence of restraining O-linked sugars.  相似文献   

2.
Protein structure prediction in genomics   总被引:1,自引:0,他引:1  
As the number of completely sequenced genomes rapidly increases, including now the complete Human Genome sequence, the post-genomic problems of genome-scale protein structure determination and the issue of gene function identification become ever more pressing. In fact, these problems can be seen as interrelated in that experimentally determining or predicting or the structure of proteins encoded by genes of interest is one possible means to glean subtle hints as to the functions of these genes. The applicability of this approach to gene characterisation is reviewed, along with a brief survey of the reliability of large-scale protein structure prediction methods and the prospects for the development of new prediction methods.  相似文献   

3.
One of the major bottlenecks in many ab initio protein structure prediction methods is currently the selection of a small number of candidate structures for high‐resolution refinement from large sets of low‐resolution decoys. This step often includes a scoring by low‐resolution energy functions and a clustering of conformations by their pairwise root mean square deviations (RMSDs). As an efficient selection is crucial to reduce the overall computational cost of the predictions, any improvement in this direction can increase the overall performance of the predictions and the range of protein structures that can be predicted. We show here that the use of structural profiles, which can be predicted with good accuracy from the amino acid sequences of proteins, provides an efficient means to identify good candidate structures. Proteins 2010. © 2009 Wiley‐Liss, Inc.  相似文献   

4.
This paper evaluates the results of a protein structure prediction contest. The predictions were made using threading procedures, which employ techniques for aligning sequences with 3D structures to select the correct fold of a given sequence from a set of alternatives. Nine different teams submitted 86 predictions, on a total of 21 target proteins with little or no sequence homology to proteins of known structure. The 3D structures of these proteins were newly determined by experimental methods, but not yet published or otherwise available to the predictors. The predictions, made from the amino acid sequence alone, thus represent a genuine test of the current performance of threading methods. Only a subset of all the predictions is evaluated here. It corresponds to the 44 predictions submitted for the 11 target proteins seen to adopt known folds. The predictions for the remaining 10 proteins were not analyzed, although weak similarities with known folds may also exist in these proteins. We find that threading methods are capable of identifying the correct fold in many cases, but not reliably enough as yet. Every team predicts correctly a different set of targets, with virtually all targets predicted correctly by at least one team. Also, common folds such as TIM barrels are recognized more readily than folds with only a few known examples. However, quite surprisingly, the quality of the sequence-structure alignments, corresponding to correctly recognized folds, is generally very poor, as judged by comparison with the corresponding 3D structure alignments. Thus, threading can presently not be relied upon to derive a detailed 3D model from the amino acid sequence. This raises a very intriguing question: how is fold recognition achieved? Our analysis suggests that it may be achieved because threading procedures maximize hydrophobic interactions in the protein core, and are reasonably good at recognizing local secondary structure. © 1995 Wiley-Liss, Inc.  相似文献   

5.
Protein structure prediction methods for drug design   总被引:1,自引:0,他引:1  
Along the long path from genomic data to a new drug, the knowledge of three-dimensional protein structure can be of significant help in several places.This paper points out such places, discusses the virtues of protein structure knowledge and reviews bioinformatics methods for gaining such knowledge on the protein structure.  相似文献   

6.
Structural class characterizes the overall folding type of a protein or its domain. A number of computational methods have been proposed to predict structural class based on primary sequences; however, the accuracy of these methods is strongly affected by sequence homology. This paper proposes, an ensemble classification method and a compact feature-based sequence representation. This method improves prediction accuracy for the four main structural classes compared to competing methods, and provides highly accurate predictions for sequences of widely varying homologies. The experimental evaluation of the proposed method shows superior results across sequences that are characterized by entire homology spectrum, ranging from 25% to 90% homology. The error rates were reduced by over 20% when compared with using individual prediction methods and most commonly used composition vector representation of protein sequences. Comparisons with competing methods on three large benchmark datasets consistently show the superiority of the proposed method.  相似文献   

7.

Background

Since experimental techniques are time and cost consuming, in silico protein structure prediction is essential to produce conformations of protein targets. When homologous structures are not available, fragment-based protein structure prediction has become the approach of choice. However, it still has many issues including poor performance when targets’ lengths are above 100 residues, excessive running times and sub-optimal energy functions. Taking advantage of the reliable performance of structural class prediction software, we propose to address some of the limitations of fragment-based methods by integrating structural constraints in their fragment selection process.

Results

Using Rosetta, a state-of-the-art fragment-based protein structure prediction package, we evaluated our proposed pipeline on 70 former CASP targets containing up to 150 amino acids. Using either CATH or SCOP-based structural class annotations, enhancement of structure prediction performance is highly significant in terms of both GDT_TS (at least +2.6, p-values < 0.0005) and RMSD (−0.4, p-values < 0.005). Although CATH and SCOP classifications are different, they perform similarly. Moreover, proteins from all structural classes benefit from the proposed methodology. Further analysis also shows that methods relying on class-based fragments produce conformations which are more relevant to user and converge quicker towards the best model as estimated by GDT_TS (up to 10% in average). This substantiates our hypothesis that usage of structurally relevant templates conducts to not only reducing the size of the conformation space to be explored, but also focusing on a more relevant area.

Conclusions

Since our methodology produces models the quality of which is up to 7% higher in average than those generated by a standard fragment-based predictor, we believe it should be considered before conducting any fragment-based protein structure prediction. Despite such progress, ab initio prediction remains a challenging task, especially for proteins of average and large sizes. Apart from improving search strategies and energy functions, integration of additional constraints seems a promising route, especially if they can be accurately predicted from sequence alone.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0576-2) contains supplementary material, which is available to authorized users.  相似文献   

8.
Defining the shape, conformation, or assembly state of an RNA in solution often requires multiple investigative tools ranging from nucleotide analog interference mapping to X-ray crystallography. A key addition to this toolbox is small-angle X-ray scattering (SAXS). SAXS provides direct structural information regarding the size, shape, and flexibility of the particle in solution and has proven powerful for analyses of RNA structures with minimal requirements for sample concentration and volumes. In principle, SAXS can provide reliable data on small and large RNA molecules. In practice, SAXS investigations of RNA samples can show inconsistencies that suggest limitations in the SAXS experimental analyses or problems with the samples. Here, we show through investigations on the SAM-I riboswitch, the Group I intron P4-P6 domain, 30S ribosomal subunit from Sulfolobus solfataricus (30S), brome mosaic virus tRNA-like structure (BMV TLS), Thermotoga maritima asd lysine riboswitch, the recombinant tRNAval, and yeast tRNAphe that many problems with SAXS experiments on RNA samples derive from heterogeneity of the folded RNA. Furthermore, we propose and test a general approach to reducing these sample limitations for accurate SAXS analyses of RNA. Together our method and results show that SAXS with synchrotron radiation has great potential to provide accurate RNA shapes, conformations, and assembly states in solution that inform RNA biological functions in fundamental ways.  相似文献   

9.
Protein function prediction using local 3D templates   总被引:8,自引:0,他引:8  
The prediction of a protein's function from its 3D structure is becoming more and more important as the worldwide structural genomics initiatives gather pace and continue to solve 3D structures, many of which are of proteins of unknown function. Here, we present a methodology for predicting function from structure that shows great promise. It is based on 3D templates that are defined as specific 3D conformations of small numbers of residues. We use four types of template, covering enzyme active sites, ligand-binding residues, DNA-binding residues and reverse templates. The latter are templates generated from the target structure itself and scanned against a representative subset of all known protein structures. Together, the templates provide a fairly thorough coverage of the known structures and ensure that if there is a match to a known structure it is unlikely to be missed. A new scoring scheme provides a highly sensitive means of discriminating between true positive and false positive template matches. In all, the methodology provides a powerful new tool for function prediction to complement those already in use.  相似文献   

10.
Using a recently developed protein folding algorithm, a prediction of the tertiary structure of the KIX domain of the CREB binding protein is described. The method incorporates predicted secondary and tertiary restraints derived from multiple sequence alignments in a reduced protein model whose conformational space is explored by Monte Carlo dynamics. Secondary structure restraints are provided by the PHD secondary structure prediction algorithm that was modified for the presence of predicted U-turns, i.e., regions where the chain reverses global direction. Tertiary restraints are obtained via a two-step process: First, seed side-chain contacts are identified from a correlated mutation analysis, and then, a threading-based algorithm expands the number of these seed contacts. Blind predictions indicate that the KIX domain is a putative three-helix bundle, although the chirality of the bundle could not be uniquely determined. The expected root-mean-square deviation for the correct chirality of the KIX domain is between 5.0 and 6.2 Å. This is to be compared with the estimate of 12.9 Å that would be expected by a random prediction, using the model of F. Cohen and M. Sternberg (J. Mol. Biol. 138:321–333, 1980). Proteins 30:287–294, 1998. © 1998 Wiley-Liss, Inc.  相似文献   

11.
We present heuristic-based predictions of the secondary and tertiary structures of the cyclins A, B, and D, representatives of the cyclin superfamily. The list of suggested constraints for tertiary structure assembly was left unrefined in order to submit this report before an announced crystal structure for cyclin A becomes available. To predict these constraints, a master sequence alignment over 270 positions of cyclin types A, B, and D was adjusted based on individual secondary structure predictions for each type. We used new heuristics for predicting aromatic residues at protein-protein interfaces and to identify sequentially distinct regions in the protein chain that cluster in the folded structure. The boundaries of two conjectured domains in the cyclin fold were predicted based on experimental data in the literature. The domain that is important for interaction of the cyclins with cyclin-dependent kinases (CDKs) is predicted to contain six helices; the second domain in the consensus model contains both helices and a β-sheet that is formed by sequentially distant regions in the protein chain. A plausible phosphorylation site is identified. This work represents a blinded test of the method for prediction of secondary and, to a lesser extent, tertiary structure from a set of homologous protein sequences. Evaluation of our predictions will become possible with the publication of the announced crystal structure.  相似文献   

12.
Small-angle X-ray scattering (SAXS) was used to study structural characteristics of human serum albumin (HSA) in solution under different pH conditions. Guinier analysis of SAXS results yielded values of the molecular radius of gyration ranging from 26.7 Å to 34.5 Å for pH varying from 2.5 to 7.0. This suggests the existence of significant differences in the overall shape of the molecule at different pH. Molecular models based on subdomains with different spatial configurations were proposed. The distance distribution functions associated with these models were calculated and compared with those determined from the experimental SAXS intensity functions. The conclusion of this SAXS study is that the arrangement of molecular subdomains is clearly pH dependent; the molecule adopting more or less compact configuration for different pH conditions. The conclusions of this systematic study on the modification in molecular shape of HSA as a response to pH changes is consistent with those of previous investigations performed for particular pH conditions. Correspondence to: J. R. Olivieri  相似文献   

13.
We predict a structure of the glutamine amidotransferase subunit (hisH) of imidazole glycerol phosphate synthase (IGPS) which catalyzes the fifth step of the histidine biosynthesis in Escherichia coli. The model is constructed using an energy-based threading program augmented by a multiple sequence to structure profile analysis. In developing our model we identified a conserved core region within hisH and a variable domain which is the likely site of interaction with the synthase subunit (hisF) of IGPS. Information available from structural and functional genomics studies was used to improve the structure prediction, to discuss parallels between histidine biosynthesis and other amino acid and nucleotide metabolic pathways, and to better understand the protein-protein interactions between the hisH and hisF domains of IGPS. This work allows us to develop a preliminary model for the structure of the entire IGPS holoenzyme.  相似文献   

14.
The GroES protein from Escherichia coli is a well-known member of the molecular chaperones. GroES consists of seven identical 10 kDa subunits, and forms a dome-like oligomeric structure. In order to obtain information on the structural stability and unfolding-refolding mechanism of GroES protein, especially at protein concentrations (0.4-1.2 mM GroES monomer) that would mimic heat stress conditions in vivo, we have performed synchrotron small-angle X-ray scattering (SAXS) experiments. Surprisingly, in spite of the high protein concentration, reversibility in the unfolding-refolding reaction was confirmed by SAXS experiments structurally. Although the unfolding-refolding reaction showed an apparent single transition with a Cm of 1.1 M guanidium hydrochloride, a more detailed analysis of this transition demonstrated that the unfolding mechanism could be best explained by a sequential three-state model, which consists of native heptamer, dissociated monomer, and unfolded monomer. Together with our previous result that GroES unfolded completely via a partially folded monomer according to a three-state model at low protein concentration (5 microM monomer), the unfolding-refolding mechanism of GroES protein could be explained uniformly by the three-state model from low to high protein concentrations. Furthermore, to clarify an ambiguity of the native GroES structure in solution, especially mobile loop structures, we have estimated a solution structure of GroES using SAXS profiles obtained from experiments and simulation analysis. The result suggested that the native structure of GroES in solution was very similar to that seen in GroES-GroEL complex determined by crystallography.  相似文献   

15.
蛋白质结构预测的理论方法及阶段   总被引:2,自引:0,他引:2  
孙侠  殷志祥 《生物学杂志》2007,24(1):16-17,15
一直以来,蛋白质结构预测都是人们研究的焦点,综述了蛋白质结构预测的几种理论方法和不同阶段。  相似文献   

16.
Structural proteomics aims to understand the structural basis of protein interactions and functions. A prerequisite for this is the availability of 3D protein structures that mediate the biochemical interactions. The explosion in the number of available gene sequences set the stage for the next step in genome-scale projects – to obtain 3D structures for each protein. To achieve this ambitious goal, the slow and costly structure determination experiments are supplemented with theoretical approaches. The current state and recent advances in structure modeling approaches are reviewed here, with special emphasis on comparative protein structure modeling techniques.  相似文献   

17.
A major challenge in structural biology is to determine the configuration of domains and proteins in multidomain proteins and assemblies, respectively. All available data should be considered to maximize the accuracy and precision of these models. Small-angle X-ray scattering (SAXS) efficiently provides low-resolution experimental data about the shapes of proteins and their assemblies. Thus, we integrated SAXS profiles into our software for modeling proteins and their assemblies by satisfaction of spatial restraints. Specifically, we modeled the quaternary structures of multidomain proteins with structurally defined rigid domains as well as quaternary structures of binary complexes of structurally defined rigid proteins. In addition to SAXS profiles and the component structures, we used stereochemical restraints and an atomic distance-dependent statistical potential. The scoring function is optimized by a biased Monte Carlo protocol, including quasi-Newton and simulated annealing schemes. The final prediction corresponds to the best scoring solution in the largest cluster of many independently calculated solutions. To quantify how well the quaternary structures are determined based on their SAXS profiles, we used a benchmark of 12 simulated examples as well as an experimental SAXS profile of the homotetramer d-xylose isomerase. Optimization of the SAXS-dependent scoring function generally results in accurate models if sufficiently precise approximations for the constituent rigid bodies are available; otherwise, the best scoring models can have significant errors. Thus, SAXS profiles can play a useful role in the structural characterization of proteins and assemblies if they are combined with additional data and used judiciously. Our integration of a SAXS profile into modeling by satisfaction of spatial restraints will facilitate further integration of different kinds of data for structure determination of proteins and their assemblies.  相似文献   

18.
Protein structural class prediction is one of the challenging problems in bioinformatics. Previous methods directly based on the similarity of amino acid (AA) sequences have been shown to be insufficient for low-similarity protein data-sets. To improve the prediction accuracy for such low-similarity proteins, different methods have been recently proposed that explore the novel feature sets based on predicted secondary structure propensities. In this paper, we focus on protein structural class prediction using combinations of the novel features including secondary structure propensities as well as functional domain (FD) features extracted from the InterPro signature database. Our comprehensive experimental results based on several benchmark data-sets have shown that the integration of new FD features substantially improves the accuracy of structural class prediction for low-similarity proteins as they capture meaningful relationships among AA residues that are far away in protein sequence. The proposed prediction method has also been tested to predict structural classes for partially disordered proteins with the reasonable prediction accuracy, which is a more difficult problem comparing to structural class prediction for commonly used benchmark data-sets and has never been done before to the best of our knowledge. In addition, to avoid overfitting with a large number of features, feature selection is applied to select discriminating features that contribute to achieve high prediction accuracy. The selected features have been shown to achieve stable prediction performance across different benchmark data-sets.  相似文献   

19.
Wide-angle X-ray solution scattering (WAXS) patterns contain substantial information about the three-dimensional structure of a protein. Although WAXS data have far less information than is required for determination of a full three-dimensional structure, the actual amount of information contained in a WAXS pattern has not been carefully quantified. Here we carry out an analysis of the amount of information that can be extracted from a WAXS pattern and demonstrate that it is adequate to estimate the secondary-structure content of a protein and to strongly limit its possible tertiary structures. WAXS patterns computed from the atomic coordinates of a set of 498 protein domains representing all of known fold space were used as the basis for constructing a multidimensional space of all corresponding WAXS patterns (‘WAXS space’). Within WAXS space, each scattering pattern is represented by a single vector. A principal components analysis was carried out to identify those directions in WAXS space that provide the greatest discrimination among patterns. The number of dimensions that provide significant discrimination among protein folds agrees well with the number of independent parameters estimated from a naïve Shannon sampling theorem approach. Estimates of the relative abundances of secondary structures were made using training/test sets derived from this data set. The average error in the estimate of α-helical content was 11%, and of β-sheet content was 9%. The distribution of proteins that are members of the four structure classes, α, β, α/β and α+β, are well separated in WAXS space when data extending to a spacing of 2.2 Å are used. Quantification of the information embedded within a WAXS pattern indicates that these data can be used as a powerful constraint in homology modeling of protein structures.  相似文献   

20.
Methods for automated prediction of deleterious protein mutations have utilized both structural and evolutionary information but the relative contribution of these two factors remains unclear. To address this, we have used a variety of structural and evolutionary features to create simple deleterious mutation models that have been tested on both experimental mutagenesis and human allele data. We find that the most accurate predictions are obtained using a solvent-accessibility term, the C(beta) density, and a score derived from homologous sequences, SIFT. A classification tree using these two features has a cross-validated prediction error of 20.5% on an experimental mutagenesis test set when the prior probability for deleterious and neutral cases is equal, whereas this prediction error is 28.8% and 22.2% using either the C(beta) density or SIFT alone. The improvement imparted by structure increases when fewer homologs are available: when restricted to three homologs the prediction error improves from 26.9% using SIFT alone to 22.4% using SIFT and the C(beta) density, or 24.8% using SIFT and a noisy C(beta) density term approximating the inaccuracy of ab initio structures modeled by the Rosetta method. We conclude that methods for deleterious mutation prediction should include structural information when fewer than five to ten homologs are available, and that ab initio predicted structures may soon be useful in such cases when high-resolution structures are unavailable.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号