首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Proteins that share even low sequence homologies are known to adopt similar folds. The beta-propeller structural motif is one such example. Identifying sequences that adopt a beta-propeller fold is useful to annotate protein structure and function. Often, tandem sequence repeats provide the necessary signal for identifying beta-propellers in proteins. In our recent analysis to identify cell surface proteins in archaeal and bacterial genomes, we identified some proteins that contain novel tandem repeats "LVIVD", "RIVW" and "LGxL". In this work, based on protein fold predictions and three-dimensional comparative modeling methods, we predicted that these repeat types fold as beta-propeller. Further, the evolutionary trace analysis of all proteins constituting amino acid sequence repeats in beta-propellers suggest that the novel repeats have diverged from a common ancestor.  相似文献   

2.
The beta-propeller architecture observed in protein tertiary structure and classified into the five different types according to number of 'blades' (or beta-sheets) and a sixth type classified according to the secondary structure composition of the blades (the beta beta alpha beta-molecular unit) is characterized by variations (or plasticity) in the structure. These correspond to the number of beta-strands associated with the blade, the number of amino acid residues associated with equivalent beta-strands in the different blades and the presence of alpha-helices and twisted beta-strands. We have generated a beta-sheet associated beta-strand pattern that may be important for protein structure prediction and modeling. Analysis of the beta-propellers extracted primarily from the SCOP database revealed there are 179 beta-propellers. The examination of the secondary structure corresponding to the beta-propeller using PDBsum that was useful to define the beta-sheet associated beta-strand pattern, combined with visualization on graphics display revealed structural plasticity associated with the beta-propeller architecture. Particularly, the type 6- and 7-bladed beta-propellers known to be associated with sequence and functional diversity are more common and associated with relatively more structural variations compared to the other beta-propeller types.  相似文献   

3.
The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence‐search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino‐acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as “Protein Blocks” (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence‐search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z‐score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales‐up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web‐server that is freely available at http://www.bo‐protscience.fr/forsa .  相似文献   

4.
Nucleocytoplasmic transport occurs through nuclear pore complexes (NPCs) whose complex architecture is generated from a set of only approximately 30 proteins, termed nucleoporins. Here, we explore the domain structure of Nup133, a nucleoporin in a conserved NPC subcomplex that is crucial for NPC biogenesis and is believed to form part of the NPC scaffold. We show that human Nup133 contains two domains: a COOH-terminal domain responsible for its interaction with its subcomplex through Nup107; and an NH2-terminal domain whose crystal structure reveals a seven-bladed beta-propeller. The surface properties and conservation of the Nup133 beta-propeller suggest it may mediate multiple interactions with other proteins. Other beta-propellers are predicted in a third of all nucleoporins. These and several other repeat-based motifs appear to be major elements of nucleoporins, indicating a level of structural repetition that may conceptually simplify the assembly and disassembly of this huge protein complex.  相似文献   

5.
Actin-interacting protein 1 (Aip1p) is a 67-kDa WD repeat protein known to regulate the depolymerization of actin filaments by cofilin and is conserved in organisms ranging from yeast to mammals. The crystal structure of Aip1p from Saccharomyces cerevisiae was determined to a 2.3-A resolution and a final crystallographic R-factor of 0.204. The structure reveals that the overall fold is formed by two connected seven-bladed beta-propellers and has important implications for the structure of Aip1 from other organisms and WD repeat-containing proteins in general. These results were unexpected because a maximum of 10 WD repeats had been reported in the literature for this protein using sequence data. The surfaces of the beta-propellers formed by the D-A and B-C loops are positioned adjacent to one another, giving Aip1p a shape that resembles an open "clamshell." The mapping of conserved residues to the structure of Aip1p reveals dense patches of conserved residues on the surface of one beta-propeller and at the interface of the two beta-propellers. These two patches of conserved residues suggest a potential binding site for F-actin on Aip1p and that the orientation of the beta-propellers with respect to one another plays a role in binding an actin-cofilin complex. In addition, the conserved interface between the domains is mediated by a number of interactions that appear to impart rigidity between the two domains of Aip1p and may make a large substrate-induced conformational change difficult.  相似文献   

6.
A G Murzin 《Proteins》1992,14(2):191-201
Twisted beta-sheets, packed face to face, may be arranged in circular formation like blades of a propeller or turbine. This beta-propeller fold has been found in three proteins: that in neuraminidase consists of six beta-sheets while those in methylamine dehydrogenase and galactose oxidase are composed of seven beta-sheets. A model for multisheet packing in the beta-propeller fold is proposed. This model gives both geometrical parameters of the beta-propellers composed of different numbers of sheets and patterns of residue packing at their sheet-to-sheet interfaces. All the known beta-propeller structures have been analyzed, and the observed geometries and residue packing are found to be in good agreement with those predicted by models. It is shown that unusual seven-fold symmetry is preferable to six- or eight-fold symmetry for propeller-like multi-sheet assembly. According to the model, a six-beta-sheet propeller has to have predominantly small residues in the beta-strands closed to its six-fold axis, but no strong sequence constraints are necessary for a seven-fold beta-propeller.  相似文献   

7.
The six beta-propellers located within the N-terminus of low density lipoprotein receptor-related protein 1 (LRP1) are arranged in two clusters that contain two and four beta-propellers, respectively. Working with LRP1 deletion mutants, we found that randomly removing large segments of amino acid sequences did not affect the intracellular trafficking of LRP1 as long as the clustered beta-propeller domains were retained. However, deletion mutants with crippled beta-propeller clusters invariably exhibited retarded exit from the endoplasmic reticulum (ER). To determine potential functions of the clustered beta-propellers, we generated a series of deletion mutants in which the beta-propellers were systematically removed from the C-terminal end of the second cluster. The resulting minireceptors, designated LRPbeta1-6, beta1-5, beta1-4, beta1-3, and beta1-2 containing decreasing numbers of the beta-propellers, were stably expressed in LRP1-null CHO cells. Binding/degradation assays with receptor-associated protein or alpha2-macroglobulin showed that removing one or more beta-propellers had little effect on binding or degradation of these ligands. However, minireceptors containing odd number of beta-propellers (i.e., LRPbeta1-3 and beta1-5) showed prolonged retention within the ER and remained endoglycosidase H-sensitive, whereas minireceptors containing even number of beta-propellers (i.e., LRPbeta1-2, beta1-4 and beta1-6) exited ER at variable rates. Cell surface biotinylation experiments showed that LRPbeta1-3 was absent from the cell surface. Prolonged retention of LRPbeta1-3 within the ER was accompanied by increased association with molecular chaperone Grp78/Bip. These results suggest that the clustered beta-propellers may play a role in folding and intracellular trafficking of LRP1.  相似文献   

8.
In the study of protein complexes, is there a computational method for inferring which combinations of proteins in an organism are likely to form a crystallizable complex? Here we attempt to answer this question, using the Protein Data Bank (PDB) to assess the usefulness of inferred functional protein linkages from the Prolinks database. We find that of the 242 nonredundant prokaryotic protein complexes shared between the current PDB and Prolinks, 44% (107/242) contain proteins linked at high confidence by one or more methods of computed functional linkages. Similarly, high-confidence linkages detect 47% of known Escherichia coli protein complexes, with 45% accuracy. Together these findings suggest that functional linkages will be useful in defining protein complexes for structural studies, including for structural genomics. We offer a database of inferred linkages corresponding to likely protein complexes for some 629,952 pairs of proteins in 154 prokaryotes and archaea.  相似文献   

9.
We report an unsupervised structural motif discovery algorithm, FoldMiner, which is able to detect global and local motifs in a database of proteins without the need for multiple structure or sequence alignments and without relying on prior classification of proteins into families. Motifs, which are discovered from pairwise superpositions of a query structure to a database of targets, are described probabilistically in terms of the conservation of each secondary structure element's position and are used to improve detection of distant structural relationships. During each iteration of the algorithm, the motif is defined from the current set of homologs and is used both to recruit additional homologous structures and to discard false positives. FoldMiner thus achieves high specificity and sensitivity by distinguishing between homologous and nonhomologous structures by the regions of the query to which they align. We find that when two proteins of the same fold are aligned, highly conserved secondary structure elements in one protein tend to align to highly conserved elements in the second protein, suggesting that FoldMiner consistently identifies the same motif in members of a fold. Structural alignments are performed by an improved superposition algorithm, LOCK 2, which detects distant structural relationships by placing increased emphasis on the alignment of secondary structure elements. LOCK 2 obeys several properties essential in automated analysis of protein structure: It is symmetric, its alignments of secondary structure elements are transitive, its alignments of residues display a high degree of transitivity, and its scoring system is empirically found to behave as a metric.  相似文献   

10.
Mooney SD  Liang MH  DeConde R  Altman RB 《Proteins》2005,61(4):741-747
A primary challenge for structural genomics is the automated functional characterization of protein structures. We have developed a sequence-independent method called S-BLEST (Structure-Based Local Environment Search Tool) for the annotation of previously uncharacterized protein structures. S-BLEST encodes the local environment of an amino acid as a vector of structural property values. It has been applied to all amino acids in a nonredundant database of protein structures to generate a searchable structural resource. Given a query amino acid from an experimentally determined or modeled structure, S-BLEST quickly identifies similar amino acid environments using a K-nearest neighbor search. In addition, the method gives an estimation of the statistical significance of each result. We validated S-BLEST on X-ray crystal structures from the ASTRAL 40 nonredundant dataset. We then applied it to 86 crystallographically determined proteins in the protein data bank (PDB) with unknown function and with no significant sequence neighbors in the PDB. S-BLEST was able to associate 20 proteins with at least one local structural neighbor and identify the amino acid environments that are most similar between those neighbors.  相似文献   

11.
TMCompare is an alignment and visualization tool for comparison of sequence information for membrane proteins contained in SWISS-PROT entries, with structural information contained in PDB files. The program can be used for: detection of breaks in alpha helical structure of transmembrane regions; examination of differences in coverage between PDB and SWISS-PROT files; examination of annotation differences between PDB files and associated SWISS-PROT files; examination and comparison of assigned PDB alpha helix regions and assigned SWISS-PROT transmembrane regions in linear sequence (one letter code) format; examination of these differences in 3D using the CHIME plugin, allowing; analysis of the alpha and non-alpha content of transmembrane regions. AVAILABILITY: TMCompare is available for use through selection of a query protein via the internet (http://www.membraneproteins.org/TMCompare) CONTACT: tmcompare@membraneproteins.org  相似文献   

12.
The analysis of disulphide bond containing proteins in the Protein Data Bank (PDB) revealed that out of 27,209 protein structures analyzed, 12,832 proteins contain at least one intra-chain disulphide bond and 811 proteins contain at least one inter-chain disulphide bond. The intra-chain disulphide bond containing proteins can be grouped into 256 categories based on the number of disulphide bonds and the disulphide bond connectivity patterns (DBCPs) that were generated according to the position of half-cystine residues along the protein chain. The PDB entries corresponding to these 256 categories represent 509 unique SCOP superfamilies. A simple web-based computational tool is made freely available at the website http://www.ccmb.res.in/bioinfo/dsbcp that allows flexible queries to be made on the database in order to retrieve useful information on the disulphide bond containing proteins in the PDB. The database is useful to identify the different SCOP superfamilies associated with a particular disulphide bond connectivity pattern or vice versa. It is possible to define a query based either on a single field or a combination of the following fields, i.e., PDB code, protein name, SCOP superfamily name, number of disulphide bonds, disulphide bond connectivity pattern and the number of amino acid residues in a protein chain and retrieve information that match the criterion. Thereby, the database may be useful to select suitable protein structural templates in order to model the more distantly related protein homologs/analogs using the comparative modeling methods.  相似文献   

13.
The Conserved Key Amino Acid Positions DataBase (CKAAPs DB) provides access to an analysis of structurally similar proteins with dissimilar sequences where key residues within a common fold are identified. The derivation and significance of CKAAPs starting from pairwise structure alignments is described fully in Reddy et al. [Reddy,B.V.B., Li,W.W., Shindyalov,I.N. and Bourne,P.E. (2000) PROTEINS:, in press]. The CKAAPs identified from this theoretical analysis are provided to experimentalists and theoreticians for potential use in protein engineering and modeling. It has been suggested that CKAAPs may be crucial features for protein folding, structural stability and function. Over 170 substructures, as defined by the Combinatorial Extension (CE) database, which are found in approximately 3000 representative polypeptide chains have been analyzed and are available in the CKAAPs DB. CKAAPs DB also provides CKAAPs of the representative set of proteins derived from the CE and FSSP databases. Thus the database contains over 5000 representative poly-peptide chains, covering all known structures in the PDB. A web interface to a relational database permits fast retrieval of structure-sequence alignments, CKAAPs and associated statistics. Users may query by PDB ID, protein name, function and Enzyme Classification number. Users may also submit protein alignments of their own to obtain CKAAPs. An interface to display CKAAPs on each structure from a web browser is also being implemented. CKAAPs DB is maintained by the San Diego Supercomputer Center and accessible at the URL http://ckaaps.sdsc.edu.  相似文献   

14.
To optimize the search for structural templates in protein comparative modelling, the query sequence is split into domains. The initial list of templates for each domain, extracted from PFAM plus PDB and SCOP, is then ranked according to sequence identity (%ID), coverage and resolution. If %ID is less than 30, secondary structure matching is used to filter out false templates. AVAILABILITY: http://www.bmm.icnet.uk/~3djigsaw/dom_fish  相似文献   

15.
Screening of protein variants requires specific detection methods to assay protein levels and stability in crude mixtures. Many strategies apply fluorescence‐detection size‐exclusion chromatography (FSEC) using green fluorescent protein (GFP) fusion proteins to qualitatively monitor expression, stability, and monodispersity. However, GFP fusion proteins have several important disadvantages; including false‐positives, protein aggregation after proteolytic removal of GFP, and reductions in protein yields without the GFP fusion. Here we describe a FSEC screening strategy based on a fluorescent multivalent NTA probe that interacts with polyhistidine‐tags on target proteins. This method overcomes the limitations of GFP fusion proteins, and can be used to rank protein production based on qualitative and quantitative parameters. Domain boundaries of the human G‐protein coupled adenosine A2a receptor were readily identified from crude detergent‐extracts of a library of construct variants transiently produced in suspension‐adapted HEK293‐6E cells. Well expressing clones of MraY, an important bacterial infection target, could be identified from a library of 24 orthologs. This probe provides a highly sensitive tool to detect target proteins to expression levels down to 0.02 mg/L in crude lysate, and requires minimal amounts of cell culture.  相似文献   

16.
Homology-derived secondary structure of proteins (HSSP) is a well-known database of multiple sequence alignments (MSAs) which merges information of protein sequences and their three-dimensional structures. It is available for all proteins whose structure is deposited in the PDB. It is also used by STING and (Java)Protein Dossier to calculate and present relative entropy as a measure of the degree of conservation for each residue of proteins whose structure has been solved and deposited in the PDB. However, if the STING and (Java)Protein Dossier are to provide support for analysis of protein structures modeled in computers or being experimentally solved but not yet deposited in the PDB, then we need a new method for building alignments having a flavor of HSSP alignments (myMSAr). The present study describes a new method and its corresponding databank (SH2QS--database of sequences homologue to the query [structure-having] sequence). Our main interest in making myMSAr was to measure the degree of residue conservation for a given query sequence, regardless of whether it has a corresponding structure deposited in the PDB. In this study, we compare the measurement of residue conservation provided by corresponding alignments produced by HSSP and SH2QS. As a case study, we also present two biologically relevant examples, the first one highlighting the equivalence of analysis of the degree of residue conservation by using HSSP or SH2QS alignments, and the second one presenting the degree of residue conservation for a structure modeled in a computer, which , as a consequence, does not have an alignment reported by HSSP.  相似文献   

17.
18.
A single point mutation (G to T) in the low-density lipoprotein receptor related protein 5 (LRP5) gene results in a glycine to valine amino acid change (G171V) and is responsible for an autosomal dominant high bone mass trait (HBM) in two independent kindreds. LRP5 acts as a co-receptor to Wnts with Frizzled family members and transduces Wnt-canonical signals which can be antagonized by LRP5 ligand, Dickkopf 1 (Dkk1). In the presence of Wnt1, LRP5 or the HBM variant (LRP5-G171V) induces beta-catenin nuclear translocation and activates T cell factor (TCF)-luciferase reporter activity. HBM variant suppresses Dkk1 function and this results in reduced inhibition of TCF activity as compared to that with LRP5. Structural analysis of LRP5 revealed that the HBM mutation lies in the 4th blade of the first beta-propeller domain. To elucidate the functional significance and consequence of the LRP5-G171V mutation in vitro, we took a structure-based approach to design 15 specific LRP5 point mutations. These included (a) substitutions at the G171 in blade 4, (b) mutations in blades 2-6 of beta-propeller 1, and (c) mutations in beta-propellers 2, 3 and 4. Here we show that substitutions of glycine at 171 to K, F, I and Q also resulted in HBM-like activity in the presence of Wnt1 and Dkk1. This indicates the importance of the G171 site rather than the effect of specific amino acid modification to LRP5 receptor function. Interestingly, G171 equivalent residue mutations in other blades of beta-propeller 1 (A65V, S127V, L200V, A214V and M282V) resulted in LRP5-G171V-like block of Dkk1 function. However G171V type mutations in other beta-propellers of LRP5 did not result in resistance to Dkk1 function. These results indicate the importance of LRP5 beta-propeller 1 for Dkk1 function and Wnt signaling. These data and additional comparative structural analysis of the LRP5 family member LDLR suggest a potential functional role of the first beta-propeller domain through intramolecular interaction with other domains of LRP5 wherein Dkk1 can bind. Such studies may also lead to a better understanding of the mechanisms underlying the reduced function of Dkk1-like inhibitory ligands of LRP5 with HBM-like mutations and its relationship to increased bone density phenotypes.  相似文献   

19.
The gap between the number of known protein sequences and structures continues to widen, particularly as a result of sequencing projects for entire genomes. Recently there have been many attempts to generate structural assignments to all genes on sets of completed genomes using fold-recognition methods. We developed a method that detects false positives made by these genome-wide structural assignment experiments by identifying isolated occurrences. The method was tested using two sets of assignments, generated by SUPERFAMILY and PSI-BLAST, on 150 completed genomes. A phylogeny of these genomes was built and a parsimony algorithm was used to identify isolated occurrences by detecting occurrences that cause a gain at leaf level. Isolated occurrences tend to have high e-values, and in both sets of assignments, a sudden increase in isolated occurrences is observed for e-values >10−8 for SUPERFAMILY and >10−4 for PSI-BLAST. Conditions to predict false positives are based on these results. Independent tests confirm that the predicted false positives are indeed more likely to be incorrectly assigned. Evaluation of the predicted false positives also showed that the accuracy of profile-based fold-recognition methods might depend on secondary structure content and sequence length. We show that false positives generated by fold-recognition methods can be identified by considering structural occurrence patterns on completed genomes; occurrences that are isolated within the phylogeny tend to be less reliable. The method provides a new independent way to examine the quality of fold assignments and may be used to improve the output of any genome-wide fold assignment method.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号