首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Prediction of natively unfolded regions in protein chains   总被引:1,自引:0,他引:1  
Analysis showed that the globular or natively unfolded state of a protein can be inferred not only from a lower hydrophobicity or a higher charge, but also from the average environment density (average number of close residues located within a certain distance of a given one) of its residues. A database of 6626 protein structures was used to construct a statistical scale of the average number of close residues in globular structures for the 20 amino acids. The portion of false predictions in distinguishing between 80 globular and 90 natively unfolded proteins was 11% with the new scale and 17% with a hydrophobicity scale. The new scale proved suitable for predicting the folded or unfolded state for native proteins or the natively unfolded regions for protein chains. In comparisons with the available algorithms, the new method yielded the highest portion of true predictions (87 and 77% with averaging over residues and over proteins, respectively).  相似文献   

2.
Intrinsically disordered proteins (IDPs) lack a well-defined three-dimensional structure under physiological conditions. Intrinsic disorder is a common phenomenon, particularly in multicellular eukaryotes, and is responsible for important protein functions including regulation and signaling. Many disease-related proteins are likely to be intrinsically disordered or to have disordered regions. In this paper, a new predictor model based on the Bayesian classification methodology is introduced to predict for a given protein or protein region if it is intrinsically disordered or ordered using only its primary sequence. The method allows to incorporate length-dependent amino acid compositional differences of disordered regions by including separate statistical representations for short, middle and long disordered regions. The predictor was trained on the constructed data set of protein regions with known structural properties. In a Jack-knife test, the predictor achieved the sensitivity of 89.2% for disordered and 81.4% for ordered regions. Our method outperformed several reported predictors when evaluated on the previously published data set of Prilusky et al. [2005. FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded. Bioinformatics 21 (16), 3435-3438]. Further strength of our approach is the ease of implementation.  相似文献   

3.
Prediction of the location of structural domains in globular proteins   总被引:7,自引:0,他引:7  
The location of structural domains in proteins is predicted from the amino acid sequence, based on the analysis of a computed contact map for the protein, the average distance map (ADM). Interactions between residues i and j in a protein are subdivided into several ranges, according to the separation |i-j| in the amino acid sequence. Within each range, average spatial distances between every pair of amino acid residues are computed from a data base of known protein structures. Infrequently occurring pairs are omitted as being statistically insignificant. The average distances are used to construct a predicted ADM. The ADM is analyzed for the occurrence of regions with high densities of contacts (compact regions). Locations of rapid changes of density between various parts of the map are determined by the use of scanning plots of contact densities. These locations serve to pinpoint the distribution of compact regions. This distribution, in turn, is used to predict boundaries of domains in the protein. The technique provides an objective method for the location of domains both on a contact map derived from a known three-dimensional protein structure, the real distance map (RDM), and on an ADM. While most other published methods for the identification of domains locate them in the known three-dimensional structure of a protein, the technique presented here also permits the prediction of domains in proteins of unknown spatial structure, as the construction of the ADM for a given protein requires knowledge of only its amino acid sequence.  相似文献   

4.
The primary amino acid sequence of an abundant methionine-rich seed protein found in Brazil nut (Bertholletia excelsa H.B.K.) has been elucidated by protein sequencing and from the nucleotide sequence of cDNA clones. The 9 kDa subunit of this protein was found to contain 77 amino acids of which 14 were methionine (18%) and 6 were cysteine (8%). Over half of the methionine residues in this subunit are clustered in two regions of the polypeptide where they are interspersed with arginine residues. In one of these regions, methionine residues account for 5 out of 6 amino acids and four of these methionine residues are contiguous. The sequence data verifies that the Brazil nut sulfur-rich protein is synthesized as a precursor polypeptide that is considerably larger than either of the two subunits of the mature protein. Three proteolytic processing steps by which the encoded polypeptide is sequentially trimmed to the 9 kDa and 3 kDa subunit polypeptides have been correlated with the sequence information. In addition, we have found that the sulfur-rich protein from Brazil nut is homologous in its amino acid sequence to small water-soluble proteins found in two other oilseeds, castor bean (Ricinus communis) and rapeseed (Brassica napus). When the amino acid sequences of these three proteins are aligned to maximize homology, the arrangement of cysteine residues is conserved. However, the two subunits of the Brazil nut protein contain over 19% methionine whereas the homologous proteins from castor bean and rapeseed contain only 2.1% and 2.6% methionine, respectively.  相似文献   

5.
The posttranslational incorporation of arginine into proteins catalyzed by arginyl-tRNA protein transferase was determined in vitro in different rat brain regions. The incorporation was found in all the regions studied, although with different specific activities (pmol [14C]arginine incorporated/mg protein). Of the regions studied, hippocampus had the highest specific activity followed by striatum, medulla oblongata, cerebellum, and cerebral cortex. Electrophoretic analysis of the [14C]arginyl proteins from the different regions followed by autoradiography and scanner densitometry showed at least 13 polypeptide bands that were labeled with [14C]arginine. The radioactive bands were qualitatively coincident with protein bands revealed by Coomassie Blue. There were peaks that showed different proportions of labeling in comparison with peaks of similar molecular mass from total brain. Most notable because of their high proportions were those of molecular mass 125 kDa in hippocampus, striatum, and cerebral cortex; 112 and 98 kDa in striatum and cerebellum; and 33 kDa in hippocampus and striatum. In lower proportions than in total brain were the peaks of 33 kDa in medulla oblongata and cerebral cortex and of 125 kDa in medulla oblongata.  相似文献   

6.
Our objective was to identify cell surface proteins in the dinoflagellate Lingulodinium polyedrum. Proteins on the surface of living cells that had regions exposed to the external medium were labeled with Na125I. After partial purification of membrane proteins and analysis by two‐dimensional gel electrophoresis and autoradiography, a protein of roughly 43 kDa was found to have incorporated the radiolabel. This protein was cloned using a combination of protein microsequencing and PCR amplification. The derived protein sequence in the cDNA has a signal peptide at the N‐terminal end of the protein and thus represents the first plasma membrane protein ever reported for a dinoflagellate. The function of the protein is unknown, but its cloning provides a proof of principle for the general use of vectorial labeling in identifying cell surface proteins of marine algae.  相似文献   

7.
Winged bean Kunitz chymotrypsin inhibitor (WCI) accumulates abundantly in seeds and tuberous roots of winged bean plant. In seeds, the WCI mRNA is observed transiently during seed maturation period. The WCI is encoded by a multigene family and the major WCI (WCI-3) is encoded by two nearly identical genes (WCI-3a and WCI-3b genes), in which nucleotide sequences in the 1.1 kb 5 flanking regions are about 99% homologous to each other and the transcribed regions are completely identical. Here we report the detection of two types of nuclear proteins which bind to the multiple sites in the 5 upstream region of the WCI-3a gene. One of the proteins, band 1-forming protein, also bound to cauliflower mosaic virus 35S (CaMV35S) promoter, but another protein, band 3-forming protein, did not. DNaseI footprinting analysis showed that these proteins bound to AT-rich upstream regions in the WCI-3a gene. Addition of poly(dA-dT)-poly(dA-dT) to the binding reaction inhibited the formation of the retarded bands, while poly(dI-dC)-poly(dI-dC) did not. In various organs and throughout seed maturation period, proteins with invariable binding specificities were detected, and these binding proteins met some operational criteria for high-mobility-group (HMG) proteins. These results suggest that leguminous seed AT-binding proteins reported on several seed storage protein genes may be HMG-like proteins which are present ubiquitously in plant organs.deceased on September 15, 1992  相似文献   

8.
9.
A new criterion proposed for classification of the living world is based on the ability of the protein amino acid sequence to form disordered regions, appearing as loops in the 3D structure. The approach used fundamentally differs from the approaches based on comparisons of certain RNA or protein sequences of different organisms. Introduction of any new structural-functional criterion that could resolve the evolutionary relationships between the main groups of origin organisms is of interest in itself, as megasystematics and macrophylogeny lack informative criteria despite the apparent abundance of molecular characteristics. The specialized program FoldUnfold was used to search for disordered regions in the elongation factors EF1A (EFs). The reliability of loop prediction was verified against five EFs with the structures known from X-ray analysis. It was demonstrated with the example of several dozens of typical representatives of the living world that the program predicts extra loops in addition to two linkers between three structural domains in EFs. Besides the effector loop, contained in all EFs, six loops were detected at maximum. Of them, three loops (A, B, and C) are in domain I, one (D) is in domain II, and two (E and F) are in domain III. Moreover, all six loops are never present in the same EF. The EF signatures were determined for each of the superkingdoms of life. Each superkingdom displayed variations in the number of loops and their location within the EF domains. Not only the presence of a particular loop was important in the analysis, but also the specificity of its amino acid sequence. As the total number of predicted loops in EFs increases with the increasing complexity of organisms, the following evolutionary role was postulated for the loops. Following the principle of thrifty inventiveness, nature operates with different universal inserts (loops), adapting their number, location within the EF domains, and amino acid composition so that the protein performs specialized functions—single in protozoa and several in higher organisms.  相似文献   

10.
Flavors of protein disorder   总被引:1,自引:0,他引:1  
Intrinsically disordered proteins are characterized by long regions lacking 3-D structure in their native states, yet they have been so far associated with 28 distinguishable functions. Previous studies showed that protein predictors trained on disorder from one type of protein often achieve poor accuracy on disorder of proteins of a different type, thus indicating significant differences in sequence properties among disordered proteins. Important biological problems are identifying different types, or flavors, of disorder and examining their relationships with protein function. Innovative use of computational methods is needed in addressing these problems due to relative scarcity of experimental data and background knowledge related to protein disorder. We developed an algorithm that partitions protein disorder into flavors based on competition among increasing numbers of predictors, with prediction accuracy determining both the number of distinct predictors and the partitioning of the individual proteins. Using 145 variously characterized proteins with long (>30 amino acids) disordered regions, 3 flavors, called V, C, and S, were identified by this approach, with the V subset containing 52 segments and 7743 residues, C containing 39 segments and 3402 residues, and S containing 54 segments and 5752 residues. The V, C, and S flavors were distinguishable by amino acid compositions, sequence locations, and biological function. For the sequences in SwissProt and 28 genomes, their protein functions exhibit correlations with the commonness and usage of different disorder flavors, suggesting different flavor-function sets across these protein groups. Overall, the results herein support the flavor-function approach as a useful complement to structural genomics as a means for automatically assigning possible functions to sequences.  相似文献   

11.
Intracellularlocation of a viral unspliced mRNA in host cell is a crucial factor for normal life of the virus. Rex is a neucleo-cytoplasmic shuffling protein of Human T-cell Leukemia Virus-1(HTLV-1)which has important role in active transport of cargo-containing RNA from nucleus to cytoplasm. Therefore, it plays a crucial role in the disease development by the virus. In spite of its importance, the 3d-structurephosphorylated and unphosphorylated of this protein has not been determined. In this study, first we predicted whether Rex protein is an ordered or disordered protein. In second step protein 3Dstructure of Rex was obtained. The content of disorder-promoting amino acids, flexibility, hydrophobicity, short linear motifs (SLiMs) and protein binding regions and probability of Rex crystallization were calculated by various In Silico methods. The3D models of Rex protein were obtained by various In Silico methods, such as homology modeling, threading and ab initio, including; I-TASSER, LOMETS, SPARSKS, ROBBETA and QUARK servers. By comparing and analyzing Qmean, z-scores and energy levels of selected models, the best structures with highest favored region in Ramachandran plot (higher than 90%) was refined with MODREFINER software. In silico analysis of Rex physicochemical properties and also predicted SLiMs and binding regions sites confirms that unphosphorylated Rex protein in HTLV-1 as Rev protin in HIV is wholly disordered protein belongs to the class of intrinsically disordered proteins with extended disorder (native coils, native pre-molten globules).  相似文献   

12.
Accurate predictions of the three-dimensional structures of proteins from their amino acid sequences have come of age. AlphaFold, a deep learning-based approach to protein structure prediction, shows remarkable success in independent assessments of prediction accuracy. A significant epoch in structural bioinformatics was the structural annotation of over 98% of protein sequences in the human proteome. Interestingly, many predictions feature regions of very low confidence, and these regions largely overlap with intrinsically disordered regions (IDRs). That over 30% of regions within the proteome are disordered is congruent with estimates that have been made over the past two decades, as intense efforts have been undertaken to generalize the structure–function paradigm to include the importance of conformational heterogeneity and dynamics. With structural annotations from AlphaFold in hand, there is the temptation to draw inferences regarding the “structures” of IDRs and their interactomes. Here, we offer a cautionary note regarding the misinterpretations that might ensue and highlight efforts that provide concrete understanding of sequence-ensemble-function relationships of IDRs. This perspective is intended to emphasize the importance of IDRs in sequence-function relationships (SERs) and to highlight how one might go about extracting quantitative SERs to make sense of how IDRs function.  相似文献   

13.
Two-dimensional agarose gel (pH 8.6)-horizontal poly acrylamide gel (pH 9.0) electrophoresis of plasma proteins in the goose was done. This method resulted in improved separation of proteins in the pre-transferrin and transferrin regions. Preliminary evidence was presented for the existence of two new plasma protein poly morphisrns.  相似文献   

14.
Intrinsically disordered proteins (IDPs) are implicated in a range of human diseases, some of which are associated with the ability to bind to lipids. Although the presence of solvent‐exposed hydrophobic regions in IDPs should favor their interactions with low‐molecular‐weight hydrophobic/amphiphilic compounds, this hypothesis has not been systematically explored as of yet. In this study, the analysis of the DisProt database with regard to the presence of lipid‐binding IDPs (LBIDPs) reveals that they comprise, at least, 15% of DisProt entries. LBIDPs are classified into four groups by ligand type, functional categories, domain structure, and conformational state. 57% of LBIDPs are classified as ordered according to the CH‐CDF analysis, and 70% of LBIDPs possess lengths of disordered regions below 50%. To investigate the lipid‐binding properties of IDPs for which lipid binding is not reported, three proteins from different conformational groups are rationally selected. They all are shown to bind linoleic (LA) and oleic (OA) acids with capacities ranging from 9 to 34 LA/OA molecules per protein molecule. The association with LA/OA causes the formation of high‐molecular‐weight lipid–protein complexes. These findings suggest that lipid binding is common among IDPs, which can favor their involvement in lipid metabolism.  相似文献   

15.
Continuum electrostatic models are used to examine in detail the mechanism of protein stabilization and destabilization due to salt near physiological concentrations. Three wild-type cold shock proteins taken from mesophilic, thermophilic, and hyperthermophilic bacteria are studied using these methods. The model is validated by comparison with experimental data collected for these proteins. In addition, a number of single point mutants and three designed sequences are examined. The results from this study demonstrate that the sensitivity of protein stability toward salt is correlated with thermostability in the cold shock protein family. The calculations indicate that the mesophile is stabilized by the presence of salt while the thermophile and hyperthermophile are destabilized. A decomposition of the salt influence at a residue level permits identification of regions of the protein sequences that contribute toward the observed salt-dependent stability. This model is used to rationalize the effect of various point mutations with regard to sensitivity toward salt. Finally, it is demonstrated that designed cold shock protein variants exhibit electrostatic properties similar to the natural thermophilic and hyperthermophilic proteins.  相似文献   

16.
Summary Proteinase K digestions of bacteriorhodopsin were carried out with the aim of characterizing the membrane-embedded regions of the protein. Products of digestions for two, eight or 24 hours were separated by high-pressure liquid chromotography. A computerized search procedure was used to compare the amino acid analyses of peptide-containing peaks with segments of the bacteriorhodopsin sequence. Molecular weight distributions of the products were determined by sodium dodecylsulfate-urea polyacrylamide gel electrophoresis. The structural integrity of the protein after digestion was monitored through the visible absorption spectrum, by X-ray diffraction of partially dried membranes, and by following release of biosynthetically-incorporated3H leucine from the digested membranes.During mild proteolysis, bacteriorhodopsin was cleaved near the amino and carboxyl termini and at two internal regions previously identified as being accessible to the aqueous medium. Longer digestion resulted in cleavage at new sites. Under conditions where no fragments of bacteriorhodopsin larger than 9000 mol wt were observed, a significant proportion of the digested membranes retained diffraction patterns similar to those of native purple membranes. The harshest digestion conditions led to complete loss of the X-ray diffraction patterns and optical absorption and to release of half the hydrophobic segments of the protein from the membrane in the form of small soluble peptides. Upon cleavage of aqueous loop regions of the protein, isolated transmembrane segments may experience motion in a direction perpendicular to the plane of the membrane, allowing them access to protease.  相似文献   

17.
Abstract: Synaptosomes from five regions of adult rat brain were isolated, analyzed for methyl acceptor proteins, and probed for methyltransferases by photoaffinity labeling. Methylated proteins of 17 and 35 kDa were observed in all regions, but cerebellar synaptosomes were enriched in a 21–26-kDa family of methyl acceptor proteins and contained a unique major methylated protein of 52 kDa and a protein of 50 kDa, which was methylated only in the presence of EGTA. When cerebellar and liver subcellular fractions were compared, the cytosolic fractions of each tissue contained methylated proteins of 17 and 35 kDa; liver membrane fractions contained few methylated proteins, whereas cerebellar microsomes had robust methylation of the 21–26-kDa group. Differential centrifugation of lysed cerebellar synaptosomes localized the 17- and 35-kDa methyl acceptor proteins to the synaptoplasm, the 21–26-kDa family to the synaptic membranes, and the 52-kDa to synaptic vesicles. The 21–26-kDa family was identified as GTP-binding proteins by [α-32P]GTP overlay assay; these proteins contained a putative methylated carboxyl cysteine, based on the presence of volatile methyl esters and the inhibition of methylation by acetylfarnesylcysteine. The 52-kDa methylated protein also contained volatile methyl esters, but did not bind [α-32P]GTP. When synaptosomes were screened for putative methyltransferases by S -adenosyl-L-[ methyl -3H]methionine photoaffinity labeling, a protein of 24 kDa was detected only in cerebellum, and this labeled protein was localized to synaptic membranes.  相似文献   

18.
19.
A three-dimensional model of curculin, a sweet-tasting and taste-modifying protein from the fruits of Curculigo latifolia, was built from the X-ray coordinates of GNA, a mannose-binding lectin from snowdrop (Galanthus nivalis). The three mannose-binding sites present in GNA were found in curculin but are devoid of mannose-binding activity as shown by docking experiments performed with mannose. Some regions well exposed on the surface of the three-dimensional model of curculin could act as epitopes responsible for the sweet-tasting properties of this protein.  相似文献   

20.
Understanding the molecular composition and the formation mechanism of shell matrix framework is of great interest for biomineralization in mollusk shell. The cDNAs encoding a novel matrix protein family (KRMP) were cloned from the mantle of pearl oyster, Pinctada fucata. Analysis of the deduced amino acid sequences revealed that KRMP have a high proportion of lysine, glycine, and tyrosine, and their predict isoelectric points are higher than any other identified shell matrix protein to our knowledge. The deduced amino acid sequences of KRMP can be divided into three regions, including an N-terminal signal peptide, a lysine-rich basic region interacting with acidic proteins or CO(3)(2-), and a Gly/Tyr-rich region involved in the protein cross-link via quinone-tanning process. RT-PCR and in situ hybridization demonstrated that KRMP mRNA was specifically expressed in the mantle edge, involved in the prismatic layer formation. Taken together, it seems that KRMP is a matrix protein family participating in the framework formation of prismatic layer.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号