首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
MOTIVATION: Protein-protein interactions play critical roles in biological processes, and many biologists try to find or to predict crucial information concerning these interactions. Before verifying interactions in biological laboratory work, validating them from previous research is necessary. Although many efforts have been made to create databases that store verified information in a structured form, much interaction information still remains as unstructured text. As the amount of new publications has increased rapidly, a large amount of research has sought to extract interactions from the text automatically. However, there remain various difficulties associated with the process of applying automatically generated results into manually annotated databases. For interactions that are not found in manually stored databases, researchers attempt to search for abstracts or full papers. RESULTS: As a result of a search for two proteins, PubMed frequently returns hundreds of abstracts. In this paper, a method is introduced that validates protein-protein interactions from PubMed abstracts. A query is generated from two given proteins automatically and abstracts are then collected from PubMed. Following this, target proteins and their synonyms are recognized and their interaction information is extracted from the collection. It was found that 67.37% of the interactions from DIP-PPI corpus were found from the PubMed abstracts and 87.37% of interactions were found from the given full texts. AVAILABILITY: Contact authors.  相似文献   

2.
Several zinc finger proteins have been discovered recently that bind specifically to double-stranded RNA. These include the mammalian JAZ and wig proteins, and the seven-zinc finger protein ZFa from Xenopus laevis. We have determined the solution structure of a 127 residue fragment of ZFa, which consists of two zinc finger domains connected by a linker that remains unstructured in the free protein in solution. The first zinc finger consists of a three-stranded beta-sheet and three helices, while the second finger contains only a two-stranded sheet and two helices. The common structures of the core regions of the two fingers are superimposable. Each finger has a highly electropositive surface that maps to a helix-kink-helix motif. There is no evidence for interactions between the two fingers, consistent with the length (24 residues) and unstructured nature of the intervening linker. Comparison with a number of other proteins shows similarities in the topology and arrangement of secondary structure elements with canonical DNA-binding zinc fingers, with protein interaction motifs such as FOG zinc fingers, and with other DNA-binding and RNA-binding proteins that do not contain zinc. However, in none of these cases does the alignment of these structures with the ZFa zinc fingers produce a consistent picture of a plausible RNA-binding interface. We conclude that the ZFa zinc fingers represent a new motif for the binding of double-stranded RNA.  相似文献   

3.
Protein-protein interactions are involved in many metabolic pathways. This review will focus on the role of such associations in CO2 assimilation (Benson-Calvin cycle) and especially on the involvement of a GAPDH/CP12/PRK complex which has been identified in many photosynthetic organisms and may have an important role in the regulation of CO2 assimilation. The emergence of new kinetic and regulatory properties as a consequence of protein-protein interactions will be addressed as well as some of the questions raised by the existence of these supramolecular complexes such as composition, function, and assembly pathways. The presence and role of small intrinsically unstructured proteins like the 8.5 kDa protein CP12, involved in the regulation and/or assembly of these complexes will be discussed.  相似文献   

4.
Knr4, recently characterized as an intrinsically disordered Saccharomyces cerevisiae protein, participates in cell wall formation and cell cycle regulation. It is constituted of a functional central globular core flanked by a poorly structured N‐terminal and large natively unfolded C‐terminal domains. Up to now, about 30 different proteins have been reported to physically interact with Knr4. Here, we used an in vivo two‐hybrid system approach and an in vitro surface plasmon resonance (BIAcore) technique to compare the interaction level of different Knr4 deletion variants with given protein partners. We demonstrate the indispensability of the N‐terminal domain of Knr4 for the interactions. On the other hand, presence of the unstructured C‐terminal domain has a negative effect on the interaction strength. In protein interactions networks, the most highly connected proteins or “hubs” are significantly enriched in unstructured regions, and among them the transient hub proteins contain the largest and most highly flexible regions. The results presented here of our analysis of Knr4 protein suggest that these large disordered regions are not always involved in promoting the protein–protein interactions of hub proteins, but in some cases, might rather inhibit them. We propose that this type of regions could prevent unspecific protein interactions, or ensure the correct timing of occurrence of transient interactions, which may be of crucial importance for different signaling and regulation processes.  相似文献   

5.
6.
CP12 is an 8.5-kDa nuclear-encoded chloroplast protein, isolated from higher plants. It forms part of a core complex of two dimers of phosphoribulokinase (PRK), two tetramers of glyceraldehyde 3-phosphate dehydrogenase (GAPDH), and CP12. The role of CP12 in this complex assembly has not been determined. To address this question, we cloned a cDNA encoding the mature CP12 from the green alga Chlamydomonas reinhardtii and expressed it in Escherichia coli. Sequence alignments show that it is very similar to other CP12s, with four conserved cysteine residues forming two disulfide bridges in the oxidized CP12. On the basis of reconstitution assays and surface plasmon resonance binding studies, we show that oxidized, but not reduced, CP12 acts as a linker in the assembly of the complex, and we propose a model in which CP12 associates with GAPDH, causing its conformation to change. This GAPDH/CP12 complex binds PRK to form a half-complex (one unit). This unit probably dimerizes due partially to interactions between the enzymes of each unit. Reduced CP12 being unable to reconstitute the complex, we studied the structures of oxidized and reduced CP12 by NMR and circular dichroism to determine whether reduction induced structural transitions. Oxidized CP12 is mainly composed of alpha helix and coil segments, and is extremely flexible, while reduced CP12 is mainly unstructured. Remarkably, CP12 has similar physicochemical properties to those of "intrinsically unstructured proteins" that are also involved in regulating macromolecular complexes, or in their assembly. CP12s are thus one of the few protein families of intrinsically unstructured proteins specific to plants.  相似文献   

7.
Tsvetkov P  Asher G  Paz A  Reuven N  Sussman JL  Silman I  Shaul Y 《Proteins》2008,70(4):1357-1366
Intrinsically unstructured proteins (IUPs), also known as natively unfolded proteins, lack well-defined secondary and tertiary structure under physiological conditions. In recent years, growing experimental and theoretical evidence has accumulated, indicating that many entire proteins and protein sequences are unstructured under physiological conditions, and that they play significant roles in diverse cellular processes. Bioinformatic algorithms have been developed to identify such sequences in proteins for which structural data are lacking, but still generate substantial numbers of false positives and negatives. We describe here a simple and reliable in vitro assay for identifying IUP sequences based on their susceptibility to 20S proteasomal degradation. We show that 20S proteasomes digest IUP sequences, under conditions in which native, and even molten globule states, are resistant. Furthermore, we show that protein-protein interactions can protect IUPs against 20S proteasomal action. Taken together, our results thus suggest that the 20S proteasome degradation assay provides a powerful system for operational definition of IUPs.  相似文献   

8.
The structural stability of a protein requires a large number of interresidue interactions. The energetic contribution of these can be approximated by low-resolution force fields extracted from known structures, based on observed amino acid pairing frequencies. The summation of such energies, however, cannot be carried out for proteins whose structure is not known or for intrinsically unstructured proteins. To overcome these limitations, we present a novel method for estimating the total pairwise interaction energy, based on a quadratic form in the amino acid composition of the protein. This approach is validated by the good correlation of the estimated and actual energies of proteins of known structure and by a clear separation of folded and disordered proteins in the energy space it defines. As the novel algorithm has not been trained on unstructured proteins, it substantiates the concept of protein disorder, i.e. that the inability to form a well-defined 3D structure is an intrinsic property of many proteins and protein domains. This property is encoded in their sequence, because their biased amino acid composition does not allow sufficient stabilizing interactions to form. By limiting the calculation to a predefined sequential neighborhood, the algorithm was turned into a position-specific scoring scheme that characterizes the tendency of a given amino acid to fall into an ordered or disordered region. This application we term IUPred and compare its performance with three generally accepted predictors, PONDR VL3H, DISOPRED2 and GlobPlot on a database of disordered proteins.  相似文献   

9.
A method to enrich cell extracts in totally unfolded proteins was investigated. A literature search revealed that 14 of 29 proteins isolated by their failure to precipitate during perchloric acid (PCA) or trichloroacetic acid (TCA) treatment where also shown experimentally to be totally disordered. A near 100 000-fold reduction in yield was observed after 5% or 9% PCA treatment of total soluble E. coli protein. Despite this huge reduction, 158 and 142 spots were observed from the 5% and the 9% treated samples, respectively, on silver-stained 2-D SDS-PAGE gels loaded with 10 microg of protein. Treatment with 1% PCA was less selective with more visible spots and a greater than 3-fold higher yield. A substantial yield of unprecipitated protein was obtained after 3% TCA treatment, suggesting that the common use of TCA precipitation prior to 2-D gel analysis may result in loss of unstructured protein due to their failure to precipitate. Our preliminary analysis suggests that treating total protein extracts with 3-5% PCA and determining the identities of soluble proteins could be the starting point for uncovering unfoldomes (the complement of unstructured proteins in a given proteome). The 100 000-fold reduction in yield and concomitant reduction in number of proteins achieved by 5% PCA treatment produced a fraction suitable for analysis in its entirety using standard proteomic techniques. In this way, large numbers of totally unstructured proteins could be identified with minimal effort.  相似文献   

10.
Nuclear transport receptors (NTRs) mediate nucleocytoplasmic transport via their affinity for unstructured proteins (polymers) in the nuclear pore complex (NPC). Here, we have modeled the effect of NTRs on polymeric structure in the nanopore confinement of the NPC central conduit. The model explicitly takes into account inter- and intramolecular interactions, as well as the finite size of the NTRs (∼20% of the NPC channel diameter). It reproduces various proposed scenarios for the channel structure, ranging from a central polymer condensate (selective phase) to brushlike polymer arrangements localized at the channel wall (virtual gate, reduction of dimensionality), with the transport receptors lining the polymer surface. In addition, it predicts a new structure in which NTRs become an integral part of the transport barrier by forming a cross-linked network with the unstructured proteins stretching across the pore. The model provides specific and distinctive predictions for the equilibrium spatial distributions of NTRs for these different scenarios that can be experimentally verified by, e.g., superresolution fluorescence microscopy. Moreover, it suggests mechanisms by which globular macromolecules (colloidal particles) can cause polymer-coated nanopores to switch between open and closed configurations, a possible explanation of the biological function of the NPC, and suggests potential technological applications for filtration and single-molecule sensing.  相似文献   

11.
Bell L  Chowdhary R  Liu JS  Niu X  Zhang J 《PloS one》2011,6(6):e21474
A significant part of our biological knowledge is centered on relationships between biological entities (bio-entities) such as proteins, genes, small molecules, pathways, gene ontology (GO) terms and diseases. Accumulated at an increasing speed, the information on bio-entity relationships is archived in different forms at scattered places. Most of such information is buried in scientific literature as unstructured text. Organizing heterogeneous information in a structured form not only facilitates study of biological systems using integrative approaches, but also allows discovery of new knowledge in an automatic and systematic way. In this study, we performed a large scale integration of bio-entity relationship information from both databases containing manually annotated, structured information and automatic information extraction of unstructured text in scientific literature. The relationship information we integrated in this study includes protein-protein interactions, protein/gene regulations, protein-small molecule interactions, protein-GO relationships, protein-pathway relationships, and pathway-disease relationships. The relationship information is organized in a graph data structure, named integrated bio-entity network (IBN), where the vertices are the bio-entities and edges represent their relationships. Under this framework, graph theoretic algorithms can be designed to perform various knowledge discovery tasks. We designed breadth-first search with pruning (BFSP) and most probable path (MPP) algorithms to automatically generate hypotheses--the indirect relationships with high probabilities in the network. We show that IBN can be used to generate plausible hypotheses, which not only help to better understand the complex interactions in biological systems, but also provide guidance for experimental designs.  相似文献   

12.
Structural biology has traditionally focused on the structures of proteins, short nucleic acids, small molecules, and their complexes. However, it is now widely recognized that the 3D organization of chromosomes should also be included in this list, despite significant differences in scale and complexity of organization. Here we highlight some notable similarities between the folding processes that shape proteins and chromosomes. Both biomolecules are folded by two types of processes: the affinity-mediated interactions, and by active (ATP-dependent) processes. Both chromosome and proteins in vivo can have partially unstructured and non-equilibrium ensembles with yet to be understood functional roles. By analyzing these biological systems in parallel, we can uncover universal principles of biomolecular organization that transcend specific biopolymers.  相似文献   

13.
Start codon selection is a key step in translation initiation as it sets the reading frame for decoding. Two eukaryotic initiation factors, eIF1 and eIF1A, are key actors in this process. Recent work has elucidated many details of the mechanisms these factors use to control start site selection. eIF1 prevents the irreversible GTP hydrolysis that commits the ribosome to initiation at a particular codon. eIF1A both promotes and inhibits commitment through the competing influences of its two unstructured termini. Both factors perform their tasks through a variety of interactions with other components of the initiation machinery, in many cases mediated by the unstructured regions of the two proteins.  相似文献   

14.
GlobPlot: Exploring protein sequences for globularity and disorder   总被引:2,自引:0,他引:2  
A major challenge in the proteomics and structural genomics era is to predict protein structure and function, including identification of those proteins that are partially or wholly unstructured. Non-globular sequence segments often contain short linear peptide motifs (e.g. SH3-binding sites) which are important for protein function. We present here a new tool for discovery of such unstructured, or disordered regions within proteins. GlobPlot (http://globplot.embl.de) is a web service that allows the user to plot the tendency within the query protein for order/globularity and disorder. We show examples with known proteins where it successfully identifies inter-domain segments containing linear motifs, and also apparently ordered regions that do not contain any recognised domain. GlobPlot may be useful in domain hunting efforts. The plots indicate that instances of known domains may often contain additional N- or C-terminal segments that appear ordered. Thus GlobPlot may be of use in the design of constructs corresponding to globular proteins, as needed for many biochemical studies, particularly structural biology. GlobPlot has a pipeline interface--GlobPipe--for the advanced user to do whole proteome analysis. GlobPlot can also be used as a generic infrastructure package for graphical displaying of any possible propensity.  相似文献   

15.
Identification of potential viral-host protein interactions is a vital and useful approach towards development of new drugs targeting those interactions. In recent days, computational tools are being utilized for predicting viral-host interactions. Recently a database containing records of experimentally validated interactions between a set of HIV-1 proteins and a set of human proteins has been published. The problem of predicting new interactions based on this database is usually posed as a classification problem. However, posing the problem as a classification one suffers from the lack of biologically validated negative interactions. Therefore it will be beneficial to use the existing database for predicting new viral-host interactions without the need of negative samples. Motivated by this, in this article, the HIV-1-human protein interaction database has been analyzed using association rule mining. The main objective is to identify a set of association rules both among the HIV-1 proteins and among the human proteins, and use these rules for predicting new interactions. In this regard, a novel association rule mining technique based on biclustering has been proposed for discovering frequent closed itemsets followed by the association rules from the adjacency matrix of the HIV-1-human interaction network. Novel HIV-1-human interactions have been predicted based on the discovered association rules and tested for biological significance. For validation of the predicted new interactions, gene ontology-based and pathway-based studies have been performed. These studies show that the human proteins which are predicted to interact with a particular viral protein share many common biological activities. Moreover, literature survey has been used for validation purpose to identify some predicted interactions that are already validated experimentally but not present in the database. Comparison with other prediction methods is also discussed.  相似文献   

16.
We have performed a statistical analysis of unstructured amino acid residues in protein structures available in the databank of protein structures. Data on the occurrence of disordered regions at the ends and in the middle part of protein chains have been obtained: in the regions near the ends (at distance less than 30 residues from the N- or C-terminus), there are 66% of unstructured residues (38% are near the N-terminus and 28% are near the C-terminus), although these terminal regions include only 23% of the amino acid residues. The frequencies of occurrence of unstructured residues have been calculated for each of 20 types in different positions in the protein chain. It has been shown that relative frequencies of occurrence of unstructured residues of 20 types at the termini of protein chains differ from the ones in the middle part of the protein chain; amino acid residues of the same type have different probabilities to be unstructured in the terminal regions and in the middle part of the protein chain. The obtained frequencies of occurrence of unstructured residues in the middle part of the protein chain have been used as a scale for predicting disordered regions from amino acid sequence using the method (FoldUnfold) previously developed by us. This scale of frequencies of occurrence of unstructured residues correlates with the contact scale (previously developed by us and used for the same purpose) at a level of 95%. Testing the new scale on a database of 427 unstructured proteins and 559 completely structured proteins has shown that this scale can be successfully used for the prediction of disordered regions in protein chains.  相似文献   

17.
A truly disordered protein lacks a stable fold and its backbone amide protons exchange with solvent at rates predicted from studies of unstructured peptides. We have measured the exchange rates of two model disordered proteins, FlgM and α-synuclein, in buffer and in Escherichia coli using the NMR experiment, SOLEXSY. The rates are similar in buffer and cells and are close to the rates predicted from data on small, unstructured peptides. This result indicates that true disorder can persist inside the crowded cellular interior and that weak interactions between proteins and macromolecules in cells do not necessarily affect intrinsic rates of exchange.  相似文献   

18.
Coupling of folding and binding for unstructured proteins   总被引:1,自引:0,他引:1  
There are now numerous examples of proteins that are unstructured or only partially structured under physiological conditions and yet are nevertheless functional. Such proteins are especially prevalent in eukaryotes. In many cases, intrinsically disordered proteins adopt folded structures upon binding to their biological targets. Many new examples of coupled folding and binding events have been reported recently, providing new insights into mechanisms of molecular recognition.  相似文献   

19.
Neurofilaments and microtubules are important components of the neuronal cytoskeleton. In axons or dendrites, these filaments are aligned in parallel arrays, and separated from one another by nonrandom distances. This distinctive organization has been attributed to cross bridges formed by NF side arms or microtubule-associated proteins. We recently proposed a polymer-brush-based mechanism for regulating interactions between neurofilaments and between microtubules. In this model, the side arms of neurofilaments and the projection domains of microtubule-associated proteins are highly unstructured and exert long-range repulsive forces that are largely entropic in origin; these forces then act to organize the cytoskeleton in axons and dendrites. Here, we review the biochemical, biophysical, genetic and cell biological data for the polymer-brush and cross-bridging models. We explore how the data traditionally used to support cross bridging may be reconciled with a polymer-brush mechanism and compare the implications of recent experimental insights into axonal transport and physiology for each model.  相似文献   

20.
Wen WS  Lai JK  Lin YJ  Lai CM  Huang YC  Wang SS  Jan JS 《Biopolymers》2012,97(2):107-116
The fibrillation of hen egg-white lysozyme (HEWL) in the absence and presence of simple, unstructured D,L-lysine-co-glycine (D,L-Lys-co-gly) and D,L-lysine-co-L-phenylalanine (D,L-Lys-co-Phe) copolypeptides was studied by using a variety of analytical techniques. The attenuating and decelerating effects on fibrillation are significantly dependent on the polypeptide concentration and the composition ratios in the polypeptide chain. Interestingly, D,L-Lys-co-gly and D,L-Lys-co-Phe copolypeptides with the same composition ratio have comparable attenuating effects on fibrillation. The copolypeptide with highest molar fraction of glycine residue exhibits the strongest suppression of HEWL fibrillation. The copolypeptide has the highest hydrophobic interacting capacity due to the more molar ratio of apolar monomer in the polymer backbone. The major driving forces for the association of HEWL and copolypeptides are likely to be hydrogen bonding and hydrophobic interactions, and these interactions reduce the concentration of free protein in solution available to proceed to fibrillation, leading to the increase of lag time and attenuation of fibrillation. The results of this work may contribute to the understanding of the molecular factors affecting amyloid fibrillation and the molecular mechanism(s) of the interactions between the unstructured polypeptides and the amyloid proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号