首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Han K  Nepal C 《FEBS letters》2007,581(9):1881-1890
A complete understanding of protein and RNA structures and their interactions is important for determining the binding sites in protein-RNA complexes. Computational approaches exist for identifying secondary structural elements in proteins from atomic coordinates. However, similar methods have not been developed for RNA, due in part to the very limited structural data so far available. We have developed a set of algorithms for extracting and visualizing secondary and tertiary structures of RNA and for analyzing protein-RNA complexes. These algorithms have been implemented in a web-based program called PRI-Modeler (protein-RNA interaction modeler). Given one or more protein data bank files of protein-RNA complexes, PRI-Modeler analyzes the conformation of the RNA, calculates the hydrogen bond (H bond) and van der Waals interactions between amino acids and nucleotides, extracts secondary and tertiary RNA structure elements, and identifies the patterns of interactions between the proteins and RNAs. This paper presents PRI-Modeler and its application to the hydrogen bond and van der Waals interactions in the most representative set of protein-RNA complexes. The analysis reveals several interesting interaction patterns at various levels. The information provided by PRI-Modeler should prove useful for determining the binding sites in protein-RNA complexes. PRI-Modeler is accessible at http://wilab.inha.ac.kr/primodeler/, and supplementary materials are available in the analysis results section at http://wilab.inha.ac.kr/primodeler/.  相似文献   

2.
Protein-RNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. Here, we analyzed the protein surface shape (dented, intermediate or protruded) and the RNA base pairing properties (paired or unpaired nucleotides) at the interfaces of 91 protein-RNA complexes derived from the Protein Data Bank. Dented protein surfaces prefer unpaired nucleotides to paired ones at the interface, and hydrogen bonds frequently occur between the protein backbone and RNA bases. In contrast, protruded protein surfaces do not show such a preference, rather, electrostatic interactions initiate the formation of hydrogen bonds between positively charged amino acids and RNA phosphate groups. Interestingly, in many protein-RNA complexes that interact via an RNA loop, an aspartic acid is favored at the interface. Moreover, in most of these complexes, nucleotide bases in the RNA loop are flipped out and form hydrogen bonds with the protein, which suggests that aspartic acid is important for RNA loop recognition through a base-flipping process. This study provides fundamental insights into the role of the shape of the protein surface and RNA secondary structures in mediating protein-RNA interactions.  相似文献   

3.
Kim H  Jeong E  Lee SW  Han K 《FEBS letters》2003,552(2-3):231-239
Structural analysis of protein-RNA complexes is labor-intensive, yet provides insight into the interaction patterns between a protein and RNA. As the number of protein-RNA complex structures reported has increased substantially in the last few years, a systematic method is required for automatically identifying interaction patterns. This paper presents a computational analysis of the hydrogen bonds in the most representative set of protein-RNA complexes. The analysis revealed several interesting interaction patterns. (1) While residues in the beta-sheets favored unpaired nucleotides, residues in the helices showed no preference and residues in turns favored paired nucleotides. (2) The backbone hydrogen bonds were more dominant than the base hydrogen bonds in the paired nucleotides, but the reverse was observed in the unpaired nucleotides. (3) The protein-RNA complexes contained more paired nucleotides than unpaired nucleotides, but the unpaired nucleotides were observed more frequently interacting with the proteins. And (4) Arg-U, Thr-A, Lys-A, and Asn-U were the most frequently observed pairs. The interaction patterns discovered from the analysis will provide us with useful information in predicting the structure of the RNA binding protein and the structure of the protein binding RNA.  相似文献   

4.
Ellis JJ  Broom M  Jones S 《Proteins》2007,66(4):903-911
A data set of 89 protein-RNA complexes has been extracted from the Protein Data Bank, and the nucleic acid recognition sites characterized through direct contacts, accessible surface area, and secondary structure motifs. The differences between RNA recognition sites that bind to RNAs in functional classes has also been analyzed. Analysis of the complete data set revealed that van der Waals interactions are more numerous than hydrogen bonds and the contacts made to the nucleic acid backbone occur more frequently than specific contacts to nucleotide bases. Of the base-specific contacts that were observed, contacts to guanine and adenine occurred most frequently. The most favored amino acid-nucleotide pairings observed were lysine-phosphate, tyrosine-uracil, arginine-phosphate, phenylalanine-adenine and tryptophan-guanine. The amino acid propensities showed that positively charged and polar residues were favored as expected, but also so were tryptophan and glycine. The propensities calculated for the functional classes showed trends similar to those observed for the complete data set. However, the analysis of hydrogen bond and van der Waal contacts showed that in general proteins complexed with messenger RNA, transfer RNA and viral RNA have more base specific contacts and less backbone contacts than expected, while proteins complexed with ribosomal RNA have less base-specific contacts than the expected. Hence, whilst the types of amino acids involved in the interfaces are similar, the distribution of specific contacts is dependent upon the functional class of the RNA bound.  相似文献   

5.
We investigate the sequence and structural properties of RNA-protein interaction sites in 211 RNA-protein chain pairs, the largest set of RNA-protein complexes analyzed to date. Statistical analysis confirms and extends earlier analyses made on smaller data sets. There are 24.6% of hydrogen bonds between RNA and protein that are nucleobase specific, indicating the importance of both nucleobase-specific and -nonspecific interactions. While there is no significant difference between RNA base frequencies in protein-binding and non-binding regions, distinct preferences for RNA bases, RNA structural states, protein residues, and protein secondary structure emerge when nucleobase-specific and -nonspecific interactions are considered separately. Guanine nucleobase and unpaired RNA structural states are significantly preferred in nucleobase-specific interactions; however, nonspecific interactions disfavor guanine, while still favoring unpaired RNA structural states. The opposite preferences of nucleobase-specific and -nonspecific interactions for guanine may explain discrepancies between earlier studies with regard to base preferences in RNA-protein interaction regions. Preferences for amino acid residues differ significantly between nucleobase-specific and -nonspecific interactions, with nonspecific interactions showing the expected bias towards positively charged residues. Irregular protein structures are strongly favored in interactions with the protein backbone, whereas there is little preference for specific protein secondary structure in either nucleobase-specific interaction or -nonspecific interaction. Overall, this study shows strong preferences for both RNA bases and RNA structural states in protein-RNA interactions, indicating their mutual importance in protein recognition.  相似文献   

6.
A detailed analysis of the DNA-binding sites of 26 proteins is presented using data from the Nucleic Acid Database (NDB) and the Protein Data Bank (PDB). Chemical and physical properties of the protein-DNA interface, such as polarity, size, shape, and packing, were analysed. The DNA-binding sites shared common features, comprising many discontinuous sequence segments forming hydrophilic surfaces capable of direct and water-mediated hydrogen bonds. These interface sites were compared to those of protein-protein binding sites, revealing them to be more polar, with many more intermolecular hydrogen bonds and buried water molecules than the protein-protein interface sites. By looking at the number and positioning of protein residue-DNA base interactions in a series of interaction footprints, three modes of DNA binding were identified (single-headed, double-headed and enveloping). Six of the eight enzymes in the data set bound in the enveloping mode, with the protein presenting a large interface area effectively wrapped around the DNA.A comparison of structural parameters of the DNA revealed that some values for the bound DNA (including twist, slide and roll) were intermediate of those observed for the unbound B-DNA and A-DNA. The distortion of bound DNA was evaluated by calculating a root-mean-square deviation on fitting to a canonical B-DNA structure. Major distortions were commonly caused by specific kinks in the DNA sequence, some resulting in the overall bending of the helix. The helix bending affected the dimensions of the grooves in the DNA, allowing the binding of protein elements that would otherwise be unable to make contact. From this structural analysis a preliminary set of rules that govern the bending of the DNA in protein-DNA complexes, are proposed.  相似文献   

7.
In prokaryotes, the recoding of a UGA stop codon as a selenocysteine codon requires a special elongation factor (EF) SelB and a stem-loop structure within the mRNA called a selenocysteine insertion sequence (SECIS). Here, we used NMR spectroscopy to determine the solution structure of the SECIS mRNA hairpin and characterized its interaction with the mRNA-binding domain of SelB. Our structural and biochemical data identified the conserved structural features important for binding to EF SelB within different SECIS RNA sequences. In the free SECIS mRNA structure, conserved nucleotides are strongly exposed for recognition by SelB. Binding of the C-terminal domain of SelB stabilizes the RNA secondary structure. In the protein-RNA complex, a Watson-Crick loop base-pair leaves a GpU sequence accessible for SelB recognition. This GpU sequence at the tip of the capping tetraloop and a bulge uracil five Watson-Crick base-pairs apart from the GpU are essential for interaction with SelB.  相似文献   

8.
Until recently, drawing general conclusions about RNA recognition by proteins has been hindered by the paucity of high-resolution structures. We have analyzed 45 PDB entries of protein-RNA complexes to explore the underlying chemical principles governing both specific and non-sequence specific binding. To facilitate the analysis, we have constructed a database of interactions using ENTANGLE, a JAVA-based program that uses available structural models in their PDB format and searches for appropriate hydrogen bonding, stacking, electrostatic, hydrophobic and van der Waals interactions. The resulting database of interactions reveals correlations that suggest the basis for the discrimination of RNA from DNA and for base-specific recognition. The data illustrate both major and minor interaction strategies employed by families of proteins such as tRNA synthetases, ribosomal proteins, or RNA recognition motifs with their RNA targets. Perhaps most surprisingly, specific RNA recognition appears to be mediated largely by interactions of amide and carbonyl groups in the protein backbone with the edge of the RNA base. In cases where a base accepts a proton, the dominant amino acid donor is arginine, whereas in cases where the base donates a proton, the predominant acceptor is the backbone carbonyl group, not a side-chain group. This is in marked contrast to DNA-protein interactions, which are governed predominantly by amino acid side-chain interactions with functional groups that are presented in the accessible major groove. RNA recognition often proceeds through loops, bulges, kinks and other irregular structures that permit use of all the RNA functional groups and this is seen throughout the protein-RNA interaction database.  相似文献   

9.
MOTIVATION: The recognition of specific RNA sequences and structures by proteins is critical to our understanding of RNA processing, gene expression and viral replication. The diversity of RNA structures suggests that RNA recognition is substantially different than that of DNA. RESULTS: The atomic coordinates of 41 protein-RNA complexes have been used to probe composite nucleoside binding pockets that form the structural and chemical underpinnings of base recognition. Composite nucleoside binding pockets were constructed using three-dimensional superpositions of each RNA nucleoside. Unlike protein-DNA interactions which are dominated by accessibility, RNA recognition frequently occurs in non-canonical and single-strand-like structures that allow interactions to occur from a much wider set of geometries and make fuller use of unique base shapes and hydrogen-bonding ability. By constructing composites that include all van der Waals, hydrogen-bonding, stacking and general non-polar interactions made to a particular nucleoside, the strategies employed are made readily visible. Protein-RNA interactions can result in the formation of a glove-like tight binding pocket around RNA bases, but the size, shape and non-polar binding patterns differ between specific RNA bases. We show that adenine can be distinguished from guanine based on the size and shape of the binding pocket and steric exclusion of the guanine N2 exocyclic amino group. The unique shape and hydrogen-bonding pattern for each RNA base allow proteins to make specific interactions through a very small number of contacts, as few as two in some cases. AVAILABILITY: The program ENTANGLE is available from http://www.bioc.rice.edu/~shamoo  相似文献   

10.
RNA-protein interactions play essential roles in regulating gene expression. While some RNA-protein interactions are “specific”, that is, the RNA-binding proteins preferentially bind to particular RNA sequence or structural motifs, others are “non-RNA specific.” Deciphering the protein-RNA recognition code is essential for comprehending the functional implications of these interactions and for developing new therapies for many diseases. Because of the high cost of experimental determination of protein-RNA interfaces, there is a need for computational methods to identify RNA-binding residues in proteins. While most of the existing computational methods for predicting RNA-binding residues in RNA-binding proteins are oblivious to the characteristics of the partner RNA, there is growing interest in methods for partner-specific prediction of RNA binding sites in proteins. In this work, we assess the performance of two recently published partner-specific protein-RNA interface prediction tools, PS-PRIP, and PRIdictor, along with our own new tools. Specifically, we introduce a novel metric, RNA-specificity metric (RSM), for quantifying the RNA-specificity of the RNA binding residues predicted by such tools. Our results show that the RNA-binding residues predicted by previously published methods are oblivious to the characteristics of the putative RNA binding partner. Moreover, when evaluated using partner-agnostic metrics, RNA partner-specific methods are outperformed by the state-of-the-art partner-agnostic methods. We conjecture that either (a) the protein-RNA complexes in PDB are not representative of the protein-RNA interactions in nature, or (b) the current methods for partner-specific prediction of RNA-binding residues in proteins fail to account for the differences in RNA partner-specific versus partner-agnostic protein-RNA interactions, or both.  相似文献   

11.
To assess whether there are universal rules that govern amino acid–base recognition, we investigate hydrogen bonds, van der Waals contacts and water-mediated bonds in 129 protein–DNA complex structures. DNA–backbone interactions are the most numerous, providing stability rather than specificity. For base interactions, there are significant base–amino acid type correlations, which can be rationalised by considering the stereochemistry of protein side chains and the base edges exposed in the DNA structure. Nearly two-thirds of the direct read-out of DNA sequences involves complex networks of hydrogen bonds, which enhance specificity. Two-thirds of all protein–DNA interactions comprise van der Waals contacts, compared to about one-sixth each of hydrogen and water-mediated bonds. This highlights the central importance of these contacts for complex formation, which have previously been relegated to a secondary role. Although common, water-mediated bonds are usually non-specific, acting as space-fillers at the protein–DNA interface. In conclusion, the majority of amino acid–base interactions observed follow general principles that apply across all protein–DNA complexes, although there are individual exceptions. Therefore, we distinguish between interactions whose specificities are ‘universal’ and ‘context-dependent’. An interactive Web-based atlas of side chain–base contacts provides access to the collected data, including analyses and visualisation of the three-dimensional geometry of the interactions.  相似文献   

12.
We extend our previous analysis of binding specificity of DNA-protein complexes to complexes containing water-mediated bridges. Inclusion of water bridges between phosphate and base, phosphate and sugar, as well as proteins and DNA, improves the prediction of specificity; six data sets studied in this paper yield correct predictions for all base pairs that have two or more hydrogen-bonds. Beside massive computation, our approach relies highly on experimental data. After deriving protein structures from DNA-protein complexes in which coordinates were established by X-ray diffraction techniques, we analysed all possible DNA sequences to which these proteins might bind, ranking them in terms of Lennard-Jones potential for the optimal docking configuration. Our prediction algorithm rests on the following assumptions: (1) specificity comes mainly from direct hydrogen bonding; (2) electrostatic forces stabilise DNA-protein complexes and contribute only weakly to specificity since they occur at the charged phosphate groups; (3) Van der Waals forces and electrostatic interactions between positively charged groups on the protein and phosphates on DNA can be neglected as they contribute primarily to the free energy of stabilisation as opposed to specificity.  相似文献   

13.
We report an optimized synthesis of all canonical 2'-O-TOM protected ribonucleoside phosphoramidites and solid supports containing [13C5]-labeled ribose moieties, their sequence-specific introduction into very short RNA sequences and their use for the structure determination of two protein-RNA complexes. These specifically labeled sequences facilitate RNA resonance assignments and are essential to assign a high number of sugar-sugar and intermolecular NOEs, which ultimately improve the precision and accuracy of the resulting structures. This labeling strategy is particularly useful for the study of protein-RNA complexes with single-stranded RNA in solution, which is rapidly an increasingly relevant research area in biology.  相似文献   

14.
We have determined and refined a crystal structure of the initial assembly complex of archaeal box C/D sRNPs comprising the Archaeoglobus fulgidus (AF) L7Ae protein and a box C/D RNA. The box C/D RNA forms a classical kink-turn (K-turn) structure and the resulting protein-RNA complex serves as a distinct platform for recruitment of the fibrillarin-Nop5p complex. The cocrystal structure confirms previously proposed secondary structure of the box C/D RNA that includes a protruded U, a UU mismatch, and two sheared tandem GA base pairs. Detailed structural comparisons of the AF L7Ae-box C/D RNA complex with previously determined crystal structures of L7Ae homologs in complex with functionally distinct K-turn RNAs revealed a set of remarkably conserved principles in protein-RNA interactions. These analyses provide a structural basis for interpreting the functional roles of the box C/D sequences in directing specific assembly of box C/D sRNPs.  相似文献   

15.
Barik A  C N  P M  Bahadur RP 《Proteins》2012,80(7):1866-1871
We have developed a nonredundant protein-RNA docking benchmark dataset, which is derived from the available bound and unbound structures in the Protein Data Bank involving polypeptide and nucleic acid chains. It consists of nine unbound-unbound cases where both the protein and the RNA are available in the free form. The other 36 cases are of unbound-bound type where only the protein is available in the free form. The conformational change upon complex formation is calculated by a distance matrix alignment method, and based on that, complexes are classified into rigid, semi-flexible, and full flexible. Although in the rigid body category, no significant conformational change accompanies complex formation, the fully flexible test cases show large domain movements, RNA base flips, etc. The benchmark covers four major groups of RNA, namely, t-RNA, ribosomal RNA, duplex RNA, and single-stranded RNA. We find that RNA is generally more flexible than the protein in the complexes, and the interface region is as flexible as the molecule as a whole. The structural diversity of the complexes in the benchmark set should provide a common ground for the development and comparison of the protein-RNA docking methods. The benchmark can be freely downloaded from the internet.  相似文献   

16.
17.
Phipps KR  Li H 《Proteins》2007,67(1):121-127
The crystal packing surfaces comprising protein-RNA interactions were analyzed for 50 RNA-protein crystal structures in the Protein Data Bank database. Protein-RNA crystal contacts, which represent nonspecific protein-RNA interfaces, were investigated for their amino acid propensities, hydrogen bond patterns, and backbone and side chain interactions. When compared to biologically relevant interactions, the protein-RNA crystal contacts exhibit similarities as well as differences with respect to the principles of protein-RNA interactions. Similar to what was observed at cognate protein-RNA interfaces, positively charged amino acids have high propensities at noncognate protein-RNA interfaces and preferentially form hydrogen bonds with RNA phosphate groups. In contrast, nonpolar residues are less frequently associated with noncognate interactions. These results highlight the important roles of both electrostatic and hydrogen bonding interactions, facilitated by positively charged amino acids, in mediating both specific and nonspecific protein-RNA interactions.  相似文献   

18.
To derive structural information about the vesicular stomatitis virus (VSV) nucleocapsid (N) protein, the N protein and the VSV phosphoprotein (P protein) were expressed together in Escherichia coli. The N and P proteins formed soluble protein complexes of various molar ratios when coexpressed. The major N/P protein complex was composed of 10 molecules of the N protein, 5 molecules of the P protein, and an RNA. A soluble N protein-RNA oligomer free of the P protein was isolated from the N/P protein-RNA complex using conditions of lowered pH. The molecular weight of the N protein-RNA oligomer, 513,879, as determined by analytical ultracentrifugation, showed that it was composed of 10 molecules of the N protein and an RNA of approximately 90 nucleotides. The N protein-RNA oligomer had the appearance of a disk with outer diameter, inner diameter, and thickness of 148 +/- 10 A, 78 +/- 9 A, and 83 +/- 8 A, respectively, as determined by electron microscopy. RNA in the complexes was protected from RNase digestion and was stable at pH 11. This verified that N/P protein complexes expressed in E. coli were competent for encapsidation. In addition to coexpression with the full-length P protein, the N protein was expressed with the C-terminal 72 amino acids of the P protein. This portion of the P protein was sufficient for binding to the N protein, maintaining it in a soluble state, and for assembly of N protein-RNA oligomers. With the results provided in this report, we propose a model for the assembly of an N/P protein-RNA oligomer.  相似文献   

19.
Protein-RNA complexes play many important roles in diverse cellular functions. They are involved in a wide variety of different processes in growth and differentiation at the various stages of the cell cycle. As their function and catalytic activity are directly coupled to the structural arrangement of their components--proteins and ribonucleic acids--the investigation of protein-RNA interactions is of great functional and structural importance. Here we discuss the most prominent examples of protein-RNA complexes and describe some frequently used purification strategies. We present various techniques and applications of mass spectrometry to study protein-RNA complexes. We discuss the analysis of intact complexes as well as proteomics-based and crosslinking-based approaches in which proteins are cleaved into smaller peptides. This article is part of a Special Section entitled: Understanding genome regulation and genetic diversity by mass spectrometry.  相似文献   

20.
Ellis JJ  Jones S 《Proteins》2008,70(4):1518-1526
Many protein-RNA recognition events are known to exhibit conformational changes from qualitative observations of individual complexes. However, a quantitative estimation of conformational changes is required if protein-RNA docking and template-based methods for RNA binding site prediction are to be developed. This study presents the first quantitative evaluation of conformational changes that occur when proteins bind RNA. The analysis of twelve RNA-binding proteins in the bound and unbound states using error-scaled difference distance matrices is presented. The binding site residues are mapped to each structure, and the conformational changes that affect these residues are evaluated. Of the twelve proteins four exhibit greater movements in nonbinding site residues, and a further four show the greatest movements in binding site residues. The remaining four proteins display no significant conformational change. When interface residues are found to be in conformationally variable regions of the protein they are typically seen to move less than 2 A between the bound and unbound conformations. The current data indicate that conformational changes in the binding site residues of RNA binding proteins may not be as significant as previously suggested, but a larger data set is required before wider conclusions may be drawn. The implications of the observed conformational changes for protein function prediction are discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号