首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The wide structural diversity of RNA results in part from the diversity of non-Watson-Crick interactions between bases. To examine the repertoire of possible hydrogen bond interactions among bases, we computed databases of base-pairs and base-triples by systematically matching all possible hydrogen-bond donors and acceptors between bases and evaluating the geometries of each planar configuration. For base-pairs, we find 53 arrangements having at least two hydrogen bonds, including 23 pairs with protonated bases that have not previously been modeled. A comparison with experimentally observed base-pairs reveals an unexpected G:U pair recently observed in the ribosome. For base-triples, we find 840 arrangements in which the three bases are constrained by a total of at least three hydrogen bonds. Base-triples in particular exhibit a wide range of structural diversity, suggesting how compact or elongated nucleic acid structures may be constructed using different hydrogen-bonding patterns. Base-pair and base-triple conformations were systematically compared to identify structurally isomorphic combinations, and the experimentally observed arrangements within double and triple helices are among the most isomorphic. Unexpectedly, however, other combinations in the database are even more isomorphic, including several in which all-purine arrangements overlap with all-pyrimidine arrangements. These studies highlight some of the combinatoric and geometric versatility of base interactions and help provide a framework for analyzing and modeling isomorphic interactions and potentially for designing novel nucleic acid structures.  相似文献   

2.
Protein-RNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. Here, we analyzed the protein surface shape (dented, intermediate or protruded) and the RNA base pairing properties (paired or unpaired nucleotides) at the interfaces of 91 protein-RNA complexes derived from the Protein Data Bank. Dented protein surfaces prefer unpaired nucleotides to paired ones at the interface, and hydrogen bonds frequently occur between the protein backbone and RNA bases. In contrast, protruded protein surfaces do not show such a preference, rather, electrostatic interactions initiate the formation of hydrogen bonds between positively charged amino acids and RNA phosphate groups. Interestingly, in many protein-RNA complexes that interact via an RNA loop, an aspartic acid is favored at the interface. Moreover, in most of these complexes, nucleotide bases in the RNA loop are flipped out and form hydrogen bonds with the protein, which suggests that aspartic acid is important for RNA loop recognition through a base-flipping process. This study provides fundamental insights into the role of the shape of the protein surface and RNA secondary structures in mediating protein-RNA interactions.  相似文献   

3.
X-ray, phylogenetic and quantum chemical analysis of molecular interactions and conservation patterns of cis Watson-Crick (W.C.) A/G base-pairs in 16S rRNA, 23S rRNA and other molecules was carried out. In these base-pairs, the A and G nucleotides interact with their W.C. edges with glycosidic bonds oriented cis relative to each other. The base-pair is stabilised by two hydrogen bonds, the C1'-C1' distance is enlarged and the G(N2) amino group is left unpaired. Quantum chemical calculations show that, in the absence of other interactions, the unpaired amino group is substantially non-planar due to its partial sp(3) pyramidalization, while the whole base-pair is internally propeller twisted and very flexible. The unique molecular properties of the cis W.C. A/G base-pairs make them distinct from other base-pairs. They occur mostly at the ends of canonical helices, where they serve as interfaces between the helix and other motifs. The cis W.C. A/G base-pairs play crucial roles in natural RNA structures with salient sequence conservation patterns. The key contribution to conservation is provided by the unpaired G(N2) amino group that is involved in a wide range of tertiary and neighbor contacts in the crystal structures. Many of them are oriented out of the plane of the guanine base and utilize the partial sp(3) pyramidalization of the G(N2). There is a lack of A/G to G/A covariation, which, except for the G(N2) position, would be entirely isosteric. On the contrary, there is a rather frequent occurrence of G/A to G/U covariation, as the G/U wobble base-pair has an unpaired amino group in the same position as the cis W.C. G/A base-pair. The cis W.C. A/G base-pairs are not conserved when there is no tertiary or neighbor interaction. Obtaining the proper picture of the interactions and phylogenetic patterns of the cis W.C. A/G base-pairs requires a detailed analysis of the relation between the molecular structures and the energetics of interactions at a level of single H-bonds and contacts.  相似文献   

4.
Ellis JJ  Broom M  Jones S 《Proteins》2007,66(4):903-911
A data set of 89 protein-RNA complexes has been extracted from the Protein Data Bank, and the nucleic acid recognition sites characterized through direct contacts, accessible surface area, and secondary structure motifs. The differences between RNA recognition sites that bind to RNAs in functional classes has also been analyzed. Analysis of the complete data set revealed that van der Waals interactions are more numerous than hydrogen bonds and the contacts made to the nucleic acid backbone occur more frequently than specific contacts to nucleotide bases. Of the base-specific contacts that were observed, contacts to guanine and adenine occurred most frequently. The most favored amino acid-nucleotide pairings observed were lysine-phosphate, tyrosine-uracil, arginine-phosphate, phenylalanine-adenine and tryptophan-guanine. The amino acid propensities showed that positively charged and polar residues were favored as expected, but also so were tryptophan and glycine. The propensities calculated for the functional classes showed trends similar to those observed for the complete data set. However, the analysis of hydrogen bond and van der Waal contacts showed that in general proteins complexed with messenger RNA, transfer RNA and viral RNA have more base specific contacts and less backbone contacts than expected, while proteins complexed with ribosomal RNA have less base-specific contacts than the expected. Hence, whilst the types of amino acids involved in the interfaces are similar, the distribution of specific contacts is dependent upon the functional class of the RNA bound.  相似文献   

5.
N V Kumar  G Govil 《Biopolymers》1984,23(10):1995-2008
With a view to understanding the role of hydrogen bonds in the recognition of nucleic acids by proteins, hydrogen bonding between the bases and base pairs of nucleic acids and the amino acids (Asn, Gln, Asp and Glu, and charged residues Arg+, Glu?, and Asp?) has been studied by a second-order perturbation theory. Binding energies have been calculated for all possible configurations involving a pair of hydrogen bonds between the base (or base pair) and the amino acid residue. Our results show that the hydrogen bonding in these cases has a large contribution from electrostatic interaction. In general, the charged amino acids, compared to the uncharged ones, form more stable complexes with bases or base pairs. The hydrogen-bond energies are an order of magnitude smaller than the Coulombic interaction energies between basic amino acids (Lys+, Arg+, and His+) and the phosphate groups of nucleic acids. The stabilities of the complexes of amino acids Asn, Gln, Asp, and Glu with bases are in the order: G–X > C–X > A–X U–X or T–X, and G · C–X > A · T(U)–X, where X is one of these amino acid residues. It has been shown that Glu? and Asp? can recognize guanine in single-stranded nucleic acids; Arg+ can recognize G · C base pairs from A · T base pairs in double-stranded structures.  相似文献   

6.
We investigate the sequence and structural properties of RNA-protein interaction sites in 211 RNA-protein chain pairs, the largest set of RNA-protein complexes analyzed to date. Statistical analysis confirms and extends earlier analyses made on smaller data sets. There are 24.6% of hydrogen bonds between RNA and protein that are nucleobase specific, indicating the importance of both nucleobase-specific and -nonspecific interactions. While there is no significant difference between RNA base frequencies in protein-binding and non-binding regions, distinct preferences for RNA bases, RNA structural states, protein residues, and protein secondary structure emerge when nucleobase-specific and -nonspecific interactions are considered separately. Guanine nucleobase and unpaired RNA structural states are significantly preferred in nucleobase-specific interactions; however, nonspecific interactions disfavor guanine, while still favoring unpaired RNA structural states. The opposite preferences of nucleobase-specific and -nonspecific interactions for guanine may explain discrepancies between earlier studies with regard to base preferences in RNA-protein interaction regions. Preferences for amino acid residues differ significantly between nucleobase-specific and -nonspecific interactions, with nonspecific interactions showing the expected bias towards positively charged residues. Irregular protein structures are strongly favored in interactions with the protein backbone, whereas there is little preference for specific protein secondary structure in either nucleobase-specific interaction or -nonspecific interaction. Overall, this study shows strong preferences for both RNA bases and RNA structural states in protein-RNA interactions, indicating their mutual importance in protein recognition.  相似文献   

7.
Three highly conserved aromatic residues in RNA recognition motifs (RRM) participate in stacking interactions with RNA bases upon binding RNA. We have investigated the contribution of one of these aromatic residues, Phe56, to the complex formed between the N-terminal RRM of the spliceosomal protein U1A and stem–loop 2 of U1 snRNA. Previous work showed that the aromatic group is important for high affinity binding. Here we probe how mutation of Phe56 affects the kinetics of complex dissociation, the strength of the hydrogen bonds formed between U1A and the base that stacks with Phe56 (A6) and specific target site recognition. Substitution of Phe56 with Trp or Tyr increased the rate of dissociation of the complex, consistent with previously reported results. However, substitution of Phe56 with His decreased the rate of complex association, implying a change in the initial formation of the complex. Simultaneous modification of residue 56 and A6 revealed energetic coupling between the aromatic group and the functional groups of A6 that hydrogen bond to U1A. Finally, mutation of Phe56 to Leu reduced the ability of U1A to recognize stem–loop 2 correctly. Taken together, these experiments suggest that Phe56 contributes to binding affinity by stacking with A6 and participating in networks of energetically coupled interactions that enable this conserved aromatic amino acid to play a complex role in target site recognition.  相似文献   

8.
Phipps KR  Li H 《Proteins》2007,67(1):121-127
The crystal packing surfaces comprising protein-RNA interactions were analyzed for 50 RNA-protein crystal structures in the Protein Data Bank database. Protein-RNA crystal contacts, which represent nonspecific protein-RNA interfaces, were investigated for their amino acid propensities, hydrogen bond patterns, and backbone and side chain interactions. When compared to biologically relevant interactions, the protein-RNA crystal contacts exhibit similarities as well as differences with respect to the principles of protein-RNA interactions. Similar to what was observed at cognate protein-RNA interfaces, positively charged amino acids have high propensities at noncognate protein-RNA interfaces and preferentially form hydrogen bonds with RNA phosphate groups. In contrast, nonpolar residues are less frequently associated with noncognate interactions. These results highlight the important roles of both electrostatic and hydrogen bonding interactions, facilitated by positively charged amino acids, in mediating both specific and nonspecific protein-RNA interactions.  相似文献   

9.
The RNA recognition motif (RRM) binds to single-stranded RNA target sites of diverse sequences and structures. A conserved mode of base recognition by the RRM involves the simultaneous formation of a network of hydrogen bonds with the base functional groups and a stacking interaction between the base and a highly conserved aromatic amino acid. We have investigated the energetic contribution of the functional groups involved in the recognition of an essential adenine, A6, in stem–loop 2 of U1 snRNA by the N-terminal RRM of the U1A protein. Previously, we found that elimination of individual hydrogen bond donors and acceptors on A6 destabilized the complex by 0.8–1.9 kcal/mol, while mutation of the aromatic amino acid (Phe56) that stacks with A6 to Ala destabilized the complex by 5.5 kcal/mol. Here we continue to probe the contribution of A6 to complex stability through mutation of both the RNA and protein. We have removed two hydrogen-bonding functional groups by introducing a U1A mutation, Ser91Ala, and replacing A6 with tubercidin, purine, or 1-deazaadenine. We find that the complex is destabilized an additional 1.2–2.6 kcal/mol by the elimination of the second hydrogen bond donor or acceptor. Surprisingly, deletion of all of the functional groups involved in hydrogen bonds with the U1A protein by substituting adenine with 4-methylindole reduced the binding free energy by only 2.0 kcal/mol. Experiments with U1A proteins containing mutations of Phe56 suggested that improved stacking interactions due to the greater hydrophobicity of 4-methylindole than adenine may be partly responsible for the small destabilization of the complex upon substitution of 4-methylindole for A6. The data imply that hydrophobic interactions can compensate energetically for the disruption of the complex hydrogen-bonding network between nucleotide and protein.  相似文献   

10.
Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide-protein complexes are analyzed manually and pseudo pairs together with single and bifurcated hydrogen bonds observed between bases and amino acids are classified and annotated. Only 5 of the 20 usual amino acid residues, Asn, Gln, Asp, Glu and Arg, are able to orient in a coplanar fashion in order to form pseudo pairs with nucleotide bases through two hydrogen bonds. The peptide backbone can also form pseudo pairs with nucleotide bases and presents a strong bias for binding to the adenine base. The Watson-Crick side of the nucleotide bases is the major interaction edge participating in such pseudo pairs. Pseudo pairs between the Watson-Crick edge of guanine and Asp are frequently observed. The Hoogsteen edge of the purine bases is a good discriminatory element in recognition of nucleotide bases by protein side chains through the pseudo pairing: the Hoogsteen edge of adenine is recognized by various amino acids while the Hoogsteen edge of guanine is only recognized by Arg. The sugar edge is rarely recognized by either the side-chain or peptide backbone of amino acid residues.  相似文献   

11.
In addition to the canonical base-pairs comprising the standard Watson-Crick (C:G and U:A) and wobble U:G conformations, an analysis of the base-pair types and conformations in the rRNAs in the high-resolution crystal structures of the Thermus thermophilus 30S and Haloarcula marismortui 50S ribosomal subunits has identified a wide variety of non-canonical base-pair types and conformations. However, the existing nomenclatures do not describe all of the observed non-canonical conformations or describe them with some ambiguity. Thus, a standardized system is required to classify all of these non-canonical conformations appropriately. Here, we propose a new, simple and systematic nomenclature that unambiguously classifies base-pair conformations occurring in base-pairs, base-triples and base-quadruples that are associated with secondary and tertiary interactions. This system is based on the topological arrangement of the two bases and glycosidic bonds in a given base-pair. Base-pairs in the internal positions of regular secondary structure helices usually form with canonical base-pair groups (C:G, U:A, and U:G) and canonical conformations (C:G WC, U:A WC, and U:G Wb). In contrast, non-helical base-pairs outside of regular structure helices usually have non-canonical base-pair groups and conformations. In addition, many non-helical base-pairs are involved in RNA motifs that form a defined set of non-canonical conformations. Thus, each rare non-canonical conformation may be functionally and structurally important. Finally, the topology-based isostericity of base-pair conformations can rationalize base-pair exchanges in the evolution of RNA molecules.  相似文献   

12.
Free energies for stacking of unpaired nucleotides (dangling ends) at the termini of oligoribonucleotide Watson-Crick helixes (DeltaG(0)37,stack) depend on sequence for 3' ends but are always small for 5' ends. Here, these free energies are correlated with stacking at helix termini in a database of 34 RNA structures determined by X-ray crystallography and NMR spectroscopy. Stacking involving GA pairs is considered separately. A base is categorized as stacked by its distance from (相似文献   

13.
The "ribose zipper", an important element of RNA tertiary structure, is characterized by consecutive hydrogen-bonding interactions between ribose 2'-hydroxyls from different regions of an RNA chain or between RNA chains. These tertiary contacts have previously been observed to also involve base-backbone and base-base interactions (A-minor type). We searched for ribose zipper tertiary interactions in the crystal structures of the large ribosomal subunit RNAs of Haloarcula marismortui and Deinococcus radiodurans, and the small ribosomal subunit RNA of Thermus thermophilus and identified a total of 97 ribose zippers. Of these, 20 were found in T. thermophilus 16 S rRNA, 44 in H. marismortui 23 S rRNA (plus 2 bridging 5 S and 23 S rRNAs) and 30 in D. radiodurans 23 S rRNA (plus 1 bridging 5 S and 23 S rRNAs). These were analyzed in terms of sequence conservation, structural conservation and stability, location in secondary structure, and phylogenetic conservation. Eleven types of ribose zippers were defined based on ribose-base interactions. Of these 11, seven were observed in the ribosomal RNAs. The most common of these is the canonical ribose zipper, originally observed in the P4-P6 group I intron fragment. All ribose zippers were formed by antiparallel chain interactions and only a single example extended beyond two residues, forming an overlapping ribose zipper of three consecutive residues near the small subunit A-site. Almost all ribose zippers link stem (Watson-Crick duplex) or stem-like (base-paired), with loop (external, internal, or junction) chain segments. About two-thirds of the observed ribose zippers interact with ribosomal proteins. Most of these ribosomal proteins bridge the ribose zipper chain segments with basic amino acid residues hydrogen bonding to the RNA backbone. Proteins involved in crucial ribosome function and in early stages of ribosomal assembly also stabilize ribose zipper interactions. All ribose zippers show strong sequence conservation both within these three ribosomal RNA structures and in a large database of aligned prokaryotic sequences. The physical basis of the sequence conservation is stacked base triples formed between consecutive base-pairs on the stem or stem-like segment with bases (often adenines) from the loop-side segment. These triples have previously been characterized as Type I and Type II A-minor motifs and are stabilized by base-base and base-ribose hydrogen bonds. The sequence and structure conservation of ribose zippers can be directly used in tertiary structure prediction and may have applications in molecular modeling and design.  相似文献   

14.
The EcoRV restriction endonuclease recognises palindromic GATATC sequences and cuts between the central T and dA bases in a reaction that has an absolute requirement for a divalent metal ion, physiologically Mg(2+). Use has been made of base analogues, which delete hydrogen bonds between the protein and DNA (or hydrophobic interactions in the case of the 5-CH(3) group of thymine), to evaluate the roles of the outer two base-pairs (GATATC) in DNA recognition. Selectivity arises at both the binding steps leading to the formation of the enzyme-DNA-metal ion ternary complex (assayed by measuring the dissociation constant in the presence of the non-reactive metal Ca(2+)) and the catalytic step (evaluated using single-turnover hydrolysis in the presence of Mg(2+)), with each protein-DNA contact contributing to recognition. With the A:T base-pair, binding was reduced by the amount expected for the simple loss of a single contact; much more severe effects were observed with the G:C base-pair, suggesting additional conformational perturbation. Most of the modified bases lowered the rate of hydrolysis; furthermore, the presence of an analogue in one strand of the duplex diminished cutting at the second, unmodified strand, indicative of communication between DNA binding and the active site. The essential metal ion Mg(2+) plays a key role in mediating interactions between the DNA binding site and active centre and in many instances rescue of hydrolysis was seen with Mn(2+). It is suggested that contacts between the GATATC site are required for tight binding and for the correct assembly of metal ions and bound water at the catalytic site, functions important in providing acid/base catalysis and transition state stabilisation.  相似文献   

15.
Studies have been made of conformational parameters in co-crystal complexes and compounds of nucleic acid bases in which there is the possibility of formation of hetero-base-pairs. Using published data extracted from the Cambridge structural database, a total of 37 base-pairs were found, of which 25 were hetero-pairs and 12 homo-pairs. These base-pairs were subject to analysis to reveal hydrogen bond parameters, propeller twist, buckle and C1'-C1' separation (or a similar parameter if C1' atoms were not present). Hetero-pairs were found to show larger twists than homo-pairs, the magnitude of twist being unrelated to hydrogen bond parameters or buckle value. The propeller twisting is less pronounced in these nucleic acid bases than in nucleosides, but still has a significant magnitude. Propeller twisting in hetero-pairs is found to be larger than in homo-pairs. Hetero-pairs appear to be formed preferentially in competitive situations.  相似文献   

16.
Geometric nomenclature and classification of RNA base pairs   总被引:26,自引:9,他引:17       下载免费PDF全文
  相似文献   

17.
The genome of the human immunodeficiency virus type-1 (HIV-1) contains a stretch of approximately 120 nucleotides known as the psi-site that is essential for RNA packaging during virus assembly. These nucleotides have been proposed to form four stem-loops (SL1-SL4) that have both independent and overlapping functions. Stem-loop SL2 is important for efficient recognition and packaging of the full-length, unspliced viral genome, and also contains the major splice-donor site (SD) for mRNA splicing. We have determined the structure of the 19-residue SL2 oligoribonucleotide by heteronuclear NMR methods. The structure is generally consistent with the most recent of two earlier secondary structure predictions, with residues G1-G2-C3-G4 and C6-U7 forming standard Watson Crick base-pairs with self-complementary residues C16-G17-C18-C19 and A12-G13, respectively. However, residue A15, which is located near the center of the stem, does not form a predicted bulge, and residues A5 and U14 do not form an expected Watson-Crick base-pair. Instead, these residues form a novel A5-U14-A15 base-triple that appears to be stabilized by hydrogen bonds from A15-H61 and -H62 to A5-N1 and U14-O2, respectively; from A5-H61 to U14-O2, and from C16-H42 to U14-O2'. A kink in the backbone allows the aromatic rings of the sequential U14-A15 residues to be approximately co-planar, adopting a stable "platform motif" that is structurally similar to the A-A (adenosine) platforms observed in the P4-P6 ribozyme domain of the Tetrahymena group I intron. Platform motifs generally function in RNA by mediating long-range interactions, and it is therefore possible that the A-U-A base-triple platform mediates long-range interactions that either stabilize the psi-RNA or facilitate splicing and/or packaging. Residue G8 of the G8-G9-U10-G11 tetraloop is stacked above the U7-A12 base-pair, and the remaining tetraloop residues are disordered and available for potential interactions with either other RNA or protein components.  相似文献   

18.
The combined biochemical and structural study of hundreds of protein-DNA complexes has indicated that sequence-specific interactions are mediated by two mechanisms termed direct and indirect readout. Direct readout involves direct interactions between the protein and base-specific atoms exposed in the major and minor grooves of DNA. For indirect readout, the protein recognizes DNA by sensing conformational variations in the structure dependent on nucleotide sequence, typically through interactions with the phosphodiester backbone. Based on our recent structure of Ndt80 bound to DNA in conjunction with a search of the existing PDB database, we propose a new method of sequence-specific recognition that utilizes both direct and indirect readout. In this mode, a single amino acid side-chain recognizes two consecutive base-pairs. The 3'-base is recognized by canonical direct readout, while the 5'-base is recognized through a variation of indirect readout, whereby the conformational flexibility of the particular dinucleotide step, namely a 5'-pyrimidine-purine-3' step, facilitates its recognition by the amino acid via cation-pi interactions. In most cases, this mode of DNA recognition helps explain the sequence specificity of the protein for its target DNA.  相似文献   

19.
20.
A 4 ns molecular dynamics simulation of an RNA duplex (r-GGACUUCGGUCC)(2 )in solution with Na+ and Cl- as counterions was performed. The X-ray structure of this duplex includes two water-mediated uracil-cytosine pairs. In contrast to the other base-pairs in the duplex the water-mediated pairs switch between different conformations. One conformation corresponds to the geometry of the water-mediated UC pairs in the duplex X-ray structure with water acting both as hydrogen-bond donor and acceptor. Another conformation is close to that of a water-mediated UC base-pair found in the X-ray structure of the 23 S rRNA sarcin/ricin domain. In this case the oxygen of the water molecule is linked to two-base donor sites. For a very short time also a direct UC base-pair and a further conformation that is similar to the one found in the RNA duplex structure but exhibits an increased H3(U)...N3(C) distance is observed. Water molecules with unusually long residence times are involved in the water-mediated conformations. These results indicate that the dynamic behaviour of the water-mediated UC base-pairs differs from that of the duplex Watson-Crick and non-canonical guanine-uracil pairs with two or three direct hydrogen bonds. The conformational variability and increased flexibility has to be taken into account when considering these base-pairs as RNA building blocks and as recognition motifs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号