首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
The trans Watson-Crick/Watson-Crick family of base pairs represent a geometric class that play important structural and possible functional roles in the ribosome, tRNA, and other functional RNA molecules. They nucleate base triplets and quartets, participate as loop closing terminal base pairs in hair pin motifs and are also responsible for several tertiary interactions that enable sequentially distant regions to interact with each other in RNA molecules. Eleven representative examples spanning nine systems belonging to this geometric family of RNA base pairs, having widely different occurrence statistics in the PDB database, were studied at the HF/6-31G (d, p) level using Morokuma decomposition, Atoms in Molecules as well as Natural Bond Orbital methods in the optimized gas phase geometries and in their crystal structure geometries, respectively. The BSSE and deformation energy corrected interaction energy values for the optimized geometries are compared with the corresponding values in the crystal geometries of the base pairs. For non protonated base pairs in their optimized geometry, these values ranged from -8.19 kcal/mol to -21.84 kcal/mol and compared favorably with those of canonical base pairs. The interaction energies of these base pairs, in their respective crystal geometries, were, however, lesser to varying extents and in one case, that of A:A W:W trans, it was actually found to be positive. The variation in RMSD between the two geometries was also large and ranged from 0.32-2.19 A. Our analysis shows that the hydrogen bonding characteristics and interaction energies obtained, correlated with the nature and type of hydrogen bonds between base pairs; but the occurrence frequencies, interaction energies, and geometric variabilities were conspicuous by the absence of any apparent correlation. Instead, the nature of local interaction energy hyperspace of different base pairs as inferred from the degree of their respective geometric variability could be correlated with the identities of free and bound hydrogen bond donor/acceptor groups present in interacting bases in conjunction with their tertiary and neighboring group interaction potentials in the global context. It also suggests that the concept of isostericity alone may not always determine covariation potentials for base pairs, particularly for those which may be important for RNA dynamics. These considerations are more important than the absolute values of the interaction energies in their respective optimized geometries in rationalizing their occurrences in functional RNAs. They highlight the importance of revising some of the existing DNA based structure analysis approaches and may have significant implications for RNA structure and dynamics, especially in the context of structure prediction algorithms.  相似文献   

3.
Abstract

The trans Watson-Crick/Watson-Crick family of base pairs represent a geometric class that play important structural and possible functional roles in the ribosome, tRNA, and other functional RNA molecules. They nucleate base triplets and quartets, participate as loop closing terminal base pairs in hair pin motifs and are also responsible for several tertiary interactions that enable sequentially distant regions to interact with each other in RNA molecules. Eleven representative examples spanning nine systems belonging to this geometric family of RNA base pairs, having widely different occurrence statistics in the PDB database, were studied at the HF/6–31G (d, p) level using Morokuma decomposition, Atoms in Molecules as well as Natural Bond Orbital methods in the optimized gas phase geometries and in their crystal structure geometries, respectively. The BSSE and deformation energy corrected interaction energy values for the optimized geometries are compared with the corresponding values in the crystal geometries of the base pairs. For non protonated base pairs in their optimized geometry, these values ranged from ?8.19 kcal/mol to ?21.84 kcal/mol and compared favorably with those of canonical base pairs. The interaction energies of these base pairs, in their respective crystal geometries, were, however, lesser to varying extents and in one case, that of A:A W:W trans, it was actually found to be positive. The variation in RMSD between the two geometries was also large and ranged from 0.32–2.19 Å. Our analysis shows that the hydrogen bonding characteristics and interaction energies obtained, correlated with the nature and type of hydrogen bonds between base pairs; but the occurrence frequencies, interaction energies, and geometric variabilities were conspicuous by the absence of any apparent correlation. Instead, the nature of local interaction energy hyperspace of different base pairs as inferred from the degree of their respective geometric variability could be correlated with the identities of free and bound hydrogen bond donor/acceptor groups present in interacting bases in conjunction with their tertiary and neighboring group interaction potentials in the global context. It also suggests that the concept of isostericity alone may not always determine covariation potentials for base pairs, particularly for those which may be important for RNA dynamics. These considerations are more important than the absolute values of the interaction energies in their respective optimized geometries in rationalizing their occurrences in functional RNAs. They highlight the importance of revising some of the existing DNA based structure analysis approaches and may have significant implications for RNA structure and dynamics, especially in the context of structure prediction algorithms.  相似文献   

4.
Atomic resolution RNA structures are being published at an increasing rate. It is common to find a modest number of non-canonical base pairs in these structures in addition to the usual Watson-Crick pairs. This database summarizes the occurrence of these rare base pairs in accordance with standard nomenclature. The database, http://prion.bchs.uh.edu/, contains information such as sequence context, sugar pucker conformation, anti / syn base conformations, chemical shift, p K (a)values, melting temperature and free energy. Of the 29 anticipated pairs with two or more hydrogen bonds, 20 have been encountered to date. In addition, four unexpected pairs with two hydrogen bonds have been reported bringing the total to 24. Single hydrogen bond versions of five of the expected geometries have been encountered among the single hydrogen bond interactions. In addition, 18 different types of base triplets have been encountered, each of which involves three to six hydrogen bonds. The vast majority of the rare base pairs are antiparallel with the bases in the anti configuration relative to the ribose. The most common are the GU wobble, the Sheared GA pair, the Reverse Hoogsteen pair and the GA imino pair.  相似文献   

5.
Base triples are recurrent clusters of three RNA nucleobases interacting edge-to-edge by hydrogen bonding. We find that the central base in almost all triples forms base pairs with the other two bases of the triple, providing a natural way to geometrically classify base triples. Given 12 geometric base pair families defined by the Leontis-Westhof nomenclature, combinatoric enumeration predicts 108 potential geometric base triple families. We searched representative atomic-resolution RNA 3D structures and found instances of 68 of the 108 predicted base triple families. Model building suggests that some of the remaining 40 families may be unlikely to form for steric reasons. We developed an on-line resource that provides exemplars of all base triples observed in the structure database and models for unobserved, predicted triples, grouped by triple family, as well as by three-base combination (http://rna.bgsu.edu/Triples). The classification helps to identify recurrent triple motifs that can substitute for each other while conserving RNA 3D structure, with applications in RNA 3D structure prediction and analysis of RNA sequence evolution.  相似文献   

6.
The natural bases of nucleic acids form a great variety of base pairs with at least two hydrogen bonds between them. They are classified in twelve main families, with the Watson–Crick family being one of them. In a given family, some of the base pairs are isosteric between them, meaning that the positions and the distances between the C1′ carbon atoms are very similar. The isostericity of Watson–Crick pairs between the complementary bases forms the basis of RNA helices and of the resulting RNA secondary structure. Several defined suites of non-Watson–Crick base pairs assemble into RNA modules that form recurrent, rather regular, building blocks of the tertiary architecture of folded RNAs. RNA modules are intrinsic to RNA architecture are therefore disconnected from a biological function specifically attached to a RNA sequence. RNA modules occur in all kingdoms of life and in structured RNAs with diverse functions. Because of chemical and geometrical constraints, isostericity between non-Watson–Crick pairs is restricted and this leads to higher sequence conservation in RNA modules with, consequently, greater difficulties in extracting 3D information from sequence analysis. Nucleic acid helices have to be recognised in several biological processes like replication or translational decoding. In polymerases and the ribosomal decoding site, the recognition occurs on the minor groove sides of the helical fragments. With the use of alternative conformations, protonated or tautomeric forms of the bases, some base pairs with Watson–Crick-like geometries can form and be stabilized. Several of these pairs with Watson–Crick-like geometries extend the concept of isostericity beyond the number of isosteric pairs formed between complementary bases. These observations set therefore limits and constraints to geometric selection in molecular recognition of complementary Watson–Crick pairs for fidelity in replication and translation processes.  相似文献   

7.
The wide structural diversity of RNA results in part from the diversity of non-Watson-Crick interactions between bases. To examine the repertoire of possible hydrogen bond interactions among bases, we computed databases of base-pairs and base-triples by systematically matching all possible hydrogen-bond donors and acceptors between bases and evaluating the geometries of each planar configuration. For base-pairs, we find 53 arrangements having at least two hydrogen bonds, including 23 pairs with protonated bases that have not previously been modeled. A comparison with experimentally observed base-pairs reveals an unexpected G:U pair recently observed in the ribosome. For base-triples, we find 840 arrangements in which the three bases are constrained by a total of at least three hydrogen bonds. Base-triples in particular exhibit a wide range of structural diversity, suggesting how compact or elongated nucleic acid structures may be constructed using different hydrogen-bonding patterns. Base-pair and base-triple conformations were systematically compared to identify structurally isomorphic combinations, and the experimentally observed arrangements within double and triple helices are among the most isomorphic. Unexpectedly, however, other combinations in the database are even more isomorphic, including several in which all-purine arrangements overlap with all-pyrimidine arrangements. These studies highlight some of the combinatoric and geometric versatility of base interactions and help provide a framework for analyzing and modeling isomorphic interactions and potentially for designing novel nucleic acid structures.  相似文献   

8.
The traditional way to infer RNA secondary structure involves an iterative process of alignment and evaluation of covariation statistics between all positions possibly involved in basepairing. Watson-Crick basepairs typically show covariations that score well when examples of two or more possible basepairs occur. This is not necessarily the case for non-Watson-Crick basepairing geometries. For example, for sheared (trans Hoogsteen/Sugar edge) pairs, one base is highly conserved (always A or mostly A with some C or U), while the other can vary (G or A and sometimes C and U as well). RNA motifs consist of ordered, stacked arrays of non-Watson-Crick basepairs that in the secondary structure representation form hairpin or internal loops, multi-stem junctions, and even pseudoknots. Although RNA motifs occur recurrently and contribute in a modular fashion to RNA architecture, it is usually not apparent which bases interact and whether it is by edge-to-edge H-bonding or solely by stacking interactions. Using a modular sequence-analysis approach, recurrent motifs related to the sarcin-ricin loop of 23S RNA and to loop E from 5S RNA were predicted in universally conserved regions of the large ribosomal RNAs (16S- and 23S-like) before the publication of high-resolution, atomic-level structures of representative examples of 16S and 23S rRNA molecules in their native contexts. This provides the opportunity to evaluate the predictive power of motif-level sequence analysis, with the goal of automating the process for predicting RNA motifs in genomic sequences. The process of inferring structure from sequence by constructing accurate alignments is a circular one. The crucial link that allows a productive iteration of motif modeling and realignment is the comparison of the sequence variations for each putative pair with the corresponding isostericity matrix to determine which basepairs are consistent both with the sequence and the geometrical data.  相似文献   

9.
Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide-protein complexes are analyzed manually and pseudo pairs together with single and bifurcated hydrogen bonds observed between bases and amino acids are classified and annotated. Only 5 of the 20 usual amino acid residues, Asn, Gln, Asp, Glu and Arg, are able to orient in a coplanar fashion in order to form pseudo pairs with nucleotide bases through two hydrogen bonds. The peptide backbone can also form pseudo pairs with nucleotide bases and presents a strong bias for binding to the adenine base. The Watson-Crick side of the nucleotide bases is the major interaction edge participating in such pseudo pairs. Pseudo pairs between the Watson-Crick edge of guanine and Asp are frequently observed. The Hoogsteen edge of the purine bases is a good discriminatory element in recognition of nucleotide bases by protein side chains through the pseudo pairing: the Hoogsteen edge of adenine is recognized by various amino acids while the Hoogsteen edge of guanine is only recognized by Arg. The sugar edge is rarely recognized by either the side-chain or peptide backbone of amino acid residues.  相似文献   

10.
New methods are described for finding recurrent three-dimensional (3D) motifs in RNA atomic-resolution structures. Recurrent RNA 3D motifs are sets of RNA nucleotides with similar spatial arrangements. They can be local or composite. Local motifs comprise nucleotides that occur in the same hairpin or internal loop. Composite motifs comprise nucleotides belonging to three or more different RNA strand segments or molecules. We use a base-centered approach to construct efficient, yet exhaustive search procedures using geometric, symbolic, or mixed representations of RNA structure that we implement in a suite of MATLAB programs, “Find RNA 3D” (FR3D). The first modules of FR3D preprocess structure files to classify base-pair and -stacking interactions. Each base is represented geometrically by the position of its glycosidic nitrogen in 3D space and by the rotation matrix that describes its orientation with respect to a common frame. Base-pairing and base-stacking interactions are calculated from the base geometries and are represented symbolically according to the Leontis/Westhof basepairing classification, extended to include base-stacking. These data are stored and used to organize motif searches. For geometric searches, the user supplies the 3D structure of a query motif which FR3D uses to find and score geometrically similar candidate motifs, without regard to the sequential position of their nucleotides in the RNA chain or the identity of their bases. To score and rank candidate motifs, FR3D calculates a geometric discrepancy by rigidly rotating candidates to align optimally with the query motif and then comparing the relative orientations of the corresponding bases in the query and candidate motifs. Given the growing size of the RNA structure database, it is impossible to explicitly compute the discrepancy for all conceivable candidate motifs, even for motifs with less than ten nucleotides. The screening algorithm that we describe finds all candidate motifs whose geometric discrepancy with respect to the query motif falls below a user-specified cutoff discrepancy. This technique can be applied to RMSD searches. Candidate motifs identified geometrically may be further screened symbolically to identify those that contain particular basepair types or base-stacking arrangements or that conform to sequence continuity or nucleotide identity constraints. Purely symbolic searches for motifs containing user-defined sequence, continuity and interaction constraints have also been implemented. We demonstrate that FR3D finds all occurrences, both local and composite and with nucleotide substitutions, of sarcin/ricin and kink-turn motifs in the 23S and 5S ribosomal RNA 3D structures of the H. marismortui 50S ribosomal subunit and assigns the lowest discrepancy scores to bona fide examples of these motifs. The search algorithms have been optimized for speed to allow users to search the non-redundant RNA 3D structure database on a personal computer in a matter of minutes.  相似文献   

11.
Sequence-specific protein-nucleic acid recognition is determined, in part, by hydrogen bonding interactions between amino acid side-chains and nucleotide bases. To examine the repertoire of possible interactions, we have calculated geometrically plausible arrangements in which amino acids hydrogen bond to unpaired bases, such as those found in RNA bulges and loops, or to the 53 possible RNA base-pairs. We find 32 possible interactions that involve two or more hydrogen bonds to the six unpaired bases (including protonated A and C), 17 of which have been observed. We find 186 "spanning" interactions to base-pairs in which the amino acid hydrogen bonds to both bases, in principle allowing particular base-pairs to be selectively targeted, and nine of these have been observed. Four calculated interactions span the Watson-Crick pairs and 15 span the G:U wobble pair, including two interesting arrangements with three hydrogen bonds to the Arg guanidinum group that have not yet been observed. The inherent donor-acceptor arrangements of the bases support many possible interactions to Asn (or Gln) and Ser (or Thr or Tyr), few interactions to Asp (or Glu) even though several already have been observed, and interactions to U (or T) only if the base is in an unpaired context, as also observed in several cases. This study highlights how complementary arrangements of donors and acceptors can contribute to base-specific recognition of RNA, predicts interactions not yet observed, and provides tools to analyze proposed contacts or design novel interactions.  相似文献   

12.
MOTIVATION: The recognition of specific RNA sequences and structures by proteins is critical to our understanding of RNA processing, gene expression and viral replication. The diversity of RNA structures suggests that RNA recognition is substantially different than that of DNA. RESULTS: The atomic coordinates of 41 protein-RNA complexes have been used to probe composite nucleoside binding pockets that form the structural and chemical underpinnings of base recognition. Composite nucleoside binding pockets were constructed using three-dimensional superpositions of each RNA nucleoside. Unlike protein-DNA interactions which are dominated by accessibility, RNA recognition frequently occurs in non-canonical and single-strand-like structures that allow interactions to occur from a much wider set of geometries and make fuller use of unique base shapes and hydrogen-bonding ability. By constructing composites that include all van der Waals, hydrogen-bonding, stacking and general non-polar interactions made to a particular nucleoside, the strategies employed are made readily visible. Protein-RNA interactions can result in the formation of a glove-like tight binding pocket around RNA bases, but the size, shape and non-polar binding patterns differ between specific RNA bases. We show that adenine can be distinguished from guanine based on the size and shape of the binding pocket and steric exclusion of the guanine N2 exocyclic amino group. The unique shape and hydrogen-bonding pattern for each RNA base allow proteins to make specific interactions through a very small number of contacts, as few as two in some cases. AVAILABILITY: The program ENTANGLE is available from http://www.bioc.rice.edu/~shamoo  相似文献   

13.
RNA is now known to possess various structural, regulatory and enzymatic functions for survival of cellular organisms. Functional RNA structures are generally created by three-dimensional organization of small structural motifs, formed by base pairing between self-complementary sequences from different parts of the RNA chain. In addition to the canonical Watson–Crick or wobble base pairs, several non-canonical base pairs are found to be crucial to the structural organization of RNA molecules. They appear within different structural motifs and are found to stabilize the molecule through long-range intra-molecular interactions between basic structural motifs like double helices and loops. These base pairs also impart functional variation to the minor groove of A-form RNA helices, thus forming anchoring site for metabolites and ligands. Non-canonical base pairs are formed by edge-to-edge hydrogen bonding interactions between the bases. A large number of theoretical studies have been done to detect and analyze these non-canonical base pairs within crystal or NMR derived structures of different functional RNA. Theoretical studies of these isolated base pairs using ab initio quantum chemical methods as well as molecular dynamics simulations of larger fragments have also established that many of these non-canonical base pairs are as stable as the canonical Watson–Crick base pairs. This review focuses on the various structural aspects of non-canonical base pairs in the organization of RNA molecules and the possible applications of these base pairs in predicting RNA structures with more accuracy.  相似文献   

14.
Protein-RNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. Here, we analyzed the protein surface shape (dented, intermediate or protruded) and the RNA base pairing properties (paired or unpaired nucleotides) at the interfaces of 91 protein-RNA complexes derived from the Protein Data Bank. Dented protein surfaces prefer unpaired nucleotides to paired ones at the interface, and hydrogen bonds frequently occur between the protein backbone and RNA bases. In contrast, protruded protein surfaces do not show such a preference, rather, electrostatic interactions initiate the formation of hydrogen bonds between positively charged amino acids and RNA phosphate groups. Interestingly, in many protein-RNA complexes that interact via an RNA loop, an aspartic acid is favored at the interface. Moreover, in most of these complexes, nucleotide bases in the RNA loop are flipped out and form hydrogen bonds with the protein, which suggests that aspartic acid is important for RNA loop recognition through a base-flipping process. This study provides fundamental insights into the role of the shape of the protein surface and RNA secondary structures in mediating protein-RNA interactions.  相似文献   

15.
Most of the hairpin, internal and junction loops that appear single-stranded in standard RNA secondary structures form recurrent 3D motifs, where non-Watson–Crick base pairs play a central role. Non-Watson–Crick base pairs also play crucial roles in tertiary contacts in structured RNA molecules. We previously classified RNA base pairs geometrically so as to group together those base pairs that are structurally similar (isosteric) and therefore able to substitute for each other by mutation without disrupting the 3D structure. Here, we introduce a quantitative measure of base pair isostericity, the IsoDiscrepancy Index (IDI), to more accurately determine which base pair substitutions can potentially occur in conserved motifs. We extract and classify base pairs from a reduced-redundancy set of RNA 3D structures from the Protein Data Bank (PDB) and calculate centroids (exemplars) for each base combination and geometric base pair type (family). We use the exemplars and IDI values to update our online Basepair Catalog and the Isostericity Matrices (IM) for each base pair family. From the database of base pairs observed in 3D structures we derive base pair occurrence frequencies for each of the 12 geometric base pair families. In order to improve the statistics from the 3D structures, we also derive base pair occurrence frequencies from rRNA sequence alignments.  相似文献   

16.
Analysis of atomic resolution structures of the rRNAs within the context of the 50S and the 30S ribosomal subunits have revealed the presence of nine examples of a recurrent structural motif, first observed in the TpsiC loop of tRNAs. The key component of this T-loop motif is a UA trans Watson-Crick/Hoogsteen base pair stacked on a Watson-Crick pair on one side. This motif is stabilized by several noncanonical hydrogen bonds, facilitating RNA-RNA as well as RNA-protein interactions. In particular, the sugar edge of the purine on the 3' side of the pivotal uridine in the UA pair frequently forms a noncanonical base pair with a distant residue. The bulged-out bases, usually seen as part of the motif, also use their Watson-Crick edges to interact with nearby residues via base-specific hydrogen bonds. In certain occurrences, a backbone reversal is stabilized by specific hydrogen bonds as is observed in the U-turn motifs and the adenosine residue of the key UA pair interacts with a third base via its Watson-Crick edge, essentially generating a base triple.  相似文献   

17.
18.
Noncanonical base pairs in RNA have strong structural and functional implications but are currently not considered for secondary structure predictions. We present results of comparative ab initio studies of stabilities and interaction energies for the three standard and 24 selected unusual RNA base pairs reported in the literature. Hydrogen added models of isolated base pairs, with heavy atoms frozen in their 'away from equilibrium' geometries, built from coordinates extracted from NDB, were geometry optimized using HF/6-31G** basis set, both before and after unfreezing the heavy atoms. Interaction energies, including BSSE and deformation energy corrections, were calculated, compared with respective single point MP2 energies, and correlated with occurrence frequencies and with types and geometries of hydrogen bonding interactions. Systems having two or more N-H...O/N hydrogen bonds had reasonable interaction energies which correlated well with respective occurrence frequencies and highlighted the possibility of some of them playing important roles in improved secondary structure prediction methods. Several of the remaining base pairs with one N-H...O/N and/or one C-H...O/N interactions respectively, had poor interaction energies and negligible occurrences. High geometry variations on optimization of some of these were suggestive of their conformational switch like characteristics.  相似文献   

19.
Noncanonical base pairs in RNA have strong structural and functional implications but are currently not considered for secondary structure predictions. We present results of comparative ab initio studies of stabilities and interaction energies for the three standard and 24 selected unusual RNA base pairs reported in the literature. Hydrogen added models of isolated base pairs, with heavy atoms frozen in their ‘away from equilibrium’ geometries, built from coordinates extracted from NDB, were geometry optimized using HF/6-31G** basis set, both before and after unfreezing the heavy atoms. Interaction energies, including BSSE and deformation energy corrections, were calculated, compared with respective single point MP2 energies, and correlated with occurrence frequencies and with types and geometries of hydrogen bonding interactions. Systems having two or more N-H...O/N hydrogen bonds had reasonable interaction energies which correlated well with respective occurrence frequencies and highlighted the possibility of some of them playing important roles in improved secondary structure prediction methods. Several of the remaining base pairs with one N-H...O/N and/or one C-H...O/N interactions respectively, had poor interaction energies and negligible occurrences. High geometry variations on optimization of some of these were suggestive of their conformational switch like characteristics.  相似文献   

20.
We investigate the sequence and structural properties of RNA-protein interaction sites in 211 RNA-protein chain pairs, the largest set of RNA-protein complexes analyzed to date. Statistical analysis confirms and extends earlier analyses made on smaller data sets. There are 24.6% of hydrogen bonds between RNA and protein that are nucleobase specific, indicating the importance of both nucleobase-specific and -nonspecific interactions. While there is no significant difference between RNA base frequencies in protein-binding and non-binding regions, distinct preferences for RNA bases, RNA structural states, protein residues, and protein secondary structure emerge when nucleobase-specific and -nonspecific interactions are considered separately. Guanine nucleobase and unpaired RNA structural states are significantly preferred in nucleobase-specific interactions; however, nonspecific interactions disfavor guanine, while still favoring unpaired RNA structural states. The opposite preferences of nucleobase-specific and -nonspecific interactions for guanine may explain discrepancies between earlier studies with regard to base preferences in RNA-protein interaction regions. Preferences for amino acid residues differ significantly between nucleobase-specific and -nonspecific interactions, with nonspecific interactions showing the expected bias towards positively charged residues. Irregular protein structures are strongly favored in interactions with the protein backbone, whereas there is little preference for specific protein secondary structure in either nucleobase-specific interaction or -nonspecific interaction. Overall, this study shows strong preferences for both RNA bases and RNA structural states in protein-RNA interactions, indicating their mutual importance in protein recognition.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号