首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Most of the hairpin, internal and junction loops that appear single-stranded in standard RNA secondary structures form recurrent 3D motifs, where non-Watson–Crick base pairs play a central role. Non-Watson–Crick base pairs also play crucial roles in tertiary contacts in structured RNA molecules. We previously classified RNA base pairs geometrically so as to group together those base pairs that are structurally similar (isosteric) and therefore able to substitute for each other by mutation without disrupting the 3D structure. Here, we introduce a quantitative measure of base pair isostericity, the IsoDiscrepancy Index (IDI), to more accurately determine which base pair substitutions can potentially occur in conserved motifs. We extract and classify base pairs from a reduced-redundancy set of RNA 3D structures from the Protein Data Bank (PDB) and calculate centroids (exemplars) for each base combination and geometric base pair type (family). We use the exemplars and IDI values to update our online Basepair Catalog and the Isostericity Matrices (IM) for each base pair family. From the database of base pairs observed in 3D structures we derive base pair occurrence frequencies for each of the 12 geometric base pair families. In order to improve the statistics from the 3D structures, we also derive base pair occurrence frequencies from rRNA sequence alignments.  相似文献   

2.
Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download.  相似文献   

3.
Sequence variation in a widespread, recurrent, structured RNA 3D motif, the Sarcin/Ricin (S/R), was studied to address three related questions: First, how do the stabilities of structured RNA 3D motifs, composed of non-Watson–Crick (non-WC) basepairs, compare to WC-paired helices of similar length and sequence? Second, what are the effects on the stabilities of such motifs of isosteric and non-isosteric base substitutions in the non-WC pairs? And third, is there selection for particular base combinations in non-WC basepairs, depending on the temperature regime to which an organism adapts? A survey of large and small subunit rRNAs from organisms adapted to different temperatures revealed the presence of systematic sequence variations at many non-WC paired sites of S/R motifs. UV melting analysis and enzymatic digestion assays of oligonucleotides containing the motif suggest that more stable motifs tend to be more rigid. We further found that the base substitutions at non-Watson–Crick pairing sites can significantly affect the thermodynamic stabilities of S/R motifs and these effects are highly context specific indicating the importance of base-stacking and base-phosphate interactions on motif stability. This study highlights the significance of non-canonical base pairs and their contributions to modulating the stability and flexibility of RNA molecules.  相似文献   

4.
The interaction networks of structured RNAs   总被引:7,自引:6,他引:1  
All pairwise interactions occurring between bases which could be detected in three-dimensional structures of crystallized RNA molecules are annotated on new planar diagrams. The diagrams attempt to map the underlying complex networks of base–base interactions and, especially, they aim at conveying key relationships between helical domains: co-axial stacking, bending and all Watson–Crick as well as non-Watson–Crick base pairs. Although such wiring diagrams cannot replace full stereographic images for correct spatial understanding and representation, they reveal structural similarities as well as the conserved patterns and distances between motifs which are present within the interaction networks of folded RNAs of similar or unrelated functions. Finally, the diagrams could help devising methods for meaningfully transforming RNA structures into graphs amenable to network analysis.  相似文献   

5.
Kink turns (k-turns) are important structural motifs that create a sharp axial bend in RNA. Most conform to a consensus in which a three-nucleotide bulge is followed by consecutive G•A and A•G base pairs, and when these G•A pairs are modified in vitro this generally leads to a failure to adopt the k-turn conformation. Kt-23 in the 30S ribosomal subunit of Thermus thermophilus is a rare exception in which the bulge-distal A•G pair is replaced by a non-Watson–Crick A•U pair. In the context of the ribosome, Kt-23 adopts a completely conventional k-turn geometry. We show here that this sequence is induced to fold into a k-turn structure in an isolated RNA duplex by Mg2+ or Na+ ions. Therefore, the Kt-23 is intrinsically stable despite lacking the key A•G pair; its formation requires neither tertiary interactions nor protein binding. Moreover, the Kt-23 k-turn is stabilized by the same critical hydrogen-bonding interactions within the core of the structure that are found in more conventional sequences such as the near-consensus Kt-7. T. thermophilus Kt-23 has two further non-Watson–Crick base pairs within the non-canonical helix, three and four nucleotides from the bulge, and we find that the nature of these pairs influences the ability of the RNA to adopt k-turn conformation, although the base pair adjacent to the A•U pair is more important than the other.  相似文献   

6.
Metazoan organisms have many tRNA genes responsible for decoding amino acids. The set of all tRNA genes can be grouped in sets of common amino acids and isoacceptor tRNAs that are aminoacylated by corresponding aminoacyl-tRNA synthetases. Analysis of tRNA alignments shows that, despite the high number of tRNA genes, specific tRNA sequence motifs are highly conserved across multicellular eukaryotes. The conservation often extends throughout the isoacceptors and isodecoders with, in some cases, two sets of conserved isodecoders. This study is focused on non-Watson–Crick base pairs in the helical stems, especially GoU pairs. Each of the four helical stems may contain one or more conserved GoU pairs. Some are amino acid specific and could represent identity elements for the cognate aminoacyl tRNA synthetases. Other GoU pairs are found in more than a single amino acid and could be critical for native folding of the tRNAs. Interestingly, some GoU pairs are anticodon-specific, and others are found in phylogenetically-specific clades. Although the distribution of conservation likely reflects a balance between accommodating isotype-specific functions as well as those shared by all tRNAs essential for ribosomal translation, such conservations may indicate the existence of specialized tRNAs for specific translation targets, cellular conditions, or alternative functions.  相似文献   

7.
Stable RNAs are modular and hierarchical 3D architectures taking advantage of recurrent structural motifs to form extensive non-covalent tertiary interactions. Sequence and atomic structure analysis has revealed a novel submotif involving a minimal set of five nucleotides, termed the UA_handle motif (5′XU/ANnX3′). It consists of a U:A Watson–Crick: Hoogsteen trans base pair stacked over a classic Watson–Crick base pair, and a bulge of one or more nucleotides that can act as a handle for making different types of long-range interactions. This motif is one of the most versatile building blocks identified in stable RNAs. It enters into the composition of numerous recurrent motifs of greater structural complexity such as the T-loop, the 11-nt receptor, the UAA/GAN and the G-ribo motifs. Several structural principles pertaining to RNA motifs are derived from our analysis. A limited set of basic submotifs can account for the formation of most structural motifs uncovered in ribosomal and stable RNAs. Structural motifs can act as structural scaffoldings and be functionally and topologically equivalent despite sequence and structural differences. The sequence network resulting from the structural relationships shared by these RNA motifs can be used as a proto-language for assisting prediction and rational design of RNA tertiary structures.  相似文献   

8.
The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access.  相似文献   

9.
Protein synthesis must rapidly and repeatedly discriminate between a single correct and many incorrect aminoacyl-tRNAs. We have attempted to measure the frequencies of all possible missense errors by tRNA, tRNA and tRNA. The most frequent errors involve three types of mismatched nucleotide pairs, U•U, U•C, or U•G, all of which can form a noncanonical base pair with geometry similar to that of the canonical U•A or C•G Watson–Crick pairs. Our system is sensitive enough to measure errors at other potential mismatches that occur at frequencies as low as 1 in 500,000 codons. The ribosome appears to discriminate this efficiently against any pair with non-Watson–Crick geometry. This extreme accuracy may be necessary to allow discrimination against the errors involving near Watson–Crick pairing.  相似文献   

10.
The natural bases of nucleic acids form a great variety of base pairs with at least two hydrogen bonds between them. They are classified in twelve main families, with the Watson–Crick family being one of them. In a given family, some of the base pairs are isosteric between them, meaning that the positions and the distances between the C1′ carbon atoms are very similar. The isostericity of Watson–Crick pairs between the complementary bases forms the basis of RNA helices and of the resulting RNA secondary structure. Several defined suites of non-Watson–Crick base pairs assemble into RNA modules that form recurrent, rather regular, building blocks of the tertiary architecture of folded RNAs. RNA modules are intrinsic to RNA architecture are therefore disconnected from a biological function specifically attached to a RNA sequence. RNA modules occur in all kingdoms of life and in structured RNAs with diverse functions. Because of chemical and geometrical constraints, isostericity between non-Watson–Crick pairs is restricted and this leads to higher sequence conservation in RNA modules with, consequently, greater difficulties in extracting 3D information from sequence analysis. Nucleic acid helices have to be recognised in several biological processes like replication or translational decoding. In polymerases and the ribosomal decoding site, the recognition occurs on the minor groove sides of the helical fragments. With the use of alternative conformations, protonated or tautomeric forms of the bases, some base pairs with Watson–Crick-like geometries can form and be stabilized. Several of these pairs with Watson–Crick-like geometries extend the concept of isostericity beyond the number of isosteric pairs formed between complementary bases. These observations set therefore limits and constraints to geometric selection in molecular recognition of complementary Watson–Crick pairs for fidelity in replication and translation processes.  相似文献   

11.
G·U wobble base pairs are the most common and highly conserved non-Watson–Crick base pairs in RNA. Previous surface maps imply uniformly negative electrostatic potential at the major groove of G·U wobble base pairs embedded in RNA helices, suitable for entrapment of cationic ligands. In this work, we have used a Poisson–Boltzmann approach to gain a more detailed and accurate characterization of the electrostatic profile. We found that the major groove edge of an isolated G·U wobble displays distinctly enhanced negativity compared with standard GC or AU base pairs; however, in the context of different helical motifs, the electrostatic pattern varies. G·U wobbles with distinct widening have similar major groove electrostatic potentials to their canonical counterparts, whereas those with minimal widening exhibit significantly enhanced electronegativity, ranging from 0.8 to 2.5kT/e, depending upon structural features. We propose that the negativity at the major groove of G·U wobble base pairs is determined by the combined effect of the base atoms and the sugar-phosphate backbone, which is impacted by stacking pattern and groove width as a result of base sequence. These findings are significant in that they provide predictive power with respect to which G·U sites in RNA are most likely to bind cationic ligands.  相似文献   

12.
We have developed a semi-synthetic approach for preparing long stretches of DNA (>100 bp) containing internal chemical modifications and/or non-Watson–Crick structural motifs which relies on splint-free, cell-free DNA ligations and recycling of side-products by non-PCR thermal cycling. A double-stranded DNA PCR fragment containing a polylinker in its middle is digested with two restriction enzymes and a small insert (~20 bp) containing the modification or non-Watson–Crick motif of interest is introduced into the middle. Incorrect products are recycled to starting materials by digestion with appropriate restriction enzymes, while the correct product is resistant to digestion since it does not contain these restriction sites. This semi-synthetic approach offers several advantages over DNA splint-mediated ligations, including fewer steps, substantially higher yields (~60% overall yield) and ease of use. This method has numerous potential applications, including the introduction of modifications such as fluorophores and cross-linking agents into DNA, controlling the shape of DNA on a large scale and the study of non-sequence-specific nucleic acidprotein interactions.  相似文献   

13.
DNA has proved to be an excellent material for nanoscale construction because complementary DNA duplexes are programmable and structurally predictable. However, in the absence of Watson–Crick pairings, DNA can be structurally more diverse. Here, we describe the crystal structures of d(ACTCGGATGAT) and the brominated derivative, d(ACBrUCGGABrUGAT). These oligonucleotides form parallel-stranded duplexes with a crystallographically equivalent strand, resulting in the first examples of DNA crystal structures that contains four different symmetric homo base pairs. Two of the parallel-stranded duplexes are coaxially stacked in opposite directions and locked together to form a tetraplex through intercalation of the 5′-most A–A base pairs between adjacent G–G pairs in the partner duplex. The intercalation region is a new type of DNA tertiary structural motif with similarities to the i-motif. 1H–1H nuclear magnetic resonance and native gel electrophoresis confirmed the formation of a parallel-stranded duplex in solution. Finally, we modified specific nucleotide positions and added d(GAY) motifs to oligonucleotides and were readily able to obtain similar crystals. This suggests that this parallel-stranded DNA structure may be useful in the rational design of DNA crystals and nanostructures.  相似文献   

14.
The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon–anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson–Crick pairs in the codon–anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon–anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids.  相似文献   

15.
RNA is now known to possess various structural, regulatory and enzymatic functions for survival of cellular organisms. Functional RNA structures are generally created by three-dimensional organization of small structural motifs, formed by base pairing between self-complementary sequences from different parts of the RNA chain. In addition to the canonical Watson–Crick or wobble base pairs, several non-canonical base pairs are found to be crucial to the structural organization of RNA molecules. They appear within different structural motifs and are found to stabilize the molecule through long-range intra-molecular interactions between basic structural motifs like double helices and loops. These base pairs also impart functional variation to the minor groove of A-form RNA helices, thus forming anchoring site for metabolites and ligands. Non-canonical base pairs are formed by edge-to-edge hydrogen bonding interactions between the bases. A large number of theoretical studies have been done to detect and analyze these non-canonical base pairs within crystal or NMR derived structures of different functional RNA. Theoretical studies of these isolated base pairs using ab initio quantum chemical methods as well as molecular dynamics simulations of larger fragments have also established that many of these non-canonical base pairs are as stable as the canonical Watson–Crick base pairs. This review focuses on the various structural aspects of non-canonical base pairs in the organization of RNA molecules and the possible applications of these base pairs in predicting RNA structures with more accuracy.  相似文献   

16.
An imidazole-containing polyamide trimer, f-ImImIm, where f is a formamido group, was recently found using NMR methods to recognize T·G mismatched base pairs. In order to characterize in detail the T·G recognition affinity and specificity of imidazole-containing polyamides, f-ImIm, f-ImImIm and f-PyImIm were synthesized. The kinetics and thermodynamics for the polyamides binding to Watson–Crick and mismatched (containing one or two T·G, A·G or G·G mismatched base pairs) hairpin oligonucleotides were determined by surface plasmon resonance and circular dichroism (CD) methods. f-ImImIm binds significantly more strongly to the T·G mismatch-containing oligonucleotides than to the sequences with other mismatched or with Watson–Crick base pairs. Compared with the Watson–Crick CCGG sequence, f-ImImIm associates more slowly with DNAs containing T·G mismatches in place of one or two C·G base pairs and, more importantly, the dissociation rate from the T·G oligonucleotides is very slow (small kd). These results clearly demonstrate the binding selectivity and enhanced affinity of side-by-side imidazole/imidazole pairings for T·G mismatches and show that the affinity and specificity increase arise from much lower kd values with the T·G mismatched duplexes. CD titration studies of f-ImImIm complexes with T·G mismatched sequences produce strong induced bands at ~330 nm with clear isodichroic points, in support of a single minor groove complex. CD DNA bands suggest that the complexes remain in the B conformation.  相似文献   

17.
18.

Background

Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids.

Results

The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions.

Conclusion

The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise.  相似文献   

19.
I-motif or C4 is a four-stranded DNA structure with a protonated cytosine:cytosine base pair (C+:C) found in cytosine-rich sequences. We have found that oligodeoxynucleotides containing adenine and cytosine repeats form a stable secondary structure at a physiological pH with magnesium ion, which is similar to i-motif structure, and have named this structure ‘adenine:cytosine-motif (AC-motif)’. AC-motif contains C+:C base pairs intercalated with putative A+:C base pairs between protonated adenine and cytosine. By investigation of the AC-motif present in the CDKL3 promoter (AC-motifCDKL3), one of AC-motifs found in the genome, we confirmed that AC-motifCDKL3 has a key role in regulating CDKL3 gene expression in response to magnesium. This is further supported by confirming that genome-edited mutant cell lines, lacking the AC-motif formation, lost this regulation effect. Our results verify that adenine-cytosine repeats commonly present in the genome can form a stable non-canonical secondary structure with a non-Watson–Crick base pair and have regulatory roles in cells, which expand non-canonical DNA repertoires.  相似文献   

20.
A bacterial RNA functioning as both tRNA and mRNA, transfer-messenger RNA (tmRNA) rescues stalled ribosomes and clears the cell of incomplete polypeptides. For function, Escherichia coli tmRNA requires an elaborate interplay between a tRNA-like structure and an internal mRNA domain that are connected by a 295 nt long compact secondary structure. The tRNA-like structure is surrounded by 16 unpaired nt, including 10 residues that are >95% conserved among the known 140 tmRNA sequences. All these residues were mutated to define their putative role(s) in trans-translation. Both the extent of aminoacylation and the alanine incorporation into the tag sequence, reflecting the two functions of tmRNA, were measured in vitro for all variants. As anticipated from the low sequence conservation, mutating positions 8–12 and position 15 affects neither aminoacylation nor protein tagging. Mutating a set of two conserved positions 13 and 14 abolishes both functions. Probing the solution conformation indicates that this defective mutant adopts an alternate conformation of its acceptor stem that is no more aminoacylatable, and thus inactive in protein tagging. Selected point mutations at the conserved nucleotide stretches 16–20 and 333–335 seriously impair protein tagging with only minor changes in their solution conformations and aminoacylation. Point mutations at conserved positions 19 and 334 abolish trans-translation and 70S ribosome binding, although retaining nearly normal aminoacylation capacities. Two proteins that are known to interact with tmRNA were purified, and their interactions with the defective RNA variants were examined in vitro. Based on phylogenetic and functional data, an additional structural motif consisting of a quartet composed of non-Watson–Crick base pairs 5′-YGAC-3′:5′-GGAC-3′ involving some of the conserved nucleotides next to the tRNA-like portion is proposed. Overall, the highly conserved nucleotides around the tRNA-like portion are maintained for both structural and functional requirements during evolution.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号