首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The secondary structure of encapsidated MS2 genomic RNA poses an interesting RNA folding challenge. Cryoelectron microscopy has demonstrated that encapsidated MS2 RNA is well-ordered. Models of MS2 assembly suggest that the RNA hairpin-protein interactions and the appropriate placement of hairpins in the MS2 RNA secondary structure can guide the formation of the correct icosahedral particle. The RNA hairpin motif that is recognized by the MS2 capsid protein dimers, however, is energetically unfavorable, and thus free energy predictions are biased against this motif. Computer programs called Crumple, Sliding Windows, and Assembly provide useful tools for prediction of viral RNA secondary structures when the traditional assumptions of RNA structure prediction by free energy minimization may not apply. These methods allow incorporation of global features of the RNA fold and motifs that are difficult to include directly in minimum free energy predictions. For example, with MS2 RNA the experimental data from SELEX experiments, crystallography, and theoretical calculations of the path for the series of hairpins can be incorporated in the RNA structure prediction, and thus the influence of free energy considerations can be modulated. This approach thoroughly explores conformational space and generates an ensemble of secondary structures. The predictions from this new approach can test hypotheses and models of viral assembly and guide construction of complete three-dimensional models of virus particles.  相似文献   

2.
Methods for efficient and accurate prediction of RNA structure are increasingly valuable, given the current rapid advances in understanding the diverse functions of RNA molecules in the cell. To enhance the accuracy of secondary structure predictions, we developed and refined optimization techniques for the estimation of energy parameters. We build on two previous approaches to RNA free-energy parameter estimation: (1) the Constraint Generation (CG) method, which iteratively generates constraints that enforce known structures to have energies lower than other structures for the same molecule; and (2) the Boltzmann Likelihood (BL) method, which infers a set of RNA free-energy parameters that maximize the conditional likelihood of a set of reference RNA structures. Here, we extend these approaches in two main ways: We propose (1) a max-margin extension of CG, and (2) a novel linear Gaussian Bayesian network that models feature relationships, which effectively makes use of sparse data by sharing statistical strength between parameters. We obtain significant improvements in the accuracy of RNA minimum free-energy pseudoknot-free secondary structure prediction when measured on a comprehensive set of 2518 RNA molecules with reference structures. Our parameters can be used in conjunction with software that predicts RNA secondary structures, RNA hybridization, or ensembles of structures. Our data, software, results, and parameter sets in various formats are freely available at http://www.cs.ubc.ca/labs/beta/Projects/RNA-Params.  相似文献   

3.
It seems well established that translocation of at least some mRNAs through the nuclear pore is (1) an energy-dependent process, and (2) dependent on the presence of the poly(A) segment attached to most mRNA species. We describe that RNA helicase (RNA duplex unwindase) activity is present in a nuclear envelope (NE) preparation, which also appears to be involved in nucleocytoplasmic RNA transport. This activity unwinds RNA: RNA hybrids. The helicase has a pH optimum of 7.5 and a temperature optimum of 30 degrees C. Applying the sealed NE vesicle system, it was shown that duplex RNA species are readily released from the vesicles in an unidirectional manner, in contrast to single-stranded RNA, which is much slower transported into the extravesicular space. Attachment of a poly(A) segment to the RNA duplex additionally increases the efflux rate of this RNA. Efflux of duplex RNA but not efflux of single-stranded RNA was strongly inhibited by formycin B 5'-triphosphate. Our results suggest that, besides poly(A), duplex structures, if present in a given RNA, modulate and control the export of RNA.  相似文献   

4.
Fulle S  Gohlke H 《Biophysical journal》2008,94(11):4202-4219
RNA requires conformational dynamics to undergo its diverse functional roles. Here, a new topological network representation of RNA structures is presented that allows analyzing RNA flexibility/rigidity based on constraint counting. The method extends the FIRST approach, which identifies flexible and rigid regions in atomic detail in a single, static, three-dimensional molecular framework. Initially, the network rigidity of a canonical A-form RNA is analyzed by counting on constraints of network elements of increasing size. These considerations demonstrate that it is the inclusion of hydrophobic contacts into the RNA topological network that is crucial for an accurate flexibility prediction. The counting also explains why a protein-based parameterization results in overly rigid RNA structures. The new network representation is then validated on a tRNAASP structure and all NMR-derived ensembles of RNA structures currently available in the Protein Data Bank (with chain length ≥40). The flexibility predictions demonstrate good agreement with experimental mobility data, and the results are superior compared to predictions based on two previously used network representations. Encouragingly, this holds for flexibility predictions as well as mobility predictions obtained by constrained geometric simulations on these networks. Potential applications of the approach to analyzing the flexibility of DNA and RNA/protein complexes are discussed.  相似文献   

5.
《Biophysical journal》2022,121(18):3381-3392
Knowledge of RNA three-dimensional (3D) structures is critical to understanding the important biological functions of RNAs. Although various structure prediction models have been developed, the high-accuracy predictions of RNA 3D structures are still limited to the RNAs with short lengths or with simple topology. In this work, we proposed a new model, namely FebRNA, for building RNA 3D structures through fragment assembly based on coarse-grained (CG) fragment ensembles. Specifically, FebRNA is composed of four processes: establishing the library of different types of non-redundant CG fragment ensembles regardless of the sequences, building CG 3D structure ensemble through fragment assembly, identifying top-scored CG structures through a specific CG scoring function, and rebuilding the all-atom structures from the top-scored CG ones. Extensive examination against different types of RNA structures indicates that FebRNA consistently gives the reliable predictions on RNA 3D structures, including pseudoknots, three-way junctions, four-way and five-way junctions, and RNAs in the RNA-Puzzles. FebRNA is available on the Web site: https://github.com/Tan-group/FebRNA.  相似文献   

6.
Fang X  Luo Z  Yuan B  Wang J 《Bioinformation》2007,2(5):222-229
The prediction of RNA secondary structure can be facilitated by incorporating with comparative analysis of homologous sequences. However, most of existing comparative methods are vulnerable to alignment errors and thus are of low accuracy in practical application. Here we improve the prediction of RNA secondary structure by detecting and assessing conserved stems shared by all sequences in the alignment. Our method can be summarized by: 1) we detect possible stems in single RNA sequence using the so-called position matrix with which some possibly paired positions can be uncovered; 2) we detect conserved stems across multiple RNA sequences by multiplying the position matrices; 3) we assess the conserved stems using the Signal-to-Noise; 4) we compute the optimized secondary structure by incorporating the so-called reliable conserved stems with predictions by RNAalifold program. We tested our method on data sets of RNA alignments with known secondary structures. The accuracy, measured as sensitivity and specificity, of our method is greater than predictions by RNAalifold.  相似文献   

7.
Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution. These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D module prediction tools and apply them on a 13-way vertebrate sequence-based alignment. We find that RNA 3D modules predicted by metaRNAmodules and JAR3D are significantly enriched in the screened windows compared to their shuffled counterparts. The initially estimated FDR of 47.0% is lowered to below 25% when certain 3D module predictions are present in the window of the 2D prediction. We discuss the implications and prospects for further development of computational strategies for detection of RNA 2D structure in genomic sequence.  相似文献   

8.
An approximation of loop free energy values of RNA H-pseudoknots.   总被引:3,自引:2,他引:1       下载免费PDF全文
A set of free energy values is suggested for RNA H-pseudoknot loops. The parameters are adjusted to be consistent with the theory of polymer thermodynamics and known data on pseudoknots. The values can be used for estimates of pseudoknot stabilities and computer predictions of RNA structures.  相似文献   

9.

Background  

The ability to access, search and analyse secondary structures of a large set of known RNA molecules is very important for deriving improved RNA energy models, for evaluating computational predictions of RNA secondary structures and for a better understanding of RNA folding. Currently there is no database that can easily provide these capabilities for almost all RNA molecules with known secondary structures.  相似文献   

10.
The most probable secondary structure of an RNA molecule, given the nucleotide sequence, can be computed efficiently if a stochastic context-free grammar (SCFG) is used as the prior distribution of the secondary structure. The structures of some RNA molecules contain so-called pseudoknots. Allowing all possible configurations of pseudoknots is not compatible with context-free grammar models and makes the search for an optimal secondary structure NP-complete. We suggest a probabilistic model for RNA secondary structures with pseudoknots and present a Markov-chain Monte-Carlo Method for sampling RNA structures according to their posterior distribution for a given sequence. We favor Bayesian sampling over optimization methods in this context, because it makes the uncertainty of RNA structure predictions assessable. We demonstrate the benefit of our method in examples with tmRNA and also with simulated data. McQFold, an implementation of our method, is freely available from http://www.cs.uni-frankfurt.de/~metzler/McQFold.  相似文献   

11.
RNA loop–loop interactions are essential for genomic RNA dimerization and regulation of gene expression. In this article, a statistical mechanics-based computational method that predicts the structures and thermodynamic stabilities of RNA complexes with loop–loop kissing interactions is described. The method accounts for the entropy changes for the formation of loop–loop interactions, which is a notable advancement that other computational models have neglected. Benchmark tests with several experimentally validated systems show that the inclusion of the entropy parameters can indeed improve predictions for RNA complexes. Furthermore, the method can predict not only the native structures of RNA/RNA complexes but also alternative metastable structures. For instance, the model predicts that the SL1 domain of HIV-1 RNA can form two different dimer structures with similar stabilities. The prediction is consistent with experimental observation. In addition, the model predicts two different binding sites for hTR dimerization: One binding site has been experimentally proposed, and the other structure, which has a higher stability, is structurally feasible and needs further experimental validation.  相似文献   

12.
MOTIVATION: Function derives from structure, therefore, there is need for methods to predict functional RNA structures. RESULTS: The Dynalign algorithm, which predicts the lowest free energy secondary structure common to two unaligned RNA sequences, is extended to the prediction of a set of low-energy structures. Dot plots can be drawn to show all base pairs in structures within an energy increment. Dynalign predicts more well-defined structures than structure prediction using a single sequence; in 5S rRNA sequences, the average number of base pairs in structures with energy within 20% of the lowest energy structure is 317 using Dynalign, but 569 using a single sequence. Structure prediction with Dynalign can also be constrained according to experiment or comparative analysis. The accuracy, measured as sensitivity and positive predictive value, of Dynalign is greater than predictions with a single sequence. AVAILABILITY: Dynalign can be downloaded at http://rna.urmc.rochester.edu  相似文献   

13.
14.
RNA tertiary structures from experiments or computational predictions often contain missing atoms, which prevent analyses requiring full atomic structures. Current programs for RNA reconstruction can be slow, inaccurate, and/or require specific atoms to be present in the input. We present Arena (Atomic Reconstruction of RNA), which reconstructs a full atomic RNA structure from residues that can have as few as one atom. Arena first fills in missing atoms and then iteratively refines their placement to reduce nonideal geometries. We benchmarked Arena on a dataset of 361 RNA structures, where Arena achieves high accuracy and speed compared to other structure reconstruction programs. For example, Arena was used to reconstruct full atomic structures from a single phosphorus atom per nucleotide to, on average, within 3.63 Å RMSD of the experimental structure, while virtually removing all clashes and running in <3 s, which is 353× and 46× faster than state-of-the-art programs PDBFixer and C2A, respectively. The Arena source code is available at https://github.com/pylelab/Arena and the webserver at https://zhanggroup.org/Arena/.  相似文献   

15.
16.
Predicting Ion Binding Properties for RNA Tertiary Structures   总被引:1,自引:0,他引:1  
Recent experiments pointed to the potential importance of ion correlation for multivalent ions such as Mg2+ ions in RNA folding. In this study, we develop an all-atom model to predict the ion electrostatics in RNA folding. The model can treat ion correlation effects explicitly by considering an ensemble of discrete ion distributions. In contrast to the previous coarse-grained models that can treat ion correlation, this new model is based on all-atom nucleic acid structures. Thus, unlike the previous coarse-grained models, this new model allows us to treat complex tertiary structures such as HIV-1 DIS type RNA kissing complexes. Theory-experiment comparisons for a variety of tertiary structures indicate that the model gives improved predictions over the Poisson-Boltzmann theory, which underestimates the Mg2+ binding in the competition with Na+. Further systematic theory-experiment comparisons for a series of tertiary structures lead to a set of analytical formulas for Mg2+/Na+ ion-binding to various RNA and DNA structures over a wide range of Mg2+ and Na+ concentrations.  相似文献   

17.
Facing the ever-growing list of newly discovered classes of functional RNAs, it can be expected that further types of functional RNAs are still hidden in recently completed genomes. The computational identification of such RNA genes is, therefore, of major importance. While most known functional RNAs have characteristic secondary structures, their free energies are generally not statistically significant enough to distinguish RNA genes from the genomic background. Additional information is required. Considering the wide availability of new genomic data of closely related species, comparative studies seem to be the most promising approach. Here, we show that prediction of consensus structures of aligned sequences can be a significant measure to detect functional RNAs. We report a new method to test multiple sequence alignments for the existence of an unusually structured and conserved fold. We show for alignments of six types of well-known functional RNA that an energy score consisting of free energy and a covariation term significantly improves sensitivity compared to single sequence predictions. We further test our method on a number of non-coding RNAs from Caenorhabditis elegans/Caenorhabditis briggsae and seven Saccharomyces species. Most RNAs can be detected with high significance. We provide a Perl implementation that can be used readily to score single alignments and discuss how the methods described here can be extended to allow for efficient genome-wide screens.  相似文献   

18.

Background

Ribonucleic acid (RNA) molecules play important roles in many biological processes including gene expression and regulation. Their secondary structures are crucial for the RNA functionality, and the prediction of the secondary structures is widely studied. Our previous research shows that cutting long sequences into shorter chunks, predicting secondary structures of the chunks independently using thermodynamic methods, and reconstructing the entire secondary structure from the predicted chunk structures can yield better accuracy than predicting the secondary structure using the RNA sequence as a whole. The chunking, prediction, and reconstruction processes can use different methods and parameters, some of which produce more accurate predictions than others. In this paper, we study the prediction accuracy and efficiency of three different chunking methods using seven popular secondary structure prediction programs that apply to two datasets of RNA with known secondary structures, which include both pseudoknotted and non-pseudoknotted sequences, as well as a family of viral genome RNAs whose structures have not been predicted before. Our modularized MapReduce framework based on Hadoop allows us to study the problem in a parallel and robust environment.

Results

On average, the maximum accuracy retention values are larger than one for our chunking methods and the seven prediction programs over 50 non-pseudoknotted sequences, meaning that the secondary structure predicted using chunking is more similar to the real structure than the secondary structure predicted by using the whole sequence. We observe similar results for the 23 pseudoknotted sequences, except for the NUPACK program using the centered chunking method. The performance analysis for 14 long RNA sequences from the Nodaviridae virus family outlines how the coarse-grained mapping of chunking and predictions in the MapReduce framework exhibits shorter turnaround times for short RNA sequences. However, as the lengths of the RNA sequences increase, the fine-grained mapping can surpass the coarse-grained mapping in performance.

Conclusions

By using our MapReduce framework together with statistical analysis on the accuracy retention results, we observe how the inversion-based chunking methods can outperform predictions using the whole sequence. Our chunk-based approach also enables us to predict secondary structures for very long RNA sequences, which is not feasible with traditional methods alone.
  相似文献   

19.
We employed the photoaffinity probe 8-azido-adenosine 5'-triphosphate (aATP) to identify the nuclear envelope (NE) nucleosidetriphosphatase activity (NTPase) implicated in control of RNA transport. The photoprobe was hydrolyzed at rates comparable to those for ATP, with a Michaelis constant of 0.225 mM. Photolabeling was dependent upon UV irradiation (300-nm max) and was not affected by quercetin. Unlabeled ATP or GTP competed with [32P]aATP in photolabeling experiments, and UTP was a less effective competitor, paralleling the substrate specificity of the NTPase. Incubation of NE with aATP led to a UV, time, and concentration dependent irreversible inactivation of NTPase. The inactivation could be blocked by ATP or GTP. Polyacrylamide gel electrophoresis and autoradiography of photolabeled NE showed selective, UV-dependent labeling of a 46-kDa protein with both [gamma-32P]aATP and [alpha-32P]aATP. This band was not labeled with [gamma-32P]ATP. Since the NE NTPase implicated in RNA transport is modulated by RNA, we examined the effects of RNA on the labeling process. Removal of RNA from the NE preparations (by RNase/DNase digestion) reduced NTPase by 30-40% and eliminated photolabeling of the 46-kDa band. Addition of yeast RNA to such preparations increased NTPase activity to control levels and selectively reinstated photolabeling of the 46-kDa band. These results suggest that the 46-kDa protein represents the major NTPase implicated in RNA transport.  相似文献   

20.
RNA secondary structures are important in many biological processes and efficient structure prediction can give vital directions for experimental investigations. Many available programs for RNA secondary structure prediction only use a single sequence at a time. This may be sufficient in some applications, but often it is possible to obtain related RNA sequences with conserved secondary structure. These should be included in structural analyses to give improved results. This work presents a practical way of predicting RNA secondary structure that is especially useful when related sequences can be obtained. The method improves a previous algorithm based on an explicit evolutionary model and a probabilistic model of structures. Predictions can be done on a web server at http://www.daimi.au.dk/~compbio/pfold.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号