期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

smyRNA: A Novel Ab Initio ncRNA Gene Finder

Raheleh Salari Cagri Aksay Emre Karakoc Peter J. Unrau Iman Hajirasouliha S. Cenk Sahinalp 《PloS one》2009,4(5)

Background

Non-coding RNAs (ncRNAs) have important functional roles in the cell: for example, they regulate gene expression by means of establishing stable joint structures with target mRNAs via complementary sequence motifs. Sequence motifs are also important determinants of the structure of ncRNAs. Although ncRNAs are abundant, discovering novel ncRNAs on genome sequences has proven to be a hard task; in particular past attempts for ab initio ncRNA search mostly failed with the exception of tools that can identify micro RNAs.

Methodology/Principal Findings

We present a very general ab initio ncRNA gene finder that exploits differential distributions of sequence motifs between ncRNAs and background genome sequences.

Conclusions/Significance

Our method, once trained on a set of ncRNAs from a given species, can be applied to a genome sequences of other organisms to find not only ncRNAs homologous to those in the training set but also others that potentially belong to novel (and perhaps unknown) ncRNA families. Availability: http://compbio.cs.sfu.ca/taverna/smyrna 相似文献

2.

Discovery of Novel ncRNA Sequences in Multiple Genome Alignments on the Basis of Conserved and Stable Secondary Structures

Yinghan Fu Zhenjiang Zech Xu Zhi J. Lu Shan Zhao David H. Mathews 《PloS one》2015,10(6)

相似文献

3.

RNPomics: Defining the ncRNA transcriptome by cDNA library generation from ribonucleo-protein particles

Mathieu Rederstorff Stephan H. Bernhart Andrea Tanzer Marek Zywicki Katrin Perfler Melanie Lukasser Ivo L. Hofacker Alexander Hüttenhofer 《Nucleic acids research》2010,38(10):e113

相似文献

4.

Inverse folding based pre-training for the reliable identification of intrinsic transcription terminators

Vivian B. Brandenburg Franz Narberhaus Axel Mosig 《PLoS computational biology》2022,18(7)

相似文献

5.

Profiling Caenorhabditis elegans non-coding RNA expression with a combined microarray 总被引：1，自引：0，他引：1

下载免费PDF全文

He H Cai L Skogerbø G Deng W Liu T Zhu X Wang Y Jia D Zhang Z Tao Y Zeng H Aftab MN Cui Y Liu G Chen R 《Nucleic acids research》2006,34(10):2976-2983

相似文献

6.

Pairwise local structural alignment of RNA sequences with sequence similarity less than 40% 总被引：7，自引：0，他引：7

Havgaard JH Lyngsø RB Stormo GD Gorodkin J 《Bioinformatics (Oxford, England)》2005,21(9):1815-1824

MOTIVATION: Searching for non-coding RNA (ncRNA) genes and structural RNA elements (eleRNA) are major challenges in gene finding today as these often are conserved in structure rather than in sequence. Even though the number of available methods is growing, it is still of interest to pairwise detect two genes with low sequence similarity, where the genes are part of a larger genomic region. RESULTS: Here we present such an approach for pairwise local alignment which is based on foldalign and the Sankoff algorithm for simultaneous structural alignment of multiple sequences. We include the ability to conduct mutual scans of two sequences of arbitrary length while searching for common local structural motifs of some maximum length. This drastically reduces the complexity of the algorithm. The scoring scheme includes structural parameters corresponding to those available for free energy as well as for substitution matrices similar to RIBOSUM. The new foldalign implementation is tested on a dataset where the ncRNAs and eleRNAs have sequence similarity <40% and where the ncRNAs and eleRNAs are energetically indistinguishable from the surrounding genomic sequence context. The method is tested in two ways: (1) its ability to find the common structure between the genes only and (2) its ability to locate ncRNAs and eleRNAs in a genomic context. In case (1), it makes sense to compare with methods like Dynalign, and the performances are very similar, but foldalign is substantially faster. The structure prediction performance for a family is typically around 0.7 using Matthews correlation coefficient. In case (2), the algorithm is successful at locating RNA families with an average sensitivity of 0.8 and a positive predictive value of 0.9 using a BLAST-like hit selection scheme. AVAILABILITY: The program is available online at http://foldalign.kvl.dk/ 相似文献

7.

31 Discovery of novel ncRNA by scanning multiple genome alignments

Yinghan Fu Zhenjiang Xu Zhi J. Lu Shan Zhao 《Journal of biomolecular structure & dynamics》2013,31(1)

相似文献

8.

Computational identification of non-coding RNAs in Saccharomyces cerevisiae by comparative genomics 总被引：4，自引：0，他引：4

McCutcheon JP Eddy SR 《Nucleic acids research》2003,31(14):4119-4128

相似文献

9.

Structural alignment of RNA with triple helix structure

Wong TK Yiu SM 《Journal of computational biology》2012,19(4):365-378

Structural alignment is useful in identifying members of ncRNAs. Existing tools are all based on the secondary structures of the molecules. There is evidence showing that tertiary interactions (the interaction between a single-stranded nucleotide and a base-pair) in triple helix structures are critical in some functions of ncRNAs. In this article, we address the problem of structural alignment of RNAs with the triple helix. We provide a formal definition to capture a simplified model of a triple helix structure, then develop an algorithm of O(mn(3)) time to align a query sequence (of length m) with known triple helix structure with a target sequence (of length n) with an unknown structure. The resulting algorithm is shown to be useful in identifying ncRNA members in a simulated genome. 相似文献

10.

Engineering naturally occurring trans-acting non-coding RNAs to sense molecular signals

Qi L Lucks JB Liu CC Mutalik VK Arkin AP 《Nucleic acids research》2012,40(12):5775-5786

相似文献

11.

Analyzing modular RNA structure reveals low global structural entropy in microRNA sequence

Shaw TI Manzour A Wang Y Malmberg RL Cai L 《Journal of bioinformatics and computational biology》2011,9(2):283-298

Secondary structure remains the most exploitable feature for noncoding RNA (ncRNA) gene finding in genomes. However, methods based on secondary structure prediction may generate superfluous amount of candidates for validation and have yet to deliver the desired performance that can complement experimental efforts in ncRNA gene finding. This paper investigates a novel method, unpaired structural entropy (USE) as a measurement for the structure fold stability of ncRNAs. USE proves to be effective in identifying from the genome background a class of ncRNAs, such as precursor microRNAs (pre-miRNAs) that contains a long stem hairpin loop. USE correlates well and performs better than other measures on pre-miRNAs, including the previously formulated structural entropy. As an SVM classifier, USE outperforms existing pre-miRNA classifiers. A long stem hairpin loop is common for a number of other functional RNAs including introns splicing hairpins loops and intrinsic termination hairpin loops. We believe USE can be further applied in developing ab initio prediction programs for a larger class of ncRNAs. 相似文献

12.

A computational pipeline for high- throughput discovery of cis-regulatory noncoding RNA in prokaryotes

下载免费PDF全文

Yao Z Barrick J Weinberg Z Neph S Breaker R Tompa M Ruzzo WL 《PLoS computational biology》2007,3(7):e126

Noncoding RNAs (ncRNAs) are important functional RNAs that do not code for proteins. We present a highly efficient computational pipeline for discovering cis-regulatory ncRNA motifs de novo. The pipeline differs from previous methods in that it is structure-oriented, does not require a multiple-sequence alignment as input, and is capable of detecting RNA motifs with low sequence conservation. We also integrate RNA motif prediction with RNA homolog search, which improves the quality of the RNA motifs significantly. Here, we report the results of applying this pipeline to Firmicute bacteria. Our top-ranking motifs include most known Firmicute elements found in the RNA family database (Rfam). Comparing our motif models with Rfam's hand-curated motif models, we achieve high accuracy in both membership prediction and base-pair–level secondary structure prediction (at least 75% average sensitivity and specificity on both tasks). Of the ncRNA candidates not in Rfam, we find compelling evidence that some of them are functional, and analyze several potential ribosomal protein leaders in depth. 相似文献

13.

Subtractive hybridization identifies novel differentially expressed ncRNA species in EBV-infected human B cells 总被引：3，自引：0，他引：3

Mrázek J Kreutmayer SB Grässer FA Polacek N Hüttenhofer A 《Nucleic acids research》2007,35(10):e73

相似文献

14.

Identification of putative noncoding RNA genes in the Burkholderia cenocepacia J2315 genome 总被引：1，自引：0，他引：1

Coenye T Drevinek P Mahenthiralingam E Shah SA Gill RT Vandamme P Ussery DW 《FEMS microbiology letters》2007,276(1):83-92

相似文献

15.

Hybridization-based reconstruction of small non-coding RNA transcripts from deep sequencing data

Ragan C Mowry BJ Bauer DC 《Nucleic acids research》2012,40(16):7633-7643

相似文献

16.

Plant noncoding RNA gene discovery by "single-genome comparative genomics"

Chen CJ Zhou H Chen YQ Qu LH Gautheret D 《RNA (New York, N.Y.)》2011,17(3):390-400

Plant genomes have undergone multiple rounds of duplications that contributed massively to the growth of gene families. The structure of resulting families has been studied in depth for protein-coding genes. However, little is known about the impact of duplications on noncoding RNA (ncRNA) genes. Here we perform a systematic analysis of duplicated regions in the rice genome in search of such ncRNA repeats. We observe that, just like their protein counterparts, most ncRNA genes have undergone multiple duplications that left visible sequence conservation footprints. The extent of ncRNA gene duplication in plants is such that these sequence footprints can be exploited for the discovery of novel ncRNA gene families on a large scale. We developed an SVM model that is able to retrieve likely ncRNA candidates among the 100,000+ repeat families in the rice genome, with a reasonably low false-positive discovery rate. Among the nearly 4000 ncRNA families predicted by this means, only 90 correspond to putative snoRNA or miRNA families. About half of the remaining families are classified as structured RNAs. New candidate ncRNAs are particularly enriched in UTR and intronic regions. Interestingly, 89% of the putative ncRNA families do not produce a detectable signal when their sequences are compared to another grass genome such as maize. Our results show that a large fraction of rice ncRNA genes are present in multiple copies and are species-specific or of recent origin. Intragenome comparison is a unique and potent source for the computational annotation of this major class of ncRNA. 相似文献

17.

Evolutionarily divergent spliceosomal snRNAs and a conserved non-coding RNA processing motif in Giardia lamblia

Andrew J. Hudson Ashley N. Moore David Elniski Joella Joseph Janet Yee Anthony G. Russell 《Nucleic acids research》2012,40(21):10995-11008

相似文献

18.

Critical association of ncRNA with introns

Rearick D Prakash A McSweeny A Shepard SS Fedorova L Fedorov A 《Nucleic acids research》2011,39(6):2357-2366

相似文献

19.

RNA therapeutics: Identification of novel targets leading to drug discovery

Muhammad Imran Qadir Sherien Bukhat Sumaira Rasul Hamid Manzoor Majid Manzoor 《Journal of cellular biochemistry》2020,121(2):898-929

相似文献

20.

Identification of small non-coding RNAs from mitochondria and chloroplasts 总被引：4，自引：1，他引：3

Lung B Zemann A Madej MJ Schuelke M Techritz S Ruf S Bock R Hüttenhofer A 《Nucleic acids research》2006,34(14):3842-3852

Small non-protein-coding RNAs (ncRNAs) have been identified in a wide spectrum of organisms ranging from bacteria to humans. In eukarya, systematic searches for ncRNAs have so far been restricted to the nuclear or cytosolic compartments of cells. Whether or not small stable non-coding RNA species also exist in cell organelles, in addition to tRNAs or ribosomal RNAs, is unknown. We have thus generated cDNA libraries from size-selected mammalian mitochondrial RNA and plant chloroplast RNA and searched for small ncRNA species in these two types of DNA-containing cell organelles. In total, we have identified 18 novel candidates for organellar ncRNAs in these two cellular compartments and confirmed expression of six of them by northern blot analysis or RNase A protection assays. Most candidate ncRNA genes map to intergenic regions of the organellar genomes. As found previously in bacteria, the presumptive ancestors of present-day chloroplasts and mitochondria, we also observed examples of antisense ncRNAs that potentially could target organelle-encoded mRNAs. The structural features of the identified ncRNAs as well as their possible cellular functions are discussed. The absence from our libraries of abundant small RNA species that are not encoded by the organellar genomes suggests that the import of RNAs into cell organelles is of very limited significance or does not occur at all. 相似文献