期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Coexpressionfinder: a new algorithm for finding groups of coexpressed genes

Rogov SI Momynaliev KT Govorun VM 《Journal of bioinformatics and computational biology》2006,4(4):853-864

RESULTS: A new algorithm is developed which is intended to find groups of genes whose expression values change in a concordant manner in a series of experiments with DNA arrays. This algorithm is named as CoexpressionFinder. It can find more complete and internally coordinated groups of gene expression vectors than hierarchical clustering. Also, it finds more genes having coordinated expression. The algorithm's design allows parallel execution. AVAILABILITY: The algorithm is implemented as a Java application which is freely available at: http://www.bioinformatics.ru/cf/index.jsp and http://bioinformatics.ru/cf/index.jsp. 相似文献

2.

Exploiting Oxytricha trifallax nanochromosomes to screen for non-coding RNA genes

Jung S Swart EC Minx PJ Magrini V Mardis ER Landweber LF Eddy SR 《Nucleic acids research》2011,39(17):7529-7547

相似文献

3.

Genome-wide RNA polymerase II: not genes only!

Koch F Jourquin F Ferrier P Andrau JC 《Trends in biochemical sciences》2008,33(6):265-273

相似文献

4.

R-Coffee: a method for multiple alignment of non-coding RNA

Wilm A Higgins DG Notredame C 《Nucleic acids research》2008,36(9):e52

R-Coffee is a multiple RNA alignment package, derived from T-Coffee, designed to align RNA sequences while exploiting secondary structure information. R-Coffee uses an alignment-scoring scheme that incorporates secondary structure information within the alignment. It works particularly well as an alignment improver and can be combined with any existing sequence alignment method. In this work, we used R-Coffee to compute multiple sequence alignments combining the pairwise output of sequence aligners and structural aligners. We show that R-Coffee can improve the accuracy of all the sequence aligners. We also show that the consistency-based component of T-Coffee can improve the accuracy of several structural aligners. R-Coffee was tested on 388 BRAliBase reference datasets and on 11 longer Cmfinder datasets. Altogether our results suggest that the best protocol for aligning short sequences (less than 200 nt) is the combination of R-Coffee with the RNA pairwise structural aligner Consan. We also show that the simultaneous combination of the four best sequence alignment programs with R-Coffee produces alignments almost as accurate as those obtained with R-Coffee/Consan. Finally, we show that R-Coffee can also be used to align longer datasets beyond the usual scope of structural aligners. R-Coffee is freely available for download, along with documentation, from the T-Coffee web site (www.tcoffee.org). 相似文献

5.

CMfinder--a covariance model based RNA motif finding algorithm 总被引：5，自引：0，他引：5

Yao Z Weinberg Z Ruzzo WL 《Bioinformatics (Oxford, England)》2006,22(4):445-452

相似文献

6.

A dynamic programming algorithm for finding alternative RNA secondary structures. 总被引：6，自引：8，他引：6

下载免费PDF全文

A L Williams Jr I Tinoco Jr 《Nucleic acids research》1986,14(1):299-315

Dynamic programming algorithms that predict RNA secondary structure by minimizing the free energy have had one important limitation. They were able to predict only one optimal structure. Given the uncertainties of the thermodynamic data and the effects of proteins and other environmental factors on structure, the optimal structure predicted by these methods may not have biological significance. We present a dynamic programming algorithm that can determine optimal and suboptimal secondary structures for an RNA. The power and utility of the method is demonstrated in the folding of the intervening sequence of the rRNA of Tetrahymena. By first identifying the major secondary structures corresponding to the lowest free energy minima, a secondary structure of possible biological significance is derived. 相似文献

7.

Kavosh: a new algorithm for finding network motifs

Zahra Razaghi Moghadam Kashani Hayedeh Ahrabian Elahe Elahi Abbas Nowzari-Dalini Elnaz Saberi Ansari Sahar Asadi Shahin Mohammadi Falk Schreiber Ali Masoudi-Nejad 《BMC bioinformatics》2009,10(1):318

Background

Complex networks are studied across many fields of science and are particularly important to understand biological processes. Motifs in networks are small connected sub-graphs that occur significantly in higher frequencies than in random networks. They have recently gathered much attention as a useful concept to uncover structural design principles of complex networks. Existing algorithms for finding network motifs are extremely costly in CPU time and memory consumption and have practically restrictions on the size of motifs. 相似文献

8.

RNAProfile: an algorithm for finding conserved secondary structure motifs in unaligned RNA sequences 总被引：2，自引：1，他引：2

Pavesi G Mauri G Stefani M Pesole G 《Nucleic acids research》2004,32(10):3258-3269

The recent interest sparked due to the discovery of a variety of functions for non-coding RNA molecules has highlighted the need for suitable tools for the analysis and the comparison of RNA sequences. Many trans-acting non-coding RNA genes and cis-acting RNA regulatory elements present motifs, conserved both in structure and sequence, that can be hardly detected by primary sequence analysis alone. We present an algorithm that takes as input a set of unaligned RNA sequences expected to share a common motif, and outputs the regions that are most conserved throughout the sequences, according to a similarity measure that takes into account both the sequence of the regions and the secondary structure they can form according to base-pairing and thermodynamic rules. Only a single parameter is needed as input, which denotes the number of distinct hairpins the motif has to contain. No further constraints on the size, number and position of the single elements comprising the motif are required. The algorithm can be split into two parts: first, it extracts from each input sequence a set of candidate regions whose predicted optimal secondary structure contains the number of hairpins given as input. Then, the regions selected are compared with each other to find the groups of most similar ones, formed by a region taken from each sequence. To avoid exhaustive enumeration of the search space and to reduce the execution time, a greedy heuristic is introduced for this task. We present different experiments, which show that the algorithm is capable of characterizing and discovering known regulatory motifs in mRNA like the iron responsive element (IRE) and selenocysteine insertion sequence (SECIS) stem–loop structures. We also show how it can be applied to corrupted datasets in which a motif does not appear in all the input sequences, as well as to the discovery of more complex motifs in the non-coding RNA. 相似文献

9.

Dynalign: an algorithm for finding the secondary structure common to two RNA sequences 总被引：28，自引：0，他引：28

Mathews DH Turner DH 《Journal of molecular biology》2002,317(2):191-203

With the rapid increase in the size of the genome sequence database, computational analysis of RNA will become increasingly important in revealing structure-function relationships and potential drug targets. RNA secondary structure prediction for a single sequence is 73 % accurate on average for a large database of known secondary structures. This level of accuracy provides a good starting point for determining a secondary structure either by comparative sequence analysis or by the interpretation of experimental studies. Dynalign is a new computer algorithm that improves the accuracy of structure prediction by combining free energy minimization and comparative sequence analysis to find a low free energy structure common to two sequences without requiring any sequence identity. It uses a dynamic programming construct suggested by Sankoff. Dynalign, however, restricts the maximum distance, M, allowed between aligned nucleotides in the two sequences. This makes the calculation tractable because the complexity is simplified to O(M(3)N(3)), where N is the length of the shorter sequence.The accuracy of Dynalign was tested with sets of 13 tRNAs, seven 5 S rRNAs, and two R2 3' UTR sequences. On average, Dynalign predicted 86.1 % of known base-pairs in the tRNAs, as compared to 59.7 % for free energy minimization alone. For the 5 S rRNAs, the average accuracy improves from 47.8 % to 86.4 %. The secondary structure of the R2 3' UTR from Drosophila takahashii is poorly predicted by standard free energy minimization. With Dynalign, however, the structure predicted in tandem with the sequence from Drosophila melanogaster nearly matches the structure determined by comparative sequence analysis. 相似文献

10.

Mechanism of down-regulation of RNA polymerase III-transcribed non-coding RNA genes in macrophages by Leishmania

Rana T Misra S Mittal MK Farrow AL Wilson KT Linton MF Fazio S Willis IM Chaudhuri G 《The Journal of biological chemistry》2011,286(8):6614-6626

相似文献

11.

Predicting non-coding RNA genes in Escherichia coli with boosted genetic programming 总被引：3，自引：1，他引：3

Saetrom P Sneve R Kristiansen KI Snøve O Grünfeld T Rognes T Seeberg E 《Nucleic acids research》2005,33(10):3263-3270

相似文献

12.

Coregulatory long non-coding RNA and protein-coding genes in serum starved cells

Fan Wang Rui Liang Benjamin Soibam Jin Yang Yu Liu 《Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms》2019,1862(1):84-95

相似文献

13.

Outer membrane protein genes and their small non-coding RNA regulator genes in Photorhabdus luminescens

Dimitris Papamichail Nicholas Delihas 《Biology direct》2006,1(1):12-19

相似文献

14.

An efficient genetic algorithm for structural RNA pairwise alignment and its application to non-coding RNA discovery in yeast

Akito Taneda 《BMC bioinformatics》2008,9(1):521

Background

Aligning RNA sequences with low sequence identity has been a challenging problem since such a computation essentially needs an algorithm with high complexities for taking structural conservation into account. Although many sophisticated algorithms for the purpose have been proposed to date, further improvement in efficiency is necessary to accelerate its large-scale applications including non-coding RNA (ncRNA) discovery. 相似文献

15.

Characterization of a carcinogenesis-associated long non-coding RNA

Yang F Yi F Zheng Z Ling Z Ding J Guo J Mao W Wang X Wang X Ding X Liang Z Du Q 《RNA biology》2012,9(1):110-116

相似文献

16.

Automatic prediction of non-coding RNA genes in prokaryotes based on compositional statistics

Tong H Guo FB Ye YN 《Indian journal of biochemistry & biophysics》2011,48(6):416-421

Although non-coding RNA (ncRNA) genes do not encode proteins, they play vital roles in cells by producing functionally important RNAs. In this paper, we present a novel method for predicting ncRNA genes based on compositional features extracted directly from gene sequences. Our method consists of two Support Vector Machine (SVM) models--Codon model which uses codon usage features derived from ncRNA genes and protein-coding genes and Kmer model which utilizes features of nucleotide and dinucleotide frequency extracted respectively from ncRNA genes and randomly chosen genome sequences. The 10-fold cross-validation accuracy for the two models is found to be 92% and 91%, respectively. Thus, we could make an automatic prediction of ncRNA genes in one genome without manual filtration of protein-coding genes. After applying our method in Sulfolobus solfataricus genome, 25 prediction results have been generated according to 25 cut-off pairs. We have also applied the approach in E. coli and found our results comparable to those of previous studies. In general, our method enables automatic identification of ncRNA genes in newly sequenced prokaryotic genomes. 相似文献

17.

Esre: a novel essential non-coding RNA in Escherichia coli

Chen Z Wang Y Li Y Li Y Fu N Ye J Zhang H 《FEBS letters》2012,586(8):1195-1200

YigP gene (GeneID: 948915) locates between ubiquinone biosynthetic genes ubiE and ubiB in Escherichia coli. GeneBank annotates yigP as a putative protein-coding gene. In this study, we found a new essential sRNA gene, esre, locates within the region of yigP. The E. coli strain with inactive esre must rely on a complementary plasmid to survive. Moreover, RACE experiments showed esre encodes an RNA molecule of 252 nt. Further experiments revealed esre gene is immune to frame shift mutations and the function of esre depends mostly on the RNA secondary structure, which are typical traits of sRNA. Since it is difficult to predict the target of an essential sRNA, more research is needed to reveal the function and mechanism of esre. 相似文献

18.

RMRP is a non-coding RNA essential for early murine development

Rosenbluh J Nijhawan D Chen Z Wong KK Masutomi K Hahn WC 《PloS one》2011,6(10):e26270

RMRP is a non-coding RNA that is ubiquitously expressed in both humans and mice. RMRP mutations that lead to decreased RMRP levels are found in the pleiotropic syndrome Cartilage Hair Hypoplasia. To assess the effects of deleting RMRP, we engineered a targeting vector that contains loxP sequences flanking RMRP and created hemizygous mice harboring this engineered allele (RMRP conditional). We found that insertion of this cassette suppressed RMRP expression, and we failed to obtain viable mice homozygous for the RMRP conditional allele. Furthermore, we were unable to obtain viable homozygous RMRP null mice, indicating that RMRP is essential for early embryonic development. 相似文献

19.

A pathogenic non-coding RNA induces changes in dynamic DNA methylation of ribosomal RNA genes in host plants

German Martinez Mayte Castellano Maria Tortosa Vicente Pallas Gustavo Gomez 《Nucleic acids research》2014,42(3):1553-1562

相似文献

20.

RNA: Genome-wide views of long non-coding RNAs

Muers M 《Nature reviews. Genetics》2011,12(11):742

相似文献