期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Previous studies show various results obtained from different motif finders for an identical dataset. This is largely due to the fact that these tools use different strategies and possess unique features for discovering the motifs. Hence, using multiple tools and methods has been suggested because the motifs commonly reported by them are more likely to be biologically significant.

Results

The common significant motifs from multiple tools can be obtained by using MOTIFSIM tool. In this work, we evaluated the performance of MOTIFSIM in three aspects. First, we compared the pair-wise comparison technique of MOTIFSIM with the un-gapped Smith-Waterman algorithm and four common distance metrics: average Kullback-Leibler, average log-likelihood ratio, Chi-Square distance, and Pearson Correlation Coefficient. Second, we compared the performance of MOTIFSIM with RSAT Matrix-clustering tool for motif clustering. Lastly, we evaluated the performances of nineteen motif finders and the reliability of MOTIFSIM for identifying the common significant motifs from multiple tools.

Conclusions

The pair-wise comparison results reveal that MOTIFSIM attains better performance than the un-gapped Smith-Waterman algorithm and four distance metrics. The clustering results also demonstrate that MOTIFSIM achieves similar or even better performance than RSAT Matrix-clustering. Furthermore, the findings indicate if the motif detection does not require a special tool for detecting a specific type of motif then using multiple motif finders and combining with MOTIFSIM for obtaining the common significant motifs, it improved the results for DNA motif detection.

相似文献

10.

Transcription factor binding element detection using functional clustering of mutant expression data

Chen G Hata N Zhang MQ 《Nucleic acids research》2004,32(8):2362-2371

相似文献

11.

rMotifGen: random motif generator for DNA and protein sequences

Eric C Rouchka C Timothy Hardin 《BMC bioinformatics》2007,8(1):292

相似文献

12.

Assessment of Algorithms for Inferring Positional Weight Matrix Motifs of Transcription Factor Binding Sites Using Protein Binding Microarray Data

Yaron Orenstein Chaim Linhart Ron Shamir 《PloS one》2012,7(9)

相似文献

13.

Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae 总被引：12，自引：0，他引：12

Hughes JD Estep PW Tavazoie S Church GM 《Journal of molecular biology》2000,296(5):1205-1214

相似文献

14.

A complete workflow for the analysis of full-size ChIP-seq (and similar) data sets using peak-motifs

M Thomas-Chollier E Darbo C Herrmann M Defrance D Thieffry J van Helden 《Nature protocols》2012,7(8):1551-1568

相似文献

15.

Detection of functional DNA motifs via statistical over-representation 总被引：14，自引：0，他引：14

Frith MC Fu Y Yu L Chen JF Hansen U Weng Z 《Nucleic acids research》2004,32(4):1372-1381

相似文献

16.

SOMEA: self-organizing map based extraction algorithm for DNA motif identification with heterogeneous model

Lee NK Wang D 《BMC bioinformatics》2011,12(Z1):S16

相似文献

17.

PASSIM – an open source software system for managing information in biomedical studies

Juris Viksna Edgars Celms Martins Opmanis Karlis Podnieks Peteris Rucevskis Andris Zarins Amy Barrett Sudeshna Guha Neogi Maria Krestyaninova Mark I McCarthy Alvis Brazma Ugis Sarkans 《BMC bioinformatics》2007,8(1):1-7

相似文献

18.

A Novel Alignment-Free Method for Comparing Transcription Factor Binding Site Motifs

Minli Xu Zhengchang Su 《PloS one》2010,5(1)

相似文献

19.

iGibbs: improving Gibbs motif sampler for proteins by sequence clustering and iterative pattern sampling

Kim S Wang Z Dalkilic M 《Proteins》2007,66(3):671-681

The motif prediction problem is to predict short, conserved subsequences that are part of a family of sequences, and it is a very important biological problem. Gibbs is one of the first successful motif algorithms and it runs very fast compared with other algorithms, and its search behavior is based on the well-studied Gibbs random sampling. However, motif prediction is a very difficult problem and Gibbs may not predict true motifs in some cases. Thus, the authors explored a possibility of improving the prediction accuracy of Gibbs while retaining its fast runtime performance. In this paper, the authors considered Gibbs only for proteins, not for DNA binding sites. The authors have developed iGibbs, an integrated motif search framework for proteins that employs two previous techniques of their own: one for guiding motif search by clustering sequences and another by pattern refinement. These two techniques are combined to a new double clustering approach to guiding motif search. The unique feature of their framework is that users do not have to specify the number of motifs to be predicted when motifs occur in different subsets of the input sequences since it automatically clusters input sequences into clusters and predict motifs from the clusters. Tests on the PROSITE database show that their framework improved the prediction accuracy of Gibbs significantly. Compared with more exhaustive search methods like MEME, iGibbs predicted motifs more accurately and runs one order of magnitude faster. 相似文献

20.

Incorporating Motif Analysis into Gene Co-expression Networks Reveals Novel Modular Expression Pattern and New Signaling Pathways

Shisong Ma Smit Shah Hans J. Bohnert Michael Snyder Savithramma P. Dinesh-Kumar 《PLoS genetics》2013,9(10)

相似文献