期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Monte Carlo EM Algorithm for De Novo Motif Discovery in Biomolecular Sequences 总被引：1，自引：0，他引：1

Bi Chengpeng 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2009,6(3):370-386

Motif discovery methods play pivotal roles in deciphering the genetic regulatory codes (i.e., motifs) in genomes as well as in locating conserved domains in protein sequences. The Expectation Maximization (EM) algorithm is one of the most popular methods used in de novo motif discovery. Based on the position weight matrix (PWM) updating technique, this paper presents a Monte Carlo version of the EM motif-finding algorithm that carries out stochastic sampling in local alignment space to overcome the conventional EM's main drawback of being trapped in a local optimum. The newly implemented algorithm is named as Monte Carlo EM Motif Discovery Algorithm (MCEMDA). MCEMDA starts from an initial model, and then it iteratively performs Monte Carlo simulation and parameter update until convergence. A log-likelihood profiling technique together with the top-k strategy is introduced to cope with the phase shifts and multiple modal issues in motif discovery problem. A novel grouping motif alignment (GMA) algorithm is designed to select motifs by clustering a population of candidate local alignments and successfully applied to subtle motif discovery. MCEMDA compares favorably to other popular PWM-based and word enumerative motif algorithms tested using simulated (l, d)-motif cases, documented prokaryotic, and eukaryotic DNA motif sequences. Finally, MCEMDA is applied to detect large blocks of conserved domains using protein benchmarks and exhibits its excellent capacity while compared with other multiple sequence alignment methods. 相似文献

2.

Finding motifs using random projections. 总被引：19，自引：0，他引：19

Jeremy Buhler Martin Tompa 《Journal of computational biology》2002,9(2):225-242

相似文献

3.

Finding motifs from all sequences with and without binding sites

Leung HC Chin FY 《Bioinformatics (Oxford, England)》2006,22(18):2217-2223

相似文献

4.

Combinatorial motif analysis and hypothesis generation on a genomic scale 总被引：1，自引：0，他引：1

Hu YJ Sandmeyer S McLaughlin C Kibler D 《Bioinformatics (Oxford, England)》2000,16(3):222-232

相似文献

5.

Modeling within-motif dependence for transcription factor binding site predictions 总被引：8，自引：0，他引：8

Zhou Q Liu JS 《Bioinformatics (Oxford, England)》2004,20(6):909-916

相似文献

6.

Binding site discovery from nucleic acid sequences by discriminative learning of hidden Markov models

Jonas Maaskola Nikolaus Rajewsky 《Nucleic acids research》2014,42(21):12995-13011

相似文献

7.

Discriminative motif discovery in DNA and protein sequences using the DEME algorithm

Emma Redhead Timothy L Bailey 《BMC bioinformatics》2007,8(1):385

相似文献

8.

PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny

Siddharthan R Siggia ED van Nimwegen E 《PLoS computational biology》2005,1(7):e67

相似文献

9.

GAME: detecting cis-regulatory elements using a genetic algorithm 总被引：3，自引：0，他引：3

Wei Z Jensen ST 《Bioinformatics (Oxford, England)》2006,22(13):1577-1584

相似文献

10.

SEAM: a Stochastic EM-type Algorithm for Motif-finding in biopolymer sequences

Bi C 《Journal of bioinformatics and computational biology》2007,5(1):47-77

Position weight matrix-based statistical modeling for the identification and characterization of motif sites in a set of unaligned biopolymer sequences is presented. This paper describes and implements a new algorithm, the Stochastic EM-type Algorithm for Motif-finding (SEAM), and redesigns and implements the EM-based motif-finding algorithm called deterministic EM (DEM) for comparison with SEAM, its stochastic counterpart. The gold standard example, cyclic adenosine monophosphate receptor protein (CRP) binding sequences, together with other biological sequences, is used to illustrate the performance of the new algorithm and compare it with other popular motif-finding programs. The convergence of the new algorithm is shown by simulation. The in silico experiments using simulated and biological examples illustrate the power and robustness of the new algorithm SEAM in de novo motif discovery. 相似文献

11.

SLiMPrints: conservation-based discovery of functional motif fingerprints in intrinsically disordered protein regions

Norman E. Davey Joanne L. Cowan Denis C. Shields Toby J. Gibson Mark J. Coldwell Richard J. Edwards 《Nucleic acids research》2012,40(21):10628-10641

Large portions of higher eukaryotic proteomes are intrinsically disordered, and abundant evidence suggests that these unstructured regions of proteins are rich in regulatory interaction interfaces. A major class of disordered interaction interfaces are the compact and degenerate modules known as short linear motifs (SLiMs). As a result of the difficulties associated with the experimental identification and validation of SLiMs, our understanding of these modules is limited, advocating the use of computational methods to focus experimental discovery. This article evaluates the use of evolutionary conservation as a discriminatory technique for motif discovery. A statistical framework is introduced to assess the significance of relatively conserved residues, quantifying the likelihood a residue will have a particular level of conservation given the conservation of the surrounding residues. The framework is expanded to assess the significance of groupings of conserved residues, a metric that forms the basis of SLiMPrints (short linear motif fingerprints), a de novo motif discovery tool. SLiMPrints identifies relatively overconstrained proximal groupings of residues within intrinsically disordered regions, indicative of putatively functional motifs. Finally, the human proteome is analysed to create a set of highly conserved putative motif instances, including a novel site on translation initiation factor eIF2A that may regulate translation through binding of eIF4E. 相似文献

12.

Finding regulatory DNA motifs using alignment-free evolutionary conservation information

Raluca Gordan Leelavati Narlikar Alexander J. Hartemink 《Nucleic acids research》2010,38(6):e90

相似文献

13.

Probabilistic models for semisupervised discriminative motif discovery in DNA sequences

Kim JK Choi S 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2011,8(5):1309-1317

相似文献

14.

Discovering motifs in ranked lists of DNA sequences

下载免费PDF全文

Eden E Lipson D Yogev S Yakhini Z 《PLoS computational biology》2007,3(3):e39

相似文献

15.

rMotifGen: random motif generator for DNA and protein sequences

Eric C Rouchka C Timothy Hardin 《BMC bioinformatics》2007,8(1):292

相似文献

16.

iGibbs: improving Gibbs motif sampler for proteins by sequence clustering and iterative pattern sampling

Kim S Wang Z Dalkilic M 《Proteins》2007,66(3):671-681

The motif prediction problem is to predict short, conserved subsequences that are part of a family of sequences, and it is a very important biological problem. Gibbs is one of the first successful motif algorithms and it runs very fast compared with other algorithms, and its search behavior is based on the well-studied Gibbs random sampling. However, motif prediction is a very difficult problem and Gibbs may not predict true motifs in some cases. Thus, the authors explored a possibility of improving the prediction accuracy of Gibbs while retaining its fast runtime performance. In this paper, the authors considered Gibbs only for proteins, not for DNA binding sites. The authors have developed iGibbs, an integrated motif search framework for proteins that employs two previous techniques of their own: one for guiding motif search by clustering sequences and another by pattern refinement. These two techniques are combined to a new double clustering approach to guiding motif search. The unique feature of their framework is that users do not have to specify the number of motifs to be predicted when motifs occur in different subsets of the input sequences since it automatically clusters input sequences into clusters and predict motifs from the clusters. Tests on the PROSITE database show that their framework improved the prediction accuracy of Gibbs significantly. Compared with more exhaustive search methods like MEME, iGibbs predicted motifs more accurately and runs one order of magnitude faster. 相似文献

17.

A profile-based deterministic sequential Monte Carlo algorithm for motif discovery

Liang KC Wang X Anastassiou D 《Bioinformatics (Oxford, England)》2008,24(1):46-55

相似文献

18.

BioOptimizer: a Bayesian scoring function approach to motif discovery 总被引：5，自引：0，他引：5

Jensen ST Liu JS 《Bioinformatics (Oxford, England)》2004,20(10):1557-1564

相似文献

19.

Comparative analysis of regulatory motif discovery tools for transcription factor binding sites

Wei W Yu XD 《基因组蛋白质组与生物信息学报(英文版)》2007,5(2):131-142

相似文献

20.

细菌GntR家族转录调控因子的研究进展 总被引：1，自引：0，他引：1

刘国芳王欣欣苏辉昭陆光涛《遗传》2021,(1):66-73

GntR家族转录调控因子是细菌中分布最为广泛的一类螺旋-转角-螺旋(helix-turn-helix,HTH)转录调控因子,此家族转录调控因子包含两个功能域,分别是N端的DNA结合结构域和C端的效应物结合结构域/寡聚化作用结构域.DNA结合结构域的氨基酸序列是非常保守的,但效应物结合结构域/寡聚化作用结构域的氨基酸序列... 相似文献