期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

This paper describes a novel evolutionary algorithm for regulatory motif discovery in DNA promoter sequences. The algorithm uses data clustering to logically distribute the evolving population across the search space. Mating then takes place within local regions of the population, promoting overall solution diversity and encouraging discovery of multiple solutions. Experiments using synthetic data sets have demonstrated the algorithm's capacity to find position frequency matrix models of known regulatory motifs in relatively long promoter sequences. These experiments have also shown the algorithm's ability to maintain diversity during search and discover multiple motifs within a single population. The utility of the algorithm for discovering motifs in real biological data is demonstrated by its ability to find meaningful motifs within muscle-specific regulatory sequences. 相似文献

6.

DREME: motif discovery in transcription factor ChIP-seq data

Bailey TL 《Bioinformatics (Oxford, England)》2011,27(12):1653-1659

相似文献

7.

A highly efficient and effective motif discovery method for ChIP-seq/ChIP-chip data using positional information

Ma X Kulkarni A Zhang Z Xuan Z Serfling R Zhang MQ 《Nucleic acids research》2012,40(7):e50

相似文献

8.

A tree-based approach for motif discovery and sequence classification

Yan R Boutros PC Jurisica I 《Bioinformatics (Oxford, England)》2011,27(15):2054-2061

相似文献

9.

Improved benchmarks for computational motif discovery

Geir Kjetil Sandve Osman Abul Vegard Walseng Finn Drabløs 《BMC bioinformatics》2007,8(1):193

相似文献

10.

A generic algorithm for layout of biological networks

Falk Schreiber Tim Dwyer Kim Marriott Michael Wybrow 《BMC bioinformatics》2009,10(1):375

Background

Biological networks are widely used to represent processes in biological systems and to capture interactions and dependencies between biological entities. Their size and complexity is steadily increasing due to the ongoing growth of knowledge in the life sciences. To aid understanding of biological networks several algorithms for laying out and graphically representing networks and network analysis results have been developed. However, current algorithms are specialized to particular layout styles and therefore different algorithms are required for each kind of network and/or style of layout. This increases implementation effort and means that new algorithms must be developed for new layout styles. Furthermore, additional effort is necessary to compose different layout conventions in the same diagram. Also the user cannot usually customize the placement of nodes to tailor the layout to their particular need or task and there is little support for interactive network exploration. 相似文献

11.

A uniform projection method for motif discovery in DNA sequences

Raphael B Liu LT Varghese G 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2004,1(2):91-94

Buhler and Tompa (2002) introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases. 相似文献

12.

A sequential coalescent algorithm for chromosomal inversions

S Peischl E Koch R F Guerrero M Kirkpatrick 《Heredity》2013,111(3):200-209

Chromosomal inversions are common in natural populations and are believed to be involved in many important evolutionary phenomena, including speciation, the evolution of sex chromosomes and local adaptation. While recent advances in sequencing and genotyping methods are leading to rapidly increasing amounts of genome-wide sequence data that reveal interesting patterns of genetic variation within inverted regions, efficient simulation methods to study these patterns are largely missing. In this work, we extend the sequential Markovian coalescent, an approximation to the coalescent with recombination, to include the effects of polymorphic inversions on patterns of recombination. Results show that our algorithm is fast, memory-efficient and accurate, making it feasible to simulate large inversions in large populations for the first time. The SMC algorithm enables studies of patterns of genetic variation (for example, linkage disequilibria) and tests of hypotheses (using simulation-based approaches) that were previously intractable. 相似文献

13.

Analysis of computational approaches for motif discovery

Nan Li Martin Tompa 《Algorithms for molecular biology : AMB》2006,1(1):8-8

相似文献

14.

Discriminative motif discovery in DNA and protein sequences using the DEME algorithm

Emma Redhead Timothy L Bailey 《BMC bioinformatics》2007,8(1):385

相似文献

15.

MotifLab: a tools and data integration workbench for motif discovery and regulatory sequence analysis

Kjetil Klepper Finn Drabløs 《BMC bioinformatics》2013,14(1):1-14

相似文献

16.

POWRS: position-sensitive motif discovery

IW Davis C Benninger PN Benfey T Elich 《PloS one》2012,7(7):e40373

相似文献

17.

deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data

McPherson A Hormozdiari F Zayed A Giuliany R Ha G Sun MG Griffith M Heravi Moussavi A Senz J Melnyk N Pacheco M Marra MA Hirst M Nielsen TO Sahinalp SC Huntsman D Shah SP 《PLoS computational biology》2011,7(5):e1001138

相似文献

18.

A generic algorithm for finding restriction sites within DNA sequences

Jiang Keyuan; Zheng Jason; Higgins Stanley B. 《Bioinformatics (Oxford, England)》1991,7(2):249-256

This paper describes a generic algorithm for finding restrictionsites within DNA sequences. The ‘genericity’ ofthe algorithm is made possible through the use of set theory.Basic elements of DNA sequences, i.e. nucleotides (bases), arerepresented in sets, and DNA sequences, whether specific, ambiguousor even protein-coding, are represented as sequences of thosesets. The set intersection operation demonstrates its abilityto perform pattern-matching correctly on various DNA sequences.The performance analysis showed that the degree of complexityof the pattern matching is reduced from exponential to linear.An example is given to show the actual and potential restrictionsites, derived by the generic algorithm, in the DNA sequencetemplate coding for a synthetic calmodulin. Received on October 2, 1990; accepted on December 18, 1990 相似文献

19.

MixMir: microRNA motif discovery from gene expression data using mixed linear models

Liyang Diao Antoine Marcais Scott Norton Kevin C. Chen 《Nucleic acids research》2014,42(17):e135

相似文献

20.

A boosting approach for motif modeling using ChIP-chip data 总被引：1，自引：0，他引：1

Hong P Liu XS Zhou Q Lu X Liu JS Wong WH 《Bioinformatics (Oxford, England)》2005,21(11):2636-2643

相似文献